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ATCAAAGATGAAGTGGAAAGACAAGGGCCCTCGGATGCAGCCGGA 
ATCTTGGTGTAGTCACCAAACCTCTTGAGTCrrcrTTCCCACATACAA 
CAAGGAAATAACCTAAATGACTAGGGCAGTGACAGTC 
GCAACCCCATGATTTATAGAAGCCAAATTACGTTCTCTGA^ 
5 GTTGTCTTTTGTTAGATCCACATACAGAAGTCACAGAAATGTTT^ 

TGTATACCAAAACACTATTTTAAACCACCTATCCCCTGGGTTAGAAATTATA^^ 
TGCCATTCTCCCATTTATTTAACTrCCTrCCCTCCATATTTGAAGTAATr 
CTGTG ATI T CI ' I XTCAGATTTTCAACACACAGTGG 
TAGTAATTTTATGTTGGGACATGTGATCTCCTTGACATTTTCT 
1 0 xAGTTAAACCATTACCACTTC^CriGTTATGACAGTGAGGC 

CAGAAAAATCTGGAACATATGGCCTCATCTATTGTCTAAATTTTTTCT 

TAAT AGTGACAATCATAAAT AGGGG ATTT AATCAAT ACTCTTTT AAAAAAAAAAGTCACAG AAAGAAAG CAATTGT ATTG AAAACC 
CCTTTCTAGTTTTTCCAGTAGGTTACATTAAATGGCCCCAA 
AAAAGATGACCAACAGGGGGAAACCTGAAACAGCCTITTAGGTA^ 
1 5 CATGTTAGAGCCAACAAAAAACATCTGTTGTTTTATTTATAGCCACAATT^ 
TTGACAGCTTATAATTCATAAAGTTACTGAGGCCTAATCT 
CCTTCCCTTTCTGCTGAAGACAGAAATGGTGCAAC^GAGGA^ 

GATGATAATGCAGAAAGGAGTACCCCGCAGTGATAAATGJVTTGCTTTCACCTCTGGGCCAT 
TTTCATTAACATTGTATGGACTTTCATTATAGGAAGCCAACATTTTGCTGTG 
2 0 CAGGAATCTAAAGTGTTAATTAAAACGACTGATTGAATAGTTCCCTCTC 
TTAGCTCTAGTCTATATTGACATAACTTTCTCCCTGCTGACrrCTTAAGC 
TCTGTATCTGTTGGCTTATTTCTGAGAATCCTTTTTGAGAACA 
TATACACACACGCACACACACACATAC^CTACATATATA^ 

TATGTATACATATATCTATATATGTATACATATAGATACATAGATATATAGATACATATCTATATAATGTATACATGTATACACAT 

25 GTACCTACATGTATACACATGTATCTGCATGTATAaiCATGTATACACATG 

GCATGTATACACATGTAGATACATGTAGATGCATGTATACATATATACATGTAGATGCATGTAGATAC^ 
GTAGATGCATGTAGATACATGTATACATATATACATGTAGATGCATGTAGATACATGTATACATATATACATGTAGATC 
ATACATGTGTACATATATACATGTAGATGCATGTAGATAOVTGTGTACATATATACATGTAGATGCATGTAGATAC^ 
GTAGATACATGTAGATACATGTATACATGTAGATACATGTAGATACATGTATACATATATACATGTAGATACATCT 

30 ATACATATATACATGTAGATACATGTATACATATATACATGTAGATACATGTATACATATATACATC 

ATACATGTAGATACATGTAGATACATGTATACATATATACATGTAGATACATGTAGATACATGTATACATATATACATGTAGA 

ATGTATATAGATACATACATAGATATATAGATACATAGATACATATATTTATATATGTACATATATCTACATATATATC^ 

ACATATATCTACATATATATCTATATATATATATCTAC^^ 

CTGCTATATCTATAGGGCCTTCTGTATATCACAGTTTTGTGTGTTGTTGTTTTTTTTA 
35 • AGTACTATTTATTATCCTGAGCTTTTTTGAGAATGTAGTGATAGGGTTCAACATCT 
CTAAAACCATAACCAGAAGAAGCCCATTAATTTCTTTAG 
ATTGTGAGATATGAAGATGTATTAGCCCACCAGAATTAATCACCTTTGTC^ 
CT CTGATTCCTTACACAATTGTTAAGGATTGTGTTTACTTTC 

GATCTTTTCAAACTTTTAAAATGAAGTCTGAATAAATATTATTTTCTTATGCATTTCT 
4 0 TACAATGTTTTCAGGCCTCTTGCCTITCTTGC 

TTTTTGATCATGGATGCTTCAACTCCCTrCATATATTTGTTAAGC^ 
CATTACAAAATATATATTTTTAAAAAGTGATTTGAAGGGCCCTCT^ 

TAAAAGTGATAAGTTAGCCAGCTATATTTAGCTGGATAAAACCCATTACATCCTTGGGGAAGAATGTGCTTTC 
GAAAATATTTAATAAATGCTGTCTTGAAAATATTTAGCCTGGCTGAGTGAGCTGGCAAGGCTTCAATACTGATAAAGCAGAG^ 

4 5 TGGTGAAAAGCTGGCCTGCC^GTCCAAGCTCAAGTTAAAAAAATGCACCTCTTGTCA 

GTCCATCAGTGTCACGT AAAGATTTTTTAACCTCACATATGGCATCATGCAGTTTGTCTGCCTAA^ CAACCT AGATT CACAG 
AGTGGTT ACAATTGGCTTTG AAGT C C CAT CTGTT CTG T AAAATC CT AG CTTT ATTT CTT ACTGCTG GT ATG ACTTT AGGT AACTTG 
TTTAATTTTTCTTAATCTCAGTTTTCTCATCTGTAAAATGGGGACAGTGATAATTTCT 
AAACGCTGAGCAGAGTTTCTTACACTAAGTGTTCACTACA^ 

5 0 TTCTGTCCCTCCCTAACTCCTCCCACCCCACAGGAGATACAAAGAGCAAAGTCCTCTTGAATGTG^ 

CATGGGCATCAGGAGGCCTGGGTCCTCGGCTTGAACTTCCTGTGGGACCTTGGGC 
ATCAGTACACAGTGCACAGGGCAGCAGAGCTCCTGGCTGCTAGTCTCTCTTTTGATACCTAATC 
TGATATGATAGCTAGACCCTCCGGTCCGTCAGTATCATTTGGGGTCTGGTGTAA 
CAAGTTATCAGTAAGAAGGTCAATCATTGGGATTCTGTCTCCCACAATGGTCTGGATTTTCTCT 

5 5 CAATCCTTGCTACTCAAAGTGTGATTGTGGTCCGAGGACCGGCAGCATGGGCATCACCTGGGAGCCTGTTGGAAATGCA 

AGACTGCACCCCGACCTCCTGAATCAGGCTGTTAATATGTACATTAAAGTTGGGGAAC 
ATATTCTGACACGCAGAGGGTATTTATTTACTAAATACTTACAGAATTTGTGTTTGA 
CCTCTCTTTGGGTGTAGTGTACTGATGTGTTAGGTCAGGGTTACCATCAGACTGGTG 
AAATCTATTTCTCACAGGGTGCC^TGTGGTTGGGTTCTGGTC 

6 0 ATCTCACATGGGAAGGGTGGGGAGAAAGGTCTCTGGTGCTGCTTCTTATAAAGGC^ 

TGACTTCCTGTAAACTCAGCTGCCTCCCAAAGGCCCATCTCCAAATATTATCACATTG 
GGGAAGGGACACAATTCAGTCTGTAGCAACTGACTTCCACATCTGACTTTAAAATGCATCTTTA 
ATATGTAAATAAAGCAAATATGAAAATGTTAATCGTAGAATCTrAGGTGAAAGATAA 
TTTTCAGGATATTTAAAAATGTGAGAGAAAAAACCCCCT 

6 5 TATTTCCATCCAGGCTTTCTTAAGAGTGCTTGTT 

TCACTTTCTCCTTGGTCAGCCAGTGGCCTGCTTCTGCACTGTCAG 
GTCTGCCAGTAGCATCTTTCTGTCCAGrrCCTCTC 
GACATCGCAGCTTCTCAAGTTACATTTTACAACTCTGCTT^ 
TGAAATTTATCATCACATGTAAGTAGGAAAGAAGTGTCTCCACAAGTTTGC 

7 0 CCCTGCCTTTCTCTGGTTATTAATAGGGGCTCCTATTGCrrTTTCC^ 

TGAGAGGGAATATATTAACTTGTTATTTGGGGAGGAAGTACAAAAACTGCTA 
AGTATTAAGGTTATTCCCTTGTGGGATTGACCATCAGATATCTGTGTGTTTCATAACAC 

CCAAACTAAATACATAGGCATTACTTAAGAGCTTTC 

75 TATACACCTTTGAACTGCTATCL^AAJIA^^ 
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AGTACTAATCATTCATTCAGCAAACAG CCTACCGCCCACTGTGTGTCATGGCCTrGTG ATTTGTC i i i-rI _ rL. l i i I rGGGGGc l TCT 
GGGTCATACTTCGTATGAAA TC T T TC T^ 

CAGGT AT AAATT ATTTTT CTGGTGAGGAGAGG AGGAAGTCC CTTT CTTCCC CATTT AC ATC CTG CCT AGCTATTTT AATTTG CTGT 

ACACCGTAAATTGTAATCTTAGAATCAAACTCCAATTT^ 
5 GTAATCTTGCATTTCAGTTACCTCCTAACTCTGAGGCITTAGCTA^ 

TAAGATCTTTGCCTGCTGTTTGTTTTCTATACCAGTCTGTTGTCTAGJUUVT 

TTTGTTTTTAGCTGTGCAG CAGGAGAGAT AAATGAG AG AAGATAAACTT AACGAT ACTAAAGACT 

TTGCAAACTTCTAGTTTAAATCTCTCGGCAACTCT^ 

GATTTATTTCCACATCTA CTL ' rr iXjTACTCTCATTTTCAGTTGCGCA^ 
1 0 TGTTrCGGCCACACTCAATATAATTTAGCACACAGTCATTCC^ 

TCAAATATATTAGTATTX3TGGTCCTTTTGGGGAAAGTATGATAAATTGAAAATAT1T 

TGTTTATATATACAAAGGCAAAATGCAAATATATACAAGG 

GAACAGTCTAAGTATGATAATTTCTGGTCTTAAAACCTGCAATTC 

ATGACTAATTCTAACAATAATAACTACCTATTAGTTTGTGCTTTCT 
1 5 TTATTTAATCCTTATTACTGCACATAATTACTCCCATTTTACAGA^ 

CTGAATTCAGTAAGTGGCAAAGCAAGGATTAAAACCTAAGCCT 

ATTAAACAGATGTGTCCGAAAATTAAGGAAGTCATCGTTG^ 

TTGTTGTAGTGTCACCCGTAAAGCTTAATAAGTTTAAAGTTCAGGGTGTTAATCA 

GGTTTATTTAAGAAAAGAATTCGGATTAGATTTGCCCACCAGAATATAC^ 
2 0 AACAG CTTTGACT CAG CAG CAGTTAG CAG AGAAACATCTTG AT AT AAT ACTG ATCCACACTTTGCAT AT ACACT^ C TTTG 

TATTACATTTTTAGAGCTAAGTGGCAGTTTTTCAC^ 

CTGGAGTCCCAGTGTTTCAGCTTAAACTTTCCAGTGGGCCCTGCCAGGCACT 

CCAGAAGGTGCAACTGTGTGGTGGTTTCATTTTGCACTTCATAACt^ 

ACACACCGCTGGTGCCCAAGGACTGTCGAGGCCTGTCCATGTTAGTGGTTTGGGAA^ 

2 5 CT ATTCCATGTG ATTGG CTGTCACCAGGG CAAAGGGGG AAAACT AG AACGAGTGG AAAATG CAGCATCGG ATGTGT CAT CAAATAT 

TCTTCCTAOVACCTX3GGACACCTAGTTCATTGGTGTTCT 

TCTTGGATTCACTACITTTAAACCTTTCTGCrcATC^ 

TTAGCTAGTGAAAAACTGGGATTGGGTTGTAGTAAAGGAATCAACAAATT^ 

CTT AT ACACTG TG CT ACAG CAAG CTTCACAACC AAAAC C CATT ACIT ATTCAGGT AT AAAAATCAAAAG CTG AACTGT A 

3 0 TTTAAATGTAAACTGTTGGAGGATGGTGCTCTGTTAATTTGTGTT^ 

TTTTTGTTTTTAGTACTATTATTTTTTGGTTTATCAAA 

AAATTTGGT AATTCCT AGCAATTGTCATTTC CTGTTGG AATTT CTT AAAG TGG AAT CT 
ATTTACTAAAATTAAAGGCATACTTGATTTTAGGATTGTCATAAAAATTAA 

CT AATTG ATCCTATTGTTGGAACT AGATGATAT AAACTTTAAAG CTTTTTGGGT AG AACTTTCACATTCACCATGT CTAAATATTT 

3 5 TAAAGAGGAAATGGATTGACTCX3CAGACATTTTGTACTAAGGAATAGA 

TAAAATTTACCATTTTAAACTCCAGTTGCAAAATCCTGATCAGCT 

AAAGCAATCTGGTGTGCCTTTTGAGAAAAGTTCCAGTGTTCTCACAGATCCTGACTTGGTGTAGCACAGTGCCT 
GTCCAACACCTGAATGAGTAATCTGTAAAACATATAATCTTTTGTAGCTCAGGA 

4 0 AAACATTGAAAAGACAAAGGAAAGCAAGGAAGTTTTCCTT^ 

TTTTGTATGGAG CATTTGC CAGCCCIAGCAAT AT AACAACTTTTGGGTGCATATTTCAATCA 
ATTTAATTTGACTTTCATATTTGCCTGCAGAAATCATTC 
TTTTCAGAGCCAACATTAAAGTTTITTTTAATTAAAAAAATG 
GTTGCATGAGAGTTGCATCGATATtSTATTTACTATTTAATCTTTAGC^ 
4 5 GGGGCCTAAGGAGTGAC^CAATAATGGATCTTAATAAATAAC 

TATATGCTGACAGGATGATATTTCATTGTATTCAGTGTAGAAGGTGTTTGTGAGAGAGGATACT^ 

AAGGGAAG TGGTTTAT AATTT AT CAGAGG AAATGAAATGGTG TG CT CTTT CATTCT AT AATATTT CTTTT AAAATTTT AT AATG CT 
TTTAATTGGACTGTCAAACCTTTAAGACATTTCTATTTCAAGTC 
GATGTCATTGCTC^ATTTTTTCATAGGATCACCCACATGAATTT^ 
50 AACAGGTTACTAGCCCATTCCTGGATGTTCCGATTCTGAAG^ 

GGTGGTGCTTAG CTCTGGCAAGTGTGGG AGAGACCAGAG AG AGGTGAAGTTCCCAC CCAGCCCTCTGCACAGTGA 
ATGGCCAGTCGTTTGTGCTTCTATGAAACCTAAGATGTGCTACAGTGATGTTGCAGTAG 
TAAACATTGCAGGGCTTTGAGTTCAAATATTGATTAGCTCTATGTTAAGATGCCT 
CTAACACAAAGCTGACTCGGGAATCTTTCATCTGCCATTGACAAAATTTGCCCT 

ACTCAATAACCtSCTGTGGCTAGTCATTC 

CTACTGTAGGATTTTCCCTCATAATATAGCTCTTCTCTTTTTATCCTGGGTAGTTGCAATG 
ATGCATCCATCTATCTACTGCACATTGCATACTGAAATCTGAAAGAAT 
AAAATTTAATAACATTACrrrCATAACGTGAATTTGTTCT 
6 0 TGGCCAGGTGCAGTGGCTCACACCTGTAATCCCAGCATTTTGAGAGATTC 

CIAGCCTGGGCAAT ATGGCAAAACCCCGTCTCTACTAAAAAT ATACAAAAA^ CACTGTAGTCG CAG 

CTACTCAGGAGGCTGGGGGGGGAGGATCACATGAACTCAGGAAGTTGAGGCTGCAG 

CTGGGTGACAGAGCGAGACCCTGTCTCAAAAAAAAAAAAAAAAAAGAAAAAG 

AAG AGCT AT ACTGGT AAAAT CTGACAG AAACAAAATTGC AAAT ATTT ACT CT TT CT CAG ATT AT AT AAATT ATT CCT AGAAAAGAC 

6 5 TCCAAGCAATX1AGGCT AATAAACAGGTTGTACTT ATATTTT CAT AATGACTTTAATTTACCTTTATTTTG CAGTTTCACAAATTT A 

AGATGAATATATTTAGCTCCCTCTTCTGGCATAATTTGAGAATTC 
ATAAGGTTATCAATGTAATGTTGAAGAATGTTATTTATGAGTGTCATCCAGTTAGAT 
GCATGCAAAAAGTTTACGAAAACACCACATTTTAAAAACTACTAATTGACCAAAAAATAC^ 
ACAGAAATTTATTTTGATATCAAGTGGGCAGTGATCATTCATATTGGCTTCT 

7 0 AGGCTTGGGGCAAGAATGGAAAAAGAGGCCCAAGTACCCTGTGCCTAACTATTTTTGT^ 

TCTGTCCCATCCTTTCTAGGCAGTTATTTCTTCCTAATGACCAGGAGGAGAGATCTGAATTCIAG^ 
TGCCCTAGAGTGTGGTATTATGGGGAGAGCnKjATCCTGGTTCAT^ 

CCACTCCCCCCACCTTTGGGGCCCAGGGGTGTACCTAGACAGGGGGGTTTGTCCTGGGGAGGGTGGACCTGGGGGA 
GAGGCCATTTAGCAGGAAATCCTGGAGCATTACTCTGAGTAGACCCTAC^TGAGAAGCAGTG 
7 5 TCCAAAAG CCCAG AG C^GGGGTTCTTCTTGCTCC-GGCCTGATGTAATTA^ 
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TCAACTATCCAAATTAAATACAATGGTGGGATTTTGTTGTTTGCAGCTTTGACTCTC 

TGACATTGCCCCGCCTCTTCTTATCAAGCTCGTGGAAGCCATTGAAAA^ 

GTTCrACCCTTATTTCATGGCrCTTATTATTTGTCTCTT 

CTGCTGGACACTCAAACTGTTTGGGGCT AAGT ACTGGGAAAAATGTCT CT AT AATTTCT AACTCT AG ATT AT CATGT AG TTT 
5 CO^TATGGAAACTATTAAATTCTTCATAA 

AACATGCCAGGTAAGATTAATAGTAGGAGTTGACITCCAAG^ 
TGTCTCTGGCAGCAGCCTCACAGGTTCTCCAGAGAGCT 
ATXHTCAACTCTATACAGAACACAGAGCTCCAGCAACCTG 
GAAATGCAAATGGGAAAGACAGGTCTTGGCTTTCTGTTTCAGGTAATGC^ 
1 0 AAATTTAAAGGGAACTCAAATCATGTACAGTTCTCCCTG 

GGGCTGCAAGTGTGTGCACGCCTCCCTGCAGGC^TGTTGGTCA 
CACCTCTTGGTCTTTCTCTCGCTXrrcTGTATCTACC^ 

AAAAAATATTTTTAAGCCTATCTCACTTAAGAAATGTCrC'rrCl'GACATCC T I T I TCTTTTATTACCTTTCCCTTCCATTTTCCTT 
TTCCGTTTCCC Tl T C TTTCTGGCTTGC C ' rrC ' l 1 111 1 1111 TGTGTTGAGTGCATTAAGTAACATCACGATATTGTTATGTTCAGT 
1 5 GGTGTGCTGATAAACCAACTCTCTAGGAAGAAAAAGGCCCTCGTT^ 
GGGCTTATTTCAAGCTAACCAACATGATCTCACTGGATCTGGA^ 
TTTAGGCAAGAGGGCTTTAGTACCCCAAACACCCACTCCCTCCTTTGGC^ 

TT CCAGCAAGTCATCTTCTT ATAAAT CTTTCAAGAT CCTT CATT CC CTCAT AACGCTT AAAAAAATT AT ATACTG C CTCACATCAT 
TTTAACTGAAATTAGGTGTTTAAGCCAAAAAGAGTGAAACTCAGTT^ 
20 CATGACAGTAAATGAGAATTTGAAGCATGAGGC^ 

AAAAGGAGGAGACTAGACCTTTATOIGTATCAGTAAATTCTTACACTTC 

GTTTAACTC CATTTTTC CTTTGTACATGG AAATCAGTTGCATG CGAACAGAG CAT ATCCT CTTTGCTG CTCTG CCT CCTGCACTT C 

CCACAGAGCAGGTCATTAGAAGTTAACATGCAGAACACTCAAGGTGAAAA^ 

GCTTTAAGGAAACGCTGAATTTTGTCCTTAGCCAATTG 

2 5 GTTTAGTCZACAGGAAGAGTTTAGACATCAGAGCCCGATGAAGTG^ 

ACACAAAATGATGCAGTTTAGTTATTTAAAAAAAAAAAAAAACT 
GTCAATGGTACATTTTCACCAACAGGTTTCCTTCCCCAGTGAGAT 
AAAAAAAGTGAAATTTGTGCCCCACCCCAAAAAATATAAATAAATAAATGATOT 
TCCAACTCAAAGATTTTTCAATACATAAAAACATTGTATAATATCCTGGAGGAAGCTC 

3 0 AAATTATGGCTTTTAACTAACCTTGTGAATTACCAGT 

TGTTATGCTAGTTCATCCATACATTAACTTTGTGATGTATGTCCT 
AAATAATTTCITTCACATTTTTCAGTGCCTTAGTG^ 

TGCCCTAAAATGAATTGTTTCTTTGTCTCGTGCAAGTTTTAATGATTCAGATTTTCAC 
ATTCAGGCTTTCTTAGCCTTGGTACTATTGACATT^ 
35 TGGGAAGCTTAGCCACATCCCTAACCTCTACCCATO 

ACATTGCC^GATGGCTCATGGCAGAGGGCAGGATCCCACTCCCACTGGGAACCACT 

CTCCATTCCAGGGGGAGGGAGGGAGGAGTGAAGGGTTGGAGAGTTTACTGGGTACCTGTCATGAGTAGACT 

TAGGAGAGAATGCAAACACCTCCAACCCAGACCTAAAGATTATGTAGCTCTTCAAAAG 

TGGAAGAGCACGTTAGGCCAGTTTCAATGCAGAAAAGGAT^ 

4 0 CTCTAGTAGAATTTGACATAAGGTAAGGTTACCCAAAGACCTGCATAGGATAATTTTATTCATATACTAT^ 

TACCTCATGTGTTGGAAAAGGGCTTTGATGAATTAACITCACT 

TTGTATGTGCTGGACACTGAGGGATGCCAGAAGAGCAGGTGACAG 

ACACATAATCCCACACTATGATTATACTCCTTGGGAAGTGTCTTGTTAGAGG 

AACCCTGCCTGAGGGAGCGATGTGTGCATATTTATTATCTTAACGTTTATACTGCATCACCAAATGT 

4 5 T AAGCAAATTCGGT AATATT AATTTT C CGGG CAG AG AG ACATT AG ATG AAACATGCGGGG C AAAGTG TATGGGAAAC CAG AAG CAC 

GGCCAAGAGTGGTGACCATAGAGAATTCCTATAGCAGCACACACACAATTTTATTTG^ 

T AC CTTCAACGT AG AT CCTTT ATG CAG AGGAT CT ACT ACAG TG GT ATT GC C CCTCCCA.CACTTGTT AAACATGTG AT ATGTCT CTG 
GTTGGTTGTTTGTAGGACATTTAAGTGATTTAACACTTCGCCTCTTCACC^ 

ATCCATATCCTATTCCAGACCTACTTTTCTTAGAGCTGTGTGGTTGTGTGACTCACTTTCGGAACGTCGGC 

5 0 AGTTTGGTTTATGATGATATGTCCAAGGAATTTTTGTAGTCTTAGG 

GTTATCTAGGATACTACCTTCTCCCACTCATTTATACTCAAACACT^ 

ACAGCCCAATGCAGTCAATCACCGAGTTATATGAGGTCTCGATAATATTTTGTACATTTCTAACTGCATTAGAATTCAGAC 

TCACCTTGATTCAGAGAACTAAATCTC^TAGTTTTACTTCTAGCAGCACTCCCTTCCAATTTATTCTTAAC^ 

GCTGTTACAAACGTAGTATCTTGCCCATCAGCCCACCTTTGCTCCACTGAGTACACAAGTCCCCAGAT^ 

5 5 CITCGCATTGCTAGTCTACATCCTTCAGCAGCAGCCAAAGTCCrrCT 

TTGTTTCTAGCATCCCTTTTCTTCrGCTGGGCATTGTC 

AGGGTGGTATTGATTAGTTTTACCCCCATTATTGGATCGCTTGACCTGTTTCCCAGCTGAGGGATTTACTG^ 
CITATTTTCTGAGATTGTCTGATATCCACAGGTTAGATTAGT 

ATTATTTTCATGGATATTCTTGAGATTGGTTGCTAAAAGTCTGTAGTATCTTACTGTTGAATC 

6 0 CATGAATTCTGAGACATATCAACTCAAGTCTTAGX3AAATCTGGGAATATC 

ATTTACTATATGACTAACCATTCCTAGTTAGAACCCTTTTCAGTC 
TTCTGTATTCGTTTGCTC^C^CAGTTATAACAAGTTACCACAGACTC 
TCACACCTGTAATCCCAGCACTTTGGGAGGCTt^GGCGGGTGGATCACGAGGT 
AAACCCTGTCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGTGGTACACACCT 

6 5 GJM5AAT06CTTGAACCCGG6AG6TQAAGGTTGCAGT6A6CCGAGATC^ 

CTCCATCTCAAAAAGCAGAAACTTACTTCTCGTGGTTCTGGAGGCTAGAAGTCCG 
TTCTGTCCTTGGCTTGTAGATGGTCACCTGCTCACTGCATCCTCACAGCGTTTCT 
TCTCTGTGTCCTCATTTCCCCTCCTAGAAGGATATCAGTTACAT^ 
CAGTCACTTTCTGAGGCATTAGGAGTTTGGGCTTAGAGGGGTGGGACAGAGGATGGGG 

7 0 TCAGCCC^TAACAAAGGCATTATTTTTTGTTGCCCCAAAGTGTGGAGGTCTGCTACT 

AAGTGCTTGC^TATGTCAGAGAGACCTCCACAGGAAAAATTGTTC 
TAACGAATACTTTGTGTCTTGTATTTATCCTCATATTGCTTCCTTATTTTTTTT^ 
AAAATTAGCCCAACTGATGTATTCTATAAAATAAATGTCTGAAATATTTCTTAAATTGTT^ 
GAAATGATCGATGTGCACGTTTTGGCTGACGCTTTCAAACGCTATCTCCTC^A 
7 5 CAGTGAAATC^TTTCrTTACCTC^GGTTTCTTTTTTC^ 
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ATATGTCTTGCATATATAGAAATAGTGGTTAGAAAA^ 

CTC CACCCAAG CATCCAAGAGAAGAC CCAGATTCTCA'lTl'GC i TGGGAAGC CCGTCTG CATGT CAG AATCAG AAATGGCT CATTGT 
TAACCTATTGGTCITAACAAAGTTTGTACGTCAATGAAGGACACGGTC^ 
GAATCCAGTGATGGGGGATGTTCTTGTCATCTTTCCTGAGTGGCCATAACAGAA^ 
5 AGATAAGGTTCAGTTAAGTGCTTTGGGGACAGGAAACTGGTTCGTAACCTTC^ 
TCTGAGTTAGCCAATCACACTTGAAGCTGCTCAATACTGGGTAAGTTGCTG 

CACTGAG CT'iT'f TGCATTTCCATTTGAC CTGAGATTTCTACTTTCTC AT AATG CG i~ri~rCTAATGAGAAGGACAAATGT ACTGTGT 

GCTTCTCCCAACAACTTTTTAAATGACTCCT 

GTAGTCG eTTACTCATTTCnxriTTTTTTTTTT^ 
1 0 GCTTATTAGGTCX3CCTAGCATACCTCATCAGTATTGGCTTACGCTTC^G 

CCAGCAAAAATCTGTTGAATGCAAGAGTACTCTCTGAAAT^ 

GGAGACAAACATGTATTTTGGGGTGATGAGGGTAGTCCTAAATGGATTGGTCA 

TGCATGTGCAGATGCTATCACACACATTCACTTGAGTGTATATAAATG^ 

CTCTCTTTCTTACAGCTCTCATAATACTGAA 
1 5 CTGCACCAGGTAATGCTTTTTGAGCATTTAACATTCTCT 

TTTGAAGTAACACAGCTTCTTATGGAATTACAGTTGGTAG 

GTATATGTTAATTATTATCCGAGTTTCAACATTTGTGGATTTTAAATT 

CCCTrCAACATTTATGCTTCAGGTTGTTGAAGAGATTTATGGAGTTCA 

TACAAAGTAGGGGATTTATAATTACTATATAATTTAAAGTATTTTAG 
2 0 TCAAAATAAACATTTTCAATAATGCAAGCAGTATGAAAC 

GCAGGCTG CTGTTCAGTT AGCCATTT AACAAAGCAGGAAATGT AG AC CATGTT AG CAATGCATTT AATTT AAACTGATG T Al rri-i" 

CTTCTACCGTGGAAAGTCTAAACAACCTCATTTTGCTATATACCAG 

ATTAGGGTACTATACGTGTGAAGTTTCAGGACCAGTAAAAGGTGTTGCTTGTACT 

TCCATGAATATATTTATTATAGGTAATATATGCAGATTACGATATTT^ 

2 5 CATGCAATGGAGAGTTAAGTTGTGCTTTTTCTO 

AAGAGAGGCCAGGGCCTAGACTCAAAACCCACCTGCJVGCAGGTAGGATAGAGGATAGGGTAGGTAAACA 

GGGAAAGGTTTATGTGGACAGGCCCAAAATGTACCAAAGGTGAGAGAGAGGTATCCGGTA 

TCTAAATCACAGCAAGCAGAATGTTTTAACAGGGGTGCCCAGAATTCTAGGTAAACT 

ACT AGGACTT CAGT CCAGGGGTTGG CCAAAAGG CTTCAT CTCX5TGTG AGT ACACAGGT C ACTGTG AATGG ATTT GGG AGCTCTGTG 
30 CTGGTGTTCAGAGGCTGCCTTrGGACITAGTGGAAAAGAATTGGGGT 

AGAGACTAGGTAGAGCTGGACCCTGCTCTC^GTGAAGGGCTAGTGGGTAGTCCTG 

GAGACACATGGAAG CCAAAG CCAGAC AGGGTTTGACAGATAC CC!AC AAATGAATTT ATCIACATGG AATTTTC^ AGAATTT CTACTA 
CACAAGGT ATAG C^AGAATCAGTGGT AG AGTTTGATT AGCCTGACTGGTGTAAACTA CCTAAT 
TCCACO^GTAAAAACGGGTATCTGCACTATTTTC^^ 

3 5 CTGCTGAGCAATTGGTGCGCCTGTTTCAAAGGAGGATTAGTTCCATCAGCTCTGCT 

GATCATCCCAGTATTTATGTGGCCTCTTAAAAAAAATTAA^ 
TTTGTGTGATTTGTATAGGACCTTTGGGATTAAATGGCTGTTTTAGTATCT 

AACAAAACGTCCCCATCCAGAACATCirCGTATTOVGGAATGGCTCCAGTGCAAATGACTGCTATG^ 

GATTCTT AGG C CTGGTTT AAAAAATGC AAT AATTTCTT AGAG TC CTT CAG CATTGC CACAG CATGTG CT AGT ACCAG ACAG AACTT 

4 0 TG AAAG CACAG CACCTT ACAT CTCCAGCG AATGGCCATTT ACAG T AACATTG TTGT G AACACAAATGAG T AC CTGAATGATGTTT A 

AGGG AG ACAAAAG CAAG ACT CTGTT AT AGGGCAG AT AG ACAG CATTT CT AATTATT AC CAG AT AG AACTTGAAATG ACAAAG CTT A 
TGATGTTTGAGGCAAAGTATCAGTTCAAATAAGAGCCATTAGCAC^ 
AAGTACAAAAAGAACTTTAGAAATGCAGCTGGTGACTTTCTAGGGTAATGG^ 
TGAAATGTGCCTTGAAGAATAGGTACAATTTGAATAGGCAAAGGAGAAATAGCATTTTGATG 

4 5 AATAAATTACACAGTACAAAATAATGGCTTGGTATTT^ 

TTAGAGTTGAAAGCATTGTCCAGCTTTACCAATTTGGACATGAGGAAACCGAGGCCC^ 

TTTGGAAACTTCGTGAGGAATGAGGCTGCTTTTAGTGTCCAATCAAAAACT 
ATGTGAAATTAGCCCCCATGGGTCACATATAGTAAACAAAGTCAGGGCAATGTTTATACTT^ 

5 0 CATGAGCTGTGCAGTGGAAAAAAAAGTGGGCTTCTGACTCAAATAGATCTGGATTGGAAT 

TTCCAAATTGTTTTCCAGACTGTGGAATTGGGTTAATGCCTCTCTCCT 
GCTATCTATCGCATCAACTCTAGCTTTTGAGGCCTTATGGGAGCTC 
CTCATTAGTTGTGTGAATCCTGTOlGGCCrTrTTCCTTACTGACGCCATGTTCCCAG 
TAGGGGCTAAAAGGGACACTCX3GGAAGTTGCCACCCT 

5 5 TGGCTTGGTGAATGGAGCAGCCTAGATTTCATCAC(^CTCCTCCATCTrGCCT 

CATACGTCGCTGCCCCAAC^AGTGCCATCCACTTrATCTGCACTCAAGTAGGTCCT 
ATTCAGTATGTTTATGCCITTAAGCTTCCTGTCACCTTATGTTTAACTGTCTATAGTTTTTCCTATT 
CTAAAGGGTAGGGGTAAAGGCTTTTTCTCCrrTTTGAGCTCTTCCTAGCAG 
CTTTC^TAAACTTCCAACTTTa\AAGAATAGTAATACATCAACA 

6 0 CATTTACAGAGGGAAGCTAAGATTTGGAAAAGGTAGGGCACAGAGTAAGCTCT 

GTTACTAAAAAGTCTTAACTTGTCGTTTGGCTITCAATA 

CAGGTAAAGTAGATTTCTGTAGGCAAACCAAGGTAAAAATAAGTCTATTTAACACA 
GTATTTACATATTTCAGATTTCATGGATATATTCAGAAATATTAATGTAATATAATTGA 

AGTGG AAAATAAATG ACT CTTTCT ACCCCTTGCCCTTCTACACCTGT CAAAGTTAATTTTTAACC^VCTGG CACATTGCAAAACATT 

6 5 CATCCTGAAGAGCCTGATTCATAAATCTTGTTTTAAATGCTCTC^GGC^ 

AGGGAAATGACATTCAAAGAGTTTTACTTGTCTCTGATTTAA 

AAAGGTGAAAAATGAAACTTTGAAAAATTACATGTTTGTAGATGCGT^ 

ATAGTTT^ATTTGTGTCCrGAGAGACACAGGTATTTTGGTATGTATGTG 

7 0 GATGCCTTTTATCAGAAACCTGCtTAGAATGTGTCAGAGTTTATACATACGTAAACCT 

TCCTTAGCATAATGTCAGTTAAGACCAGTGGATATGGAGCAAATAATTTGAATTACCCTAA 
AAGTTAAAATTATGAAGAGAGGTATTCATTTTACCCTTAGATGAAGATTTTGGTTAATTTT^ 

ctttgttcataaatgatgtccac^aaaagaagaagttttctaggttgaaa 
gtacatatgaaatatagtattcagtttgtaacttttgaacagt^ 

75 tgaaacaatc^gctctatttgcctctgajlataagtt^ 
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CAGAAGGTAAAGTTAGTTTTCCAGTGTGTCTAGGCCTTTGAGGTC CCAGAGAAATtKTIX^CATTAAAATAAAAGTTTTCCT ACAAC 

ACT AG CAGCTTCGTGAAAACTGTACTTTTGATTT ACTAATTAGTGAGT C AT AT AG AT CAAGTGACAG T AAT ACAATT ATGT ACATT 

AGTTGGTGTCAAATTTGGACATACGAAATATACAGAAAGTTGATGGTT^ 

CAGAGGGCTCC CiTX ' l IT AAGCTAATAAAGAACATGTGTCTTGOVGATCACTTrTT 
5 AGCTCACACTTTCCTAATATAATAGGACCTCTTTGGTAGATGATTGTACATATTG 

ATCrrATTTCAAGACTTTCTATACGTTGAGAGAGTACTGCTTTC 

TTAACAGAGCATTITAATCTTAGAAGGTTTTGACATGAAAC^^ 

ATGTTTCTAGCTTTAGAAAG<3CTATTTTTAAAAGTAG 

ATGTCCCTATGATCAGATGGGAGATTCACTTATAGACTGTTATTTCCAT^ 
1 0 GGAAAGTTTGTAAAATGACTTTCATTGTGGGTT^ 

GCCTGTGTGGACTTTATGTGACCACrcGATCACTTTCCCATC 

AATGAAATGTTTTGTTTTAATTTAAGCAAGCTAAAAATAGAAAACAGAAAAA^ 

ATGTGGCTTAAAAGATTAACTTCAGTTTTGTCCATTTTT^ 

GAGAAGGGCATTTGAAACCATCAGAAGATTG<3GTTCTAGTCTTAGGGCT 
1 5 ATTTCGTTTTCCCCTCTATGGCATGCAGGAATTGATTGAAGTGAACTCTAAATTACT^ 

G AT AAC CTT AAAG CT AT CT ATGCT CATGCAACTGAAAATTTTCTT A^ CAAATTTT AATT AAATAGT AGT CAGTAATCC CAAGT C 

TGGTCCATTGCCACTTTCTCAAATAAAGGGCAAACATCCGCTTAGAAAGG 

AGCTGGCACATAAACCTAATGCAGAGACACCITTrAATTAGCAG^ 

AAATAGCCGCAGTGTTGTTTTATTCTGAATCGTCACATGATCATGGAGTATAGTATGCATATG^ 
2 0 AGGATGCCCTCCTACTACTTACTATTGAACATTTTTC 

GCAGGGCTGTCTGTGCTGGTGTGTGGGAGGAGGTGCAAGAGGTGGACACGAGC^ 
CAGGCCTTTACAAGTGAGGCTTTGATTCTATTTTTTG 

TTCACTGTCGGCTACTTGAGCCCCXaCGAGCCAGCCAGCGCGCTCAGCAGCC^ 
CTCATTTCCCAGGCXaAAAATGAGCAAGTTGGTCTTGC^ 

2 5 CTCCCGAAATCCCTAGCTCAGCAAATATTTGAACCTGCCCAAGTTAAT 

CGCCTGGGCTCCTTTCCTCCCCGATACAGTAGCGAAATCCAGTTGGCTTCTCAATGAGG 
ACGCACTGCCGGGCGGGGCGTGGGGCGGAGGGACGAGCCGAGCCGAGCCAAGCGGAGCT 
TGGCGGCGCCCCCGCTCCTGCGCGCACTCTCGGCGCCGGACACGAGCACTGCCTGCCGGGAACA^ 
TTGGCCCACTTGGTGGAAGAACAGCTTTGGGGATTTT^ 

GGACAGTGGCTGGGAAAGATAGGGTTTAGGTTGGAATTTCA^ 

AGACTGGTTTGCACCGACGGTTCCCAAATGCATGG ACTGGC ACTCTG<^ CTGCTCGGAG AACTTGTCT 

3 5 CCTTGCTTCGGGGGAG^TCGACATTTTAATTA^ 

TTTTATTTGAATTTGAGTTCAGGTAAACTTAGGCAAATAGAAAAGTGGGAG 

AATACTGGGAAAATGAATTAAACTAGACGGGGGACGTCAGAGAACAG 

ATCCCAAATTTTACAGAAAGTCTCAAGGGTTTCTGTGTTTTTTAAATGGTCAAATATC 

GC CCCACTGT AGTTTTTT AAATG AGC C ACATTT ACCTTGGGGAAG AG CCCCTCCCCCCCCGC CAT C AT AATATTGTG AATGT AAAG 

4 0 AGAGATTGGTTCACAATGAGAATGTACAGTAGGCTCCTATAAGGGTGACAGGATTA 

GATTTATTTATGTTAAGTAGTGCTAAGAAGGAATATTTGTCCCGGAGTGA 
AAAAAAAAAAAAAAAAAGTGATTACATCATCTTCCCGTTCAAGCAGTATTAGTAGCGAAGATG 
AAAGATAAATATACCTCCACTCTGCATTCTGATCTATAGCCTCTTTCTTTTTGGAA 
CAGTCAC^TTAAAATTCTCATTAATAATGTTGACAGAC^TTACGTTTATTATTT 

4 5 ATTTTCATTGTTAGTCTATAAAGTCTAGTTTTAACrrTGGAATGTTTGGTA 

TTTCTAATGTGGTACATTTGTGACAGTGTTTGAATTCTGGGTTT^ 
CrGCTTCTCTTATACATGAGAGATATCTAAGGACTATAAAACATAACTTAA 

CCAAGG AATG ACTGGGCT ATCTAT AAACCCAGATGCZACT AAGTGGG<5 AG AAAAGCTCGG^ CTGCAGCATAT CAGCTTTTGTCAAC 
CTGCCAACAGC!AGAGCGTACCATGCTGCAAGGAGTAATAAACAAGGAATTAGAA 

5 0 CCTTTTTGAAAGGATGAG CATGT ATAT ATTGCAGTTCT ATTAAATGTAG CTTCTTAGTGAGGTTGGCATTATGTTAAGGCTGGTAT 

GGAAGAGAACTGATGAAGCAGGAGTGGTCTGGTGACATTTTTCTGACTTGATTGG CTGGGGC 
AGCCCCTTATAGGTTTTAAAATGAATTCCAAGACACCATTACAAAGAAAGCCGGACTCTTTTCTTA^ 

AACTCTTG CACAAATGT ACAAT ACTGTTTGG AAT AT GG AAG ACCTGG ATT T AG AATATGC CAAGACAG ATAT AAATTGTGG CACAG 
ACTTGATGTTTTATATAGAAATGGACCCACCAGGTAATACTGCAGTATTATTGTAGAGAGTTAGTTAATTT 

5 5 TTTCGAAAGCTACTGTAAAAGATCCTTTTTGGATTTCTGTTTTTATT 

AAAAATTAAACIAGATTGTTTGACTGTCTGTGGAAGCAAG<IAGCTC^ 
TGAATTGAACrCCAGTTTTTCACTGGTGTCTTTTC 

AATTAAAATGATGCATCTTTTGTGCTTGCTTTTCTCTGTTGCCAGTGGATGAGAACAGTGTAGCA 
ACTTTAGAAAGCATGTAAAATGTAGCAGTGATTACAACTCAGTTCTCTAAA^ 

6 0 TTATAATGAAAAAAGAAGTTGACTTTCCATTATTGTTAGTCTTTGTAA 

AGTTAAAATAGTAACAGTGCTTTAAAACTTATGAC^GAATTTACCTAAAAATCCT 

TATTCCAATGTTAGCrCTCCCCCTGCCCCCATTTAAGGTATrCAGGAATACTGCAGTCTTTTATTTGTCAC 

AATACTGATTTGACATTGAGGAAGGGGGATATCATTTTTAATCAGAC 

ATATTAAAGAGATTTAGGG<ACMATGAGAGTTTCTTAAAGTGGCTTTTC 

6 5 CCAATTGCTCCTCTTAAAACATACTGCTGTAGATGAGCCTTTTTATTACTG 

CTTTCGTAATTAAATAAGAGGAAATTAAAGCCAACTCATGTTATTCCTGCTACT 

TTGACCAGGCATGAAACTTTAATATAACTAGAATCTAGAAGTACAGAATGTCATGACTCTGGATTTACTTTGAAA^ 
GGCCAGCCCAATTTATTTGTTAGTTTCTAAGGCTC 

GAl " ri'Tri''l'l'AAGAAAArrATCrTCCATATTGCATGGAATTGTGAACTAAT^ 

7 0 TT AAAG TAAAAGTATTCATCTAAAGAAATTTAGTTCr AATGT AGTTGGGATTGCGAACAACT^^ 

CCTCCTAAACCACCAAAACCTACTACTGTAGCCAACAACGGTATGAATAACAATATGTCCITACA^ 
AGATATCrCGAGGTAAGGCTACAGAAACTTGATTT^ 
TTTTTAGGATATCATCCAACATAAGCATGAAGCATAGTTGGTTCT 
CTCTGGGTGCTGGATGCCACAGGAAATTAAATACCCGGCAAG 
7 5 GCACTGCCTAAGAACACAGTGTGAAGGCACTCTAT 



1775 



WO 03/008583 PCT/USO 1/5 1291 



TGCCATTTTATGTTAGCTTTGGGAAGGGGGAGTAAGGTTGGAGAAACTCTTTT 
AGATTGTAATGTTTGGTTGAAAAAATAAAAACTTAGTACCACAGAT^ 

ACTGCAATTGCT AACATTTCT ATTT AAACAAATT ATT AG CTCTT ATTAGTGAG CTCTG AAAATG CAATT CATT AATTTAAATCTAT 
GTGGGCAGGAGGAATATGGGCACTCACTGTACTTTCCACTTGATTT^ 
5 AAAAAAT AT AAATCTGTGGTCACTAAAC CTT AAGATGAG CATTGTlTTtjTti 1 1 1 i CATTTCAGGGAAGAAGTGAATGAAAAACTTC 
GAGATACAGCAGACGGGACCITTTTGGTACC^AGATG 
GAATATAGCTGAAATTAGGGTTTTGGGCTGATATTAAAACATATT^^ 

TTT ACAGG AAAGGGGGAAAT AACAAATT AAT CAAAATATTTCATCGAGATGGGAAATATGG LTT CTCTGACC CATT AACCTTCAGT 

TCTXH'GGTTGAATTAATAAACCACTACCGGAATGAATCTCTAGCrCAGTATAATCC 
1 0 ATCCAAATACCAACAGGTAATAAAAACTGAATGAATTATCCAGTTAC^ 

TGAAGCAGATGAATTACATGTAATCAAGTCTAAAAAACTTGACACTCGTAATTACATAAT^ 

AGCTATTTTGTTAAACAATTGTTATTTGATTAAATACCTTATCCATTGAATT^ 

AGATAATATTGAAGCTGTAGGGAAAAAATTACATGAATATAACACTC^ 

AAGAATATACCCGCACATCCCAGGTGAGTTTTCTATGAAAATCAGATTAAAAAATAA^ 
1 5 AAAAAAGAATTTAAAAGGTTGAGTTTTTACGAATGAGGTGGGGGTG 

AATATCTCTAAAGCTTTAAACCAAAAATTTTAAGTATCAGAATAATAAATTAAGTTCACGTGATA^ 

CAAGTTGATTTAATTCGTGATAGAAGGATCCTGTTTAGATTGGAAGTTAGTA^ 

GAAATGGAGATTTAGGCATTAATATATATCCTACCATGAAAAGGAAGGGCTATAT 

ACTGGTCACTCATGTATCTGGGAATGCCAGAGGGAACAGAAAACAACAGGGATGCTATA 
2 0 AACTATAGGAAATTGCTACGCAATCATTTTTCATAGATTGAGAAAATTCT 

TGGATTTTATGAATTTGAGTCCCACTCTGCTAATAATACAGATCAAATGTCCT 

AAGGTATGACATTATCTTTTTAAAATTATGTTGCAGGAAATCCAAATGA 

AATATTTGAAGAACAGTGCGAGACCCAAGAGCGGTACAGCA^ 

TACAAAGGTTGGTGTTTCCCTTGTTCTTGTGCTAGAGATAACCAAAATCCT 

2 5 TTTGAATGATCACGTGGACACAGGAAGGGGAATATCACTCTGGGGACTGTG^ 

AGATATACCTAATGCTAGATGACGAGTTAGTGGGTGCGGCGCACCAGCATGGCACATGTATACATATGTAACTAACCT 
TGCACATGTACCCTAAAACTTAAAGTATAATAAAAATAAAAAAATAAAATAAAATAAAATATC 

AAAATG ACAGG AAG AG AAG CCACG CTTTACCT AAGGAAAACTG CTGGG AAACCATAGTGAAACTTTTCATAAACTTTGGGGACCGT 
TCCTGATGTACCCAGATAATAACAAATACGTTTCTTTTGCCT^ 

3 0 GAAATTATTGACAGTAGAAGAAGATTGGAAGAAGACTTGAA 

CATTAAACCAGACCTTATCCAGCTGAGAAAGACGAGAGACCAA 
TTGGTGATTGCTACAATTCAGGATGAGTTAATGCGTTCTCTTTTCAAAA 

AAAAGAAGTTGAACGAGTGGTTGGGCAATGAAAACACTGAAGAGTAAGTAGTTACTAAAGATGGTGA 
TTTTAGGAAAATGCATGACTTGCTTTGTTTTTAGAACAAGTGAGGAATTT^ 

3 5 AAACTCAGTGCCATTATCCAGAATTTTAAAAAAAGAAAGCTTAG 

ACAACTCTCTGTGTCCATAATGATGTCCCTGAACATCTGAAAAATCCCAAAATATTT^ 
C!TTAATATATTTTAAAGCTTAAGTATATACTTTGTTTG 
TGGATAAACGAGATTGTTTTAATACCTTCCACTCTGCCCAGGTACCTGAGTC 
GGTAACTCTTTGTCCKKK5AGGTTGCACTGGAGGCTGAATACCAG 

4 0 C CAGT CTG ACTGG CTTGGT AGGGGC CAGGAGGGAAGTG ACGGGGGT ATG C CT AGGG AAGACAG CAAG G C AGG CTG ATGGCT CCTG C 

ACT CTTCATTTAGAAACTTTCTGTCCTGCCTGCCTAGCCAATATTCACTGGTGG AAGATGAG 
AAGACATGGAATGTTGGAAGCAGCAACCGAAACAAAGCTGAAAACCTGTTGCGAGGGAAGCGAGATGGC^ 
GAGCAGTAAACAGGGCTGCTATGCCTGCTCTGTAGTGTATCTATCTCC^^ 
TCATTTATTCATTCATTGAGTTTTGGGGGGAAAGATTTGACATAGT^ 

4 5 GAAACCAAAGCAGCTGTAATACCATTTTAAAGTCTAGCCAGAAGTGTATG 

AAGACTAGGTTTGGATTCCCATTCAGTGCTAATTAATCATGAGAAATCGCCAAGTCTTTTTTTCT 
AGTGCCTGCCTTTTTTGCCTCTCAAGGCTACTATAGTTAGAATCT 
CATCAGGAACAGGTGAAAAAAAAAAGGATTCTGATCATTATCCTTGAGATAGGTACAC 
CCAAGAAATTAATGTCCTCTAGTAGGGAATGGTTCTAAAAGTATGTTACACTCCATGATCACCT 

5 0 CCTACAGCAAAGAAGGGTACTTTTGTCCTTACTGTGCATAAGGGC 

Ca^AAAGAAGGAATCCTATGAATAGTCATAATGGGAGTAATAGTAAGACTGCTAGTCATGGAGCACTG 

CATGATTGGAAGCTACCACTTGTTGCATGCTTACTGCATGTCAGGCATTCT 

CTCAAAGTAACTGTATTACATGCATTTTACrATTGTCTTCCCTTTAA^ 

GGG TT AC CT ACCCAAGG CACT CGG C C AGGT GTCAGTTGT AACCT AGG ACCCTTT CCCC AAGTTG AG ACTG CAC AAT AATGCTTTTT 

5 5 ATTAAAGATGCCCTGAAGAGTTGTGAATTGTCTTGGACAAGCT 

AGTAAAGCATTGTGTCATAAACAAAAGAGCAACTGGCTATGGCTTTGCCGAGCCCT 
TGCTACATTACCAACACACCTCCClTGTGCAGCACAACGACTCCCrCAATG 

CG ATG AAGCGCTT ACTCTTTG ATCCTT CTCCTG AAGTTCAGCCACCCTG^ CTCCTCTCCAGTCTGA 
TCTGTGAATTGAGCTGCAGAAACGAAGCCATCTTTCTTTG^ 

6 0 TG CAG CCT AAGGCTGT ATG AT GAC C ACACGTTCCT AAG CTGGAG TG CTT AT C CCTT CTITTTT CTTTTTTT CTTTGGTTT AATTT AA 

AGCCACAACCACATACAACACAAAGAGAAAAAGAAATGCAAAAATCTCTGCG 
GTTAATGCTTTCTGAAGCTTTACCAGCTGAAAGTTGGGACTCT 
GGTTTGGTCCAGCCTGGTTTAGCCTGGATGTTGCTGTGCACGGTGG 
AAATGAA03ATATGTAGCAGAAAGGCACX3TCCACTCACAAGGGACGCrTTGG 

6 5 CTGTCATAGAAAGTGCCAGAAAGTGTTTAACTTGTCAAAAAACAAAAACCCAGCAACAG 

AAAATGACATTCAjGTATATAAAATATGTACATAATATTGGATGACTAACTATCAAATAGA 
TCTGTTTTGTTTTGCTGAAGGCTAAATTCACAGCGCTATGCAATTCTTAATTT^ 
TTCAGAATAAGCTTCCCCACCCCAGTTTTTGTTGCITGAAAATATTGTTGTCCCGGA 
TTTTTAAAAGTAAATGTACAGGATGCCAGTAAAAAAAAAAAATGGCrTCAGAATTAAAAC 

7 0 TACAGAGTACTTGGCTGTTAGCCCAAGGTTAAAAAGTTCATAAC^ 

TTCAAAGCTGCTTTATTCAATAAAAAAAAGAAATGAAAAAGATATATGAATATGAC 
TAAGACTCTTAAAATACGGTACCTGGCAATGTTTATTTCATAAAGAATTGTGAA 
AGGGATGTATCAAGTGGGGTGGTGGGAGGGGGAGGCAAGGTTATATGCACTTTCT 
GTGAAGTTGCTTCTTCTACTTCAGTCTTCTCTCACT^ 
75 7CTAA7ACAGTGTGACTGGTCAGGTATTTCAGTTCT 
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TTTGTTTAGAACACTGGTTTAAAGGGATAATCM 

AAAAAAAGCATGATCTGAAAAATACTTTTGGTGGTATGTTGGTTACCCT^ 
GACACAGTCATACTGTCACTGCTCTGG^ 

ATTCATTCTAATCATTGTATGTGCTTCACTACGGGGGGGAGAAGGAAACGTTAGCATCATG 
5 GGTCTCTCTTCCTGATTTAGATATGCAAAAGCTGGTATGTTCAGTAGGAACTG 

AACCATAATATGGTCACTACCCTAATAGACTAAATGAAATCTTGCAATTTCAAATTA 
CTATATTTCTGTTTAAGTACTAGGGTGAGGGTTTTCTGTTACTTTGTT^ 
CTGAGTGAAAAATCTGTGGAATGTATTATTTGTCCTCTTTACA 
CTGTTAGCCCTCAAACTTGACTCTACTGATCTGACCATTTCCCTCT 
1 0 GTCTCTGCTTTAAAGTTATTGTGATATCCTTCTAGATCATACAa^ 
GTGGGTATTTTTTTGGTAGCACATGTATGCTATTACATACAAA 

T AAAAGCT ACAGTTTTGTG AATCTTTG TG CTTCAACAl"!' CTTf G CAAGATG AT ACGGT ATTT AGG CATTTGCCTT ATTTTTGCATC 
TCACAAACATAAGTGCAATAGATCTTTTCATTGAACAGCAA^ 
TCTGCCCAATGCCTTTTTGATTACAGTGTAGCTTGCCCACCGCATTT 
1 5 CATCAGACAGTCCTAACAATATTGTCTGAACGGCTGAATATGA 

GCGTCAGTTTTTCCCACCCAGTCTAGCATCCTAAAGATAAAGCCAGAAGCTAAGCTGC 

GGAGCATTGGGACCTCACATTACACACACGAGAGATCATAACCATGTGAAAAGGCAAAAA^ 

TCATGGCCITTGATAAATGTATATATGTATATGTGCATGGACTGTGTTTCCAGTACA 

TGTTG AGTTCAAGT ACAT AAAGT ACAT AACAAG CG AACGTCTAG T ACAATT CTIT ACTT ATGTGT ATGGGATTTTTC CCTTTGA^ 
2 0 TGCTTTGTTTTGTCTTACAAAGGTGAAAATTGTTTGTAAGTGAAGTGAGAA 
GTTACTCCTTTTAGGGAGCTGGTCTGATGACTTGCTTAG 
GAGCTGTCACrGTGGGGAACCAATTGCTTTGTCATATAGCTGGTTA 

TTGGAAGAAAAAAT CTGCTGGTTTT AACAACTGTG CTTTTGCT ATGT ATGG CAGACACTGAG ATCT 
GTTTGAGTTTAGGGTCATTITTAGAAAGGGGCAGTTTAAAGCACAATGTCTC^ 

2 5 TTTTTTAAAAAGCTAGTTCTATAAAATACTGGTATTATGGGTGG 

AACATGAAACTTGTCACCATGAGATAGCATTAGCTGCCCAGGATGCTGCTATATATATATATATATAT 
GTGTXTTCTGTGTGTATATATATATATATATATATATATATATGTGT 

ATATATGTATATATATGCATATATATATATGTATATAAATAATAATCAAAACX3AAAAAAAACTCATTTATAC 
AAGCTACAATCTGTTCAATGTTTTTAAAAATCTGTTTATATGACATTGTTAAAATAA 

3 0 (^CGGTCAGTTGTAACTTTGCCTTCACAAGGCAACTGGGG 

AGATTCAATGGAGCTATGTCTTGTTTTAAGTTGCTTTAATGCA 

AAGTTTTGGGTTGAAATTTCACTTATCACCTAAGAAATCTT^ 

GTTGTCTCCCACCCACTCCCTTTTGATTGTTATCACAGTATCT 

GTAATACATATGAGCTGTCTTTCATAATTTCACTGTTTTTTGAGACGGAGTCGGAGTCTCACT 

3 5 TGGCGCTATCTCGGCTCACTGCAAGCTCCACCTCCT 

GCGCCCGCCACCATGTCCGGCTAATTTTTTGTATTTTTAGTA^ 
GACCTCGTGATCCACTCG 

HUMAN SEQUENCE - mRNA 

4 0 TACAACCAGGCTCAACTCTTGCATGGTAGCAGATTTGCAAACATGAGTGCTGA 

AAAGGAAAGAGAAGAAGATATTGACTTGCACTTGGGTCACATATTGACTC 
ATGGACAGGAAGCCAGGCCTGAAGAAATTGGCTGGTTAAATGGCTATAATGAAACCA 
TACGTAGAATATATTGGAAGGAAAAAAATCTCGCCTCCCACACCAAAGCCCCGGCCACCTCGGC 
TTCGAAAACTGAAGCAGATGTTGAACAACAAGCTTTGACrCTCCCGGATCTTGCAG 

4 5 CTCTTCTTATCAAGCTCGTGGAAGCCATTGAAAAGAAAGGTCTGGAATGTTCAACTCTATACAGAACA 

GCAGAATTACGACAGCTTCTTGATTGTGATACACCCTCCGTGGACTTGGAAATGATCG 
ACGCTATCTCCTGGACTTACCAAATCCTCTCATTC 

CCGAAGAATATATTCAGCTATTGAAGAAGCTTATTAC^TCGCCTAGCATACCT 
AAACATTTCTTCAAGCTCTCTCAAACCTCCAGCAAAAATCTGTTGAATGCAAGAGTACT 

5 0 CAGATTCTCAGCAGCCAGCTCTGATAATACTGAAAACCT 

AGCCTGCACCAGCACTG CCT CCT AAAC CACCAAAAC CT ACT A CTGT AG C CAACAACG GT ATG AAT AACAAT ATGTC CTT AC AAGAT 
GCTGAATGGTACTGGGGAGATATCTCGAGGGAAGAAGTGAATGAAAAACTTCGAGATACAGC^ 
TGCGTCTACTAAAATGCATGGTGATTATACTCITACACTAAGGAAAGGGGGAAATAACAAATTAA 
GGAAATATGGCITCTCTGACCCATTAACCTTCAGTTCTGTGGTTGAATTAATA 

5 5 AATCCCAAATTGGATGTGAAATTACTTTATCCAGTATCCAAATA 

AGGGAAAAAATT ACATGAAT AT AACACTCAGTTTCIAAG AAAAAAGT C G AGAAT ATGAT AGATT AT ATG AAG AAT AT ACC CG C ACAT 
C CCAGGAAATC CAAATGAAAAGGACAGCTATT GAAG CATTT AATG AAAC CATAAAAAT ATTTGAAG AACAGTGCCAG ACCCAAG AG 
CGGTACAGGAAAGAATACATAGAAAAGTTTAAACGTGAAGGCAAT^ 

GAAGTCTCGAATC^GTGAAATTATTGACAGTAGAAGAAGATTGGAAGAAGACTTGAAGAAGCAGGCAGCTGAGTAT 

6 0 ACAAACGTATGAACAGCATTAAACCIAGACCrTATCC 

G TTCGG CAAAAG AAGTTGAACGAGTGG TTGGG CAATG AAAACACTGAAG AC CAAT ATT CACTGG TGG AAG ATG ATGAAGATTTGCC 
CCATCATGATGAGAAGACATC<iAATGTTGGAAGCAGCAACCGAAACA 

TTCTTGTCCGGGAG AGCAGT AAACAGGGCTGCTATG CCTG CT CTGT AGTGGTCG ACGGCG AAGT AAAGCATTGTGTC AT AAACAAA 
ACAG CAACTGGCTATGGCTTTCCCGAGCCCrATAACrTGT ACAGCT CTCTG AAAG AACTGGTGCTACATTACCAACACACCTCCCT 
65 TCTGCAGCACAACGACTCCCTCAATGTCACACTAGCCT^ 

TTCTCCTGAAGTTCAGCCACCCTGAGGCCTCTGGAAAGCAAAGGGCTCCTCTCCAGTCTGATCTGTGA^ 
AGCCATCTTTCTTTGGATCGGACTAGAGCTTTCTTTCACAAAAAA 
ACACGTTCCrAAGCTGGAGTGCTTATCCC'l^CrrriTCrr i -rTTTCTTr 
AGAAAAAGAAATGCAAAAATCTCTGCGTGCAGGGACAAAGAGGCCTTTA^ 

7 0 GCTGAAAGTTGGGACTCTGGAJGAGCGGAGGAGAGAGAGGCAGAAGAAC^ 

C^ATGTTGCTGTGCACGGTGGACCCAGACACATCGCACTGTCGATTATTTCATT^ 
CACGTCCACTGACAAGGGACGCTTTC^GAGAATGTCAGT^ 
TTTAACTTGTCAAAAAACAAAAACCCAGCAACAGAAAAATGGAGTT^ 
TGTACATAATATTGCvATGACTAACTATCAAATAGATGGATTTGTATCAATAC 
7 S ATT CACAG CGCT ATGCAATT CTT AATTTTCATTAAGTTG TT ATTTCAGTTTTAAATGT ACCTTC A 
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TTTTTGTTG CTTG AAAATATTGTTGTCCCGGATrri'T'G T T AAT ATT CATTTTTGTT AT C C l'lTT' l T AAAAGTAAATGTACAGGATG 
CCAGT AAAAAAAAAAAATGGCTTCAG AATTAAAACT ATGAAAT ATTT^ CTGTT AG C C CA 
AGGTTAAAAAGTTCATAACAGATTTTTTTTGGACTC 
AAAA 

5 

HUMAN SEQUENCE - CODING 

ATGAGTGCTGAGGCGTACCAGTACAGAGCGCTCTATGATTATAAAAAGGAA^ 
ATTGACTGTGAATAAAGGGTCCTTAGTAGCTCTTGGATTCAGTC 
GCTATAATGAAACCACAGGGGAAAGGGGGGACTTTCCX3GGAAOT 
10 CCAAAGCCCCGGCCIACCTCGGCCTCTTCCrrcTTGC^ 

CCCGGATCTTGCAGAGCAGTTTGCCCCTCCTGACATTCCCCCGCCT 

TGGAATGTTCAACTCTATACAGAACACAGAGCTCCAGCAACCT 

OfcCTTGGAAATGATCGATGTGCACGTTTTG^ 

CGTTTACAGTGAAATGATTTCITTAGCrCCAGAAGTACAAAGCTCCGAA 
1 5 CTAGCATACCTCATCZAGTATTGGCTTACGCTTCAGTATTTGTTAAAACATT^ 

TTGAATGCAAGAXSTACrCrCTGAAATTTTCAGCCCTATGCT^ 

AAAAGTTATAGAAATTTTAATCTCAACTGAATGGAATGAACGAC^GCCrGCA 

CTCTAGCCAACAACX5GTATGAATAACAATATGTCCTTACAAGATGCrc 

GAAAAACTTCGAGATACAGCAGACGGGACCTTTTTGGTACGAGATGCGTCTACT 
2 0 GAAAGGGGGAAATAACAAATTAATCAAAATATTTCATCGAGATGGGAAATATGGCrTCT 

TTGAATTAATAAACCACTACCGGAATGAATCTCTAGCTCAGTATAATCCCAAATTGGATG 

TACCAACAGGATCAAGTTGTCAAAGAAGATAATATTGAAGCTGTAGG 

AAGTCGAGAATATGATAGATTATATGAAGAATATACCCGCACATCCCAGGAAATCCAA^ 

ATGAAACCATAAAAATATTTGAAGAACAGTGCCAGACCCIAAGAGCGGTACAGCAAAGAATAC^ 

2 5 AATGAGAAAGAAATACAAAGGATTATGCATAATTATGATAAGTTGAAGTCTCGAATCAGTGAAATTATTGACAG 

GGAAGAAGACITGAAGAAGCAGGCAGCTGAGTATCGAGAAATTGACAAACG 
GAAAGACGAGAGACCAATACTTGATGTGGTTGACTCAAAAAG<3TGTTCGGCAA 

ACTGAAGACCAATATTCACTG<3TGG AAGATGATGAAG ATTTG CCCCATCATGATG AGAAGACATGGAATGTTGGAAGCAG CAACCG 
AAACAAAGCTGAAAACCTGTTGCGAGGGAAGCGAGATGGCACTTTrCTTGTCCGGG 

3 0 CTGTACTGGTGGACGGCX3AAGTAAAGCATTGTGTCATAAA 

AGCrrCTCTGAAAGAACTGGTGCTACATTACCAACACACCTCCCTTGTGCAGCAC^ 
AG T AT ATG CACAG CAG AGGCGATG A 
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Table 86 

MOUSE NOMENCLATURE 
ICSGNM Braf 
Celera mCG4668 

5 

HUMAN NOMENCLATURE 
HGNC BRAF 
Celera hCG3843 0 

10 MOUSE SEQUENCE - GENOMIC 

GG AATG ACG CTITT ACACT ACG AGGTAGGCAAGGAGATGCGGATGAACA 

CCAGGAGGCTGGAGTGGGAGGTCCCGTTTTTTGGTGGGGTAAGGGGGGG 

CATTTGGCAGTCCAGGAACTTGCTACATAGACCAGTGTAGCCCCAAATT 

ACTGGGATCAAAGGCATGCX3CTCCCACACCCAGCAGGCACTCA 
1 5 CAAAAGCAAAGTGTACATTTGAAAATTAACATCCC^GAACTTAAGAA 

ATACrAGAGGTTTACTTTCTAAAGGTAClTAACTGTACTCAAGGATC^ 

AAGACTCTCATTACCATTCAGATTATATAAAAACCTCGGAAAGAGTGTTGO 

AAGAAATGAAGAAATGGAAATAAGATTTATCACTAAATATCCTAAGGAGACAGCAAGACAG 

AAAATGTATACGGCAATTTAACATACCAGAAAATGTTTATGTGATAAATGGATATATC 
2 0 AGCCGGGCATGGTGGTGCACGCCTTTAATCCCAGCACT 

ACAAAGTGAGTTCCAGGACAGCCAGGGCTACACAGAAAAACCCTGTCT 

CCCCCAAAAACCCAACAACAACAACAACAACAACAACAACI7VAAAAACCT 

TTCTTATTACCTGTGAACTATGATGTAAAACAATCCATACCGAAGTAT 

AAATT AAG AAAAAACAAAGG ATTGATTG AT AAGG AATTCAC CTT AGG CATT AT AACTCAAGAAGG CT ATTT AAT AG ACT AAT AGTA 

2 5 GTGACACACACAGCAGAAATGCCTGTCAGAGCATTCCTTAGAAACT 

GAAAATTTGAAATGTACAGAAATGGTAAAAAGGAAACAGC^ 
ATCTGTTATATTTCTTTTTATTTGTTCAGTTTGTAACAAGGTCTCATAGAGCCC^ 

TCAATTTTTTTCTG AC CAATTT AAAGT AAG TAGAAGACTTACTATTT CAAT CCT AAGCATCAGCGTTTAC CTT AT AAATTAAGGAT 

3 0 TTTCIATATTAATAATTAATACTAATTCACAACCATCACTGGACAAATCCT 

ACAAATAATAATAATAAAAACCAGAACAAAATTGTTCTC 

TCTTTTGTCTTTT AAA CAGCAACAG CC CCCT ACC CTTG ATTTTC CCC C CTG AGGTTG AG TTTGATGT CAT CT AT C CTGCAT AACG C 
CCCATGGTCTATGTTTGTTTCCTGTGGTTAC^AAACACACTGATA 

TGACGTGGAGTTTAGTCCTGTTGGTGATTGGTGATGCTAAGGCTCCCACATAACACAC^ 

3 5 ACACACACACTCACACACACACACACACACACTCACACACACACAC^ 

ACACACTCACACGCACGCACGGTGGTTAAGTAGATCTTACACAGTGGGAAAG 

AACAGTAAAAACCTATACTCAGAAAACTCTATTCTAATTGGGCCTAGAGTTTGCCTGGG^ 

GGAAGATGGCAAGTTCAAGGCTAGCCTAGGTAAGTTAGCAAGACCCTCT 

CTCAGT ACT AAAG CACTTG C CAACCGTG AGC AAAGTTCAATCTC CAAT ACT AC AAAT AATT ATG ACGCAAAAC CCT AT ATT CAAAA 

4 0 AGTAACTGAATTAAAAACCCTACATTCAAAGGATATTTGAGTTAATAATTTCCTCTGT 

TCCATTTTTTCATTTCTCATTTTCATCTATACT 

TATGAAGGTGAGCCCACGGGTGCCACAATATGGAGGGGAGAGTACATGTTTTGGAGTTGGTTTTCTCCT 
TAGTTTTGAACTTAGGGTGTCAGGTTGCACAGCAAGCACTTTGCTAGTAC^ 

CCATGCCTTCTAGATACTATTATTCTTCCAAACCCATATATACACATAGCTGTTTTCCCTTATCCAT 

4 5 TCCTAGGAGAAGCCTGAAACTGTAGATAGAAGCAAATTTTATATTACT 

AGCATTTAGGCAAAAATAACAGAGGGTAAATAGTAACTAAAATAACACT 
TTTTGGGTCTCTAAGTAAAATAAGGGTGAATTAAACATGATCAGTGTGATACTTTGACAGT^ 

CAGTG ACTCATGGGTCACAT ATCCTTT ATGGATGTG CCGTG C^UVAGGGAG ACTCCC ATTCCAGCTCTCATTCC^^GTTAGACAGiAA 
CAGGATGGTCCAAAGGGCTATAACTCTACTCGATGGAATAGAATTTAAAACGTGAATTATTGAG 

5 0 CGATACTCAGAAAAAAGTCAAAGGCZAGGAAGATGGCTCCAAGTTCAGGGTCTGCC 

GGCCAGACCCTTTCTCAACCTCTCTCCTCAAAAAAGAAAACAAAAGCAAAACTCT^ 
TTTCATTT AAT ATTT AAGGCCACAGATGACCACATGTAGCTAAAGCCAAAGGAAAC^ 

GCTCTT AGACT AT AGT C CT C CCTC C C CTCCT CTG CCAAG AGAT AACAT AT CATTCTATG CCTTCAAT AAAAGCAAGCAAAC AAACA 
AAAAACTTAGTATAGGAGACTAGAGAGATGGCTTAGCACTTAAGACCACTTACTGGTCTT 

5 5 CACCTACATCAGGTGGCTCACACTGTCTGTAACTCCTATTCTAGGGGATCTGATC 

CACACAGTGCACACACAGACAAGCATGCATACACATATGTAAATAAAAATAAAAATTA^ 
TTTTAAGAACTCTGTGGTGGTTTGGATATGAAGCGCTCCCAAAGGTTCGGAGATTTGA^ 
TTGATAGGATTAGAAGGATAAGTAGGTGTGACCITGATGGAGGAAGTGTGTCATTGG 
TGTTCTCTTCTTCTTCITCTTCTTCTTCTTCT^ 

6 0 TCCCTCTCCCTCTCCCTCTCCCTCTTCTCCTTCTCCTTCTCCTTCTCCTTCTCCTTCT 

CTTCTTCTTCITCrTCTTC , I " l , C , r X'CTTL''l' rCTTCn CI I XTTCTITTI'CITCTITTTTCCCCCCAAGTTGAATATACTAATGTGGG 
GTCCGATCATCTGACTAGTAGGATTATCTAAGGTCCATCAGAACGCAG^ATTCAAGATGGAAA 
TGCAGGGGAGACAGGTGAGGCCCAGCAGAGCCTGGCCTGTCTCAGTC^^ 
ACTTATTTGCCTCCAGTTTCTTGTGGCGGTGTGACTGCCCAAGAGGTGCAGTGG 

6 5 AAAAAATGTCTGTAAGAATTTAAAGTGCACrGGTTGTACTGTCrTGATGT^ 

GTCTGTCTAGATCACATTCTTGCTTGGCTCCTTAAGAACZAGCAA 

TATGG CT ACACT AGACTGTGGGAGGGAGT AC C CAAGAACAGACAGGCTGTG TT CAATG AAAAAGG ACTTGCGCCT ACTTTTTCATG 
AG GTG AGGG CT C AGAAG TTG AAAAGAT AAAGGGAGG G T ATG CTCCTT AGGCTACTGAGACACAG CTTGGTAATGCCCCACAGCAGA 
GCAGAGAGAACTGGACTTGCGTTCAATCTTTTTGGGACTGATTGAG 

7 0 TTGTTCTAAGAAAGACCCAGGGTAGTTTTAAAACAAGTCTGAGCCAGGCACG^ 

GAGTTTG AGGCCAGCCTGGTGTAT ATAGTGAGTTCCAGG CCAGTCAAAAAGTCTCACZACAGTG AGAT CCTCTCAAAAAACAGAACA 
AATGAAAAAAGCTCTAC^GTATCCAGAAGAGAAAC^GAGAGAAAGTGCAAACTT^ 
AACATTTTCACTACATAATCTCATAATGGGTAAACTCCAATCCCTGTTTATTGGG 
TTnCTATTGTTTTTAGATAGCATCTCACrTTATAATCrc 
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TCTAGGATTACATGCCACTCTACCTGGCTTTC 

GACAAGCTACATCAAACACAACAAAGAAGGATTTAAACTTATCTATTATATTACA 
CTTTTGCAAACTGTAAAAATTACCTTCAAAATTCAAAGTAAACT 
GCAGAAAGACTCAATGCCTAGGGCTTATCTTC^ 
5 AAACTGGAACCAAGCT AAGT ATC CAT ACACAATTG CACTACCTTCACG CACTCT CAG CTTCCT CAGT ATGG CTGCTTCAGGGCAGT 
CAGACTTCGGGCCTCTACGACTGTCACATGCCT 
AGGGCAC^CAGTATCACATCTGTTGTGTTTTATTATATACAG^ 

T CTGATGGAAG TTGGATCAAAGAATTTCCAGC CCTTTTTG IXiTI TTCT1T CCG AU'I'l'rr ATG ACAGGGG TTTCTCTGCAT AGCCCT 
GACTGTACTGGCTGTGGACAGTOVGGGGGGAATCTCTC 
1 0 ATTTATTTTCTTTTTTAATGATGGGACTTTCAA^ 

CTTT CTATTGTTGTGAT AAACACCACAAACAAAAG CAACHT AGGGAG AAAAGGTTG ATTTT AGTTTTT AGCTT AT AGTCC^ CGTG 
TATGAAGCTCGGGCAAGAACTCAAGCTGGAACCTG^ 

CG CG CTTTGCTCAGCCTGG CT ATT ATTT ACG AATTATGT ATT ATTGCAGAGGAAGAACAG CTTTT ATT AGAGGAAGTAGACAAAT A 
TATGTAGCAGGTATAGAGAAAAATGTCTTACACAAAACTGATCCCAGGCTCGAGAAGCTG 
1 5 GG CTTTTCAAG TTTCGGAAAGGT CTTAGGG CTGGT CAG TTTG CAG AAAGAC AC~GGGCC<; CTG ATTGGCCCACATCT ATTGTGT AAA 

CCTGTCCAGATCTGGCCTTGAACTTGTTTVAACCAGTCCATCCGCAGTGG 
TTTAAGCTGTCATGCTTTTGTCTCAGGTCAG 

GTCATGGCCCTTCrCCCrATATGTCrGCCrACCAAGGACTCTGTCT 
AACTGCTGTTAGGGGTTTGGGGGACAGGCTTCTCACTTGC^^ 
2 0 GCCTCAGAGGAGGCCTGACCTAAGGGCTCAGGATAAAACTTAACC^ 
TGTATATTCTTTTTGTTTGTTTGTTTCTTGTTT^ 
CCTGGAACTCAATCTGTAGGCCAGACCAACCTCTGATTTAG^ 

TG TG C CAC CAGTG CT CAG CTGGT AGT ATGT ATTCTTGAAGTGTCACATGAG AG ATGT AT CATCAGCAGT AGACATCTCTG C C CACG 
TACCAGAGGTAATGTCCAATGGAAGTGGGCTACATAGGCCAAATCT^ 

2 5 CAGCATCTAAGTGACAACTTTGATGAGTTGGTGGT^ 

TTTATTTACTITTTCTCTGCCAGTTAAAGAGTAAA^ 

CATTAGTTAACAAAACTAACCAACACACAAGGAAAAAAAAT^ 
AGCCATCCTGTGAAGAAATAATC!ACATAAGACGTGATCACTGTATAGACT 

3 0 GAAAATGTCTTAGAGGCAGGGCATGGGGOTACATGCCITTGATCCC^ 

AGTTCCAGGACAGCCAAGCGCTGTTAC^CAGAGAAACCCTGTCT 

TAAAAAAAAATGTTATAATGATACATTCTTTAGC^AATAATTTCCTAAAGTAAACCTGGACA 

ACAACAACAATACCCCAAAACCCCAACAGTTGTGTTTCAGAC^ 

TTAGTATGCITAGACATTATGTATTTATGGTTTTTAAA 

3 5 " CATCTCCTTCACTCGAATCTGTCAGATCATCTTCTGACTTA 

ATGTCATTGGCAAACACAGAATAGCTATTTGATTGGTGGCATTTTTT^ 

GCCACACGTG CACTGCTTACCTTGAAGCAAAG CATCAGTTGCCGGGCCTGTG CTG ATCAGACCAAATAGGC AGACATCCT ATTGTC 

AGAAGCTTTCTACCrTTTCAACTCATCCTAAATGCAATAAAGAGGTTGGTG^ 

ACAGACACACAGACAACTCAACTCAAGAGTTAAACTGGAAATTCCTGAATAGTGTTC 

TTTAATTTCTGCAACTTGGGCTTCTCT 

TCATGTGTTCTCCTGAC^TCCTAGTGGATGGAAGGAGCCAATTGATAGTAAATGACT 
AGCTTTAGATTTTCATTTATACTTCAACAGATCAAGTCCrTTAG^ 

4 5 GACAGTTCTATGAAATTTGGCTTGGCTGATGTTTCAACCTTGGCGTTTAAGGT 

CGGCAGTAATAAGTACTACA.GACTTTCCGGCCTTCTTCCCTCTAGGATTTTAACTTC^ 
AAATCCCACATGACAGAACAGGCTGAGAAAAAATAGGAAGCCGGAGCCATGGAAACTGTGC^ 
TGGTTGGAGGAGAAGGAACACTCCCCTGGCGACAGCGGCGCGAGG 
CCGTGAC^TGCGGGTGACGNNNNNNNNNNNNNNNNNNN 
50 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNCGGCGCT 

GG CCATCCCCCCCCAGTGCTCGCGCGTCIAGGGGCrCGGCGGGCCAGTGTTT^ CCCCG 

55 GCAGTGCTTGGTAGCCGGCGGGGAGGTGGTTATCGGCCGAGTGT^ 

TGGCCCAGGGGGGTTCCCCGGCCCCCACCCAGATGGGATTGGGGCAGTGGGGGCAGAGCTGGAG 
CTGGGTTGGAGAAGGTCACGCCTGGCTGAGGCTTCCAGCCCTGCATCCCTGAGATGTTTTCCTTTT^ 
GTCGAGGATGGAGCTTAGTGGCCAGGGCAGTACCCCTTGGTCCCCTAGCCCTAGGAACCGCCGTGTGCGATTACT 
GGGATGACGGAGCGGATCITTCCACTCTCCrCACTATCTTCCCTTTGCAGGCC 

6 0 GGATTTTTCTT T T G ACCGG TITTT GCCCAGGGGAT^ 

TCCGAAGCTTTTCCCATCCTGGTCTCTGATCCCCGAATAATACTGTTTGTCAGTGCTCCCTTGCTGACCC 
ATCCACCACCCCGCTCATTTCTCAACTGTACCTACTTGCTTCTAACCTTAGTTTCCACGGAAGA 
CTAGTCCCCTTGAGTGCCTCTACTTCATTCCACTGCACCGTCACCTTGTAAT 
TGCAAGCTATTCACTCCCTGAACCCCAAAGCTGTCAAGATTCCATCAGGTTTCACTTGGCCC^ 

6 5 TTTGGGTTAGTCTITCAGTGCACATACACACCTCAGTTC^TATCCTTC 

TTGCGTGCTAGCTTGTTTTTAGTGGGTTTAAGTGATCTGGCT 

AC TTGG CTTTT GGT AGTC CTTGT AT CATT CAACTT ATT AC ATTG T CAAATT AC CT AATTT ATTCTCTTTTCT CTGTATG T AAAACT 

TACTACATTTACCATTTGGGCACTGTATCGCATAAACAATACCATATCTT^ 

GTTGTCGAGAGTGACTTGTTTCTGTTGATTTTACTTGAAGATAAATTTTGTT^ 

7 0 TCGTACTACCTCCTTATCTCTACAGTGTTCnTTTTCCTGCTTC 

GTTAGCAGTCTGTTTGCAATAATGTTTGTTTTCCATCT 

ACCCTACATTTCTCACAG rr rrr'X'f TT XT T C 1 1 ATTTCTT AATTTG AAAT AATTC CAACG T AC C ACTTGAATT CATGCT CAGGTCA 
GTTTCCATTTGCATTTATTTCCCTCGGCTTCTTCTCCAGTCTAA^ 
CCTAGTTACCATCCATTCCGTCTTGTTTCCTGGCTTTTGGTACTTCACTTGCTGACA 
7 5 TTCCATCTTATTTATAAACAACTTTGTTCCGTT^ 
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GTCACATCTGCCQTCTATTTTCTCCAGGAATCGGATATG CCTTTGTCTTACTAGG CACAGGTGTCTCTGGGTTTT ACTATGCTGCA 

CATGCTTGGTTGTCTTGAACTGTATAGATGTGTGAGGACCAC^ATCCAAGT 

ATTCTGGGATGTGACACTGACTTGAGAACTTTGrc 

TTGGGAGAGCTAGAGCACTGGGGTGGGTCCCTTACCTTGGCCCCTCCTGTCTTCT 
5 TTCrcTTTGACACTTTCTAGGCAATCCCCATGGACCA 

AACTGAAGGCATGTTTGTATCTCCTGAAACTGTATTCTTTATTT 
AATGTTTGATGATGCTATAAAAAGAGAAAATATGACAAATATACATAACAAAATAA^ 
ATC CTTGTCAGAAACTCTGTGTGGCTGAGGCTGT AGTC 

GTGCCCTCTACCCAACAAGTTTAAAGTTAACATCAACTCAGAGGAAACTGAAGTATCCT 
1 0 TATGAACTTAAAGGAATATCTTTAGTTACTAAAGTAGATGGAAAGATGAAAATCTGTGAATGTG 
TTTTTATCTTACCTGGGTTTAAAAAAGTTTGTCAAAGAG 

TT ATTGAAAAATT AGGGAAAAAGTACAAAAAGTGC CCAT AATAT ATAATTG CT AT AAATATTTG CATGT ATT ACAACT AGTGCTTG 

CACAATGCATTTTATATATAGACTTGTAACTT1TGAAACAATAGTA<^ 

TGATAACTTTTTTGGAC^TTAGGTTATTATTATTT^ 
1 5 AAAATT ATG TT CXJ AG AAT AAAC AGATTTCATGGC CT ATTTGT ATT ACT AT AT AGG TTTCCAAAAT AAAAT ATTCTG AT AAATAACA 

AGAGTTT ACT AAATG TTT ATTCAG T AT CTTAT AATCTT ACTGTT AAGCATTT AATTCTG ACCATTT AT ATTT CCTTGATGGT ACTT 

TTTTGTGACTATATGGTAAGGCATGGCACATGGTATAATATTACCAGATGGTTAATA^ 

AAGGGTGTGATATTTATTACCTGTGCCITTCATGCrGTGTTTTCA 

ATTATITTTTGTTTTCAAGATCTCrrTAGCCAGCCTTGGCT 
2 0 TGGGATCTACTTTATAATGAGGCCCACTCTATT^ 

GTGGTATTCTCACTAAGGTAAGAAGGTGTGTGTGTGCA.CAGTAAATTTTAGCTG 

GATTTCCTTTGGAGGGGGAAGCTATGAAGTGGCTTC^SCT 

CCTG ACTT AACACCTTC CTTT AGGCATCCATTCAC C CTTTT ATTT AAAAAG AG TT AT CTTGTATCAG TTGCTTATAT ATG CTGGCA 
AT CTATTTTAAGTCGACCAAAAGGAATGTTGGA(TTGGACACTTGGATTA^ CAG ATTT G ATTTAAT 

2 5 TGTGTTTGGAGGCCAGAAAGATTACAGTAGTACAGCAGACGAG^ 

CAGGACAAAT AGTCTAATGGCAG CTG AAAAGCCATGTTCrCAGTGACAAAGAAGAAAGTCCGTGTTGCTlT CACTGTG CTCTGGTG 

TTTTCTAAGGCTTGTTACCTACCAAGTCACACAATGGAGCC^ 

GTGTTTTCTCCrTTAGAGCTGCAGAAATGTGTTTTG 

GCCATAATCCCCTTCC-TTTTAGAAATCTTGTTTGCTCCTATTCCT 

AAGCCCCAGTTATAGATCGTTAC!AGAACTTCGGAATTCA^ 
TGTCCTTTCITTACCATAGTCCrAGAGATCACTTCATCTT 
AAACATTTTGTAGATGCAATGACIAGTCTTGAAGGCAGGGAATGGTT 

ACTAAC CAATTTGCT AG CAT ACTT AGTG AG CCATGG T AGT CTT AAG C AACTTG ACAG T CT AT ACCT AACTTT AGG CAG TT ATTTCT 

3 5 TTCTCCCrCTATTGTTATCTTGCTTTCTGTGTTTTCTCTTGATCTTTA 

CATGCIATTCTCTGTGTAGCTCTGGCTGGCCTCAAACTC^ 

ATTACCACCGCCCGGCTTTCTCTTGATCTTTAGTTAGTAAGTAAGTAAGTAAGTAGATAATTGTTGTTC 

ACTATTGACrATGTAGTATCAAGAAATTGATGCCAAGTAATTTTACCTTCCT 

AAAATTGTTCTGAGCrCACTAAGCTT^ATTTCTTCTGA 

4 0 TTTTTTTTCCGAGACAGGGTTTCTGTGTGTTAACCCTGGCTGTCCT 

CTGCCTCCCAAGTGCTGGGATTAAAGGCGTGCXjCCACCACTGCCCCTCAGTTTC^ 

TTCTGGGGAAGATGTACCATAGTACGATGGTTTCCCAGTAACTCCAGTGCTACACTGGCTGACCAGTGGGCCCTGGGGAT 
GTCTTCTGCCTTCCAGCTTTGGCAGATGCn'AGGAATCCAAACTCA 

CCGCTCCCCAGCCCCCX3AGTTACTTTAATAAGTTCTTGATGGTACAGTTTTTGATAATGTATAAGG 

4 5 AGTATCCTATTTTCTATGAAGGCCAGATAGTTGTTTTTATAACT 

TGGTGCATACCTTCAATCGCAGTCCTCAGGAGTTGGGGCACX5TAG 

TGCAGAACAG CCAGGGCTAT AT AGAGTG AT CCTGCTTCAAACAAAG CAAAA C AAAAAT ACT ATTTT AATT ACTTCCAAGT AACT AA 
TTGGTTTCAGCTTTTGTTTCTCTGTGGGGTGAAGGGGTAGGGAGCTAGAGGAGTGAGGAGGC 

ATAGGCACAATCTGCCATGAGCTGTACCCACATTCTCATT CCCCCAG CAT ATTACT CT CCTTCCAGAGCTCATCAGGTAGGGAACA 

5 0 GAATATAATTCAGAGAATTCAC^AGGCCTTACACAGGC4GTATTCrAAGCT 

TGCCTATATTTCATCCATAGAGTCTTCCTACACATC^CTTCTCATATTAGCAGAAAGTTATACAA 

AATACAT AATTCATTCACAGCAAGCTTTTTGCTGG CACCTTCTTGATCTTCCCC<^CGCCTCCGTATGCTGTGCG CTG CACrTCTGT 
TAGTCO^AGTGTGGCTTAGAAGACTGCCGTGGGAATAAAGTGAGGTAATAAATATGT^ 

5 5 AGACAACTTAGGCTCTAGTTATGAAATTTTGAC^GTCCTTA 

CTTGAAACTATCTTTGAATCCAAGAAATACAAAGTGGCAACATATTATTGACTTACTGGATAA^ 
ATGCGATAGTACAAGGTCTAACTATGTTTTTATGGGTGATAAACTGACTGTGC^ 
TGCTGGCACCTAAAACTACATTGCTGACTTTTGCATTGAGTTATTTC^ 
CTACAATATGTCAGGTAATACTTTTGAAATTAGGAATGTATTTCAAAACAAAAATCT 

6 0 AGAGGCACATGTTAGGATTTGTGGGACAACAAATAAGCTCATAAGTAGACAAAAAGGAAAT 

ACCTGTAATC!AGTTCTGATGCTCAGGAGGTGGAAGGAAGAGGATCAGGTGTTGAAGGCT 
ACCAGCTGGTTCTACGTGAGACTGTTTCAAAAAACAAAATGAAACAAAjGCCAAAAA 
AAATTATTTTTGATGTATTTTATGTACATGAGTGTTTTGCCT^ 
ATCAGAAGATGTGTTGGATCCCTTG<3GACTAGAGTTACGGATGATTC^ 

6 5 CTGTAAGACA(IAAAAGAGCAACAAAAATTCTTAACCACTGAGCTGACACCCCAGA 

ATGTAATTAATTAAGGCTATTTTAGAATGTGTGCTCTGAACCCAA 

GTACTGGAGCTAAAGACATGCACCATCATGCCTAGCTTCAAAGAGGAAGTTATACAGTATGTTTGAAGATACTAAGTTCA 

AATTTCAAGGAAAGCAGAGTAGGCCTGGGAAGGTGGACTGAGGTTTAAAJVCAAGGGAGAG 

GGCATTCGAACAAGAGTTGGGAGAAAGAATX3AGAGTGCATTCCTGTAGAACT 

7 0 T AAGAAATAGT AAG AGGCCAGAGTTACTGGAAT AACTCAAAAGAATTG AAAG CCAGTGGG AGATGAAATTCCAG AGGG CTTGGGGG 

CATGGCCCAAAGACTGTTGTAAAGGAGTTTTAACATTTTAAAG<5AAAAGGAAAGC 
GATCTGATCGAACAGGTTTGTGTGTTTATTTGTTCATATC^ 
CATGTGGAAGCTGGAGGTTGATGTOVGCrGTCTrCCTCAATTCTG 
ATCTGGAACTTAGTCATTTGGCTAGACTGTTTAGCCAGCAA 
75 TATTC C CTGTCTAAAC7GGG CAAGT A CTTTO CC AA CAAG CCCCAGTTCTTCTCACAT CTTG AT AGGAATG CTTTGG TT AC AGT AT A 
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GAGAAT AGACT AACAGACAAGGT AAG AG AACT CAG ATAAAGGGGTCAC ATTTTG AATGG 

TTTGAAAGTAGCTCAAATCTGATGTGTTTTACCCTGCCCCC^^ 

AGAAATTGTACCACTGCACCACATCATAAGCCCTTGGGTTTGTTGATAGACT 

ATGTCAATGGGAAAG AGAAGACC CAGGTCAGGAATTCATTGTGTGGGATGCTAAGTTAGAGG CAGATAGCAAATGCTGTCTAAGAT 
5 TTTX3TATAACTAATTAGATTTCTGACTCTGGAGTTGAAACGAGGCC 

TTTTTCATACACAGCAAGTTAGCATTTTTACTTTGTAGTTAGGTA 

ATTAGGAGGTGTGGCCTTGTTGGAGTAGGTGTGACACTGTGGGTATGGGT^ 

TCTCTTAGCAGCCTTCAGATGAAGATGTAGAACrCr 

ATGATATTGGACTGAACCTCTGAACCTGTAAGCCAGCTACAATGAACTG 
1 0 CATCAGTAAAACCCTAACTAAGACAGTAGGTATGATCAAAGAATTGGGTTCCAGT^ 

ATTTCTAGATGTGGCACATATAAATAGCCTCC^TTCTAGCTC 

GTAGCTCCAGGCAGATCTCTGTGAGTTCAAGGGCAGC 

AGGCTGATAGAAAACTTATCATTTTTAAAATGGCTTGGAATC 

C^TGAAATAGAATGAAGGATATTTGTTTTGTTTTTGAGATAG 
1 5 ATAACTGGTTGAACTATGTATTATTGTTTTAAAAAAAACCTAGGAAA 

TGACACAATCAGAGGAGAATACCCXaTACTCTGGTTCT^ 

TTTT ACTTT AAAATT ATT CACCT CAAG TATTTTCTT AT AGCAC AAAATTGGTT AAG AGATGAAGAACTGG^ 
GAAAATAGAATCAAGAAATTATGTTTTTAATTTTAATTAATW 

AATTTGTAG ACGAGGCTGGCCTTAAATCCTGC CTGTCTCTGCCTCCTGAGTG CrcGGATCAAAGCAGTGCCCTTCCACCTAATTAA 
2 0 TITrrr AAAGGTTTTTGAGAGAAAGAATTGTTCTATGGTTCAAATTGACCT 

TTGGAATTGTATGCATGTGTTCCCATATAGCCTCAAGCAAAAATAAATTTTGTTTGGTC 

AGAAAGGAAATATTAGAGCAAAACACCAACTATTTCAGGAAGGATAAGAACT 

ACAGGGTTAGTGTATAAGTAGGAAGGATGGCTTCAATACGATC 

CTCAGGGTCTGTTTGGCTCGTGTCAGGCGTCGTCTCAGAGTAGAAAGTTTCATCCCAG^ 

2 5 TGTT'ACAATTCAGCGCAGCCCAGGAGAGAAGGGAGTAGAACAGATCGTAAAGTGG^ 

AGCACTGTGGCTTCCTAACCTTTGCTGTCACATCTGCACGGTTA 
CTCTCATTTTGACTCAAATAAAACATGAATTATTACTGCATGAGGTATCTATAA 
GAGATTTGGTGACTGATTTTTCTTTTTACGAAAATTTCTCTTCTGGGAGTGGTA^ 
AGATGCAGGTGGATTTCTTTGAGTTTAAGGTTAGCCTGGACT^^ 

3 0 GTCTCAAAAATGAAAAAAGAAAATAAAAAAAAAAGAAGAAAAGAAAAAAAGAAATCCT 

ATTG CAG TTTGTCTTTT ATG CTGTG CAGTG AAGCAGTTCCCTGT ATTTTT AAAA^ C CT ATGT AT ATG AGT ACAC 

TGTAGCTGTCTTTAGACACAACAGAACAGGGCACCAGATCCCATTACAGATGGITGT^ 

CTCAGGACCTCTAGAAGAGCAG CCAGTGCTCTTAACCACTG AGCCATCTCT CTAGCTCCCTTTCCTGTACTTTTTT ACACTTCAAA 
TTGTACTTGGTTTTGTTTACACTTTTC^ 

3 5 CAGTGAC CTGCCIATCTACTCT AGGAAGTATCCTTAATCCCCTCCTAAGCCTGCCC^ 

CGTTTTATTCACCATATTGCAAGTTTGTTTTTATTTTTGT^ 

AG ACTTT CTATAATGTTG AAAGTGTGCITTCTTGCCCAGTGGTAAGCTG CTGCTCATCTGTTCATTCCTTTG C CTG ACAGCACTG A 
TGCACTAAATTTATGTAATTTTCAGATTAGACAGTCTCGTG 
CTTTGCAGTTTTATTTTTAGATCTTTAAACT^ 
40 TCTCCCTCTCTCTCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG^ 
CAACTTTATG^AGTTAATTCCTTATGCCTTTATGTGGGTTCCAG 
CTGATCCCAATGCTCTTAATCCAAAGAGCTAGCACTAATCCGGGGTG 
TTTTCCAGTTTGTTCTGAATTGTGTCGATTTATGCAGCrCTTAAATTC^ 
GCTTTTATTAAGTGCTCTTTACCATTCTAAGCCTG^TGTGTGACACATACCrrc 

4 5 GAATATTAAATTTTGT ACCAACCT AGGCTGTTTACAGAGTAAGTTTCCTAC CCATTCTGGATT ATCCAG CAGTCCCCTTTCTCAAA 

ACAAAAAGGCTCCTTGGGTTGATAGAATATTTTAATCAGTTGAGGACCCAAGGACCTAAGTTT 
ACCCATACATGTGAACAGACACACACITTTTCAAAGGACCCTAGC^ 
ATAACTCTAGGTCCAGGGGGTTTGGCACCCTCTTCTGGCCTCCAAGGTAC 
TATTTAAAAATAAACCrTTTTGTAAAAGCCAGGTGTGGTTGCTTGCAT^ 

5 0 CTGAGACTCACTGGTCAGCTAGCCTGACCTACTTGGCAATCTCCAGGACATGAGTGACT 

GCCTGAGGGGCCTATGCACACATACGAAGGAGTTCTTTCCAOTTTAGAAAGTCTCA^ 
TTGTGACTGAAATGGTTTCC!ATGGCAGTCTGAAGTTGTGTCAACAACCACAGTGATTCrTTTGTATAAG 
ATCATAATOVCCrGTACATTCTTGACTTATCTTCTTAAAAATT^ 
TACTAGCrrCTTAAAATTGTATCCAGCTAAATTATCTTTGAAATAGGATTTAC^ 

5 5 GTGGGCITTGTTTGTTTTAGG CTCTGTCTCTCGTAGGC CAAGCnTGCCCTAAACCT ACTATGT AGCTATATCTATCTATATCT ATA 

T CTATATCTATATCTATATCTATATCTATATCT ATATCTATATCTAT ATCTATATCT CTA 
CAAGCATGTGCTACCACCTTGGTAGGAATGTTATTTTAAAAGAAAAGAAAAAGAAGG 
GAAATTTGCTCTGCCAAAATCTAGGAAAAAGCACCCCTTCCACCCCCCAATAAGTAGCT 
TGCTGGTTAGTTATTGTCAGCTTAAGCTACTCAAGTGAGAAAATGCTCTCA 

6 0 GCATTTTCTTGATTGATGTTTGATGTGGGAGGGTGCAGCCCACTGTAGGAC 

CAAGCGAGGAGGAACAAGCAAGTAAGCAGCATTGCTTCATGGC 
ACTTTGTTTCCCCrrGGTGATGAGCTATAATGTGTGAACAAAATTCTTT 

ATGTTTT AG CTTTGGTGTCAGTGGACTTCATAAATTGAAGAGTACAAAAGT CAACACCT AAAGCTTATATTAAGTGAG AGTCAAAG 
ACAGTTTCrrTTAGGTAGGACCCTGAGGATTCCrrTCATTAGGTTTGCCTTTGAAGTAGAACTAAATTATCT 

6 5 TCAGAAATGTGAGACTCTAGAGAAGCTTGATTAGAAC^GAGGAATCAAAAGGAAAGTAAAAAAAAAAAATTGTGACTAT^ 

TCCTT CATATTGACTTTTAGAGCAGTTTTTGTTTGTTGTTTGGTGTGTGT CGTGCGTCTTTGCATGCATGGT ATTTG 

GTCrGAGTGGTCCTGTAGCTCAGAAGCrCTTTATCCTAGTTCCTAGTTGCAGAGATA 
GGGAAGTCACCTGTGGAGAAAGGCCCGCCCITACTAGGGAATTGTTACTAATAATTTTTCriT 
TTAATCTTCTGTATATGTTTGTGGGTCTGAAATGTGATACATGTGATCCA™ 

7 0 CCTATATCAGATGCCCTCGGACTGAAGTTACACACAGTGACCTTCTCCTACCCATGTGGGTG 

AACCTCTCTGGAGGAGCAGCAAGTGAATAAGCTAATACATTGT^ 
CACGTAAGAAAACACATATTCTGAGTCAAAAAAGGTCTTATTACAGTATAATT^ 

TAAAGGTAATTAAAACAAGACAAGGCAGTGTTCACTGTGTAGCCCTGGTGTTCTGGAGCATGCTGTGTAGACCA 
ACTCAAGATGCCCACCTCCCTTTGCCrCCCXSAGTGCrAGGGTTGGAAGTGTGCACCA 
75 GAAAAGAAAAAACACTAATAAAAAAGC*TG^ 
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TTTAAAATGAAGATAAACTATAAAAATGGCTT 

AAATACATGAGAAGCTTAACAGATTATTTTTGGGTGAAGAAGTGGTTGCTT 
ACAGTCAAAAAGATTGAAAATGCTGAAGAGAGCATAAAAGATAAGAAGA 
TCCAGAGTAAAATATTGCAAAGTTAATATTTCAAGAGTG{K3 
5 TGT AAGAAATG CACTGAATTTTTAACAGT ATAAG AAAAAACAAGT CTAT ACCCAGACAGATCAT AT AGT AAAACAAGG AG AAATCC 

TGAAAGCAGTGAAAGJUVCAGAAAGACGTCCTTTAAAGAAGCAACTCTGGGGCC^ 
ATGGAGCCTGAATTGAGAGAAGGAAAAjGAACAGACTCTTCACT 
ATCTACACACAGACGATAAATAAACATAATGAAAACATTTTAAA 

AAAT ATCAGGAGT ACAAATGT AGTT CAACGAATCATCAGC C CTTTGAAATT ATG AAGT ACAT AATT AAAG T AAAAAGTCTGAGATG 
1 0 CAAGGTG<1AGTGAAX1AGCAAAGAGGAGCATGCATAAGTGATAATTGCATA 
AGGCAGTGTTCTAAGACTTTTATITG<K5 

GATCTTGTCCTTTTGTAG<3ACCAAACAAACCATGAAACAAACAAAC^ 
TCCTGGCACCACAGAAGATCGGGAGGAAAGGAAATAGTGTGGTGTCT 
TTTCCATAGCATTTGTATCAAGTGAATTAATAGTAAGACAATTC 
1 5 GGAACACAAGTATGCTTTCTITAATTTATGTCTCCAGT^ 
AATTTCrrrATTTTTGTACTCTCATTATT^AGTAGGTAG 

TG CCTTCATGTGT AT ATGT ATCCAGGTCACC CCAACT C CAAACT AAAAAT ACCT CTG CTTG ACTTTGCTC CTATCCTTTTTTTTTT 
TTTTTTTTTTITAAATAACCACAGAACTGAAGATGTTGT^ 
TTACTTACTACCTGCGCTTACAGAGTTCTCA^^ 
2 0 CATCAGGCTACATGTGCACTGTAATGAGTATCTGCCTGCGTTTTCTG 
CACAAAGTCAGCAATTCAGTTCTAGACCCAGTTATCITTGCCTTTA^ 
TAGTAGAATTTACGCTGGATGCTCCATAGTAAGTTTCTCT 
GACTCACCATCTGCCCTGATGCTTCTATTTTATTCATCTCAATCACTCT 
TATTTTTGTTTTGCCCATAGTTTACTTCT^ 

2 5 TTAAG<jAATTTAGTAAC^TAGTAACAAACACCCTGTTGTTTACAAATTA 

TCCTGATCTAATAATCACAGTCCCGTCTGTCITTGTTCGTCTTCCTGTGTTTTATTCT 
TGAGCATGCTGAGCTCTCCTGTGATTGATTTACTTTGTCGTTTTGT^ 
ATTCCTGGCTGTTTTGGAACTCACTCTGTTTCAAGGCTGGCTTGAAACT 
TCACTGCCCAGCTATTTTCTTTAGGCTTTGCTAATATTTTTA^ 

3 0 CTTTCATAGAGTATTATTTGGACTACAGATTTGTTATTGAAAGAGAAGAGAATCTACTGCTGAATGCT 

TTGGGGAGAGAGGGATGAGGCTCrAAAGGATCTTGTGGACATGGATTTTAAAGTG 

GAGAAAAGGCAAGATGAATACAAAAGGCAAATAGTATGCAGAAAAGGTTGTGGATGGACTT^ 

TTGCTGGGACAGTTTCCTACTCTAAGGAGATGGCTCAGCAGGTGTA 

CACAGAGAAAGGAGCTTCAATGGACXJAAATGCCTCCATGAGATCCAGCTATAAGGCATTT^ 

3 5 TGTGGGTGGTACCATCTCTGGGCTGGNNNNNNNNN^ 

TGAAATATAGGCAATCACATAAAAACGAAGAGAAATGGATACTTCCAACAGCTTAGAATACCCCT 

CTGAATTATATTCTGTTCCCTACCTGATGAGCTCTGGAAGGAGAGTAAT^ 

AGATTGTGCCTATGACATACAGAGCCCAGGGTTCCCTGCCTCCTCACTCCTCTAGCTCCCTAC 

AAG CTG AAAC CAATT AGTT ACTTGGAAGTAATTAAAAT AGT ATTTTTGTTTTG CTTTG TTTG AAGC AGG AT CACTCT AT AT AG C CC 
40 TGGCTGTTCTGCAACTTACTGTGTAGACCAGGATGGCCTT 

AAGGTATGCACCATCATCATACCCAGCAAGATAGTACTTTTAACACCACCAAGAGGGATGCCGG^ 
ATCCCAGCACTTGGGAGGTAGAGGCAGGCGAATTTCTGAGTTCAAGGCCAGCCTGGTCT 

GCCAGTAAXIAAACATCTCTCCATGGCCTCTGCATCAAC^ 

4 5 TCGACAGCAATGCGGAAAGTGTAAGCTGAATAAACCCTTTCCT 

ACCCTGACTAAGACAGCAGGTAAGGGCATTTTTGGATAC^ 
ACTCAAAGTAAACAACAATCTCAGAACCTTCCAACTCCATT 
CAAAGCTGAAAGTAAATGTTGTATAAAATTAACAGAGTTGGGGACTGAAGC 
CTTCCAGAGAACCTGGGTTCAAGTCTAAGCACCCAAATGGCAACT 

5 0 CTGGCACCAGATATGTATATGATGCACAGACATTTACGCAGACAAAAATGCCTGTGC^ 

AGAGCATAATTTTATGAGCTCAAGACATTCTTTTGGTTTGTAGTGCTGGAGAT^ 

TGAGGC ATGCTTCT AGCCCAATAAAGAAC ri'l'T'lT'l'TT TT TT T '1 l'G<3TTTTTCGAG ACAGG<3TTTCTCTGTGT AGCCCTGGCTGTC 

CTGGCACTCACTTTGTAGACCAGGCTGGCCTTGAACTCAGAAATCCGCCTGCCTCT 

GCCACCACGCCCGGCAATAAAGAACTTTTTAAATGATTTAAAACCTCTAC^ 
5 5 TTAGAAATAGAGGGCCAGAGAGATGGCTCACCAGGTAAAAGTGCTTGTTGTGC 

CAGGTAAAG<5TGGAAGGAAAGACCAGATGCCAGAGGTGTCTTCTGACCrrCCACATGCGCACTGCAG 

TGCTTATCCACACCAAAAGAAAGAAAGAGAAGATAGAAACTTCTCTTTATAAATACCTTAT^ 

GCATAAAGTTCTTGCCATGTGTGGTGTTTTAAATAAGAATAGCCCCCATTCATGTGTTTGAATGCTAGTCCGTA 

GTTAGGAGGTGTGGCCTTGTTGAGGAAGTGTGTCACTGTAGAGGTGGGCTTTGAGGTCTCAAA 
60 CAGTCTCCTGTTGCTGCCTCCAGATCAAGATGTGGAACTCTCCGCTCCGCTCCTGTGTCT 

GACTAAACCTCAGAAACTGTAAGCCAGTCCCAATTAATTAAATG 

AGCAATAGAAACTCCTTTTTTTTTTAAAGATTTATTTATTTTATGTATGTGAGTACACT 

CATGTGTTTGTTG<K3AATTGAATTTTTAGGACCTG CTTGCTCTGGTCAACC CX5GCTCCCT CTGGTCAACCCTGCTTGCTCAGTCCC 
TGCTTGCTCTGGCCCAAAGATTTAATTATTTATTATTATACATAAGTACACTGTAGCTGTCTTC^ 
65 AGATCTCATTATGGGTGGTTGTGAGTCACCATGTGGTTGCTGGGATTTGAACTCAGAACCTTTG^ 

CCGCTGAGCCATCTCATCAGCCCTAGAAACTCTTAAGACACCATGAGGATCAGAGTTTAGATTCCCATCATCTCT 
CCTAATGGACATGGCAGTTTGCCTACAATTCCAGCTTT^ 

CAATATTGATGAGCACCGAGTTCCATTGAGAAAGATTCCTGGGCTGGAGAGATGGTGCAGCGTTT 
AACCTCAGGTCTCTCTCTATATATACATGTACCAAGGTACATATGAACAGGCATATATGC 

AAGGCATTTTGGAAACCTGGAAGAGAAGAGGGGCTAGGTGGCGAGAGAAAGAAATCCAG 

AATTTTACTGTTCTGCACGCAGTTATX3AAGGAAGGGG 

CCATGTGCACAGCTCCGAACAGCAAAGCGGCAGGTTCTA 

AAGCAGGTTTCAGGCTGGGGGAGGGGAGACTACAGACAAGATAAGTGAAGATTCTTGCAATCCATTAG 
7 5 ATTTACTACCCCCTAGTATTTTCA&AACAT 
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GGCATAGCTCTGACTCTAGGATCTCACAAGGCTAGAATTACCAAGGATGTAATC^ 
CCATGTTTCCTGATATGGATATAGCAGACTGCAGAAGACCCACITCCA^ 
GATTTTCAGAAGAGAGTCAAAGTTTTTTGTTTTTAAGAT^ 
ACACCAGAAGAAGGCATCGGATCCCATTGTGAGCCACCTTGTGGTTGCTGG^ 
5 GCTCTTAACCTCTGAGTCATCTCTCCAGCCCTACCATTGTCTTTTTAT^ 

TAACTCTTAAGCCTACACTCAAGGGATTCAGGTCATTAAGGACATTTTGGA 
AAAATATACATTAAGAATCAGCCATAGGAGATCAAGGTCTAGGGAA 
CTCAGTTATGTAAAAATGCCACAGTAAAAAAGACTGAGGATACAAATGCT 
TTG TGTTCAGGCCTT AATTCTT AAAACAAGAACAAACAAAAACT^ 
1 0 CTCAATGAAAGCCATTACTTTGTGTGCTGACCTACTAGGTTTTAAAGT 

TGl"rrrClTTGTGCTCCTAGACTGCCTTrATGAAGCTGGGAGGTGTGCCTGTAGAGAG 

G AGCAAATTTTCTAAGTTCTT CCATCACTTGAG AATGTG T AG ACT AGAGAGACT CAG CATGACT CAGGGCA 

TCCTACAACTTGTC^GCCAATTTGTGTCTGTTAGAGAGTGTTGTCTGCT 
TGGGGAGGAAATCAGGTGCTCCTACCTTCATTTGGAGAGGGCTTAA^ 
1 5 CTGTTTTTGAGGCAGTCATGCTTTATAGCC 

GAATTGAAACTAGGTTCTCTCAAAGGAAGACGTGACCACTATTTTATTAAATTG 

TTATATTTTTAACCACTCTTAACTGCTGATCCATCTCTTTAGCC^ 

AATGAGTAGCAAATTAGCTATCCTAATTCTAATTATTACTTACTATTAT 

CACT AT AAATCACTATAAAATTTTT ACT AT AAATG TG CCCAGCCCGTGGAAAGTCGAGA 
2 0 CTTATTGGGATAGTAAGACACCCCCCTAATCCCCACTCTGAATTAATCAAAAATAGAG 
CAGTGGGTAAAGTGCTTGCCGTTCAAGTGTGAGGTCCrcAGTGAGTT^ 
ATGGATCCCAGGAGCTTGGGGCTATCTAGTGTAACTGAAACTGAGAGGT^ 
ATAGAAGACACACAGCTCGATCTCTGGCCTCCACTCCAATGCAC<3AGCACATACCAC^ 
CATACACACACACTACATACACAAAAAATTAAAAGTGGTCTTTA 

2 5 GTAGGTAGGTAGATAGGTAGGTAGGTAGATAAGATAGAAGGATAGGACATGACCAGT 

AACAAT CAGG AAAATT AAAACAAT A^ ACT ATTTT AT AT AGATT AG ATTTG T AGAAATG AGG 
GATAATATGGAGAAGTAGGGGTTTCTITGATTTCTAGTGATAGTGTTCATTAGAAAA 
CTTCCTGTAGACAGTTATCTTACTCCAACTGTTTTCAAGCTTGAATC 
TTCTGATTAAAGAGGACTTGGGTGGAAACTGATAATTTGCTT^ 

3 0 CCTG CTTTGTG CITCAATCTTTTCCCTAG AGCAGTOT 

TGACCTCCAACCATAAAGTTATTGTGTTGCC^TTTCATAACTAATTTTGCT 
AAGGTACGGTGACCCACAGGTTGACAACTGCTGCCTTAGAGAAATGAAAGC^ 
GAGCACTCAGCGGTTAAGAGCACTGACTGCTCTTCCAGAGGTCCCGAGTTC^ 
TGTAATGGGATCTGATGCCCTCTTCCGGTGTGACTGAAGAGACC!AATGGTGTACT 

3 5 AAAAAAAGGCTTATTGAGGTCAGGTGGTGGTGGTGGTGGTGGTGGT 

GTAGCTCTGAGTTGGAGGCCAGCTGGTCTATAGAGTGAGTrCCAGGACAGCCAGGGCTACACAGAGAAAAT 
AAAACCACCACCACTACCAACAAAGAATVGTTGATTGCTATTAACCACCAAAT^ 
CATATTTGTGTGCTTACrTAGTGGTATATTATATATTTAATTACAGAATATAAAGCTAG 
GAAAAAAGCCCATGGATTTGAAAGAGAATAGATGTGTTGGGGTACCrGAGAGGTGTT 

4 0 AATTGTATTTTAATCTCAAACAAAAAGTTAAAACAAAAAGATATACAAATCrTAGTC^ 

CAAGTATATTAGTCAGGTATTAGAAGATATGGTGGTGTGGGTTTGTAATCCTAATGATGGCAAGATGGGA 
AATGACATCTGGAACTCTGAGCACTTCAGATGCTTACCTGTACCACATGTGCAC^C 
ACAAAAAACCAAGATGGAATTAATGATGTTGTTTAGGAATGCAAATATAGOTA^ 
GTGTGATAGTAATCAACTCATGGGAAGCAAGATTGTCAAGGCCAGTTTAACT 

4 5 TAACATTTTTATAAACTTACGATCTCCTTTTTAT 

CTTTGGATTTTTTTCTTITATTATTTGTGGACATCTr 

TGGAC^CAATATAACTTATTTGTCTTTTTATTTAGAATTCTTTTTTGTTTCT 
ACTATGTTGCATCTGATGAATTTAAGTTTACAAAAATTATGCTTITTTAAAATT^ 

GGG ACATGTAATT AT CCT AACAC CAT GTTTG CAT ATG C AT ATGATTTTGTTCTCATTGATTTGT AAAG CACCCTTG AGT AT AT ACC 

5 0 AGATTCTGTTTGTCTTTTAATAGTGTCATGTCCACATTACTA 

TTTGrrGATTTTGTGCAAATCrATACTAACTTACTCTTCATGAG 

GGATTGTTTTAGATTTTAATACTGTGAAGAGAAACCATGACCAAGGCAACTCTT 

CAGTTTCAGAGGTTCAGTCCATTATCATCATGGTGGGAGGCATGGCAGTGTGC^ 

T ACATCTTGATCTG CAGG CAC CAGGAG ACTGTGTCACACTGGGTGT AG CTTG AG C CT AGG AG ACCTC AAAGCC CATG CC CACTGT A 

5 5 ACACAC^CTCCAACAAGGCCACACCCCCTAATAGTACCACTCTCTTTGTTTTGT^ 

TATGTACATAAGTACACTGTCGCTATCTGCAGACACATCAGAAGAGGTC^ 
TGGTTGCTGGGAATTGAACTCAGGACCTCTGGAAAAGCAGTTGGTGCTCTTAACT 
TCTATCAGCC!AAGTATTCAAATACATGAGTCTACAAGGGCCATACCTATTCAAGCCATCA 
GTTCATATAGGTTGATTTATAACTAAAATTCAGTTTCACAACrCATGAACATGGTCTT^ 

6 0 TTTTCCTTAATAGGTTTTACCTTTCTATCCAGTGCTCrTCAGTTTGG 

TTGTGAATGAGATGCTTTAAATACTTATTTTTATTA^ 

GCTGTGAGAATATGTGTCAGATCTTCTC^ACTTAGAGTTATAGGCAGTTTTCAGCAGTGGGATAAA 

GCAAGTGTTGTTAATCACAGAGTCATCTCCAGTCTTGAGATAATAAATAAAAATAGATAATAAATGAATATGAATGAA 

TAAAAATTAAAAAATAGGGGCTGGTGAGATGGCTCAGCGGTTAAGAGCGCCGACr 

6 5 GCAACCACATGGTGGCTC ACAACCATCTGT AACGAAATCTTATGCCCACTT CTGGAGTGTCTG AAGACAG CTACAGTGT ACTTACA 

CATAATAAATAAATAAATAAATAAATAAATATTAAAAAAATAAAGTTGTATGGAAGAAGCTGT^ 
ATATCTAGTTCTGATACAGTGCTTAGTATTTTTCAAGTTCACTAT^ 
AATAAATAAATAAATATTTAAAAAATTAAGTTGTATGGAAGAAGCTGTTC 
ACAGTGCTTAGTATTTTTCAAGTTCACTATTACATTGCCAGGATTTTAAA^ 

7 0 TAGTTATTATTATTCTAATACATTAGCTAGGGCCTTCCATATATAGTCATAAATG 

AAATGTATCTTTGGGCTGGAGAGATGATGGCTCAGTGATTAAGAGTACTGGTTGCT 
ACCCATATGGCAGCTTCACAACTGTCTGTAACTCCAGTTCCAG^ 
ATG CAGG CAAAAT ATT CAT ACATGTAAAATAAATATTTTTAAAAATTGT ATT CT 
CTATGTGAAGTTTTAGACTGTTTCrrGATTTGTAGTCTGCTA 
75 CCACCCATTATACTGAGGTGATCATATCACCTGTTGATGTAAGTCATTTCATTATTAGGC^ 
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TACTGGATATATTTTATTGCA CTGTT TT T AACT 
ACT AAAATGTCCGGATT ATAT ATT AAAAAG CCTCAAGGTGATGGA^ 
T ATG AT ATTCCAGGTGGGGACTGAGATGCTTGGAGAACCTGG AGGC CAGGT AT 
TTTCTTGATTTGTAGTCTGCTATGCAGTTCTTCTGAATC^ 
5 ATCATATCACCTGTTCATGTAAGTCATTTCATTATTAGGCATCCTAT^ 
ACTGTTTTTAACITATTTTGAAAGAGAGAAAGCTCATO 

ATT AT AT ATT AAAAAGCCTCAAGGTGATGGAAGGACAGCC CAGT CC CTGGG CTGGAAT AT CATTGTTGT AGATATGATATTCCAGG 

TGGGGACTGACATGCTTGGAGAACCTGGAGGCCAGGTATGTAAGATATGCAACTCAG 

ACCCAACAATTAGAGCTGATAAGTCTTTCAGATCT 
1 0 GTCTGTCTGTCTCTCTCTCTAAGATTGAGTGTCACTATATTGGTCTGTC^ 

TGGTCTTAAACTCACAGAAATGTTCCTCCTCCTGCATTCTCCAACTT^ 

CTGAGACTACACCriTCriTCAACTCTTTATGAT 

TGCATTCGAGACAGACAGAGAGACAGACAGAGACAAAAAT 

GATACAGAGGCAGTGGAGGAGCACTGCTTAATGGCTTGCTC^ 
1 5 AGTACCACAGTGAACTGGGCCCTCCCACATCAATCAAGAATATGTACTATAGGCT^ 

CGCAGTTGAGGTTCCTCTTCCAGAATGACTCTAGCTCTAGCTTGTGT^ 

GTGGTGGCACTAGG AACCAAACAAGTCTTGGAAAAGCAATAAGTGCTCCT CCAAG CCATCCCACCAGCCCCAAATTAAAG A 

GAAGAAAAGGTCTGTGTGTTTGTGTGTX3TGTGTGTGTGTGTGTGTGTGTGTG 
CATTGACACATACTTGAAGGTCAAAAGGATTACTTGACT 
2 0 CCAAGTCATCAAAGTTTTGCAAAGAGTGACTTTACG TGCAG AOT CTT CTGCTT CCAATGGT TTTGGG C CA 

AGGCGTTTATTCAGAAGGATGATGAGTCCACCCAAGGAAGAGATAGGTGAGGCT^ 
TTAATGATTTGATACTTATTTTCAGTCTACTCAGCTGACTTAATT^ 
AACAGGTCACCACCAGAACTGTACTCTTCCATCCTGTTCrc 
CCTACAGGGAAGCATTTGAAATGTTTGATTGTCATGAATTGGAGCTGCT 

2 5 ATGGCAAGAGTTACCCACTTCAAAGTGCTGCTTACXSGTCTCTATGA^ 

ATCATCCATCTATTTTTGAGGCAAGGTTTTACTTTATAGCCTAGTCTGACT^ 

CTGGG ATTGGAGGC CTATGC CACCATAT CAG ACTT CTGAATTT ATT AACTTTT AAAAATGATGTTG TG T CTGTGGGT ACATTACTG 

TGTGTGGGTGGGGTC^VAGCCATGATATGCATGTGGAAGTCAGACGACIAGCGTTGTAGAAT^ 

GGTCCCAGAGATTGAACTCAGGTCCCCAGACriTTCACAGCAAGAGCCIT^ 

3 0 T ATTTATTCTTTAAAGTTTAGTTCTGGGGCAGTG A CAAGCCTGACAAC CTGAGTTC 

AGTCTCTGGGTCCAGAAAGAGAAAACAGACTCTGGAAATGTX3TCCT 

ATAAATAAACAAATGTAAATAATGATAGTAAAATAGTAACrATTTAGTTCTAGCCAGTCATGA 

ACTCAG AATG CAGAGG CT AGCAG AT CT CTGAGTTTTGAGGCTGG CATGGTTT ACAT AACAAGTTCT AG AGTCAGG ACT ACAAAA 
TGAAATCCCGTTTTAAAAAAGCAAAAACAACAACAAAAACACTTTGATTATCT 

3 5 CTCCCCTTCCCCTTCTCCCGCCCCACATTCTTCCTGTTTGCT 

AACTATCTTGTAGTGTTCTTGATTAAGACACrATACCrGTACCTTACT 

TTGGTCCATGGTATATGTAATT' r i "ri"l"r<JTl"l ' l I 'AAATAATGTATCCTACAAATAAGTCATATATATATGCATATATACATATATA 

TATACATACATACATACATACATATATATATATATATAAAGAGTACACTGTAGCTG 

TCTCATTACAGATC^TTGTGAATCACCATGTGGTTTCTGGGATTT 

4 0 CTGAGCCATCTCTCCAGCCCCCGAGGTGTAGCTCTTAAAAGCCCTGCT 

GTGGGAAATAATGAAGGACTCACTTTACACACACACrrGTATTAT^ 

ATTTAAGGCTGGGATATAGTAGTGCATGCCTTCAGTCCCAACACrTGGGAGGTAAAGAAAGGC^ 

ACATGATCTAGATAGTGAGTTCCAGGACAGCCAGGGCTATATTGAGATACCCTA 

CAAATAAATAAATAAACAAAACCCCCAAAACAAACCACAGAATCAACTTAAG^ 

4 5 AGTCCTCCTGGAGCAGGCTTGGTGGCAGGAGCTTGAGGGAACTGGTCTCA 

CTGGAACTCAGCITTCTTTCTCCTlTrT 

ATGCTTAGAAGAAGACAGCAAGCACATTTTTAAAAAC^ 

AAGCTAGCTGGGTGGGATCTGGCrCTGAGATCACCATTCCTGCT^ 

TTCAGGAGTCTGCCTGCCTTTGTAAGCACAAATGAAAAAAACCGTACCTGGGTGGGATCT^ 

5 0 TGCTTTATCTCCTTCCTTAATATCTTTCTGACAAGCTCACTCATAGTGCAGCTGCCC 

CATATAGCCAATATTGCATGCTTATATTGTGCIATAGGTGAG 

ATCCTGAAGGTCACAGCA(TTTATCGTTTTGCTAAGGACATAGACATATCCTCACCITCCAGACTAGTGCT 

GAGTC^TTAATCTTTGCCAGGAGAACACCrrCCTGAGGAAGGAGCATAAAGAAATTATGTACATTATTATG 

TGGCAACCCATGAACACTCCTGGAGGCCCATGCCCTGCTGGCTCrGTTCCAGGGGCAAACCATGTCACTCT 

5 5 TGCCAGACTCAGCATCATAGAGCTGCAGAAATCCTTTAATACCCAGAAATGTTTTACAGTC 

TTAGTCAGATTGAGGAATTGAGTGCITAATTCAGTTGTATTTCAGTTTTTAACI^ 
TGAGCTCTATtlAATGTGTTATGCATGTAACACATGAATAAAGATTGATATGTCAAAAAACCT 
ATGATCCGACAGACTAACTTCTCCACK^TCATACAGCTAAGTGACAAAACT 
CAAATACTAAGCTITTTCTCAACAAGTCTTGAGAGGTAG 

6 0 AACTGGCCTGGGGCACCCTGCCTCTTGACCATATGTTCATGTTTCTCTGATACT 

GCTGTGGTTTATTTGTGTAATCAACTGTAAAAAACTGATTCTTCTTAATGGTT^ 
CCAGCTAACATCrrCTGCAAAAAGAGTATCTCAACTTGAGAATATACCCATTATGTAGATC^ 
TATTGTTGGAGCCATAATATTrAAAAAGAAATTTTTTGCTCTTTGAATT 
GAAAATATATTTTGCTATTAGGTTAAATGTCAGTCATATTTAATGGAAAATCATTG 

6 5 GTTTTAATGTTTTATGTGTAAGGTCTGTCTTTC 

GTGTAGACCTGCCTATCCTGTAGCTTTGTAGACCAGGCTGACCrCGAACTCCGAGATCACCr 
TTGAAGGTGTGCGCCGCCATGCTTGACnTCTGTTTACATTTTTTACrGTGATCTGAT 
CrGGCGGAAGGGGCTTCTTTTACATTCACATGAGTGGCGGGCTTATGAGGA 
AGTCTCAGTCATCTAAGTGCAGAGCTGGAGGACTACACAAAATGATTCTTCAGTCCCCC^ 

7 0 CTTGG CGCATCITTGGTG CATT ACAAGTTTGT AAGAGT ATT AAATCT ATC CAC AG A^ CTTTTTG CTG C 

TTATTGTAGTGTGCITTAATGCACrTGCCTCCCTTGTGTACCTT^ 

AGTATAAAATATAATCCTGACCAGATCAAAAGCCACTTCATAACAGTAAAAGATTGAATTA^ 
TTACATGTGGTCTCTGCAATAAGATGGCCATAAAACACATTACAAAAAAATATACATATACACA 
ATTCCAAAAGCGAGCTTAATGAAGATTCAGTAAAGATTGAGAGAAGGCCCTC^ 
75 TTT ATTT ATTG CTT AATTTGG AAAG CTG CTGAGGGAAG AJVTGTTC A.GGGAT AAGG AAT AGAATGT ATG AATTGTGTGGGGT AAG AG 
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ATGAAGGGACCAGTGCCTGATGAGAGAGAATGAAACAATCTTG^ 

TAAAGTGGTGAGATGTGTTCTAGGGATTCCCTTCTGACCAGACAATGGAGGAA 

CAATTGATACCrrTCTAAGTATTTTAGAGAGAGAAATGATAAT 

5 AGTAGTTGCTTTGTCAGTTCCTAGCACCCACATCAGATGGT^ 

AGGCACACGTACATATGGATAAAAAGAATAAACTCTAAAAATTTAGTCACATAGCT 
GAATGCITGGTGCACAAGGAGGCCAGAAGAGGGGGTTGGATT 

CTGG AATCAAACCCAGGTCTTCTAGAAGAG CAGCTGATG CTCTTAACTG CT AAG CAAACTCTCAAAACAAAATCTTTTTTTCAGAC 
AAGATTTGGAAGTCAGATGTGGTGGCACATGCCTTTCATACTAGCACACAAG 
1 0 GTCTGCCTGATCTATATATACAGCAAGTTCCAAGCCAGTCAAAGCTACAAACAG 
ATAAGAACCAAAGAAGGGCCAGAAAATTGAATATAGTTGTTTTAGATATTTGCT 
TTTTGGGTGTTGTTGGTTATATAAAGCATATTGTTTT^ 
TATTTTGACTTAGGAAACTTTGATAGAGATTTCACTGAATCTGTAT 

TACCAGTCCATGGAGATCTrTCCACCrAATTGCTTCAA CT r Cl 11 1 1 f CI 1 UCi ITTT T TTTTTl i TTCTTTI 1 1 GAG ACAGGGT 
1 5 TTCTCTGTATAGCCCTCGCTGTTCTGGAACTCA 

AAGTGCTGGGATTAAAGACGTGCACCACCACTGCCCAGCTCAACl M r 
TTCTTTGAATGTCCTrCATTGTCTGTTGTAGTATGCTA 

TTGGTTAGAGGGTAGGTGGTCTGGCTAGGCAAGCACTTTACTCCAGAGCTATATCACCCC^ 
TTTCTTTTTAACCTTACACTTACTTACCTATGGTGTATGTGGTGGTTGTG^ 

AGTGGCTTCTCTTCTTGCACTGTGTGATCTACAGATTGAGGCATTCCTCACGAC 
TGACTCAAACAGTTCATTTTAGACATATTTAAACACAGGACAATTTTGGAAACAGT^ 
AGAGTGTGACTATATTGAAACTCTTGCAAAACOTGTCTGGGACCTCT 
AAGAGTCTAGGAAGAATCTGGATAAATACAACAGTCTAGGGGATTCGGGCCCT^ 

2 5 TCCATTCCCAGCTTTCTGGGCAGAAACAGGTATAGATCCCTATCTCAGGAGGCCCTTGT 

TAGAAGGGTTCAACCTTCATGTGGGTACATTGTAAGACAGCT 
TTTTGGTGGTTTTGTTTTTCAAGACACGGTTTCTCTGTA 

CACTCTGTAGACCAGGCTGGCCTTAAACTTACTGAGTTCCATCTGCCTCTGCCTCCC^ 
ACCACCTGGCCAGTATCTTCTTTCATTAAAAAAATAATAATAATAAAAGTCTACTCA 

3 0 AG CACTGG CTG CTCTT CT AGAGGTCCTGAGTTCAATTCC CAG CGAC C ACTTGG T AG CT C ACAACCATTTG T AAT G AG ATCTG ATG C 

CAAGTTT AT ACTGAAG ATT AAC CAT CACAAT AACAAACT CAAAAAAAAAATTG CTTT CTGGTAGGGAG AT C CAAATTTG ATGG AAG 

ATTAAAAATAGATTTTCATGATTTATTCTGTATATGTATGTGTGTGTGTGTTTTAATTATATTTAAT^ 

GTATAGAAAATATAATGTTTGTATTTCTGAGTCTGGCTTATTTTATTTAATATAGTAATTGCTAGTTTTGAT 

3 5 ACTC^GGTGTTTGAATCTCITATAGATAGGGATGCAATACTAGATGGAAAAA 

CAATTGTCT AAAGCCCCGAGAACACTGG CAGTTACTG CAGTCTGTTGTCTACATTAAAATAGTTTTAATCCCT ATC CTAAGTG ATC 
TTTATTTAGATTCTTTTACTTTTTTAGGTATGGAATATCAAGC^ 
CAAATTTGGTGGAGAGCATAACCCACCATCAATATACCTGGAGGTAAGT^ 
GTATTAAAATGTTTTATAGTCATAGTTTTGGTTATTTAGAGAGCAGGATTGGAAACATTGG 

4 0 TTTTTTTCCTTAGTCITGTTTTCATT CACTTGTTG CAATCTG CIATGTATT AGTTTTATTTGGAAAATGTCAAAACTGAG AGGAATG 

TCTCCTATAAACTTCAAAATATTGCCTGATCGGCTGGTGAGATG CAC CCG ACTGCTCTTC CAAAGGTCCA 

GAGTTCAAATCCCAGCAACCACATGGTGGCTC^TAACCATCCGTAACGAGATCTGAT 
ACAGTGTACITACATATAATAATAAATAAATCTTTAAAAAAAAAAAAATTGCCTC 
AGAAAAAGCACACTGTTTTGGTAGTCAGGAGAATGAGTTACC^ 

4 5 T AAGTTTCCCAG AGTTGTTTTCTCCTTTCrTAT AG ATAGTTT ATAATCTTGGAATGG CCCAGTACTG CTC AGTCTTGCTGCTT AGT 

GATGCCTTGAGATTTGTATCATCTGCCTTGAGTGAGATTATAAATCTGTAGTTAGCCATTGTTCTTAAT 

ACTAGGTAGTCATAGATAGGCCTTCTTGCCCTTAGTTTC 

CAAGACTGCTTGCCACAGCTTTCTACTGCCTTTCTCCAGTAACGCITGCTCAG 

CAGGTCTGAGACTTTG AGGT CAACCTGGGCTAC ATAGT AAGTTCTAATAG AG ACACCTAAG CTACACAATGAG ACTCTGTCTCAAA 

5 0 AAAAATAAAATAAAATAAAATAAAACAAAGACCAAGTAAGCATTTGGCTTCAAAT 

TTGTAACTAAAATTTCTAATTTACCAATCTACTTTTTTTTTTAAG 
GACACACCAGAAGAGGGCATCAGATCCCAATACAGATGGTTGTGAGCCAC 
AAGATCAGTCAGTGCTCTTAACTGCTGAGCCATCTCTCGAGCCCCTC^ 
TGAGAAAGTTAAGTTTGTGTTTATAAAACAAACCTTGGAACTTTGGAAACACCAACT 

5 5 CTTATTCTGGATCACACTGCTGCrTATGCTATAAAATGTAGGACTAGAATTCTATCT 

CAAAGAAGTAAGAAGTAGTCTTAGTATAGTCTTAATGAAAGTGCrCATTTTTGTTT 

CTCTGTCTATTGGG CACTTTCCCACATTTGAAT CAATGAACATGTTATTTAT AGTCATGATGC ATTGTTTCAGCCAGACAC^ 

CTTCAGACATTTGAATACTTAAAAATTGATTAGTAAAAAGAAAAACATTTGTGCTTAT^ 

ATGCTTTTATTAATACTTACAATTACTTTTAGCAAATTC 

6 0 AGTAACTTACTTCAAGTGGTAGGTAACTATTGCTGAAGGGAGAAGGCAGATATCTAGCTTACAAATCr 

TTATTCTACAGCTTTGCTATCCTGTTGTGTCTCTGCCCTCGTGCTAAACATTTCTAATTTCTT^ 
TGTGGACTTTTTTTTCAAATGTATTTAATTGGATTGGTTTCTTTTTACAACT 
GTGTAACATCAAAGCAAAGTTTAATGGGTTTTAAGTATGTGACAA^ 
GAATATTTATGAAACATTTAGAAAAAAATTAGGATAGTA1TATATGTAATAGTTTAAG 
65 GTTCCTGAGATATGCTAAGGTTCTACAGATAGGGGTACTCACTGAATACCCTGTAATATTTATACT 
AAAAAAATGTGCCGGGTGTGGTAGTGCACACCTTTAATCCCAGCACTC^ 
GTCTC^TCTACAGAGTGAGTTCCAC^ACAGCCAGGGCTATACAGAGAAATCCTGTCT 
AGCCAAAAATCTGTATCTTCAAACAGATTGTGCTTCAGAGCAGGGAGGGACTTATA^ 
TATTAATATTAATTAGCATTTGATACACC^TGAAGAAGACTGACGGATGACAGGATGCT^ 

7 0 CAAGGGAAAAGACTCAGGGACTAAACTTTGAAGGCCTTCACTC^ 

CTTCAG GGTTTT AGGTT CACTCTTT AAG AACAG TTTTTT CCT AG AAT AAAAG ATACATT G AATTT AC CACTTT AAAAAAT AAA 
TTAGGGTTGGAGAGATAGCTCTGCAGTTAAGAGCACTGGCTGCrCTTCCAG 
TTACCATCTGTAATGGATACCCTCTTTTGATGTGTCTGAAGACAGTG 
GAATAAATAATTTTTTAGCAAATTCATTTTATTTTGAAA 
75 CCCCCGCCCT^GTGGCGCACACCTTTAAT^CCAGCACTTC 
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ACAGAGTGAGTTCGAGGACAGCCAC-GGCTATACAGAGAAACCCTGTCT 

GGACrcATCTTTTTACTTTACATATTAACTACCAGATCCACA'lT'rr LT I'CTGATTATGATCTGTAATAACAACTACATACTCCTGT 
GTCTGTGAATTTGACTGTTTTAAAAGAATCACATCAITTGTC^ 
TACATGTTGCAACATGGATAACACGTGTGTAAAAAGTCCCTTGCTTGCT 
5 GCTGCTTGAATTGCAAAAAATGATGAATGGCCATGGGTG 

AGGTTGGCCTCAAATTCAGAAATCCACCltSCCTCTGCCTCCCG 
AACAAGTTTTAAAATTATATTTGTAAACTTTCTGCACACATGCATGTG 
AATAAACAACTGGGAGGAGT(^GTTCTCAGGTCATCAGGGTTGGTGG 
1 0 TGATTTGTAATGTAGGCTTATCTTCTGCATAATTAACTAGTGATTACM 

GTGAGC^GTCGCATTGTTGATCCTGAGCAAAGTCGGGTCCTTAGGCTTAAACA^ 

CAGAAT CATACAGTGATTGTTTCATT ATTT AAAGGGACTCACACAC CTTTTT AT ACTGTGCCT ACACCATGTTACATTCTT ATT AT 
CAACGATTGCACAAGGATTCTTCTCTCTCCGTAGCCTCACTGACACT^ 
TGAATGTACAGTGATATTCTATTGTGGCITTGCTTTGCATTTCTCCAGTGACT 
1 5 TATTTGGACATCCTCTTTGGAGAAATGTCTGTTAGCTCTTTAATTT^ 

TT AAAG TT AATT AATTT ATTTTTTTCAT CATT GATTCATTCAATTTT AAAGCATCT CGCACGCACTTCT AT AGAGAAGAT ATT CAA 
GTT CATTGAGT AT AAGCAGGAAAATT AAAGAAACAG AAAAATCATT CATTGAT ATG TGTGGG CCAAAGG ACAACTTT AAAGAATTT 
TTCTCTTTCCAACCATGGTGTTAAAACCTAGGTTGTCAGG 

GACAAAATCCAGGACCAAAGCAAAACCAAACCCTAACAGTATAAAATTCTATATGA 
2 0 TGGCTCAGTCCCACCAGCTTTGCTGCCTGTGACATACATCTCCCTCTTGGGCTGATTCCACT 
G^TAGGTATCTCACAGCTCTGACATTGACAACATATTGGTCT 
AATTTATCTTGTTTGCTAGTCTCTGGCTCACTCATTAACTACCACAATCCT 
GTCCAGGATTTTTCTGTTTCTTTAATTTTCCTGCTTATACTCAATC 
AAATTGTTCACATATCTTTGTTTTATGGGCGTGTGCACTT 

2 5 AAACAGGACACGCTGAAGGAGTCAGCGCTGTCCTTTCATCACGTGGATCCCC^ 

GCACCTCTGAGTGCTGAGCGATCTTGCTAGCTTGAGATGCTTTTCTCCT 
GCAAAGAGAAAAATACGGAGCCAGAACTTAGTATTCTCTTCAAAGGCAGACCTAGTC 
GAGATTCAGATCAGCCAGGACTACATAATTAAAATCCrrCCCTCAAAAAAA 
TTATTATTTATAGCATTCTTATAATGCATGGCAGTCTTTTTCAT 

3 0 ATTTCTGCAATGAAAAGAATACCAGCATGTAGATCAGTTCATTGTATATATCTGTG 

GATTAAATTCAGTAAGAATAAGCTTAATACTTACTCTTTTTGTTTGT^ 

GTATAGCCCTGGCTGTCCTGGAACTCTCTCTGTAGACCAGGCTGGCCrCGAACTCAGAAATTCGCCT 

GAGATTAAAGCTGTGCACCACCACGCCCGGCAATACTTATT 

CTGAAGGAAGCAAATTCTAAGCTAATTCATGCAATTTAGCAACTCT 

3 5 TAAGTGAGAGTGTTTAGCT AGACT^CACGACATTCGCGGTTCAATTACTGGGACAGAAAGAGGGAGGGATGAACGTAATC 

GTGCAGGAGATCTGTGAACTGTGGAAGCCATTACCATAAACTCTCAAACAGTATTACTT^ 
TCCATTAGCATCTGTTCCTTCCAGTCCTCGGAATGACTGATCTTTTGATTGATGATAATGTTGGAATA 

GAAAG AGT AC AGT AT AG AC CAAAGT CTGGGAAG ATG TT AG AAAT ATTCATGTTG ACCTG AAACCT ATT CGTG TG C T AGTGAGATGT 
AAAGTCAAGGTGGAGTAGTGTTCATTCAAGGTTGTGAACGTCCCAGAGTCAGACTCTGGCATCATT^ 
40 TGGTATGAATATCTCTGGGTCTTGAAGACAGAATGGA 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TCCTGAATCACAGCCCGTTGAGGGAAGCCAAGGCIAGGAACTCAAGTGAACAAGAA^ 

CATGAAGGAGTGCTGCTT ACTGGCTTGTTCCTC^TGGC CTGCTCAG CCTGCTTCCTTTTATTTTTCCCCCTTTTATT AAAAATAGA 
TTT ATTTTTCAT AT AATCTATC CTGAT AACAGTCAC C C CGCCCT CT ACT C CT CCCAG AT CCT CC C C ACTTTCCCC CT CATCTG AAG 

4 5 CCACTCCCTTTCTGTCTCTCAGTAGAAAACATATAGGCTTCTCAAGGAATA^ 

AAAATTTTAAAAAAAAAACAAAACAGACAGACAGAAAGAAAAGAGCCCCC^ 
CCATTTATTCGAACACTAAACCCGAAGCCATGATATATATGCAAAGGATCAGTACGGTAAAAA 
AATACAAATTTTTAAAAAITAAGTGGTGAGGGCACTGTTGCTATAGAGAC^ 
TTGTTTTGCTTTTGTTTTCTGTTGCTCATCTACTGCCAGGCTTGCAGCATATCCri^ 

5 0 C CTITGAAAAAACAAAATTTT CATTTTCAG CCTGCT CTTTTT AT AG CATCCAG TACCAAGTCAGGGGG TGG CAC CACCT ACAATGGG 

CTGGGCCTTGTGCA.GG AAAT ATT AAAAAAGAACTGGCTCGTTCCCGAGATTGTGCCTG 

ATCCCCCCTGGGCCCTCTCTCACCTCCCCCTCCTCTTCTTCCGTCCAGCCCGGAAGTCTCCCCTACTTACCCAGTGATTTTCT 
TTACTCATTAGGGGATAGGTGACAAG^GTCACCAGTATACAAGACTCCATTCTCCCTGGAGAGTGGAATTAGCATCAAAATAC^ 
G CAACC C CTGGG CT AT CCAC CACCG GGCACTTCTAT ATCT AT T AAGG AAATGCCCT AC^ 

5 5 GGCATTTTCTCAGTTGGGATTCCCTCCTCTCAGATGACTGGCTTCT 

AGCCGAAGTAACTTGTTAAACrGAACTGTGAGAATGAAAAAAACTATTGATAATGTG 
AGAAATTGGTATATTTTCAAAATTTAACATTAATACTTTG 

TTTATAAGGCTAATTTTATATAATAACCATGrrTGCCTAGGCCTCAGTAATAAGAATATT^ 
CCCAGGCATTTCTAAACCTTTAAGGGTAAGAATTGTCTGCCTTGTCTGTTTTTGTAGTCTC^ 

6 0 TAAACAACGAATATTTATTGAATGAATATAGACTCAATAAATGAAGCATATTGAGAAAG 

TGAAAACTAAACAGAAAGATTTGAATTGATTTGATGGTAACTCTCTATTCT 
GGGAATTTTTCCAGAGTTCAGGCTCCTTACCTCAGTACITTCACT 
AAAGTTGTTCAGTAAGATAAATATTCAGTACACCAGAGTGCTGTAGCCTTTATTGAA^ 
TCAC^TAATTCCTGCGGTATAATTTCTGCTTATGAAATACAGATTTCCTAGTG 

6 5 TACCTGAC^TTCAGTTTAGAAATCrrGTGGGGTTCCCCCCACCCAGTACCTTAAAATATGTTCTTTG 

AGGCCTCTTTTTAAAGTTTTTATTGTGTGTTGTTTTATGTTGTACTCATGAGGCT 
ACAGCTTTCAGAAGTTAGTTCTTTCCTTTCATTGTGGGTTCTAGG^ 
CCTGCTGAGCCATCTTGCTAGCCCAGTAAACTTCTTGAGAAAAAGGAAA 
TCATAAATGTTTCTAGTACTGCATCTGTTTGGGATGTTAGGGAACA 

7 0 CTGAAAGAACTGTGGATATGTATATTCCAATCAATGCCATCTCTTTACTTTC^ 

ATATGGTAAGCCATTTTACTTTATAGCTTTTGTCAAAT 

TGCAGGGTATCAAATCAATTTTTACITGCTTAAGCAGCATTTGATGG 

ATATCCAGGATCATAGCTGTTACAGGAGACAGTGATGGCATCCTTGGTTTGTG 

T ATG CTG CTG CTGG AAACT ATGTGAAAGT CCATGCATG AT CT ATG CTGC CATTGACTG CTATGGG C AAGG AAG CTTCTG CAGTGGT 
75 AT CAATGACTC CAG ACT C AT ^ A CTG AG AATGAG AGATGTTGAAGG CTTCTGTGACAC TC CCT AC CC AC CCCCCTCCCCCC CAAAAA 
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AG CAGTCCAGACAAAT AG CT ATTG AAAAGACTG CTT ACAAATTGTG AT AAAGATGGTGAAGTGT AG CT CATC ACAGTTGAT AG CTT 
CTCTCAGTGG<yrrTGGGAAAGXaACTCTGT C ' r ' rrri TAGGCAGCTGGCCAG CAG AGTTTGACCATGCrTCAGTGAGTACATGAGCAA 
CACAAATTGGACTrGGATTTTTCTITCT^ 
TCTTATAAGGACAACATTTAGTTGGGGCTGGCTTA^ 
5 CATCCAGGCAGGCATGGTGCAGGAGCTGAGAGTTCTATGTCT^ 

GATGAGGGTCTTAAAATCTCTCTAACAAGGCCACACCrCCAAATAGTC 
CACTCCCTGGCCCCCATAGGCITGTTCAAACACC^ 
TCCAACTTCCAAAGTCCCCATAATCTCGAGC^GCATCAACAATC 
ACTGTAATCCCCAAATCAATACAGGAAACCATCTGGGTAAACTCaiAAT^ 

1 0 ATCTCCAACTCCTTTTTCATCATTCTTGACTGCA 

GAAATATCCTCTAGCTCTGGGGTCTCCAATGAGCAATGTTCAGGTCCCATGTG 

CACCAGCAAGCTAGATC^CCTTCAGCAAAGAGAACAG 
CTGCATCAATGGACACCGTTACATCCTCCTCCTCrrCTAGCCTTTCA 
GATGCATCACGCAACAACCCCAAGTCACCACAGAAACCTATOSTTAGAGTC^ 
1 5 GTATTTATCATCATCCTTTTGCTTTCTTTGTGnTGAA 

TAAGTTAGGGTAGAAAAGACATGATTATTTTATGTCAAGTCTATATACAGAGTTC 

GTTTGTTTGTTTTATGTATATGAGTACACTATTGCTCT 

CCACCATGTGGTTGCTGGATATTGAACTOVGGACCTCAGGAGGAA 

CTTTTT AATTTTT ATTTGTGT ATTTGTG TACATGGCTTCACT ATGT AG CCT CGG CTGTGGAACTC ACT ACAT AGAT C AGG CTGGC C 
2 0 TCTGACTAACAGCGACCGGCCTGCCn'CTGCCrrCrTCAAGTGATGGGATGTTAGCAT 
AATATAAGTACTAGTTAAACATCAGTCAGGAGCCAGGCAGTGGTGGCACACGCAT^ 
GACTGAAGAGGTGGCTCAGCAGTTCAGAGCACnt3GCTGCTCCT 
ACAACCATCTGTAATAACATTTTGAGGAAATTTGGTGCCCT 

AACAGG CAGGCAAAGT ACG CATAAG AAT AAAATAAAAACGAATGCATCmTTl' 1' AAAGACG AGATTAG AATTTCCAGATTTAAGC 

2 5 TTGTATCAGGTAAGATTCTTTCCrCTGTGTAATCrCCAGCCTAAATTACCT^ 

TGTGTATGTAACTCCTAAAAATTGTCACAAAATTGTTAGAAAC 

TCCTCAAGTAAAGCACATATTGTCAAGTTCAGCCTTCATAATTAACGTTCCT 

TGTGTCTGTGTGTGTGTGTGTCTGTGTGTGTGTGTGC^ 

GACTGGAGTTATAGGTAGTTAGTTTTGAGTCACCGGACAAGAAAGATCCTGGGGACTGAACTCT 

3 0 GTTCTTAACCACTGGTCCATCTCTCCAACTCTATATGATTGCT 

TAGTTTCTCTTTGTTTTTGCTGTGAGACAACAGAAATTCAA^ 
TAAAAAATGTTACTATAAAAGCATTTTGCCCTAGAGTCTAGAATTCCTGTC 

CTACCCACATATTTTTGGATTTTG ACCTTACTTAGAAAATACTGTTT^ IT AATCGAAGCA 

TTTGGGAAGCATTAGCAAGCAGAACTCTCTGTGTTCAAGACC^ 

3 5 CTGTCCGGGAACAAACAACAACCAAAAAAAAAAAGGAAAGAAGAAAATACT 

TTACACACT TT CT T CCTGCrCTTTCTTCCAGGAAGAGTTGAT^ 

AGCTCAATTTCTAAAACAAATCCAGTTTTGTCCTCTTCAAAAACCAGGA 

TCTGAAGCTTAAAAAGTGAGTGGACACCCACCCCCATCTCIT^ 

CAGATGACTTGGTGGTTAGAAGTACTGGCTrGCCTTTGCIAGAGAAGCCCGGTTCTGTTCT 

4 0 TCTGAAACTCCAGTTCCAAGGGACTCAATACTGTCTCrGATCTCTGTGGGCACTC 

TACATGCAGGCAGAACACACAGAGACAClAAGATGAAAGGAGTTTAGGGGATTGGCrr 
GTTCACTTTTAAAAATAATTAGGATAAGACrTTGTTATGTGTATACTTTAATAC^ 
AAAGTTAAAATATCTCTAACTAAGATGACCCAACCATCTATTTT^ 
GTTCTATTTCTCATAACTCCAAGTTCCTAATGTCCGTCAACAAGTAAATTG 

AAAAGTTTTTTGTTGTTGTTGTTTTTGTTTGTTTTTTTT^ 
GGGCTTTGGAGATGGCrCAGGAGTTAAGAGCACTGTCCTCT^ 

CAACTATCTGTAACrCCTGTTCTAGGGGATCCAACTCCCTCATACATGCAGGCAAA^ 
AAAAAAACTGT^TATATCTATATATAAAGAACATAGTTTGATACT^ 

5 0 CAAGAAAAATATGCTGTAGAAAAGAAGGTAGACAAAAATAGATGCATACTTTGTAATTCCATTTATATAA^ 

CITGGTTATATATGGTGGTGAATACATATAATACTCTCACTTAC^AGATGGTGGCAAGAGAATTAGGG 
ACATAGTTAAGTCTTTAAAAACAAAGGCAAATTTTTCTATAGTGACAGAAAGATGTT^ 
GGAAGGAGGTATTACATTGAAGTACAAGGAAACTTGCAGAAAATTATGCATATGTTCATTATTGTAAATAT^ 
TGGGATTTCAATTTCTATAATGCAACACTATGACGAAAGCAAGTTC 

5 5 GTTCATCCTCAAAGAGGTCAAGGCAGTAACTTAAACGGTAAGAACCT 

CTTATAGGCTCGCTCCCCATGACITATTTAACGTGCrrTTATTACAGAACTCA 
TACTGAGTCCTCCTAAGTCAATCACTACATAAGAAAATGCCrrACAGCCCAGTCT^ 

TGT C AGGTG ACT CT AG CTTGTG TG ACAG AAAAAGT AG C CAG AACAGG AT ATGTGTGGTT AT GT AC GGTGCGT ATT AAACT G GGTTC 
TCCACATCACTGCAGTTTATCCCTTACCTCATTCTTG<5TTTTCATTGTGAACCTT 

6 0 TTGTTTTTATTTTCTCTTATTTTATTATTACTATTTTAGATGCTTG 

GGT AGGTGG AG AAGTGGG AAGG AGCTGG AGG AGG GGAACTG AAAT C AG AAT ATATT AT ATG AAAAAATTATTTT CAAT AAAAAACA 
AAGAATGCTGTTAGTCATAGAAATATATGAAATAGATTTTAAGGAAAACAATC 
CACCAGACAATTAGAATACTAAGTATTGTATTTAAACAGTTCTTTGTCAGTT^ 
CTGTATTCTGTGCTGATCTTGTACTTAAGATCTTTCTACCT^ 
6 5 agCCTAGATGTCATTCTTAATATATTACAGGTAGGTCCTGTAGAGATTTAAAAAAATATA 
CTATATTTTTAGCAATAAGGAATATCCAAAAATGCAAATTCTCTGAACA 

CATGGTTTAGAATTGTAGTTACTCTCAGCATAGACTAAAAAATTCTAAGGGAAATTAGAAAGTA 

TGCATAAAATTGTAGAATATACAAAAAGCAATATAAAATTTGCCTTTTAATAATTTATATGAG 

TGAGAAATAAGATCAAAGTAATGTTAACAATCATAATAAAGCCCTAACT^ 

70 AAGGTCTGAGAGAGTAGCTCAGGAGATAAAGCATTTGTTGCTCAAGCCTGAG 

AAGGATAACCCGCATTGTGGCTACCTGTATTCTCATTACCCAGAAGATAGAATATGGACCCCT 
AGCAGGAATCCTCGTGCTCTCTCTTTTGCAAGTTACCTAGCTTCAAAAAATCAT 
TTGGGCCTCTATACAAACATGAGCCCATGAC^TGAACATACTCATATACTGACACCATACAC^ 
ACACACACACACACACACACACACTATGACTAGGGCTGAGGCAGTAACTCGGAGGGAGTACTTGCTA 

75 TCCAGTAGTACTAGAAGTAAATAAAAGTrCATAJ^CTTGA^^ 
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TATAGCTCAGTGCTAGAACTTTTACCTAGOVTGTGTGAGCT 

CACACATTTTCAAAAGCAAATAAAAAAAGAAATAGAAAATTGT AA 

GGGCAGAGATTCAAGGGAGCAAGAAGX3<lAAACn3GTGATGG 

ATAAATATCTTTTTTAAAGATGGAAGAAGCCTTGTAGCTACTGAAAG^ 
5 TGTGACATTTGCATCATGACAGGCATATGTTTTATTTGATATTAACATTC 

GCAAGTAGGTAGACATGAGAAAGTATATACCATTCAAGAAAGAATGTTGTA^ 

CAGTAAATTATAAAAAGATAAATATCTAAGGAAAATTAATAA 

AAGCTTKyiGTTCTCCAAGGAACAAGAACTTATTGTAT 

CTATCTGACTGTTAGATGTTATTGATTGTCTAG 
1 0 CTCCACTTCTCACCTGGTTTGGGATGTGTGGGTAGGTGGCTTTATC 

GGCCTGAGGACATC CTG AGGTGTG TCAGGAGT CATTT ACCTTC C'riTl'l' G AG AT AG 

AGGGTAAGCTGGTTGCCCAGTAAGCCCTCAAGATCCACTTTTCCCrc 

TTGCTTGATTATGTAGACCAGACTGGCCTAAAGTCACAGAGATC 

ACCACTGTTATTCCTGACTC^CATCTAACTTC 
1 5 TTT ACTAACTGAGCT CTCTCCTCAG CC CTTeAAGlTTlTTG AAAT AGGAT CTT ACT ATGTAGTGTTTT ATTCAGGGCAATCCTTCT 

GTCTCAACCTCTAAGTATTAGAATTATAGGCATGAGCCACCATGCCT 

TTTTTTTATGGCTATAATAAAGACCATGGCCAAAAGC^CTTG 

CAGGAGCTCAGGGAGGAACCTGGAGGGAAGAGCTGCAGCAGAGGCCATG^ 

CAATCTACTTTCTTAATCTCATCTAGGACriTGCCTGATG 
2 0 AAG AAAATG CTCGCGGTTCAAGG C^TTGC CTCAGTT AAT Al"riTTl^ CTTCTCAAATAGTTCTAT AT CAAG TTGTCAAGAAATCAC 

CCAGCATGCTGCTGCCAAAAGGATTTCTCTGAACTCCACATGT^ 

ATATATAAATAAGGGAATGTAATTTAAAAATGAAATCATGCCCTACTTGTATTTATGAAGTAAGAACT 

AATAAGATAAAGTTGTTTTGATTTTAGCCATTTTGACGGTGAGGAAGTAATAGTAGAAAC 

AAGTGTATGAAGTAGGTTAAGAATAAAAGGATATTATTAAATAGGATATATTCTACAGCCCCA 

2 5 TTTAGCCACAGTGTATTTTGCTTACCA^ 

GGGCAAGTGCTGAGTGCAGCACTGTGGAACCAGGACATTCTAGGTTC 
TTCTTTTGTCCTTGGCACGTTTTTTAACTGATTAACCT 
ATGGCCCTGGl"rL"ri'lXSCAAGTTAAATTCCGTTTGTCTAGGAAGTACTTTAA 
ATAAGAATCCTCTTAACAAGACTGTGTTGTAAGCATAGCATATCATAGTTT^ 

3 0 ATACAACTTCAAGATTAATTGTCATAAAATCACAAAGCTATGTCTTGGTCAGGCTTACTAT^ CGTG AT GAAACAC CGTGAC CAG 

AAGTGACTTGGAGAGAAAAGGGTTTATTTGGCTTAC^ 
AAACAGGGCAGGAACCTGGAGGCAGGAGTTGATACAAAGACCA^ 
TCAGCTTrCTGACAGAACCCAGGACCACCAATCCAGGGATGGCATC^ 
AGAAAATGTCTTAGAGGTTTGTCTACAGTTCAGTCTTATAGAAA 

3 5 GATTATACCTTGTGTTAGGTTGGC^TAAAACGAGCCAGAGCTAGCTGTATATTTAAAATT 

ATATGAGTCCTATGCTAGTTTACTTAAGAGCTGTTTCGTC 

GTGTATGTGATCTGAAATGCTGGGAAGCAGGAAGGACAGGAGTGCAGAGTATGGGTATGTAA 
GAGATCCTGTCTCGAAAAGAAAGAGAAAGAGAGAATACAATATCACACACATACT 
GATGTCCAACCAAATCTTTTTTTGGGTGGGGGAGGGGTTTGAGACAGGGTTTCTCTGTATAG 
40 TGTAGACCAAGCTGGCCTTGAACTCAGAAATCCACCTGCCTCTGC . 

GCTGTCCAGCCAAATCITAAAGTGAATCCTTAAAGTTrTGTAAAACATTGTACTATAGAA 
AATGACTTCTCATGGATTTTTAATCAATACTCATTTCAG^ 
TCAAGTGTGAATTGCCCTGTGGTAAATACATATTCAGACAAATA 
ATCCCTGTGGCTGAGGCCTGAATTAAGTTTCCCCAACACACACACACACACCT 

4 5 TTCTCTGGGCTTAGGAGCTCTAAACT13CACTTAAGTACTGTC 

GTAGGTAGG CTGGGAATAGGAATAGG AT ACAGATTTGTATTCTG AAAGCAG AAAGGTAA^ CAACCTTAGCTCA 

GACAGTACATGAATAGGTTTTTCACTGATCTTCATGTATCCATAAACACAATAAAG'rrTl'l"! 

ACACACATAATTCCTCACAAATAATGAGGGTTGCCTATATGTCTAGCTATATCT^ 

TATGAXZAAACTTAAGAT AGGG CCGCACTCTGTTAACCTGGCTGG CCTGGAACTCACAG AGCT AGGTCCTCCTG CTTTTTGCGTCCC 

5 0 AAGTGCTTGGATTAAAGGTGTGCACCATCATACCCAGCAATAGTTTCAGTC 

CCTACCTAGTAGCTTAAAGTAATTAAAGTGACTGTAGCAGTAATCTTCTATTACCTAAAACACCCCTCCT 

AAAG CTGTACTCCCTGTAATCTTTCTGCTTCTCTTT AGTC CCAGGT AGATG AGGCATGGGAAGGGCTAAACAATA 

GGATTAACCATTTCCCTTGCCCTCTCCTCCCAGTGTTACTTCrCCTTATAT 

TATTAATGTTCTAGGAAATATGTATCAATTGGCACACTCTTAACTACACTAAAACAGGAGCATCT 

5 5 CTTCAAATAAGACTCAGGTACAATTTAGAAGGTCATATATCTAGTCCTGGACCTATTGCTATC^TAAGGAATTA 

CATGTACATCATTGAATACTAGAAAAAGCTATTGATAGCCCCAATAGTAATATGAGAT^ 
GACACTAGGGAAGTAAGTATCGAATATCAGATTGATGCCTGTCTCTGTATC 
TCTGTCTCTCTCTCTTGGATGGTGGTGCATATCCACATGTGTATGTCT 
AGTTGTTTTTATCACTGTTCAACTTGTTTTT^ 

6 0 CCAGGTCCACACTGGAGTTAAAGATGTGCACTGCTACACTCAGCTTTTTAGTC 

GTACAGCAAAAGCTTCACCCACTGAGCATCCATCCAGCCTTCCCATATAGCCTAAGACACACATTTTCT^ 
CCATAAGGCATCTCTTAATAATCATCAGTGTAACCAAGGACTTATTATACA^ 
AGACATAGTAGTGTGTGCTTACAAGAGTTGTGAGGCTGACATAGGAATATCTAGAGAACCAGG 
TGCCTCAAAAAGCAGACAGCAACATTTTTTATTTTACTTTGAGTGT^ 
65 GGTATGTGCAATATAGTGGATATTATGTTTTCTATTCATGTATATGTGTATGTAGTTTTAATAG 
TCTTTGTTTCAAGCTGATAAAATAATAGTTACTTGGTATAATTT 

G AT ATTTTAAAG ATGTT ATAAGGT AAATGACTCAGTTTT AAAAAATG AT^ CT CTTG CATGT A 

TTTATGTCATAGGTACCCGCAAGATGTGGTGTTACAGTTCGAGACAGTCTAAAGAAAGCACTGAT 

ATGCTGTGCTGTTTACAGAATTCAGGATGGGTATGGTTTGTATGTGTTGTC 

7 0 TGGGGTTCTTTTGTAGTCTTTCrGCAAATCT 

GCrCATAGCCATCTGTAATTCCAATTCCAGGGATCTAAAATCTTGTTGTCTCC 
CAGGAAAACACCCATACACATTAGAAAAAGAGAGGGTATTTATTACT 
CTACTCTTGAAGAAATCTTGAAGACTATTGATAATTTATTTGGTT^ 
TAATTTCCTTAATTTGTGAACAATTAAAACAGCAGTTCTC^ 
75 ATTTA-C1 B .GTGTG ATTCAT^^ C AGT AGAAAATT ACATTT ATGAGTAG CAACAT AAATG TTTTT ATGG AT GGGGG ATTGCCACAAT AT 
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GAAGAACTGTATTAGAGGGTCACAGCIATTAGGAAGGTTGAGAACCACT^ 

ATAGTCTTCClXnTAAATACCAAGCTCATAGTTCCACAAAAAGTTACCAAAATTT 

AGTAATATTGTTCCGAGATGGTTATGTTTTGAATCTACATCrTGAGGCATC 

GGTAAGACCATACCTCTGTGTTTGTTTGTTTGTTTC 
5 TTTTTCTGGACCGACTTAAAAGTAAATATATTTATACTGCTCTTTGTTTAAA^ 

AAGACTITAATTTCTAATGATTGAAAAGTACAGAGCCAAACTTT 

ATATACTGAQATACAGTAATCAAGAAAGG TITVIG CCT^ 

CCTGGCTTACTGGAGAGGAGTTACATGTTGAAGTACTGGAGAATGTCCCAC^ 

TTGAATTGTCAAAGCTGGTCAGTT T LTriTGAA rm ' rrTA TTTA^ 
1 0 TGGCCTAAATTCTGTTCTCTAAAATCTTGTATTTGAAAAGTTA^ 

TCTTAAATGAACAAATAAAAACCAAACAAAATTAGTCTGATATGAAAATCTGCCTATG^ 

GTGTATAGATGCAGGTAACAGGATAGTTCTCTATGTACACAACTCCAGGGCAGT^ 

TTCCCATTCCTCTCTCTCTTTCAGGTGAAAACTGGGGTCA.TG 

CGGGTGTTAATTCTATTTCAGTACGGCTAGCTTTATTCTTGTAATGATTGCCT 
1 5 CT CTT ATTTTT ATTTTT AT AAAATGTAACATGT AG AGTT AGACTG C CT ATGT CTTGACATACAAATTGTT CG CAAAGATTAAGG GC 

TGGGTAGGAAGCAGCAGAGTTGGGTATTGAGTTCTCCTTGAGACTTTATC^ 

TTGTCACCAGCC CGCAACAG CCTC CTAACCACCC CCC ACTCAACT AAAACATCTTT AAAT AT ACTT AGAGT CTGT^ 
CATAAGTT ATGG CCAG AAGAAAT AATAG AT CTTTreGGT^ CTTT ATGGTTTGCAT AAAAT C CATC 
TTTAGTACnGCATACCCAGTACCTTTTACCCTTCTGCTGT^ 
2 0 CCCTGTGTACCTCGCCTGCATGGGCGTGTGTGTGTGTGTGTGTGTC 
TWTGCAGTACTAAAGATGCTTAATTrGGCAATTAAATATCAA^ 
GGAAAGGTAGAAACAAACATTTGCCTTTTTTCACATGC^^ 
AGTTCCTGAAAGTCCAGAATGTATTATGTGCCAGACTATAATT 
TTTGAGTGGGGCACAGTGGCACACTGGGAAGCAGAGACACACAAATG 

2 5 AAGGCCCTGTCTCAAAAAAAGAAAAAGTTTGAGAGCAGATGTGGCnTTCCATAGCT^ 

GG AT GGTTGAAAGTTCAAGACCAACTT CTG T CG CAAAAAC CATTTAAACAGAAAG AG AAATG TGATTTG GTGTT AAAAT CAAATGT 
ATCGTGAAAAGCATTTATACATAATAAAGGATATAAAATGCTAAGCTA 

ATT AGGT CAT AAT AAAAG AAT ACATTT ATT ACTT ATTTGATT GTG GTTTTT^ CAAGATG GTTG AAAAT AT AAAAAA 

GCCATCCTTGCCATCAAGATTCTGAAATAGCAGTGTTAGCAGGTTGTGTGTTGTTCT 

3 0 TTATATC^TTTCTGCCTGGGGGAGAAAACGTTCCCGTCATACACTGCAA 

ACTTTTTGATACTTTTGAACATTTATTTGTTTTGTTAT 

ACATTCTTCCAGAGGTGTCCTAACAATATGTTAAAGTCTGAAGGCAGAAGTT 

rTGAGAAGTTGCTGAATTTAGCAGTAATACATCCTGTGTTACAAA 

ACTACACCTTAATAAAATTACAATGTAGATTTCATGAAGCCATTGATAATTTTTAAT^ 

3 5 TTTTTATATTACTAAATATCTTTTGCCATGACTTCTTG 

TGGCTTGTTGCCAGATTATAGCATTCTTTCTCCT 

AACTGTCGCACTAGTGGTCACCTCCCTTAGCTTCCTTCCTTGTGTCGGGATTGAGTGATAGCTT^ 
AG CTCTT CTTTTTGT ACTTTTACTTTTTTT ATG CTGTO^ 
GAGAATTTCATGCATGAGTCCCTTTAAGTATTGCTTGTTGTCCATTTGAAAT^ 

4 0 TGTTGTATGCCTTGATTTTTGAGCTTTTATAT^^ 

GATGTGTGACTTTTTATGATACAAATATATAAAACGGATTCrTGCAAGTATCT 
GAAAACirTTTTCACCTTAGCATTTTGTGACTTTTGCCGAAAGCT 
TTCACCAGCGTTGTAGTACAGAGGTTCCACTGATGTGTGTAAATTATC 
TATTACTTTATAGTTAAATTTTCATACTATAAAATTAGTGTACrTTGAAAG 

4 5 GAATTTTTAAAAGAACGAATAGTCCACGCAACTCTTCTCCCTCAGATGAA 

CATACTCTTTCTAAAGTAACAAGTACTAGCCCTTAGAAATCACT 

GTTAACTATTGTCTTTCAGGACAATGAGTCTGATTCAGC^^ 

ACCCATATCTGTTTGTTTCATCTTATCCCTGTTCCCTTTATTTAAGGGAATACT^ 

CT"l'*I"l'"I-r CT I'CT 1' ATTTT AT AC3T AXG XCT AAT ACSJVT AX ACT AGATG TTATCAXTG X AOAAT AAXOX"! 1 ^ CM CX3 AT AACT 1' CTJ."1" -, 1"TGTTG 

5 0 CTGCTGAAATGGAATTTGACTTTAAAATTAATTATAATGATATA 

CTGTAAGTCCAAG ACCAGCCTGGTCCACAOIGTG AGTTCCAAG^ CATCTCAG AAAAAAACAAGAAGGAT AAT 

CTAACITACCACTAGATATTTTTCTTCrAGTTTTTAAACTAGAT 
GTTTTAGAGCTGAGCATGCCCTTGAAAGTATAATATAATTTTTTTAAAAA 
ACAGTTGTTTCTGAAAGTAAAACTTGATCCCAOlTTTTCT 

5 5 AGTTAGGTTTGTTTTGTTTTGTTTTGTlTl"rGCCTCACAGTTTG CTGTTTGTCTCCAAG TTCTTTG AGCAT CACCCAGT AC CACAG 

GAGGAGG CCTCCTTCCCAG AG ACTGCCCTTCCATCTGGATC CTCI^CCGC ACCCCCCTCAGACTCTACTGGGTAGGATTTGA 

TGCTGTCTAATGATATGCTAGCTGAACrGCCTTTCTTTGGGTCTTTTG 

TATAGATATTTTCCTCCAATAATGTATTTTATCACTAATGTACATGATTACAGCATTGA^ 

GGGATTAAAAGCCTGGTGTTAGAGTAGGT C ACIACCTTTAATCCCAGCACTCAAGAGGCTGAAACAAG CAGATCTCTGTG AGTGG AG 

6 0 CCTGACCGGATCTACACAGTGAGTTACIGTCTTGAAACACAC^ 

AAGATCAATTGTGAAGAAAGTCAACATAAAATTCTTATGCCTAGTCTTTTATT 
TTAAAAAATATTAACATATGAGTTGATAATTTTTCAAAACT 

TATTTTATATTTATAGAATGCAAGGAAATAGGATGTACTC!AAGAAAGCTTAGTAGAGTTGTGATTTA 

GCTGG CTG CCG GG CGT GGTGGCG CACA.CCTTT AATCCCAGCACT CGGG AGG CAGAG GCAGG T GG ATTT CTGAGTT C AAGG CCAG C C 

6 5 TGGTCTACAGAGTGAGTTCCAGGACAGCCAGGGCTGTTACATGGAGAAACCCTGTCT^^ 

GCCATAGACAGAGAGAGATGGGGGAAAGGGGGGAGGGAGGTGGAGAGAGAGAATGAGTTGCTGGA^ 

CAGAG CAAGTGTCCAG AG AAG AGAGCAGGTrTCAGAGCTAGTCACAAAG CTT ACCTAAGTTT ATAAGTAGTATGTG AGAGACTG AT 

ATGATGCACACGTCTAAAATCCCAGCACTCTGGAGGCTGAAGCAGAAAGGTGGTGAGTCTGA^ 

ATCTTAATCTCAAATAACAACAAAAAGTTGGAAGAAGAAAAGTACCATATGTGAGACCATG 

7 0 TGGTATTTGCTTGAGATATGAAGGTGTGGGAGAATAAAGATTTTCAAACAA^ 

ATTT ACTTAGCAGTGAAATACAG ATGG ACATGGAAAGCCTACGGTGAAGAAGACAG CAAAG CCT AAAGTTGAGGG AAGGCCGTGTG 
AAGAGAAGCAGGGCATTGACAAAGGACAAAAGGGA&GTG^ 
AATATCTCCAAGGCATTGCATTCATAGTAGAAAAAGAGACAATAATTTAAGC^ 
TTTCCCATTGGGTTAAGTCCrrAGAAGTATGTC<nTTTTAAAAAAAAATAAAAAG 
75 TCCCAGTACTCTGGAGTCAGACGCAGATGGATCTCT^ 
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GCTACACAOAOAAACTCTTATCTTG^ 

GAATAOTTCTGGACTGTTTAGTGCGGGGGTATTTATTTCACTATTACCCT 
ACCCTCAAAAACTTATTTATTTATGATTGATTGATTCTGTGGGCAGTGATATGT 
TGTGTACCTCTACACAGGGGACAGCTTTCATGACGCAGTTCTCTCCTTTCACTGTGGAA 
5 GGATCGCTCAAGTGGTCAGTCAGGGATCGCTCAAGTGGTCAGT 
AGTCAGGGATCGCTCAAGTGGTCAGTCAGGGATCGCTCAAGTGG 
GTGGTCAGTCAGGGATCGCTCAAGTGGTCAGTCAGGGATCGCTCAAGT 
GCTCAAGTGGTCAGTCAGGGATCACTCAAGTGGTCAGTCAGGGATCGCTCAAGTG 
CATCTTGCTAGCCTTTATTTCTTTAGACTTAAAGAAGGTTAACATTA^ 
1 0 CTTG CAATCT AGTTG AAAG AACTCGATTTC CAGGAT CAAGG AG AGACCCTTCCT CAGAT AAGACAGAAA3 CAATAGAGAAGGCTAC 

CTGGTACCCTCCTCCCCCATCACACATAGAATTGGAATGTGTGTGAGATGAGA 
AGGACTTTGTAAATTTGACAAAGGCrAAATGTGGTACAGCTGTAATCCCAGCA 

CAAAAGTCGAT ACAG CAT AGCAGT AACAAGAGACCCT ACCTCAAAAAT AG GT AAAAATTTG "Tl"!'!' CT T ACCTTCATGTG TG CATCA 

TGGCATGCACACACCTGCAAACAAGCACATAATAACAACAACAACAATAATAATTATA^ 
1 5 TATTACCTTTTTATATCACTAGAAAATACAACTGTATTGTCIAGATTGTAATAGTTATCT 

ClTlTI " n 'r n "l'T U TAGATGATCAGATCAAAATAATGTTTGGGATTTTA^ 

CTATAGATCTGTAATGCTTTTCTAATGACAGATGCCAGTTCTATGAACAGAGTTACAGAAC^ 

GGATCATAATGGTAACAGTACAGATTGTCTTCTAGTGAAGGGTTGCTG 

TTTTTTCTTAACTAGAGGGAAATGGATCTGGTCTTGAAAACTGTGCAAGATTCTCT 
2 0 CTGTGT AAAATT AC CTTTGT AAAGTTAG AAG C CAATTAGT CAAGGTTCT CT AGAGGAACAT AATTTAT ATAATGAACTT ATAG AAT 

ATGTGTATATGTATTATGTATTATATTTTTATATATGTACATATAAATACACACATATGTGT^ 

GTTTACAGATTATGGTCTAGCTAGTCTACTAATGGAAAGTCCAACAATCCAGTAGTTGTTCAGTCCATGAGGCTQ 

TGGTCITCAGTATCTGCCAGAATCCTGAAGAAGTAGGCTGACTCTAATGCCAGTAAAAGAATGGACTTGCTA 

AAGCAGGGAAAGAGAGCACGCTTCCTTCTCCATGTCCTTTATCTACGCTGCC 

2 5 TTTCTACCTCAAAGATCCAGATTAGAATGGGTTTTCCCACrT 

ATTCCAGACGTAGTCAAGTTAATAACAAAAAGTAGACACATCAGTTATATTCAATATC^ 

CAATTTTAAAAGTAATGGTATTTCTTTTTTCCTGGCTAGCTCGCACA 

ATAGCACAGTATTTGAACATTTATTAATCTATTTTAATCCTCTAAACTAATCT 

GTATTTTAGTATTGATGTGACTTTTTCTGCTCCAGGTGTATTCCCATAGTTGTCCCCTCAGCTCTCCT 

3 0 TCTCTTCTTTCTTTTCCCCTCACCTCTCT^ 

ATTGACAAATAAGAGGATAAATGAGAACAATGTTTACACAACATTGAGTCAGGAGATACTGAGAA 
CAGATATGGGGACAGAAAAATCAGCATTTGAATAATACAAGGATAATCTTTACACAGTGTTCAGTAAC^ 
TCTGACAACAACTATAACAGTCCTAAGACAAGTATGGGCATACCAAGAAAT 
AAAGAGATAATTACTTTCTTTTTATC7VCTTACAGAAAGTTACTGT 

3 5 CT AAATT ATAGTAGTTTT CT AAAG AG TGTT AG AATT AAAATTCTGTTT ATTTTTAATl'IT AAAATTTTTT 

AGGTATGTGG AAGGGTCZAGTACAAAT AT AT ACCTGTGGGAGAAAGC C AATT AGGTGATGG CCAAG ATCCTTCATAGGTT AGTTATA 
AATTGCCTACTTCAAGGAAAGAGTCCACAATATGCAGGTATGGCm 
TAAGTTGAGCAAAATCAGAACAAAGAATATACATGCCACCTAATCCAGACTTGTG 
GTAATGAGAAGCAAAATTATAGTTAGAGGATTGCTTTTGTTAAGCATTGAATTAAA^ 

4 0 GCTCAGAG TT ACC AT AT ACTTG C CTT CAAGGGT AGAG AAGTT AACTTTG AACT C AGG CAT AC CTTCAC C CTC CAG ATGG TATT AAT 

ACATCACATGGATTTTGTGTTTGAAGCCCTCTTCTTTTAC 
CCTCTGAAAACAGAGTTTTTATTTCTTACCC^ 

CCTGAATTAAAGGCACTATTGTCCTAAGACTGTGAGTAGCTTAACAGTTTTTATTTCTGAA 

4 5 TAATTTTTAGCTCTTTGAAAATTCCTTAGCAAAGGAATTTTTGTTTCT 

TTTAACAAAATTTCTCGATAACCAGATGAAACATAACTrCCCTCAAATCCAGTTCCA 
TTTTAGAGAATAGTGTTTG CATC CTGTT C^TTG AAG CAG ACTCG CAT AGTG AAT ATGAATGCCC^ 
CCTTATATCACGAGCTTCCTAGCCTGATGACGTGCTTTGTTTAAAAC^ 

TCTACCCTTTTCTATTGCTTTTTTTATTATATGTATTTAATATGACTGTGTGCATGTGTGTCT 

5 0 CCTTTGTGGAAGTTAGAAGACAGCTTTGTGGAATAAGGTCTCCTTCCACCACCT 

TAGTAGCAAGTACCTTTTGATCCACTGAGTGACCAATCTCATTGTCCCCTCCT 
CTTGTTATCTCAACAATGTC"1TTCTTGTCTAATAACAACCAAA 
GTATGTGAAGAGAACTTGGTCCATGTTGGACTTTCCATGGCCAGTTGACACTTAGG 
TAATCACTTCTCTGCTCCTGAGAATCTTTTCTGAGGCAGCTACAATCTTTT^ 

5 5 TAGGCCAGAACTCATAGATTTTATTATCTCTATCAAAA^ 

AATAATTTTAGGGCTGGGAAGATGACTCAGTGGCTAAAGTCCTTGTCATGCAA 
ACCTAAAAAGTTGGGTTGTGACTGTTGTTGAAACCCATGACCCCAGTGTTGCAGGATCACAC 
TACTGGGGGG AAAATT TTT OlGTT ATGTATGGTGTGGC T CAGGC CTT AGGACACACCTT T AATCC 
GCCCTTAGTACACATGTTTAATCCCAAACAATGAGATAAAGTTAGTTTGTAGAAGGAAGCACT 

6 0 AGAAGGATTTGACAGAATAGAAT ATGCCCAACTTGAG ACATTCTTC CTrCAACTGTGCTTCT A 

CAGAACTGGGTGCCCAACCCCTCTGACCCCAGTTGGCT^ 

CCTGCCACTGCCATGACCAATGCTGCTCCCACGAAAGTGTTGCCGAATTACCATGAGGA 
GAACTCGGAGTTACATGCCTCCTGCCTCTGTCTGTGGCTT^ 

CTTTCTCTG CCAATCT CTGATGAAG CTGTAG AACCAATG AAGTGGCTG AATTTTCCTGCCGGATATAAAGAAACCAG ACTGTGA 

6 5 CCTGGGAGAGCCGGCTGAATGCAGTCGAGTGTACACTGCACTTGGAAAGGAATGTGAATCAGGC^ 

ACTATGGACAAGAATGGTCCCCACTTATGTGACTTCACTGAGGCACATTATACCT 

GGTGAC CACGTG AC CAACTATGCAAG ATGGT AG CCC CTGAATCTGGCATGG CAG AATATCTCTTTGACAAGCACACCCTGGGACTC 

AGTGATGAAGCTAAGCTGACTTCCCCAAAGCCATATGACTTTACTGGTCACTGAGGCAGTGCATATATGTTGG^ 

TTTCTGTAAGTTGC^CCAAAACATCCrCTTAAGTTCTTTAATTTGTACC^ 

7 0 CCACCCCACCCCACCCCACCCCCAAAAGAATACCCAACTTAC^AGAGAAAAAATO 

GGCCGGGCGTGGTCGCGCACGCCTTTAATCCCAGCACTCGGGAGGCAG 
TCAGAGTGAGTTCCAGGACAGCCZAGGACTGCGCAGAGAAACCCTGTCT 
GAGAGAGAGCAGCATACrAGAGAAAAAAGGAGGGAGGTAGTTTTACTAGGTTACAGAGA 
AATGAGAAAGAGCCAG AAAATT AGACCATGTTGCCAAAGTTAGTATGAGTTCAAGCAGAGO 
75 cCAGATTCAGTCAGTCAGCTTGGAaaJIGAGTTT 
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AAAAAGTGAGCTTAGTCAGCAGTAAAGCTCAGAGGCT 

ATG ACTAGG CCT AG CTTAGCCX1ACTGAGGCCAT AAGCCTCCGAGACAATTT CTTC CAGGGAAT AAAAGAT ACTTTT ACAC C CCAGT 
CCTGGGAAGACAGAGATAAGAAGATTTTCGGAGTTTGCTGGCCA 
ACCCCGTCTTAAATAAGTGGAGATAAAGCAATTGACACCAGTTTCTG 
5 TTTTAGTTATTTTAAATATTTGCTTCA^ 

ATGAAAAAGATTTTGAAATATGGACAATAAATGGCAC 

AAAATTCATTCAATACTGGTATTAAGTAGTAATTCTATCTITAGGGTCCTAA 

TTTCTAACT ACCTTCrTCAGATTTTTATTTCAGT CAAC 

TAGTCTCTACCTGAAAC^TTTTAATTCTATATATAATTTTAAAGGATAATT 
1 0 GCCATTTGT AGATTCCTCTGGAG'rrX"r CTGGTCTGCTTTTGAG CAAAG CTTTGGCAGT ACT AGTTGTAGG T ATGTTmTIT AAAC 

CAATGC ITIT IX J TAAAGCTATTACTACTGGAATT^ 

TTAAATATTATTATTAATATAATTGCTCCAACTCTTATCTTCCCC^ 

CTCCTTCAAAATCC7VTTCCAATTCCACAGCCCTTCCGACCAGCAG 

TCCTCAGCTCCCAATGTTCATATAAACACAATTGAGCCTGTGAATATCGAT 
1 5 TGTGTTTCTCTGGTTTTCTGCTGT ACTTTT AATACAGATTGGACX3T 

ACATTATAGCTCCAACACCCTCTCACGGTCTCT^ 

TATGACACTATGTCrGAAAATGGTTTTAGTTAAAATGAC-lTTATCTTAATT 

GCCCATGGGAATTCATGAATTCTCTAAGA'l , 'l ill GTG CAATTAATGTGTT ATGAAAATG ACATT AAGAATTT AT AT1TTITT CTGT 
CATGAGTTTAGCATTAGAACCAACAAAAACAATTCTAAATTTG^ 
2 0 ATTAACTTTCAAAGTGCTTTATTATAAATTTTAT^ 
CTAGTTTTTATTAGGTGCCCCCCTATATTTTAGA 

TTCTCTCTCTATGTAGACCTGAACACACAGGAAGAGACAAGTGGGAAAGGGAGA 

CAACGGTATCTGTAGGTTAGGTGTGTTCTTGATGTGTTCAAGCT 

CITCCCCTTTTCTCTGCrTGTGACTGTCCTCCTGGTTTC^ 

2 5 TTrCATAACTGCCATAACTGTTTTTCCCCATTATGTCIT 

G AACCT CTGAATGCATTT ATTC C CACACATTTCC ACCTCGGTG CTCTCAC C GGG AT ACATTTTT ATG AT ACnTTTT AAAAAATATT 
TAATTGGGGCTCACTTACAGTTTCAGAGGTTCAGTCCATTTTCATCATCATGAGGAG 

CTTG AG CCGCAGGCAGAG AG ACAG AGTG T ATG AGTTTG AGT CTGG TGTGTG CTTTTGGAATTT CAAAG CCAAC CCCT AGTG AC ACA 
CITCCTCCAACAAGGCTACTTCCTCGTAACTAAGCATTCAAATATTATGAAGA 

3 0 GAAAACTAGTAGAAAGAAGCCAACCAGGCATTGTGGTATGTGCCTTTAGT 

AGTGAG AGG C CAGC CT ATGATACATGG TACCCTGGAGAAGG AAGG AGAAAGG AGAGAGTGAATG AGTTT AGCAATTACAG CTG ATC 

ACAGACAAGGGACTAGAAGAGATTAGGGAACAGTTGAAATCGTTTTAAAA 

CAGATGAACTGGGAGCTTTAAGAACACGTGAGATTGAGAAGTGTAAATGTGTGTTGGACCTGGCAGGAGTGGAAA 

3 5 GCTCATCTGT AG GTGTGGGGGG AGTTT CAG AG CT AGGG T AGG AAACAG G AAAAGGGATGT AACTGTTCTCTTG TGGGG CCAT CT AT 

CTAGAAAGCCCCTTGTTTTTATACATGCCTGTCCATTCT^ 
TACTAGTGTCATTTCTTTTGCACTATAGTCACCAATTCTCA^ 

TAGTCTTAATCTATGTCACTAAAAGCCTTGTCAATACCACTACATGGTAGTAAATTATAATAAACTGATGTATCTCT 
TGAAATAGTTACTTATTTTTCATGTGCCrACATCTCACrcTGTGTTTGTAAGCTCTC 

4 0 GACTTTTGTGTGTTGAAGGTTCTATGTAGTACACCAGTAAAGAATTCACTTTTTATTT^ 

CTAACCTTTTTAGAGACATTTGTGGCCTTTTAATCCC^ 

AT C CAAATTTCATT ACAT ATGG AATTT CC CCT C CTG CCAACATT ACTGT AAAAG CTG C ATTTT AAAT AT AG AGAC CAATTT AAACA 
AAATC CTTTTTT G AGCT ATTTT AAT CT CC C CT C CTTTTT AATG TTTCTT AT ACAT CCACATTTTTATGGT AGTATTCAGTGTG CTT 
TTAATTTTGTATTTTTTATTAAATGTAAACATGTTTCATGTTTTAAGTTT 

4 5 TGATACACCATAACCMAATATGTCAAAATGCCT ATCATAGGGGAG CTG CAC 

TAAACATCTTTGTGCATATCATTTTTTCTTCTGAATTATTTCCT^ 
TGAACATTTTTATGGCTCTTAATATGTGCCAGTAGGATTTTCTT^ 
GCATTTGACAAGGACTAATAATCCTTTCACATAGGTCTAGGTTGGCAAGT 
ATTTAATAGTCITTGCTTTGTGAAATAATTATAAGACAC^TATTGTTTTC 

5 0 TTGTTGTTGTTGCTGTTGTTATTACITTTTTACCATGTGCTTTGAATAAACATA 

ACCAGGGGTTTCGTGGTGATGGAGGTAAGTAGTGACTTTAGGGTTGTTGT 
TGTTGTTTTAAGGCAAAAGTGCAATTGCTTTGCTGCTTGTCTC 

ACACAGAGAAAAATGTGTAGAGAAAACCAAAGC^TTTTTGGTTTTTTGTGTGTATTTTTTTA^ 
CAG AATTT AAAATT AT CCATGACAAC^IAGAGAAAATGAGAAG AC ATGG AAGAAAGTCTTAAATGCT 

5 5 TCCACCAAGGAGGGTCCTTAATTCCITTGTCCACTTTTTAATTTTTGCT 

ACACCAAATACTCACAGATAGAGCTTAGAATCAGGTTAAATCATTCTATATCATC 

TGTCAGACTTTTTCAAGCATTTCCCACCCACTTTATAGATCTATTTGGAATACATATTTATT^ 

AAATTTAGTCACTCCCTTGTTGGTATAGTAAAAAATATATACAAA 

TGGTATAATAAGGAAAATATACAAATGAGAGATTATATTAAGATTCTTACAGAATTTCCATCTA^ 

6 0 TATTGTTTTAAGAAAATGCTTTTTGAGCCAAATGTGATAGCATATGCCTGTGATCC 

GCTAGCCCAGTATACATAGTATGGCTCTCTCAAAAAAAAACAAAACAAGCAAACT 
AGATTGCATTTGAGACTGTCTTCACATTTTCTTGCAAGA 
ACTGTAGGAGCTGTACACCATGCCTAGAGCTGGGTTCTTCACTGTGA 
TATTTTTATACTGAAAGTAAACATTGTAAGCCATATTTTGTTATGGTTCTGTACT 

6 5 CGAGCCTAAATTATGTACTTAATATTGACTTTATCACATATTCAAGAGT^ 

GTTGCTGCCAGTTTGACTTTCATATGTCTTAGTAACTGTGGCTGCTGATAATTTTGCCCAATAC^ 

TCGTGTTAGCATCACAGTTAACTTAGTAGTAAACAGCCTGCTTAGTACCTAAAGTCA^ 

ACTTTGAAAAAATTAATTGGGGTTTTTGTGAAAATATAAGCAGATTT^ 

AAGATACAGACTCTGTTTGTGCCTCCCTCACAGCGTGTGTTTGTAAGTGTGAGAGTTAAGTACT 

GAGTGCTCTGAATGTGCTTCTCACACCCTCACTACaSCAAGCCTCCTGTGTGCTGT 

CGGAGTTAACATTTCCATCTTGCTTAATX3TTGTCTTATTTCT 

7 5 A' lT { ^ V'i ^ V' £7 n^7*T^^^7GQJ^'T^^^J^.*T^I^T^J^T^T^^2^^T r ^^^^ T AAT ATTAATT'I' GTT T AAG AAAAT AAAAATGT AT ATGG AAACC 
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CCTATGAAAAAGGGTGTTTTATATTGGGCGTATGGCTCACATCTGA 
TGCAGGTTTGAGGCCAAAGTATTCTTATGAGACTTTTGATACT 
CCCTT AT ATATGAT AT AT ATTTT AGTT AAT CT ATAAACATG AGTT CAAAAAC^ 
AAGGAAAGTTAATAACTGTCCAGTATTTTACCATGTCAATTC 
5 TTAAAATATGTCTACAGTTATCITTATTGCTTTGATACACTGA^ 

CAACCCAGCTACTTTTCAGATAAACCATAATATAGTGGAAGTCATTCA^ 

AAG AACTGT GCCTGAGGACTX5GG CAAGTTGTT ACIG CT TGACCAGTGT AAAATGT ATTG TG CTC 
TAGTTCAAAG AAATTC CAGACACCATT CT CTTTT AGAAATTTG CAT ATACAATCAT AGAATTTTCAT ATGGTG CTATAACATAAT A 
TAGTGGCTACCTAGAAAATTCTACCTAGAAATCTCAGGTCTATTCACTGTAC^ 
1 0 AGATACTATACATATTGCTACTGAGTTAACTGGAAGTTTACAT^ 

GGTTCACITTGCATATAAAATAGTAGAATATATGTGAAATATGGATG 

AGGTTTCAGAAAGTCTTAAAG CCTCTTTAGTGGG ACTGG<K3TG<rrAG CTCATCAGTTTATAAG CACTGteCTGTTTCTCCATAGGAC 
G CGGTTCCCTTCCCAGCTCCACTGTTTGT AACTCTGG CTTCAGG AG CTCTG AT ACCTTCACACAG ATGG CACATACAG G CG AAG CA 
CCAGTGCACATAAGATAAAAATAAATAATTTTTCAAAAAGAAAGCTTCCTTAGCr 
1 5 AAAGAGT CT C CG T CT AATAGAAAAAC CTCACCAAC CAGTGACACTG CTCAC CGT CTGT CTCTT AAAT AAT AG AACATG T AGTCTGC 

TATCCTAGATAAATGTATGGCTAATAGGAAAGAAATACITAAGAGGAAGGCCATTAAAATG 
TTGAAATATTGAGGCTACTGAAATAAAAAATAAGTTTGT^ 
TTCTTGTTTTATCAGAC^GGGTCTTTGTATTATGTGGCTCTGGCT 

G AGATTT AC CT ACATCTGC CT C CCAAGTACT AGGATT AAAGGTGTGTGG CAC C ATGC CTGTGCATTTGTT ATG ATGTTTT AATT AT 
2 0 TTATCAAAGG CACAGGAACrTGGGAGTGAATTTTAGAAATTGTGTCTTGG CTTAGAGATTGTGTTTTTCT AAACTTGTTTATT ATT 

ATTTATTTTAAGGGTCnTTAGATTGGCCCAAAAAACAAAAC^ 

CC CTTTG TT AAGGGCCAAT ATT T AAAT ATG CACTCCATTTTT AAAGT CT AG AT CT CC C CACTTT CTGCCC CTG TTGTGTGGACAT A 

TTTGTTCCTCCrTAATGCTTACTCCrrCCCTAATGTGGATAATTTCCCCIT 

CCCCGCCTGCCTCATTACCIGGCTCACTCACTAACG 

2 5 TCTTCCT CAT CCTCGG AGGACAG AAGTCGGATGGT AAG CAT CTGTGCTGTGCTCCTCTAACTGTGACGCCGGGTT CCCAT CAC ATG 

GAATATTGCATAGCGCATATGCTCACAAAACTGAC!TTTTGCTCTGGTT^ 
ATGGTGTTCTGTTTCTAAAATGAGGAGAATGAGACTAAGCAAGTATTTA 
AAAGCTGGTGCAGAATTCAAGTGTCTACCAAATGGCATCCTTCCATATTTGAAAAATAA 
GAATTC<3TATATTGGTTTATTTATTTATTCCTTTATTTATTT 

3 0 AGCACTAGTTAGAGCATGGTAACTTGAATTGT^^ 

AGTTCTCCACATAGTTAAAAAGCCAGTTTATTTTAAGAATAACAAATGTATT 
TTTAATTAGTATTCTTTTGTGTGACTGTGTGTATAAGTCCAGCATGTACATC 

ATCTCCTGGAACTAGAGTAAGAATGTGAGCTGCTAGCTGAGTGCTGGGACTCGAACCTGGGTCCTCT 

CT AACTT CTG AACTATCTCTCC AGCACCAATT ATTT CT AATTTT AAG T AATT ATT C CAT C CTC ATATTGG CTCATTTT AAATG T AA 

3 5 CTTTATATTTTGAGTCACAAATATAACTATTACTTCCTCAACATGGATATTTTATGTT^ 

CTTGCTCTGTTCCTAACTTCCTGTCAGGCAGGTCAATATAGT^ 
TTTTGTACTCTATTTTTCGAGAAAACACTTGGTA 

AGAGAGAATTGGATCTGGGTCATTTGGAACTGTCTACAAGGGAAAGTGGCATGG^ 
GGGACATGGTATGTATAACACGATGCCIATTGGCACAAGTGGGATATATTAAGAAATGTTT 

4 0 GCTGGGTGTGATAGCACACATCCTTAAACCCAGCTXCTCAAGAGGCAGAGGCAAGTAGATCTGAGT^ 

GAGAAAGTTACAGTACAGAGAGGGCTACACAGATCTGTCTTC<K3^ 

CACTTAGGAGGTATAAGCGTGAGGATTAGGAATTCAAGGACATCCTTAGCTA^ 

GATTGTCTCAAAAACAAATATGAAGTAAAAATATT(!ACTGTTTAC^ 

GTTTCTAACAAAAATGGTGGACCATAAAAACTGGCTTCAGAGTAACTAAGATTGCTGTTATACTTTATATTA^ 

4 5 TGAAGGTCTTCTAGCTATCAGTCCCAAAGGAAGTTCGTTTTTCTTTT^ 

CAAAAGTTAACATAAAAAGTGTTCAGCCTCACTAGTTTTCT^ 

ATAGATTTTTTTTTTAATTGATCATGTGAAGATTGTGATTATAATCCTGTTTATTTTGT 

CACTGAGCCTAAGCACGTAATTGAACAGCAATGAATAGGTTGATGGTTGGGGAAGTATACTCACTAGC 

AATGAAAATAATGTGAGTACTCACTAGTAGAAGATTCAGTTAGCACATTGACACATCCATATATTGTTATATA^^ 

5 0 TATAATAGTTACATTGATATGGAAAGGTTTTAATATATTGATTTGTTTGAAACAG 

TAAAATGCATTTAACAGTATAAAAATGCTATCTGAATCTCTCAAGTTTATTATTTCGT 

ATATATAAGCATGTATATATGCATAAATATATATACACACATATATATGTATGGTATATATAAGTTTTAATTCTGAGTGCC^ 
TAG-GAG AATTTTTTTTAATGATTTACTCATGATTATGAGTGCTCTATTTGCATGTACA 
CCATTATAGATGCTTGTGTTCCATCATGTGTTTGCTGTGAATTGAAATCAAGACCTGGAAGAGC^ 

5 5 ATCTCACCATCTCXrTATGGAACATTTATGTTTTCCTAAAACATTTAAGTAA 

CTATAGAACirTGGTAGTTAAGAAAAATCAAGATTTATAGTGATTATGTTTAAATGTATTA 

TGAAGCTCACCAAGTG AAT AAATACATTTGTG ATTAAT AAATGTAAT ATTATT AG CCTTCTAG AAGGTCTCCACATGCCTTCTT CA 

TTCACTCACCCTGCAAAGACAACAAACATCCTAACCACAATAGAATATATTGTCT 

AATGTTGTTGGGTATGTTTGTGTTTTATTTGTCCCCTGAATAGCATT^ 

6 0 ATAATAAGCATTTGGGTACTTCCCACTTTGGTGCAATTCTGAATGATGTTGCTGTGTATA 

AT ATGTGTATTTTTT ACTGG CT AT ATGC CTAGGAGAG AATTTT AAAACTACACCAT ATG CAT ACTTG C AATTTTAT AT AT ACT C CC 
ATCGGTG CTG CTAAG T AATT AT AC CAGTTTAT AT AC CCAACAAAAGTG TAT AAAACACT AGAAG T ATTT AAACT ACGATT AT AATT 
TTAATAATTTTTCCAGTCCACATAAGTAAGCCATAACGTTTGGAA 
ACCTGTTTTGTTGTCCATGGTTCGAGCACTCACTGACCAGGTAACATTTT^ 
65 GTACTTAGTTACCTAAGTGTATTCCCAGGGTGAGGAAGTTC^CTATATTTTAGCATATATTTT 
TTGTTAGTTGCTTTGTGTGGGATTGAAGGATGTGCGTGCCCGATAGTATC 
TTAAATATAATTTATTTACAGTTTAATAGGGATATGAATTTAAAACTGACAATTAGCAAAA^ 
TCACAATTTGAAATCTTGGGTAACAATTTTCTTTCAGGTGATGTG^ 

GCTACAGGCCTTCAAAAATGAAGTAGGAGTGCTC^GGTGAGCGAGTGGCCACAGCTTTCCA 

7 0 GTGGCTGCTTAGACTTCACGTTCCCATTGTTGTC 

TGATTGTGTCTAATAGTAGTTCCTTTTAGGAAAGTAATACTCTA^ 
TCCTAGGCCCTAGGACATGAGGAACATTGACAAATGAGAGAGAGTAAATCCATGA 
AATCTTATGTGAGGGGAGTGTTCAACCAAACTGAGGTTTTAATCCAGAGAAGTGTCTTGTATTCA^ 
CTATTGTAAAGAACAGAACGAGGAAGCTAGCTGTGGTGGTTAGAGGAAAATGGGTTTACGTCC^ 
7 5 ttcGAaCaaCTTAAACAAAT^ 
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TTAAACTCAGTAATTAGAGCTCAACCTTGCTGTGTTTCTGGTACTTTTTCCTTGAGCA 

TTG TT ACTCATTGTCACG TTT CTTTTTGG CAGGAAAACTCGACATGTG AAT ATCCTCCTTTTCATGGG CT ATTCT ACAAAG C CACA 
ACTGGCAATTGTTACACAGTGGTGTGAGGGCTCCAGCTTATATC^ 

TTATAGATATTGCTCGGCAGACTGCACAGGGCATGGAGTAAGTTCAGTGCTGTTAAATCTTCA 
5 AAGGATACACTAAAGG TCCTGATGATAACCT CITATGGGTTG CAGAGG TACT AACT ACT CAAAATTGC 

TATTTCTCTAGTTCAGCCTTTAACATATATAAGGAAGAAGTAAGGTATITGATGA 
TAAGTGATTTATATAAACAGTGGAAGTACAGAAAAGCAC^AGCCCT 
AAAAAAAAAGCCATTAAAGTTAATAGGAAATATGTGACTATTATTTTAAATGAATGT 
AGTTGTAACAGGTATATCTACTGAATTTTACTTTGTAATTCAAAAGA^ 
10 GGGTAGGTGTGGTGGTGTACA 

MOUSE SEQUENCE - mRNA 

NCGGCGCTGGCGCCGCGGCCTCTTCGGCTGCCGACCCGGCCATTCCTGA 
CAGGAACATATAGAGGGCCTATTGGACAAATTTGGTGGAGAGCATAACCCACCATCAATATAC 
1 5 CAGCAAGCTAGATGCCCTTCAGCAAAGAGAACAGC 

CATCAATGGAC^CCGTTACATCCTCCTCCTCITCTAGCCTT^ 

G CATCACGGAACAACCCCAAGTCACCACAGAAACCT ATCG TT AG AG TCTT CCTGC CCAACAAACAG AGG ACAGTGG T ACGGAAAAC 
TTTTTTCACCTTAGCATTTTGTGACTTTTC 

AGCGTTGTAGTACAGAGGTTCCACTGATGTGTGTAAATTATGACCAACTTC 
2 0 TCCATTCCAATTCCACAGCCCTTCCGACCA 

CAATGTTCATATAAACACAATTGAGCCTGTGAATATCGATGATTTGA 
CAGGCTTGTCCGCCACCCCGCCTGCCTCATrACCrGGCTCACT 
GAAAGGAAGTCATCTTCTTCCTCATCCTCGGAGGACAGAAGTCGGATGAAAA 
GATTCCTGATGGACAGATTACAGTGGGACAGAGAATTGGATCTGGGTCAT^ 

2 5 TGG CAGTG AAAATGTTG AATGTG ACAG CAC CCACACCT CAAC AG CT ACAGGCCTTC AAAAATG AAGT AGG AGTG CTCAGGAAAACT 

CGACATGTGAATATCCTCCTTTTCATGGGCTATTCTACAAAGCCACA^ 
ATATCACCATCTCCAC^TCATTGAGACCAAATTTG 

ACTTACACGCCAAGTCAATCATCCACAGAGACCTCAAGAGTAATAGTATCCTTCCT^ 
ACX3GTAAAAATAGGTGACTTTGGTCTAGCCAGAGTGAAATCTCGGTGGAG 

3 0 TTTGTGGATGGCACCAGAAGTAATCAGAATGCAAGATAAAAACCCGTATAGCT 

TGTACGAACTGATGACCGGCCAGCTACCTTATTCAAACATCAACAA 

TCTCCAGATCTCAGTAAGGTACGGAGTAACTGTCCAAAAGCCATGAAGAGATTAATGGCAGAGTGC 

GAGACCACTCTTTCCCCAAATTCTCGCCTCCATTGAGCTGCTGGCCCGCTCAT^ 

CCTTGAATCGGGCTGGTTTCCAAACAGAAGATTTTAGTCTG 

ACATCAAAACACAGTTCTGTTCCTCAAAAAATTTTTTAAAGATACAAA^ 

TTC CTATT CAACAAAAGAGGG AAGAATGTTTT AGG AAC CAG AATTCT CTG CTG C C CGTGTTTCTT CTT CAACATAACT AT C ACGTG 
CATACAAGTCTGCC 

40 MOUSE SEQUENCE - CODING 

ATGATTAAGTTGACACIAGGAACATATAGAGGCCCTAT^ 

CTATGAAGAGTACACCAGCAAGCTAGATGCCCTTCAGCAAAG AGAACAG CAGCTTTTGG AATCC CTGGGTAATGGAGCAGATTTTT 

CTGTTTCTAGCrrCTGCATCAATGGACACaSTTAaVTCCrCCT 

CAAACTCCCACAGATGCATCACGGAACAACCCCAAGTCACCACAGAAACCT^ 

4 5 AGTGGTACGGAAAA<-n~i~i"iTCACCTTAGCAi"i 1 TGTGACTI i rGCCGAAAGCTGC 1 1 n CCAGGGTTTCCGTTGTCAAACATGTG 

GTTATAAATTTCACCAGCGTTGTAGTACAGAGGTTCC^CTGATGTGTC 
CC^TCTCCTTCAAAATCCATTCCAATTCCACAGCCCTTCCGACCAGCAGATG 
GTCCTCCTCAGCTCCCAATGTTCATATAAACACAATTGAGCCTGTGAATATCGATGATT^ 
ATGX^GGGTCAACCACAGGCTTGTCCGCCACCCCGCCTGCCTCATTACCT 

5 0 CCAGGTCCTCAGCGGGAAAGGAAGTCATCTTCTTCCTCATCCTCGGAGGACAG 

AAGTGATGACTGGGAGATTCCTGATGGACAGATTACAGTGGGACAGAGAATTGGATCT 

AGTGGCATGGTG ATGTGGCAGTGAAAATGTTGAATGTGACAGCACC C AC AC CTCAACAG CT ACAGGCCTTCAAAAATGAAGTAGGA 
G TG CTC AGG AAAACTCG ACATGTG AATATC CTCCTTTT CATGGG CTATT CT ACAAAG C CACAACTGG CAATTGTT AC AC AGTGGTG 
TGAGGGCTCCAGCTTATATCACCATCTCCACATCATTGAGACCAAATTTGAG 

5 5 CACAGGGCJATGGATTACTTACACGCCAAGTCTAATCATCCACAGAGACCTCAAGAGTAATAG 

CTTCATGAAGACCTCACGGTAAAAATAGGTGACTTTGGTCTAGCCACAGTGAAATCTCGCT 
GTTGTCTGGATCTATTTTGTGGATGGCACCAGAAGTAATCAGAATGCAAGATAAAAACCC^ 
CGTTTGGGATTGTTCTGTACGAACTGATGACCGGCCAGCTACCTTATTCAAA 

GGACGAGGATACCTATCTCCAGATCTCAGTAAGGTACGGAGTAACTGTCCAAAAGCCATGAAGAGAT^ 

6 0 AAAGAAAAGAGACGAGAGACCACTCTTTCCCCAAATTCTCGCCTCCATTGAGCTGCTG 

GTGCATCAGAACCTTCCTTGAATCGGGCTGGTTTCCAAAC^ 
CAAG CAGGGGG AT ATGG AG AATTTGCAGCCTT CAAGT AG 

HUMAN SEQUENCE - GENOMIC 

6 5 TGCCACCACAGCCCTGCTCCCACTGAGAAGTCTCCTGCTTATAAATTTTAAA 

AAAAAT CTTTGGG CCTGG TCTCATCCAAAACAATCCACTG CACTT ATGT C CAAAATGT ATT ATAT AACATGAAATGT AAAACG AT A 

AAATAGTCAATCTTCTCCTTAAAGTAACAGGGCTCCTCCTGGCTTCTGAAT^ 

CTAATAACTTTTCCTAGAGAGTGAATGATTGTCCTGTACAAAATCAGGCATGACATC 

7 0 TTCAAGCCATTCTCCTGCCTCAGTCTCCCGAGTAGCTGGGACTACAGGTGCCAACCACAC 

GAGATGGGGTTTCACCATGTTGGCCAGGATGGTCrCAAACTCTTAACCT 

GATTACAGACTTGAGCCACCACACCCAGCCTATATTTATGAAGAGA 

TTCTGAATAATGTAAGAGACyvrACATATATATATATATATAT^ 

AATATTTGCCTCCCAAGCCATTAGGTTACTTTAATGTATAAAATATATAAACTGTGATTGACATATATACT 
75 AT AT AG CT ATAAAATG TT AAATT AAT ATTTC-G C CT AT ATT AT CT AAT AATfi &GCT ATT AAAAGCATAAGG AAATG ATTAAGG CAG A 
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TGCTGAAACCAGAACTT CAAACCCAGGAGCAACTGAGATGG CCAG AGAAG T ATCAAAG T AATAAAT AAAAGAG TTTT AAAAGGG AA 
ATATTCATATTTTCCTTCAGTAGAGGACACCTrAATGTGGAAAGATG 

AAACTTCTGAAGT AG CTGAACTCAATTACTG AAAATGATTACAGGG ATT CAATGAAAG^ TT AT ATCTG CAACAG AGAA 
TAAAGGATAGAATTAGTATTTGTCTTTAAATAATAAAGTTAAAAGGTGGTAACTAACT 
5 GCTGATGAAGCAGAAAGTGGGAATTGTAGAGCTCAGCT^ 

T CT AGT AAGT AGTTGGAT ACAACAGGTTGAGG C CAGGG AAAACATT ATTGTAAAT AAAAC CAATGGAATGGGTAAGAGGGT AGAAG 

GAGTTCTGCAACAGACCAGATGTTTGTGTTCCCCCCCAACAAGTTCATAC 

CITTGGGAGGGTGATGAGGATGAAGCCCTCATGAA 

TTCTGCCACATGAGAACACAACGAGAAGACZAGCAGAGTGTAACACAGAAGAGGGCCCT 
1 0 GATCTCAGACTTTAGCCTCAAGAACrCTGAGAGATCATTTCTC 
CTGAACTGACTGAGACAAGTTCTGTAAGAAAAGAAATAGGCAA^ 

TCAAAACTGAC CCCATT AAGG AAAC CG AGAAAG AACTGGC CTTCAAGAAAAGAAGCAGGAGT ATG T AGCATCATG ACAT CT AAAGC 

AACAGAT ATTTTCAAGAAAAAGGAAAC AGC CAAAAGT ATCAAATGCAAAA^ CAG AAAAGG 

AGGTAAGAATGAACAAAAATGCATAGAAGTTTATAGTTGTAATTTGAAGCT^ 
1 5 TGGAGTGGG AAG CAAG ATCTCATAAAAACATAT CT AT ACCT AAGGAT CAAACACAAGAGCAAAGCACAG ATTTGAAAAT CGGCAT C 

AGAGAT ATT AAACACACAAAAACAGGC CAGGCACAGTGGCT CACG C CTGT AATCC CAG CATTTTGGGAGG CTG AAG CAGGCTG AT C 

ACAC GGT CAGG AGTTCAAGACCAG C CTG ACCAACATGG TGAAACCCCATCTCT ACTCAAAACATGAAAAT C CAG ACATGGTGG 

CGTGCGCCTCTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTT^ 

CGTX3CCACTGCACTCCAGCCTTGGCGACAGAGGGA 
2 0 (nTCTGGTATCTCTAAAAGTCGGGGTACTATTAGA 

AAATGTTAAAACAAACAAAAAAAATCTAAGACTGTCATTGTTCAGATTATATA 

CAGTAGCTATGCGATAAATATTTGTTGAATAATTGAGGAAATAAAAAAATGGAACT 

AACCAGGAAAGGTTTTGCTGTAAATGTTTATAATTAAAAAAGC 

TATCGCAAAATTCCTACTTGATAGGTAGAGATACATTACAAGGTCTTATTAA 

2 5 GCTCGGCATG^TGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTG 

GCCTGACCAACATGGAGAAACCCCATCGCTACTAAAAATACAAAATTAGCCTGGTGTC 
TGC^AGGCTGAC^CAC^AGAATCGCTTGAACCTGGGAGGCAGAGGTTGCAGTTAGCTGAGA 
CAACAAGAGTGAAACTCCGTCTCAAAAAAAAAAAAAATATTGTCTTGCAGACT 
TAATTCCAAAGTATCAGAAAACAAAAAGCATTTACAAAGTAGATTTATCAGGAATAGAGT^ 

3 0 AACAATACAATAGTAACAGTAATAGATACCTAACACGTTGTTAATTGTCT 

TTAATACCATAAATGTATATTATTTTTATATTACAAAAGAAAAATTTCTTTC^ 
ACITAAAACATAGTATTACAGGGCCAGATCTAGTGGCTCGTGCCrATAATC 
TGAGGCCAGGAGTTTAAGACAAGCTAGGGCAACACAGTGAAAGCCT^ 
ATTACAAACAATTACTATATCCTTCATCTAGATTCTCCAGTra^ 

3 5 T CTCTGTCTCTTTTTCTGAACT ACTT AAAAATTG AAG ACA CATC C CT AAACACT ACTT C CTCATAAAAT AAAG AACAT 

TAATT AAATT AATT ACCATT AACTCAAAATG AT AACTTGGT AT ACAG C CCAT ATTTAAATTTCC C CAATTGTC C CAAAAAT AT C CT 

TGTAGTTTATTTTCTTATTCAGGATTCAGTTGTTTCATG 

AATCCCCTCTGCTTCACTTTTTTGTTCCrCATAATACT 

TCACTATTTCCTTATTATATTCAGGTTAAGCATCrTTTTGGCCCAAATACCTC 

4 0 CGTAATGTTCAACTCTATCCCTGTTTGTGATGCTAAATTTGTTAGCTTGAG 

CTCCCrTTGTAATTGATAAGGTCTGTGTGATGGTACTATGAA 

ACTTTT AG CAA CTCTTG ATGGTT CTG ATCTGGAG T AGTTTTT AT ATTGGCGGT CG CAAAAT AG T CATTTTT CTAAATCCGT CATTC 

ATTTTTTCTATAGTAACTAGAATTCATCCTTAAAAGAAACTTCACTTC 

TTCrCTCTCTCATTACTATC<3ACTTACGGATTTAAAATAATTAAATATATTGCAACC 

4 5 CTATTGTCCCATTTTGGACCAATGGC^GTCTTGTAGTTGGCTCCTTTTGACATGACCT 

AAGTATACCAAGGCTCATCATATGCAAGCTCAATTTATAAATTTCCT^ 

GGTTACTTTT AGTGG AG AATGG CAT TT CAAAATCAAG ATGC AGG T ACCAGGT ATACTT ATT GTT ACTC CT AAG CACC C CTTTCAAT 

GGGCAGAGCTAAGAATACATGAATACACACACACACACACACTCCTATGTTCA 

GCATTCTTCTTCAATTTCCCTTGTTCTATATTTGTTTCTC^CITTTrCC 

5 0 CTCATTCAGTTCTATTCC^CAATAAACACAAAATTGTTTCAGAATTACAA 

AAGTTAAAGATTTACTTGTAGTTTGTTTTGTCCAAATTTG^ 

AATTTATAGGTCTATGTTTTAAGAGGAACACTGAAAAGAGTATACAGCAAGAAGAGAA 

AACIAGTTTCCTTCAAACACrTAAAAGCAGTTACACAGAATAAG 

CAGAATATAGATGTCATAAAGAG<3AGGIATTTCAATTCAACrTCAGAAA 

5 5 TCATTAAAGTATTTAGACAGAGTCTGGAATTCCACAAAGGGGT^ 

ATAATTCTATTTTCTGTTCGCACCCAAATAGCTTCTGGAAGCAGCCCTCCATTAAACAATACAGGAG^ 

TGACCTTGATTCATATAGAAAATATTGAACTATGATCAGTTATACATCA 

TGCTTGCTTGTTTTCCAGTAGTTGATCATATCAGTGTGGGATCA 

CATCCATTAAAATAGAATGCAAGATTCGAGATAGAAATGCTGGTAGGAAATAAACTTGTGTCCGGGGAAGA 
60 CAGTAAAGCTCTGAATAGTGTCTCAGTCATCAACTGAAGCAATGCTTGAGCAGTA 

GTGAATACATGAATGCCTTAGGGCX3AAAGGGGATTTTGAAGAAAACCTAAAAGAGCTGTGTTG 
GAATGCAAGATAATGCTGCTTTGATGCCCATATGGCACATTACAACACAAAAGGGAC^ 
AGCTTAGCTCATTATGGGCTATGC^CTGCAGGGACZACTGTTTTGTGTAGAGGG 
GGGGGTTTGGGGGAGGTGCTCAGAAGACAGKSCTGTGTCCAATGGCAAAGGAOT 

6 5 AAGCTGAAAAAAGAAGGTAAGGGCTTGTTCATTAGCATACT 

GGGCAATTTATAAAAGAGGTTTAATTAGACTTACAGT^^ 

GAGGAGCAAGTCAACGTATTACACGGATGGTAGCAGGCAAAGGGAGAGTTTGTGCAGGGAAACT 
CTTGTGAGACTCATTCACTATCATGAGAACAGTGCAGGGAAGATCTACTCCCATAATTC^ 

7 0 GAGAAAGTTGACTGGTGTAGAACGCTTTGGAAGAAGGTTTGTGTGCGGCTC 

TGGTTTTAAAACATC^CTGTAAATTCTTTGA(^CTTCATTGAGAAGTG 

ACTGACTTG CTCATAATAT AAAATGATGCAGCAGAAGTCATGCTGTGTGGCTTTGGAGG CTAGATCAT AAAATG CAATTCAGCTTT 
AACCTTGTTTACTGAAACACTAGCATTTGAAGCCCTGAGCTGCTTCGTAAGAAGTCTGACTA 
AAGCTCAGGCCACATGAAGAGTAGATTTAG^TATTCTGGCTGACTGCCCTATCTG 
75 AAAATATGCCTCCAAAGOLATTTCAOTCJT^ 



1795 



WO 03/008583 PCTAJS01/51291 



GGAGCAGAGATAAGCCATCTTTGCTATGCCrrc 

ATGTTIGATCCTCACAATGATAGTTTGGCAGGTTGGGCTAGCATTATTA 
x - ACAGGGATTTAAGCTTACCTGCTTTGAGATTGTA™ 

CTTATGCTTTCTCTCGTTTCTCATCTGGATACTGTAGGACTTTAGAAT^ 
5 AGGCAAGGTCAGAAAATGTCAAATGAGTAGAACGCTAAGAAGAATCAATATTAA^ 
ATCATGGAAGATCTGAACTACAGACTGATACTGACAAATAAGGCT 

AACTTTT AATGTTAT AAGGAAGTGTT AT AATGAT ATAACATTTTT AG CAG CT AATTTTTT AAAAATG AAAT AGGA CAGTT AG AG AA 
TTAACAAAAATCTTGCCAGTTGTGTTTCTGATGAAAGGGTGATGA 
GTCTGCTTCTAAGTATGTCCTTATAGTAAAGGGTGCTTCAGA 
1 0 CATTTGCC^GCCACTTTTTGATTTAAATTTATTTCCT 
GCATACCATGTTa^T GniTlTn 

ACTGCAACCTCTGCCTCCCAGGTTCAAATGATTCTCCTGCCTCAGGCTCCT 

AGG CT AATTTTTTGT ACTTTT AAT AGAGATG AGG TTTCACT ACGTTGG CCAGG CTGGTCT CAAACTCCTGG CCT CAGGTG ATCCAT 
CCGCCTCAGACTCCCAAAGTGTTGGGATTACAGGCGTGAGC 
. 15 CAATGCAG AAG AATGTT AGT ATTCTCATTCAT CAAG CAG CCT CACACAGCATC CJT1T ACCTGAGAGT AAG CAT CAGCTCCCAGG CT 

TACGGTGACCCTAATCAATATGAAGACACCCTGTTTCAGAAGTCAGTAGGAC^ 

AAGGAGGTCGCTGGTGACTTCCCCCATGATGGGAAGTACTAAAAGG CAGAATC CACTT AG TTTTT CATGACTT CATGTTT AGGTT A 
CTTTGGTGATTCACTCCAGGCCATTGGGAGCCCACAGTAATTGTT^ 
ATTATTTATTCTTTATTAGATCCATGTATTTTTAATAATGAATTGAATT^ 
2 0 CTTCTCTC C CAATTGTTTTCATGCGT ATTTTG CAGAGAATGCTTGG AATATTCTGTT CATGTlAAGGCACAAAG T ACATAAAAT ATT 

TTGAGTGATTTTCTTCAAAAT AAAAATGATTAGTTTTGATGCTT CCTTGTACAGG CCCT ACGGC 

ATGGAAAGTGCTCAACTGATATTTGTTAAATGAATGAATCTGGA 
AAAAAGCTTTCAGACrrCCATTTAGACTCGTTTTAACTGGCrc 
GCTGTTCCTAAATATATTTCCATCAAATTCTGCTTGGCTGTTTCAAC 

2 5 TCATTGCAGTGGAAATAGCAAGTAGTATTGGCTTCCCCGCCTTCCTCCCTCTAGGATT^ 

TGCTCTTCTAGTAAATCCCACATTACAGAACAGGCTGAGGACGGAGGAGACAAGAGGAAAGTA 
CGGAGCG<3AGGCAAAGTGTGAGAGCGCGGGTGGTTAGAAGGGAAAAAACCCTCC^ 
ACGTCACGCACCGGGGTGACGTCAOGACCGTGACACGCGGGTGACGCCGTT 
CCCGCTCCCCCCCGCACCCCCGCCTAGCGTCCTTCCCCC^TCCC 

3 0 GGTGGCCC^GGCGCCCTCCXSCCCXK^CGCCGCCCGGGCCGCTCCTCCCCGC^CCCCCCGCGCCC 

COSCCTCCGCCTCCCCCAGCTCrCCGCCTCCC^ 

TGGCGGCGCTGAGCGGTGGCGGTGGTGGCGGCGCGGAGCCGGGCCAGGCTCTGTTCAACGGGGACAT 

GGCGCCGGCGCCGCGGCCTCTTCGGCTGCGGACCCTGCCATTCCGGAGGAGGTGAGTGCT 

CCGGGCTCGGCGGCrGGCTGGTGTTTATTTTGGAAAGAGGCGGOXSTGGGGG 

3 5 CGCGGGCTGGGAGGTGGGCATCGCCCCCGTGTCCCTCTC CGTCATGCAGCG CCTTCCTACGTAAACACACACAATGG CCCGGGGGG 

TTTCCCTGGCCCCCACCCCAGATCTGGGG ATTGGGG CAG CGGTGGTTGAGCGGGAGGCT ATCAATAGGGGGCGAAACTC AGGGTTG 
GTCCGAGAAGGTCACGATTGGCTGAAGTATCCAGCTCTGCATCT 

GGTT AGTTGCTGGGGCTGAGACAACAG CCCGAGTTACTGTCG CGTGT AATTCTT ACATGGTCGTGGGG ATGATGGGG CTCATCATT 
TCCTCTCTCCTCTCCCGGACTGCCCCOCTTCTC^TCCGCTG 

4 0 ACC CAG CAAATT ATTTTG ATTTAGTCTTTACrrrri'CAATC CT AAATCGCAGTTT C CG ATGCCTTTT CTCXSTCT CTGG TCCTCTGT 

TCCTAATGTTTGTCAGCGCTCTGTCGCTGATTGGTAACCCCCATTCTATTCCCATCTACCGCCCGCTC 
CTGCCTGCCTTCTAACCCCAGCTCCCACTTAAGAGCATT^ 
CACCACTCCCTAACATTGTTGTCTGTTGTTATCTTCACAAATCCTCCT 
TGTCAAGATTCCATGAGGTTTCACTTGGCTCrTTT 

4 5 CCTCITTATTCCTACCCATTTTTATAGGTCTGATTGATTTCrT 

TCCTCTCCAGTCCTTGCTTCTCTCAGCCTGTTrriGCATTAGTCAGCCTCTTAGCACT 
CATAAAATTTATTACAAATATTTGGTATTTTATTACAGAAAATAATACTT^ 
GTGGTTTACAGTTATTGATTAGGTAATGTGACTTACTCTGTTGACTTTGCTCGAAGT^ 
TTTCTCAATTCrCATAACATCTCATTCrCrCTGCAAT^ 

5 0 ATTCAGCATCAGCTTGTTTATGATAATGTTTGTTTTCT^ 

CATTTGATCTGTTCTAAAATTTCTCATTTTTAAGTTTCTTAAA^ 
CrCTCATATCTGTTCCTTTCCCCCAGCTTCTTGATTTCTAAGGAGA 
CCTTGTTTCCAGTTACCATTCATTTTGCCTTGTTTCCTGGCTTTTGGTACT 
CTCCACGTTCCTTCTTATTTATAAACATCTTTGITTCCTTT 

TTTACTCTGTAATAGATGTGTAGTTTTGTTGAATTGTATCTTG 

TAGGGGTTATGAAATTGTAATTCTCTGATTATAAAATTGTTTATCTTGGGAACTTTGCTGCAG 
CTGTAAAGAAGGCTTTTGTGAAGTAAAATCTCTACCCTTCTATTTTAT^ 

GAGTCTGATTTAGTAAGTGAGAGCGAGGGCCATGGATTTCTGTATTTGGCACATGTCTTGAGCAGTTCCCATGTACCA^ 

6 0 GAACCTCTAGGCTAGCTGAATTTAAGTATAAATTGCCAGTAATC 

ATTTTACTTGGTrAAAAAACAAGTGTTTCCTACTGTCCTCTTT 
ACATTTTTTTCTGAACTGATAATAAATGTATTTGGCTGGGAGGGTGACA 
TATTACAGATGATAAAATAAATTCCTCCAAGAAATAATTTTAA^ 
TTTTATTTGAAGTCTTCCTTACATTCACTTAAAGGGATCTTTTACTGCAAATTAC^ 

6 5 AATGACACATTTTATTCTGAAGATTTATTTTACCTAACAGTAAAATGTAG^ 

ATTTCATCTAAAAAAAAAGTCTGATTTCAAAGGGAAAGCAAGTCATTATCAAAAATTAGAAAACTATAAGTACAA 
TCATCAGTAATTTTGCCACTAAGATATTATTACTATAGACATTTTGGTGT 

ATG T AGTTTTGT AT TTT AAAAAACTT AAAGC AAAAATTT CT ACGT ATT ATT AGACATACTG TGATTT ATTT AACT AAT C ATT! T TT 
TGGGGTGTTAGGTTGTTTTTAATTTTTTACTGCCATCAAACATCTTGAACAT 

7 0 GAATGAACAAATTTCACATCCTGTATTTGTAGTATTAATACTTTGTAGGTGCTCAAAATAGAAT^ 

T ATT AAAT ATTT ATCAAATGAATGTACTTGTACTTTTGGCATTAA^ 
TATGGCCATATGGTATGAAATAGTGTATGGTATAAATTAACCATATC 

ATATTT ATT AACTGTTCrTCCTGTGCTGTTTCTGT AG AAGGG AG CTTCTCACAATTG C!ATTAGAATTACAATTTT ATTATGTTCTG 
TTTTOVAGATCTCTGATCGTCAGTCTTAAACTGTTTAATT 
7 5 AT AATG AGGT CCACTGT ATT AAAATACATCTTTG CAAG CCACACCAG GTTGGATTG CAT CATAACC CTG AAAAG TGGT ATT CTCAT 
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TAATGCAGGTGCTTGTGCAGTTTTGGCTATTGCTGTT 

AATAGTTGTTTGTCTTTGGTATTT CT AGTGTTCACTTTCT AT ATT CTTTT CT CTCTC CTTATTT ACTG AACTCCCTTCTTT AGGCA 
TCCACTCACrCCTTTTTCrGTrTAGAATATTATCTGTCAGTCATTTT 

CT AAGAAAGG AATGTTGGACTGGGTGCITG AATCCTTG AATGT AG TAAATGTG AG TG CAAACTTGATTT AATTGT ACATGT ATTTG 
5 GATAATAGGCCAGAAAAATTACATTAGGGTAACAGGCTA^ 

GAATGAATAGGAGGTGGGTCTGGATATAGCAGCTGGAAACCTGTGTTCCATGA^ 

CTGATri"! rGGCATl^'l'CTAAGACCTGATGCCCACCTTGTCAGAGAATGCGATGACTACi"ri iX5TGi J l , C"rTCC'l"lTTCCL ,, l ,, rr , I"rC 
TCCCAATTATAAAATTGTTTTCTCTTTCAGAACTC 

TACCTAGTGAACATTTTGTGTCATAATCCCC'l T G'ri I IATGAAATCCAGTATGGTCTAGTCACCTTACATTTCTGCCTCATATTGT 
1 0 CCTT AAGCCf TTTTl'l GT CAGT AG CTCTT ACT AGATTITGT CTTCATCAG TT AAAGTGTTTT AAGTC CTTT ACTCATTCTGTT 

TCTCTATTTTAACTTACATTGGTTATTCTGTAAAGTC^ 

TACrTGGGAAGCTGAGATGGGAGCATCACTTGAGCCCTGAA^ 

TGGCAACAGAGTGAGACCCTCCCTCAAAACAAACAAACAAACAAACAAAAA^ 

TCAAAATAGGTGGGCCTATTTGGGAGCTrTTCTGTTTTTAA^ 
1 5 CATGTTAGTGTAAGAAGAAAAATATAGCCTCATTTGTTCCATTTCTTTCT 

CGATTTCAATTTTTCCAGCCTAAATCACTAACATACTTAATTA 

ACATTAAGCAATTATTTTTCCCCCCTACCCTCTGAGATTATTTTTGTG^ 

TTCTAATTCCTTCCIGTATCAAGACTATGCATATAGAGGGAACTCAATGCC 

ATATAAAAATTGCTTTGAACTCAATTAACTTTATATCIT 
2 0 TGTAGCTGCAGTTCCTGACAGC^CGTTTGGGACAAATC 

TTTCGAAAACATTCTATCATTGTAAATAATAATTITTTTTT^ 

TGCAGTGGCGCCATCTCAGCTGACTGCAAGCTCT 

GCTGGGACTACAGGCGCCTGCCACCACGCCCAGCTAATTT 

GTCTCAATCTCCTGACCTCGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCT 

2 5 ATAAAAATTCCAAATTGCACTAATGCATATGTGAAACTGTTTTTGTCTGTTTTG 

GGCACATAGGTACTAAAATATTTGGTGAAAGAATTAGTGAATAAAACCTTACTGGATATGAGGTGATCTGA 
TATTCTTTTTTTCTGCCGGTCACAAAATCACCrTATGAAATT 
TGGGGTTTCAAGGACCITCTTCAGAGGTTACCCCAGGGCrCTGTTTAGT 
CCTCGTATCTACCAGTCCGTGACCTGTTAGGAACCTGACrrACACC^ 

3 0 CTCCACCTCCTGTCAGATCAGCGGCAGGAGCGTGAACCCTATCAGAAA 

AGAATCTAATGCCTGATGATCTGAGGTGGAACAATTrCATCCTGAAACCCCCT 

AAATTGGTCACTGGTCCCAAAAAGGTTGGGG AGTGCTG CCCT ATACCAT AACT ATTGAAGTTCTTCATTTATCTG CTTTACATGTT 

AGTTTCCTTGAAAAAAGGGTTTTATGGCrGTAAAAATTTTAGAGCTA 

TCAGGTATCATCTGTGTAGAAAATTTCAACAACTTGTGTCTTAAGGCAGAG^ 

3 5 AGGCATTGTGGGACATACAGTTCCTCTCACAATTCCTCAACAATACTGTAGTAGCTC^GAAGTAGCTA^ 

AATGAGTTTGACTCTGTTCCAGTAGTCATTTTCAGGACACTGAAATTTGAATTTCAT^ 

TTTGATTT'l'r I ' T TCTACC ACTTAAACATGTAAAAAGTATT CTTAG CTTGTGAGCT ATACAAAAGCAGAT AGTGGGC CAGTTCATGG 

GCTGAACTGTTCTGACTCCTGGTCTAAGGGAGACTATATATGTTAT^ 

AGAAAGTTCATAATTGCTTTTGTAAAAACTAGATTGATTTATTTGAGAGGGAGAGAGA^ 

4 0 GGJVCTTGAACTCTGGGCT CAAGCAATCTTCCAGCCTCAGC CTCCCAAGTAGCT AGGACTACAGGCATGCACCAAGCCCAGCTCCTA 

GTTCCTCTTTGCACTCAGTTTGATCTTCTACCCTCAGCCCCTC 

TAAATGGAGTCATATAAAAGTCATATCAGTGGGGCCGGGCATGGTGGCTCACACCTGTAATTTCAGCACrTTGG 

GGCAGATCACCTTGAGGTCAGGAGTTTGAGACTGGCCTGGCCAACATGGTGAAACCCCCT 

AGCATGGTGGTGGGCACCTGCAATCCCAGCTACCTGGGAGGCTGAGGCAGGA 

4 5 GAGCCGAGATCGCGTCATTGCACTCCAGCAATTGTGAAGCAGTGGTTAAGGTT 

TT CACATTT AT CAAATTTCTTTTGAATCACCTTGGCACTTTT ATTG AAAT C AATTGCTT ATGTTTGTG AG TTTTGG ATTCT ACT GT 
TTTATTGATCTGTGTGTGTTTTCTTAGGCTAATACCCCACTACCTTAGTTATGATAGCT 

TC CAG TTGAG AAGATTTTTAGAACTTTG ACTTTTT AATTTCTTGTT AC CAT ACTTTTGGAAGG AG AAG ACTCAT AATTTT ATTCAG 
GTTTTCAAAGGTGTCITCTACCCCCAAATACrrTAAGACriT'CAGT^ 

5 0 GTTCATGAAAAGTGACAC^TAAAAGTTGCAACCAAAATTAATTTC 

TCAATCACGTCCTTTTTACCGTGTTGCATTATGAGGTGT 

ACCAAAAAAAGTCACCGAAAGTGTCAATTAGGTTGTGAAAGATACTAGTTTACAATGTGTTATTCTC^ 

CrTCCATTTGAAACCTAlOTCATGTTCrGTGATATTTCAGAGTTAGTATCT 

TTCITTCrAAGGAAAGGCATATGGTAGTAGTTGGTCATATTTCTACCTTT 

5 5 AACTGC^CrGATAGCTTTTTACTGGTTAGCCOVGGGAATTAT 

TAATTTAAAAACAAAATAACAGGAAACACATGTTTCTAG^ 

CGAAGGTAGAAACTGACTAAGTTTTGACCATGTTGCAAGGAGTATCGTTCCTIGTTTTGTTGGGTC^ 

AAATCCrTGGG TTTTT TT T GTTTGTTTGTTTGTTTGTTTTGGTAGAGATGCTGTGTTG 

AAGCAGTCCTTCCCAGAGTGCTGGGATTACAGGCATGAGCCACCGCACCCAGCATCACCTTGCCG 

6 0 GGAGAGAGTAATACAATATCTTGGCAGACTATTTTTAACTATTGTTCCAAAAI^ 

ATCTTTAAAAGATACTTTAATTAGGCATGCA<^AAGCCCAGCTCCTAGTTCCTCTTTGC^ 
CCTC^CAAACACTGGTCTGATCTCrGTGCCCTTTCC^GAAAGTCAA 
ATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAGGATGGGTGGATCA^ 
CAACATGGAGACACTCCGTCTCTACTAAAAATGCAAAAAATTAGCCGGGCATGGTGGCG 

6 5 GGCTGAGGCAGGAGAATTGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCCGATATTC 

AGAGTGAGGGAAACTCCGTGTCAAAAAAAAAAAAAAAGATACTTTAATTATATTTAAGTTG^ 

TCTTGCACTAAATATGTAACCTTTAAGTTAATAACACCAGA 

GTTAGGATACTTTGAATTAGTCACTTATTTTGGCATATTTTAAACA^ 

T ATTTTT AAACTGT ATTTTTT ACCTTG T AC CATTT ATTTTG CAAAAT AAATTT CTATTTTGG CTTAT AGATGT ATTTTT AATATT A 

7 0 AAGGGGATTATTGGTGTCAGCCATAAAATTGTAATTAAGTCTTAAGTAGGTATGG 1" 

TCTCGCTCTATCATCCATGCTGGAGTCCAGTGGTGTGATCT^ 
ATTCTCCTGCCTCAGCCTCCAGAGTAGCrcGCACTACAGGCGCATGCCA 
TGGGATTTCACCATGTTAGCTAGGATGGTCTCGATCTCCTGACCTCATGATCCGTCCGCCTC^ 
AGGCATGAGCCACCGTGCCKK3CTGATTATTTTTATAGAGCTCTTGTTAGC 
75 TTCCCATCJVTAAAJLATATCCCATCTTTTCT 
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TACT ATTGATAATT ACT C CT AATTAACT CAGAACTJ.' 1 1 T1X3TTTTACACACATTAATATATACTTCCATGGGAATAGTGTCAGAGA 
ACATCZAAATAGGGAAGAGATTATGATTCAGAGTGGTCTTTATATTCCT 
TTTGCAGGATTGCCTCTAACACTGATGATATTCCACTCATAT^ 
CTCAAATAAGTTTCATTATACATGAGAACTCAGTTGAATGCTTGC 
5 T ATGCACTG AACTT CT ACT AGG C CAT ATG TGTT AAGT AGAT CAT AGGAGTGCT ATG AAAAT AAAGTG AAATGATGAAT ATGT AAAA 
GCCITCACAAAATTATAGTAGTGTTTTCTGTTGATTTT^ 

CTTTGTTCnTl'lTin^TTTTGAATGTTCAATATG CTTAGCACAGGG CTCTAGTTAACACTTTTGGCAGTTCTTAATGTGGGACTG 

ACTGATGATTGTCCTAGAACTGTTTCAGTGTTAACT 

GTGGCAACATACTGATTCATGTGTTCAGTAAGCAATTC^ 
1 0 TTAT AGG AAAACTGGT AATT ATGACTTGTGTTTT CCC 

AGCACTTTGAAAGGCCCTGGCGGGTGGATCATGAGGTCAGGAGATTGAGAC 

AAATATACAAAAAATTAGCrGGGCCTGGTGGCATGTGCCTGTAATCCCAGCT 

CAGGGAGTCGGAGGTTGCAGTGAGCCAAGATCGCGCCACTGCACTCCAGCCT 

AAAAAAAAAAAAAAAAAAAAAAGCrGTGATTAACATTTGCTTTG 
1 5 TTTCTTAGGG CAACAGGTATTCATGTATTCAGTAAATATTTGAGTGC CTACT AT ATGCCAGGTAGTGATCTAGGTGCTTAGTAGT A 

CACTTGAAAACAAAACAAAGGTCTCTACCCTTATGTAGCTC 

AAeCTTAGACATACAGAAAGGAAATTATGTAGTATGTTCA^ 

TGGGATGGGGGTAGGTAGCAATTTAAATAGGGAiGGTCAGGGTAGGCCTCACTTGAGA 

GGAGGAGG AGG CATTT CAGATAAACCAATT AGTTCAAAG ATTCTGTGT CGGG AATG TGCCTTG C CTATTT AAG AAACAGCAGGAGG 
2 0 CCAAAGTTGCTGGGGCAAGGTAGAGACTAGGGGATTAGGGAAG^ 

ACTGAGAAGTAGTCAGATTTTGGATGTTTTGAAAGTAGATTCATCGGC^ 
GGCaSAGGCGGGCAGATCACCCGAGGTGAGGAGTTCGAGACCAGCCT 
AAATTAGGTGAAATATTGAAGGAGATGTTTTGATTGAAGTGATTTTAAGAGAGAAGA 
TCCTGTAAAGGGGAACAGAGAAATGGGGCCAGAGCTAGTGAGGAAAGTGGGGTCAATAA^ 

2 5 GTATGATGATAGGAATGAGCCATTAGAGAGTAAAACGTTTCAGGAGGGAGAGAGAAGAATTG 

AAGAGGGAAAGGGATTTAGTGTATAAATAGGAAGGATTGGCTTTCT 

GAGTATGTGGTTAGAAATGCTGCTAGGAAGGTCGACGTGATTGGTGGAGTCTGTACACGTTCTGTTG 

CAGTGAAGTAAGACTTGAGGTTATCAGCCGAGAATGAGGATTGGGGATATGTGAAAGT^ 

GAACrGGAGACAAATAGTAAAATTAATGGGCAAlTTlTTTTCCTTTAAGAGATGGGGGT 

3 0 AGTGGCACAGTCATAGCTCACTGCA^CCTCCAACTCCTGGGCTCAAGTGATTTTCT 

AGGCTAATGGCAATATTAAGTTATITTATGAGTTGTCTAGACAGCAT^ 
GAGTTAAATGTAACTTGCCC^GGCAATTTAAACACTCAATATGAGTCATTT^ 
GAACATGAATTTATTACrCCTTAATGAAGTACCTGCCACTATCCTGC 
ITlTCTTTTAAAAAAAriTTCTTCATCTTTTCT 

3 5 TTTCTTTCAAACACTCTCCAAATTTGACTTGTCTTTGCTCA 

CCTCCAAACCTTCCAGACTTACCTCACCTACCGTTTATTC 

TGCTTTGTG CTTTTGTAG AATCCCAG AGTTTACCTTGTTTT ACTCACTAT ATCGTATTGTG^ C 

TTTTTCCTCTAGACCTGCACTGTTCAGTCATACTTrCTGTGATG 

GTATCTATGTGGCTACTGTGCACTTGAAATTTGCATAGCAGGACTGAGGAACTAAAT^ 

4 0 ACTAGCCTGATGGCTAGTGGCTGCTTTAATCAGTGCAGTTCTAGACCCCTTCAAAGTCAAGAGTGTGTGG 

CTCGTTGCTTAACAATAATACTGTCTAATACTTAAATGACTGAATTCTTTTTCGCT 
AGAATATGGCAGTTTTGTTCTCTAGATTTCrrGTCTTTAACCATGGCTTTTCAGTT^ 
CTGAGTTCTTCAGTCCTCCGAAGTTAGCGTTTTGGTTGGATC^ 

GGGACGACTGCTTGAAGCCAGGACTTTGAGATCAGCCTGGGCAACATAGCGAGCCCCTGTCTCTAC^ 

4 5 TGGCATCGACATGTAGTCCCAGCTACTTGGGAGGCTGAGGGCGGGAGGAT^ 

TCACACCACTGCACTCCTGCCTGGGTGACAGAGCAGGAG 

AAATTGGGTGTTACAAGATTACCTCAAGGACTGGTCTGAGAACTGGGG 

GTGGGGGTAGGTGGGCAATTTCTGTTTCAACCAAAGCAGTTCTACT^ 

TGGAGGGG AAAAAAGTCTTCT AAACAAT AAC1ACTGTT AATTGAAG AGAC AAAG CATG C ATATGG CAG CACGTGATT AAC CACC AAA 

5 0 GTGGATAACAGATCAAGAAGACATGGGAAGTTGTTATGGGCTAGTGAGGTCTTGA 

AGGAAGACAAAAGGATGTAATAGGCAGTGGGAATAGAATTTGCAAAGAATTGGAGTTGG 
AACAATGAGCTGATTATAGTTGAGCAAATTGTATGAAATAAAATTTGACAA 

GG CTGACAAAAAGTGCACTTTATATTGCATGCAATGAGATGAAATACTCGGAGGTATTTTGGTTGTTACAAAGAT 
ACACTATTAGCATTTAATGAGAAAGGATCAGGGAAGTCAGCCATCCCACA^ 

5 5 AAATCACCACATGACCTAGAATGACTTGCTGGACTTTGATGATC 

CAGTAACTCCATTTTGTGTATGAGGCACTTTTTTGTTGTTGTTGTTGTTGT^ 

CTCTTGTTGCCCAG<3CTGAAATGCAGTGGCACCATCTAGGCTC^ 

ATTCAGTTACCACAATATTGGAGGAATGTTAAAGTTTTTGTTCAGAA 

ATTGATGGCAGTGTCACTTGTGCTGTATTGGTGACGCAAATACTCTTCAGTCT 

60 GTATATATGTACAAAGTACCATCAGTGTGTTATTTATTATAGTAATAGCCTAAAT^ 

TGAATTACATTTATTTTTTATTTTTATTTIT 

AACCITCGCCTCCCXSGGTTCAAGCGArTCTCCTGCCTCAGCC^ 
AATTTTGTATTTTTATTAGAGACGGGGGTTTTCCATGTTGGTCAG 

GGCCTCCCAAAGTGCTGGGATTACAGGCATGAACCACTGTGCCCAGCCTCT 

6 5 AATAGTTGTTGTAAAAAGGAGAAATTGATTTCTGTTTTCAGTATTGGAAGACTGGGT^ 

GAAACAGTATTAAAAAAAAAAAAAAAAAAAGAAAACGTAGAGCAGTTGCAGAGCTGAAAAGATAGT 
TCTAGGAATAAACAAGAATCCAAATAAATAAGTGGATGAAGTAGCTTTTGCCCT 
TTGGTTTTGGTGGACCATTGGGGTGAAJGAGGACAGAAATCAAGTCCTAGAGTTCATCT 
GTTTTAGATGGGACCCCAGTCACTCTATCGTTAGGTTAAGGGTGAAACAACTG 

7 0 CAGGGAAGGTTGAATGGAGCAGGAGGAGGAAAAGGAAATTAAGAAAAATAAACCTT^ 

AGTCATTTGCCTTGAGGGTTTGCTCTGTCTGGGTGGTCCAG<3GACCCTCCATGATA 

TGGTTT CAG ACT AGT ATT ACTT CC ATGG CCTGG CAG ATGG AAAGGCAAATCACGTGTGG AG AAAGG CCTTTCIATT CTAG 
CTTCCAGTAATTTTTCAAGGACAATGAACAGTACATTGTCACAAGTA^ 
TACCTGAGAGATATATAATATGTAATATATATGTAGAATTCTT^ 
7 5 ATACCTAAAATTACGTTTTTACAG AGGAAAAATAATAACCAAGCATGCATCCAATTGGC^CTGTGAATAAGAAGAGAGCAGAAACA 
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GATCTATAAAATCTTCAAATGTTGCAAATATCAGATG^ 
GAACAAAAGACTCTTATAAA<3TGACTTAG<K3CCAGG 
GGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTG^ 
GTGGTGAGGTGCACCTGTAATCCCAGCTGCTCAGGAGGCTGAGGTGGGA 
5 CCGAGATGGCACCACTGCACTCCAGCCTGGCCTATAAGAGTGAAACTGTGTCTCAAAAA 
TGTAATCCCAATACTTTGGGAGACTGAGGTGGGAGGGTCACTTGAGGCCAGGAAT^ 

CTGCAGTGATGTATCATTGTGCCACACTGCACTC 

TGGGCACGGTGGCTTATGCCTGTAATCCCAGCACTTTGGGAGGCT 

GGCTAACACAGTGAAACCCCACCTCTACTAAAAGTACAAAAAATTAGCTGGGC^TGG 
10 GGAGGCTGAGGCAGGAGAATGGCGTGAAACTGGGAGGTGGAGCTTGCAGTGAGCT^ 

GCAGTGCAAGACTCTGTCTCAAAAAAGAAAAGGGGGGGAAAAACC 

AGTGAACACTTTACCAAATATACCTAAGAGATTATGCCTAGCTGAAGAAAGAGl^ 

TTTAGAGTGTAGCACAGAATAAAAAAGAAAATATTGAAGAGAGGTAAAGAGACATGGAA 

CAGAGCTCTGGAAAGAGAGGAGAAAGAATGGTACAGAAGTAATATTTCAAAAGA 
1 5 AGAAACCAGTTGATTGATTT AAGAAGGTT AATGAATTTCT AG CAAT ATAAAT AG AAATCTACACC CAG ACAAATCAT AGGAAAACT 

GCATAAACCCAGATACAAGGAGAAAAGTCTTGAAAGTAGCCAGAGAGAAAAAAAGAT 

GGTTGACTTTTCAAT AGAAAATT ACAT ATATTCT CAAAATAACTG CC^ CT AGAATTCTG T AATT AG CAAAG AATT AT CCCTCT A 
CAATGAGGGTAAAATACTTAGTTGAACAAACTCCATCAGCTCTTTCTAAAGGAAAT^ 
AAGATTCTAGATAAAAGTCTGAGGTGCAAAATGGAATAAAGAGCAAAGAGAGTGGCAAATATGTGGATG 
2 0 ACTGT AT AAAG TACT AG T AAGACCTT AATT AAAAT ATG TGACAAGAAG CTGGGCATGGT ACTTTGAGAGG CTG AGG CGGGCAGATT 
GCCTGAGCCCAGGAATTTGAGACCAGCCTGGGCAACATAGTGAAATCCCGTCT 
GGTGCATGCCCATAGCCCCAGCTACTCAGGTGGCTGAGGTGAGAGGATCGCTTGAG 
GTCGTGCCACTGCACACCAGCCTGGGCAACAGACTGAGACCCTGTCTCAAAAAAAATATATG 
CCTTTGTATTGTCAG AG AG AAAGGTAGAAAGT ATT AATTGACTITGACCrrTG ATAAAT 

2 5 TTTTTTTTGAGACt3GAGTCTCGCTCTGTTGCCC^ 

TTCACGCCATTCTCCTGCCTCAGCCrrCCCAAGTAGCTGGGACTACAGGCGCCCGC 
GTAGAGACGGGGTTTCACCtSTTTTAGCCGGGATGGTCTCGATCTCCTGACCTCGTGATCCG 
GGATTACAGGCGTGAGCCACCGCGCCCGGCCTATGTTTTAATTTCTAAGTTATCTTCTAAAAATGT^ 
CTCAACCAACAGAAGATAACAAATGATTAATAAAAATTAATCCTGAAGAAGTGAAGAA 

3 0 AAAGATGGGTAGATTTAAATCTAAACATATCACCAGCTACATTAAATACAAAATGGATTA 

GTTACACTGAATTTCCAAAAAAATTCAGTTATATGGGGTTTATAAGGAAGAT 
ATCATCACAATAAGACATACCATGCATATTCTAGCAGACAGTATGGTACAGTTAATATCAAAG^ 
TATTGGCAGAAGAGTCACCTCAAATGATAAAATGACCAATTCACTGTGAAGATTTAATA 
TATAGCTTTAGAATATTTATAGCAAAAGTTAAACAAAACTACA^ 

3 5 AGCTCTTTAAGTAACTGGTATAAGAAGCAGACAGGTTAGTTAGGATATAAAATATTTGTA 

TGGGTGT AT AG AACC CATT CT ACCCAACAGTG G CAG G CT ACAC ATTCTTTT C AAG CATG T AG GATTTT GGGGGG AAAATTG ACTGA 
GTAATAATGTTGTAAAACAAGTTTCAACAAATTTCAAAGGATTGAAAC 

CT CT AT CAAT AGGGTAATTTTT AAAG CTT CATGTTAG AAATTG AG C AACT AT T AAT ACTTGG AAAT AATCTGGTCAG GCG C AGTGG 
CTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCGGATCATOAGGGCTGG^^ 

4 0 GAAACCCCGTCTCTACTAAAAATATAAAAATTAGCTGGGCGTGGTGCT 

GAAGAATTGCTTGAACCCAGOAGGCAGAGGTTGCAGTGAGCCGAGATCGTGCCATTGCACTCCAGTCTG 
CTAAAAAAAAT AAT AATCCATG CAT CAAAGAAG AAAT CACAATGGAAGTT 

ATTGAT CAAATT C CACAACTTGAT AT AT CTT AG ACCATTGAAAGTG AG AG AAT CAAATGTT ATG TCTTT AG AT ACG AAGTTT CTTG 
ACACCCCTCCCTGCCTCCCCCAAAAAAGAAATACCTACCACCTATGAAAGATTCTTGATTGGAAAAAAAAAAAATA^ 

4 5 AGTTTCATTAAACTTCTAGATCTACTGGGAAATTCACAGTCATTTGAT 

TGCCTCGATGTTGATGGCTGCTGGTTCATCCCTGGTGGTTACTGAAGGATA 

GTTTGCGACATCAGTTGACTTCTTTTCACAAAAGATTTCTCTGTACCAGGCG 

TGCTTTTAAAATTAGAGTCAGTCCTCTCAGATCTGCCAGTGCT 

TTTTAACAGTGTTTGCTCACAG CATCTTCAG CAAGAGTAGAGTTC ATGTCAAG AAACCACTTTTTTT ACTCATC CAT 

5 0 CTTCCCATTTGTTCAAATTTCATCATGATATTGTAGCAATTCAGTCACATCTTTAGGTTCCACT 

CTTCTACTACACCTACAGTTCCTTCCTCCATTGAAGTTTTGAACTCAAAGTCATCCATG 
CCTGTTAATGTTGCCAGTTTTACCTCCTCCCATGAATTACAAATGTTCTTTATGGCATCGAGAAAGGTGAAI^ 
TTTCAATTAATGTTGCCCAGATCCATCAGAGTAATAACGATATCTATGGCGGCrTATAGCCCAATGAAAGGTACrr 
AATGTTGAAAGTCACAATTACTTCTTGATCCATGGGTTGCAAGATTTAT^ 

5 5 TCTCGATCAGAGCTCTTGGGTGACCAGGCACATTGTCA 

TCAACAGTGGGCTT AAAAT ATT CAG CAAACTATGTTGTAAAAG AAT ATGCT GT CATC CAGG CTT CATTGTT CCATTT ACAGAG CAC 
AGACAGTAAATTTTGCAGAATTCCAAAGGCCCTAGGATTTTTGGAATGATAAATGAGC^ 
CATTAGCCTCTAATAAGAGTCATC<5TGTCCTAGGCCAGGCGTGGTGGCTCATGCCT 
GGCGGATCACCTGAGGCCGGGAGTTCAAGACCAGCCTGGCCAACATC^AGAAACCCTGTCTCTACTA 

6 0 AGGCATGGTGGCACATGCCTGTAATCCCAGGTACTCAGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGG 

TGAGCCGAGATCATGCCATTGTACTCCAGCCTAGGCAAAAAGAGCGAAACTCCATCT 

TGTCCTTTGAAGCTTTGAAGCCAACCATTGACTTAGGGAAATGTTGTGGCT 

TTATATCAGCAACTAGGTTGTTTTGCTTGCTTGCTTT^ 

AGGCTGGTCTCAAACTCCrGGCCrCAAAGCAATCCT CCCACCTTGG CCTCrGAAAATGCTGGGATTAT AGGCATGAGCCGTAGTGC 
6 5 CCGGCCTGTTTTGClTTCrTATCATTCATGTATTCACT 

ATCTGGCTGTTTGGTACAAGAGGCCTGCCTTTCAACATGCCTTCTTCACTAAGCT 
AGACATGTGACTCTTC^TTTCACTTGAACACTTAGAAGTCATTGTAGGGTTTTTAA 
GAATAGGAGGCCCAAGAAGGGGTTAGGGCAGTAGCTTGGTCAGTGGAGCAGTGAGAACACACACAACATT^ 

TAATG CAG AAGTTTG AAAT ACT ATGAG AATT ACTGAAATGTGACAGAC^CATGAAGTG AG CAT ATG CTG TTGGAAAAATGG TG CCA 
ACAGATTTGCTCGATGCTC^TTGCCACCAACCTTCAGTTTGTAAAAAACAAACAAAAACCATTATCCGCT 

AAG CACAAT ATGATG AAGCATG CCTGT ACAG TTT AT ACTTCACA CATT CCTTT AAGCAT AAAAAAT A 

AATATTTAGCACCTTTGTGCTGAGTTGTATGCTAC^CACTGGGATATAGAA^ 
AAGTGGTAGAAGTAGATAAATAAAGAATTAAGGACAGTGTACTGTGATGA 
AGCACAATTCAATATGCTTTTA.GGGGATGGA 
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TACAAAATCTTTATAAAGAW3AGTAATTAAACCT 

GATTGG AAG ACTGAGT ATG TTAAAAATGTTAGT ATTT CT CAAATTCCTGTAGG 
TTTTTCTATGGAACCTAACAACCrrGATTCAACAATTTCTGTGGAA 
AATAAGGTAGGAAGACTTCITTAATTCCTTTTTCCACT 
5 ACCTTTGTGAATGTGAGATCTGGCTCCCACCCAAAGCTCACATTCAGATCCACT^ 
TTCCTGAAATACAGGTTAACTACTGGCACTACCAACAAACAAA 
r rTClT G ATCATA l ' r T TTr CATACAAACTTTAATTAGTTTCTATTAATT^ 

G CTT ATAAT AG ATCTTTAATATTATGTCAGTTGGACT ATGG CTTTGTT AGAAAAGGAG CTACTGCTGAGTG CTGTCCA 

AAGAGCAGTTACCCAACTGAGCAGAGAGGGAGGATGACCTTGAGG 
1 0 CTGAGGGACTTATAAAAGGAAAAGGGACCCAGGTGCAGTGGCTCATGGCT 

T CACTTGAGG C CAGG AGTTCGAG ACCAGCCTGGCCAAC^TGGTG AAACCC CGT CTCT ACT AAAAAT AC CAAAATT AG CTGG CTGTG 
ATGGTCCACGCCTGTAATCCCAGGTACTCGGGAGGTTGAGGCCCAA 

AGATTGTGCCATTGCATTCTAG CCTGGGCAACAGAG CAAGACTCAGTCTTAAAAACAAACAAACAAGCAAACAGAAAACAACAGAA 
AGAGGAGATGAATAACAAAATGGCAGGTAATACTAAATAATATGGATC 
1 5 CTGCTCTTGTGGCAGATTAGTTCGGAAAGGATACACTT^ 

TCTGTTATG CATAAAAAT C^TTTCTTGTAGGTTAAAGATT CAAT ACAAAAGGGG AAAC C CTGAAAG CTTTTAG AAAAATAAAGGG 
ACATTATTTAAGGCAGAGAAGAATTTCTTAAGACrCAAAAAACTAGCCATAA 

A CTT'TT 'l ' J. CATT AAG CCCCATTAAGACACTCAAAAGGCAAGGCAAGGAATAGA CTAT 
TGTTGTGTAATTACCGAAGACrTAGCAGTTTGAAATAACATTTCTCT 
2 0 GGATGTTTCTGGCTTAGGGCCTTGAGGCTGTAATCAGTCAATGGCTC 
TGGATGTAGCAGGCCTCAGAGCTGGCTTTCTAGAAGGGAGTAAGAGAA 
AGTGGCACCCCATCACTTCTGCCATATTCTITTTGTrAGAAGCAAG^ 
GCACCACCAGGAGGGTGAGGATCATTGGGAGCGAGTTTAGAGGCT 

T ACATAAAAAATT CCT AT AGATTTCT AAGAC^AAG AC AGAAC ACCAAATTGG AAAAAGGG CAAAAAAT CCTGAG CAGGCATTTGAA 

2 5 TTAAAAAAAAATTTAAATGAT^AATAAACIATATGAAATGACCCTTAATCT 

TGAGAGACCATTTCATATAGATTGGCAGAAATGAAAAAATCTGACAATATTAAATCT 
CTTCCACTGATGGATGGGAATGTAAATTGGGGGGAAAAATCCTTTGGAAAATAATAGGATATCCTATAAC 
TCTTAGGCGTTTGCCCTAGACTAATGGTTCTTCAGTATGCATCAAACT 
CCTTAGAGTTTCTGATTAAGTAGGAATTGGGTGGAGCCTGATAGTTTGCT 

3 0 A£GGGTGGGGGGGTGGGATGTTGGTGCTGCTTGTC 

TGAAT AT ATGAAG TAGGAAGCAT AT AT ATAAG ATTG CT CAT AACAGT ATTATT CAT ATT AAC CCCAAGAT AGAAAAAAAGTT CAAG 
TATCCTCAGTAGTGCAGTAGATATCATTGTGTGTTTACTTATGGAACAAAAAACTAGTT 
ACAAACAGCTCCTGCCTGCATCAATATGTATGAATCTCACAAATACAT^AGTCATACCT^ 
CAACATGATTCTATTTOTAAACAGTTTAAAAACAAATGCAGTTAA 

3 5 AGTAGAGATACTAGAATTATAAGAAAAATCAAGGGAATAATGATGATAAAAGTGAGTGA 

GTGGATGTG ATT AAGGACAAATGGGAG ACTTTT AAG AT AATTG G C AATG CT ATTT CTT ACCTTGGGTT G T G GGGTTTG CTTT ACAG 
TTATGCTTTAATGGAACGATGTTTTG CAAACTTCTCTACAT AG CTTTATAATAAAGGGAAAATTT ATCTCATTT AATTTTCATCTC 
TGATTACTAGTGAGGTTGAGTATGTTTTC^TATGTTAGCCTTTGGGTTTTT^ 
ATGCTATTTGTTTTTAGAACTTATTTTTAGGTATTCTG 

4 0 T C ClTl^TT ATTTGT ATTTTT ATTTATTTTTAT1T CTTITTT AAG AGG C AGG C CTCC CACTCTATTG AGG CCT AGGCT ATGGATAG 

TC lTClTT ATCCATCATGTCAAATATTTCCTrCTAGTC 

CCCACTCTTTTGAGACCCAGGCTGGAGTGCAGTAGTGTC 

AGCCTAAGTAACTGGGACAAATGCACACCACTGTGCCCAACTAATTT 

TGCCCAGGCTGGTCTCGACTCCTGGCTTCAAGTGATTCTCCCACCTTGGCCTCCCAAAGTGCT 

4 5 TTC CTGC C CT AG CCTTTC CTT AT AAAGACAT ATGTTGTTGTTACTGTT^ CAG T CTTTTCATTT ATTT ATT 

TATTTTGATTITGATTTTGATTTTTTT 

GCCAAGGTGAGOU3ATCATGAGGTCAAGAGATTGAGACCATCCTGGCATGGATGAAACAT 
ACAAAAATTAGCTGGACGTGGTGGTGTGTGCCTGTAGTCCCAGATACTCGATAGCCT 

G CAGAGGTTGCATG AGCTG AGATTGTGCCACTG CACTCCAG CCTGGCG ACAGAG CAAG ACTCTG TCT CAAAAAAAAAAAAAAAAAA 

5 0 AAAGAAAAAAGAAATCTTTCCTATCCAGAGTTAAAAGCATATTCTCTATATTGTC 

TTTAGGTATTTAAGCAGTCrAGTATTTGTTATATGAAGATGTCTTG^ 

CAGTCTTTTTCTCACTTATTTGTAAAGCAACCTCTCATTTCrr 

TTGTCTATAACCTGTTTGTCCACATTACTACTTTATATCTTTGGTTTTGG 

ACAT ATG CT CATT CACT CTTCCACGTGAC CATT AG AAGAATTAG ATT GT CAATGT C CTCT AAGT CT AGGT ATTTGGTT AAAAAAAA 

5 5 AAAAAAAGAATTAGGTTGTCAAATTCTGTAAAAAGTACTATTGGGATTTTCATTGTGAT 

GACATATATATTAGAGATTTATTATGATACTGAGTTTTCCAAT^ 

CCCCCCCACCC CTTTTT T i H T I IT 1 1 1 TTTGAGATGGAGTCTCQCTCTGTCACCCAC^CTGGAGCACAGTC^CACAATCTTGG 

GGCTTATTTTTGTATTTTCAGTAGAGACGGAGTTTCACCATATTGGTCAGGCTC^ 

6 0 GCCTCGGCCTCCCAAAGTGCTGGGATTACAGACGTGAGCCACCGCGCCTGGGCCATGTGTTTCT 

CACCATGTTGGCCAC^CTGGTCTCAAACTCTTGACCTCAGGTGATCCACC 
GAGCCACTACTCCCAGCTCTG1 T TT T TTTTCCTTAAAAGGTTTATAGTTTTCTACATAATG 
TTGGTATTATTTGCGAGAATCTTTTTGTGCTGTTGTGAATGAGATGTT^ 
AGAAGAATGAAACTACrACCCCCACCTCTCACCCTATACAAAAATCAACTCAAAATGGATCAA 

6 5 TACAAGGTGAAACCACAGTGTGGTATCATCTGGCCCCAGGTAGGATGGCTGCTATAAAACTGTAGAA 

AAAACCA.CAGTGAGGTATTGTCTCATCCCAGTTAGGGTGGCTGTTATAAAAAAGACAAAA 
AGGAAAGAGAACTCTTGGACGCTCTTGGTGGGAATGTAAACTAGTACAGACACTGTATC^ 
TCCCACTACTGGCCGTTTACCCAAAGGAAGGGAAGTCAGTACGTCGAAGAGACAT^ 
TCACAGTAGCCAAGATATGGAATCAGTTCAACAACAGAAGATTGGGTAAAGACAATGTC 

7 0 TCAGCCGTAAAAAGGAAAAATCCTGTCATTTGCAGCAACATGATTGGAACTGGAGGACATCACG 

AGAAAGTTAAACCCTGCATGTTCTCACTCTTATGTGGAAGCT 

CTCGGGCTGGGAATAGTAGGGGTAGAGTGGAAGCTCTGGGGAGATTTATTCATGGA 

GTTCTAGTCTTCTGTAGTAGTATAGGGTGACTATAGTT^ 

AACACAAAGAAATGACGAATG TTTCTCCTGT CTTT AGTG AG AATG CTT CTGTT ATG CTTGGTGACTT ACAG ACTTTTT AT AGAT AC 
7 5 CCTATATGAAGTTTAAGAATGTTrcrrCa^TTTCT^ 
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TGTTAGATTTTTTTCGTACATTTATTGJVGGTGATCATGTTT^ 
TTCCTA GTT l l ' G AGTAAGCAACrCTlX3TGTCACTGGATACATTTTATTC 
AATTCAGGTTTTTATTTAGGGATTTTACGACTGTGTTTAC^ 
ATCAAGAGCTGATTOSTTTTCTAGAGTTTTCTGTGATTTTG^ 
5 TTCATTAGGCTTTAAGTAACCACTGGGCCCA l'rC'l'TT GACTTTCAAC^ 

ATTTGTTCAGGGCTTTGGTTTGTGAGCItSA (jTr T C ''l 1 1 X ' 1 1 rATTTI n"A AATAGATGAGCTTATCTCATTTAT Al M A TTGTT GATA 
AGGCAGTTCrTCCJVCATTATTAATTATATrrT'rCl TTGGGGGCTTTGTCTCTTATGGGTATGTGTGTGTTTCCCATCTAATAGTTT 
TTGATGATTTTTTATAAATCTTATTCrCTT^ 

TCAGGACT ATTTT AACTCCTTGCT AAT AAAGGT AAGAAAAT CAG TACACTT AT ACCT CCCTTT CC CCAG CT ACCATTAAAAAAATT 
1 0 TTTTTCCATCTCTTCATATGAGATATAAAC 

CATGTTTGAATGGGATTAATATTC^CCTTTAATTCTTTAAAACC^ 

CAGCTIXxATTTTCCT AT CAAGT AG TTCTGTrrr'ri' CTG CCAAG AATGGCTTCClT'rriTCTTClT CAGCCATTT CTTTCrGTTT CC 

ACCGTGGCCTTCATTTCTGAAGTGGTCTTTT^ATT 

CTTTTTTTTGAGACAGGGTCTGGGTCTCrrCACT 
1 5 GCTCAAGCCATCCTCCCATCTCGGCCTCTCAATTAGCTG 

TTGTAGAGGCAGGGTTTCATCATGTTGACCAGGCTGGTCTCAA^ 

CACTGCCTCCTGCCTCATCTCCCATTTCATCTGTrCT^ 

TGATAGATGGGGATATATCAGAGACTTCrrrTCTTTCCTTGACT 

CTCATCCCCCACTCCCIXSTTT^ 
2 0 TTTTTATT ACT CATC CTTTGTTGTCriTTGAGTAGAGTACT^ 

ACAGTCGCTCACACCTGTAATCACAGCACTTIXK^AGGCCGAGGTG 

ACATGGTGAAACCCCGTCT CCACTCAAAATACAAAAAATTAGCCAGGCATGGTGG CGGGCGCCTGTAGCCCTAG CTACT CAGGAGG 
CTCAGGCAGGAGAATGGTGTGAACCCTGGGGGCAGAGCTTGCAGTGAGCTGAGAT 

G CAAG ACT CTGT CT C7WWVAAAAAAAAAAGAATGTTATGGGGAACAGCACX3GAGAATAAG CCTAAGCAGGCATAACTTTTTT CT C 

2 5 AGACATCTTGTCTCAGAAAGCTCGTTCACCAAATCTGTTGTGTTCT 

TTTrCCCCITCCTGCCATTTTGGGCCAAGCCAATCATTCAGTC^ 
GTGAAGTTGGACATCAAATAGATTCCTCTTCGATGATTTGATTTTTAATTTG 
TATCTTATGTTCAATAGTTATTCTCACAAAGGAACTGGTTCCCAACAT 
AAACTGTAACTGCTTTTATTTTATGTGACTTGATCTGATGTATTGT^ 

3 0 GGGAGGTCTGTGTTTCACTTTAAAAACAAAATCTAAACCAGATAAAAACT 

CCCTTTCCTACCTCTCTACCCCTGCCCACGTCTATCAGCCATCCrTGCTGTGTACTAAATGCTCTAGC^ 

TATATTGTGGATACTGAATATTCAAAGTGTTCTCCCAAAGATAC^TTCCATAGTTTACAATTT^ 

TTGTCTAACCTGTTGAGTTTCTAAGCTCAAATGAGGGGGGATACTC 

AAGGGATTCATCTAAAAAGTATCTCAATTGATTTGGCAAGCAGGTGATACCCTGTACTATAT^ 

3 5 CATAATTGGATAITAGCATAGTAATGGTTGAGAAGAACTGTAGTAAAGAAGCTTACCTT^ 

AGATTCTCGCTCTCTCACCCAGGCTGGAGTGCAGTGGCACAATCTCGGCT 
GTTACCCTTCCAAGTAGTTGGGACTACAGGTGTGTGCCACCA 
TCACCATGTTGGCCAGGCTGGTCnTGAACTCCTGACCTCAAATGATCCACCT 
TAGCCACTGCACCTGGCCrTACCTTTGTCTAAATrAGAATTGTATAAACTTC 

4 0 GCTAATATTCTTTGAAACATTTTAGGGAAATGCTGTT^ 

CTGGCCCTTTGTCCTAGAATGCTCrCTTCCrTCATTrTCTCCATAC 
ACCCrrGCTC^AGGCATTTGGAAATATTTTGATTGTCATGCCTGC^ 
CAATGCACTAAAAGTCCTACACAGCAAAGAATTGCCCCACCCAAAATGCCACT 
ATCCTAGGATTCACCCAACTGCAGATCAAAAGTATTAGGAAAACAGTAAAAAGTAATACAATA^ 

4 5 ATAACAACTATTTATATAACATTTACATTATATTAG^ 

ATGCAAATACCCTCCCTTTTTATATAAGAGACTTCrcCATCCAAGGATCATGGTGTTGTAGGGTA 

GTGGAT ATGGAGGAATGACTGTATACTCCTTTGG AG AACACTG AACTCAG CAACTTTTACTCAT CCTTTAGAATTTAGTTCTGGTT 

ATCITTTCCAAAAGCTTTCTCTAAGTACTTTTGTTTTCCCCTCTAC 

ATTTTAGGCACATCTTtlATTATTTGAAGTAACCACTTTGTAGTATAGGAGCCTAT 

5 0 AATCTTTGAGGGCAAAGACATTITGTATCCTTGGTATATGATATATAATTAGTCTTT 

ATAAATAAGTTGTACTTTTACTTTAAATCAGAATTGTGGTATGACTCTTAAACTGTTGCT 

TTGTGGAAAATAATCTCATTTTACTCTCATAATAGAAAATTAAAATATGCT^ 

TAGTGTTTTATTTGGGTTTTGACCnTGCCrTGTACATCAGAAGCACCT 

CAGAGGTTCTGGTTTTTGAGGGTGAGG CCTTGGT AT CTGG ATTTTTAAAAGCTTCCAAGGTGATTTTTAATCTGAAGTT AAGATTC 

5 5 AAAAGTCATTTAGTGAGACACTCATATGTAGCATTTTGGGAAGTCAGTAAACTTAAGAG 

AGTGAAAGGAGTAGGGAGTTTTAGCTTAGAAAAAAAAAATCTAGATGGAGTATGTGAGTTCTCT 
TCTTGCTCACCACTGTATCCTCAACTCCTAGCATC 
TATGTAT^TTAGGTGTTTAAAAGATAGTCATTGTAGAACCACAGA 
TAATTTTATGATTAAAAATAATGAACrcGATATTTAATTCATTT 

6 0 TTCCATTTGATTCTCTGGTCAAATCTATGAAGTTTGTATCATTCAAATAATAGGGCTGTGTCAAGAAACT 

GTAATTTGCCTAAGGTTACAAAGCTAAGTGACAAGAGCTGGAATTCTTAAAGTCTGTACTCn'TAACT 

CTTCACCACAACCCTAAG AGTT AGAT ATGGTTTAGAAAGGTTATG CAG CTTGCCTAAGGGCAC^TTGCrTGAGTGGAAGATTTGGG 
ACTTGG ACCCAGACATTGTG ACACTAGAGTCCATGCTGTTAACTG CCATG CT ATTGG AACCCCCAACCCCCTCG CTTCCT ATTGTA 
TAATGTACAACC^TCTGTTTTCATGAGATTATTAACCACAGTGGACA^ 

6 5 GACTTTTGACniATGGGTTGAATTTGGATTATTGTCTTGAATCT 

TTCTTTACTAC^1TL"1TTTTCTTCA1"1TC1TTT^ ■ 
ACACrcATATCTGTTAGCTTCTGTTATTTTAAGT 
CACATTACCCCTAGTAGATATCTGTTTTATTTATGCTATATGTGTACT 
GTTAGCATGAAT'ITTTTTAAAGCrAAGCTAAAAATGAAGTGAGTTTAAAATT 
70 AATGTTTCAAGTTACAACATACCATTAGGCCAAGGTTCGTTATTATAATAGTGTGT^ 
TTriTlTAAAATAGCTAGTTTCTTGTCAGAAATTAAGTAACCT 
TTAGGGTTTCTCAACATTGGCGCTGTTGATGTTTTGAAATGGATGTA^ 

CCTACGATGTAGAACTTGCAGATATGGAGGGCCACTGTAAAAGTGGCATTTTGGGTGGGGCAATAATACT 
TTAGGAGCTTCCGTGGCCTCTGCTCACAAGATGCCAGTAGTACCCCCA^ 

7 5 GTCTTC^GGGGCAAAJVTTOCTTCTG^ 
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CAC^CTATAGTGCAATGGCGCAATCTCGGCCCACTGCAACCTCCACCT 
TAGCTAGGATTACAGGTGCCCGCCACCGCACCCTTCTAATTTTGTACTTTT 
GTCTCGAACTCCCAACCTCAGGTGATCTGCCCGCCTCAGCTTCCAGAGTGCT^ 
GATCATTGTTTTTATCCTGTATTATGGATGACAAG C AG CTT^ 
5 CAGCAATAATGTTTTCTTCAGTATGAAGTTTGAGATCTGTTTC 

TTAAAACTTGGTAACATAATCCATACGTTTACTTAAATCAATATATGTGAGTCATAG 

GAGCCACTGTC^GTGTTCTGCAGTATGGAATGCCCAGCCTTCAGCAGACCT 

CTAAGTTAGGGTGCCTGTGTAAATATTAAATGCTGAGGCCAGGC^ 

GGTAGACGGATCACAAGGTCAGGAGATCAAGACCATCCTGACTAAAC^CAGTGAAACCCCGTCT 
1 0 GCCAGGCATGGTGGCATGTGCCTGTAGTCCC^GCTACrTGGGAGGCTG 

GCTGAAATCATGCCACTGCACTCTAGCCTGGATGACAAAGCGAGACTCreT 

GCATGCATCTGTAGTCCCAGCTACTTGGGTGGCCAAGGTGGGAGGAT^ 

TCACACTACTGCACTCCAGCCTGGGCACAGAGTGAAACCCTAC^ 

TAGATGAACACATTAACTAAATAATGTAGCCTACCTATATCCCCAAAAAAGAGTCTTCCT 
1 5 CTATTTTGTAAACCACTTGTTTX3ACCATCACATTGTAA 

AG TAATTTT AG CT CT CTG AACTGACCTC CACCTAAC CAACTC AT CAGGTT AACTG ATG TTCTCCATTT CAAAAAAT ATTTG C GGTC 

AGATTAGGTGTGAATCATATTTAAAGGAAGATTGTTGGT^ 

TAAAATGTTTAAGGACAGTGTTTACCATTGCACATGATTCCCTGATTTAACTC 

ACATTGACTATAGGGGCTTCTTTTGTGCCACACCAGTGTTGGAAT^ 
2 0 AGCAGCCTGCTTATCTGGTTGTTTAAACTTGTAGCTGAATT 

TCAAAACTGTGGCTTCCTCCTTGCTACCTTACAGGGTAGTTGTGAG 

TCTATAATTGTAGCATGCTTTTAATGGGATTCCCTGCCTCCAGTACCTCTCTGGC^ 

TAAACATCCTAAAGTATAATGTGTAATCAGATCAAAAGCCATTTATAACAATAAAAACTACATT 

CTTAGATGTGGTCTCTGCAATCAGATGTTCATTAAACAAATTATTTAAAAAATAC^ 

2 5 TGATATGTGCCAAAAGAAAAAATGGGGGGTGATGTGAGAGCTCAGTGAGGATCAAATATAGATTGGGAAGAAG 

GACTTCTTTGAGAAAGTGAGATTTAAACTATCrGAAGAAGCTGTGCTTAGTTT 
AAGGAATAGCATGTGTGAAGGCTCTGCrCTGCrGCTC 

TGAGGCAGG AGGT AGG CTGGGATG AGAT CTTTTCAG ATCTTG C ACT CTG TACT AGGAAG TTT AGGTTG T ATT CTGAGTT ACTGGGG 
AGCCATTAAAGC<3TTTAAGGAAGGGGAGTGTTGTGAATCAGTTA 

3 0 CAGAATGGG AGTTAG AG CAAAAG AAATG AGG AAACCAGTTAG AATG CCTTTG T AAG T AGT AGGT CAG AG AGAAAGGATGAT AATTT 

TGACTAGAGTCATGACAGTAAACAAGGTAACACATGAAGTGATTATXjTCAT 

CAGAATAAGATTTAGAATAAACTTATTCCAAGCCCATCTAATAAAGC^TGGTGTGAGGAACTGAAACAATGGG 

TCGGTCCTTGTCTCTAAAATAAAATCCAAACTCTGTTGAAGATTTTTCAGCCTACT 

ACTCACCTATGCTATAATCATACCAAACTATTACAGTGAGTATCATGAACATTTCC^ 

3 5 TATCGCTGGAATACTCTTACTCCCACATACACTTAACAAAACCCTACTCATTGTTTA^ 

CCAAATTTGCCTTC CAG ATTGAT ATTG CC CATTT AT CITCTTTT AAC CAT AGT AGTT CAT ATTACT ATCACATTAT AGT AT ATT ATT 

CAACAGATGTCTATGTAATTGATTATGTGCCAGGTACTA 

CCGCATTTGTGGAAACTACATTGTAACACAGACCrrTGAACAAAATCT 

AGG AAT AC CTG C AGGTTT CTGAAAC CCT AAT AAGG C CCTTG AGG TCAAAATT ATTTTCAT AATCCT ACT AAG AT ACTATTTG CCTT 

4 0 TTCATTCTCGTTCTTGTXiAGTATGTAGTAGTTATAGGATGTATGATATTCCAACA^ 

ATCTTTTGTTAAGG CAG AC ATTAAAG AAATTGGCAAAAAAAAAATTTTTTTTTTTAAAGCC^ 
AGACCTXSTAATCCCAGCACTCrcGGAGTCTGAGGTGO^ 

AAACCTCGTCTTTACCAAAAATATGGTAGCTGGCTGTGGTAGCACATGCCTGTAATCCGTCCCTGTAATCGGAAGGCTGAGAGGCG 
GGAGAATCGCTTGAACCTGGGAGGCCAAGGTTGAAGTGAGCTGAGATCGTGCCATTGCACTCC^ 

4 5 TGTGTCTGAAAAT AAATG AAT AAATAAACAAACAAATAAAT^ 

TGTTGTTTTGGAAAATGTGGTTATTTTTTTCATTAAAAAATAGATCACTTT^ 
TTAAACATTTCTTCGTTATAATTTTTAATGTGGTAAATACTGATAGCGATAAACTAC^ 
AATTTTTAAGAGTAT7kAAGGGGCCAAGTGTGGTGGCTAA(^CCTGTAATCCCAA 
ATAGCAAGACCCTATCTCTACAAAAAAATTTAAAAATTAGCCAGAC^ 

5 0 GAGGTllAGAGGATCACTTGAGCCCAGAGTTTGAGATTGTGGG^ 

GACCCTATCTCAAAAAAAAAAATTATATTTTTATACACACACACACACACACACACACAC 
TATATATAAGAGATCCTGAAACCAAAATO^GAGTCCCTGTAATGGGGAGCTGATTTAAATTGG 

AT CTTATGTCTCTGTT AAACACTTATGTTCTTCT AG AAATTAGGAGTGTGGT TTTCTCCGTATTTCCAAAAACTGCAAGTC f II TT 
ATATAACGAGTGTCTTGGTTGAATCTAATTTTAAAAAAAATTTTTT^ 

5 5 TCATGAATTTGCATGCCATTCTTAGGCAAGGGCCATGCCTATCTT 

AGCACCAAATTTTCTTATCAAAGTCTACTGTAGATGGCCGTTCTT^ 
CATCTTATTTCTTATTACTGTAGGAAATAGATCTGGATTC^ 
CCTTACCATTATTTATCTGTAACATGGAATACATATTATCTTACTGT 
CATAAGTGACCAGAACAGAGACAATCCATAAAATGGTAATTCITTTTATTTTATA 

6 0 AGTTAGTCAGCAGAGAGGTCTCTGGATAAATCAGGATATAACAATCATCATTCAAGGATATATTCAC 

AAGGCAGAAAGCCAGAATGATCTATTGTGTATGGTTATTCATCTTTAGAAAAAGGAACAGAAAGTGT 

TAAGTAGCTCTATGTTACAATGTAGATGATTTAGGGATGGGGTAGAGTTTTATGTGTCT 

TAAATGTTTGAATAATATTAATTAACTAGAGATTCTTGAAAACACATTTTTAGAAGCCT 

GG CCAAGCCTGAGAAAGCTTACATGG CTAACTGGAAAAATAAAT AAAGGT ACCAT AGAGG AAAAACAAAATTGCCCTGTGGGG AG A 

6 5 ACATGTGGTGTCATATGGTGTGACTAAATAGGATCCAGTAAGATAAGACAAGGTAGAGCATCTTGGGAGTGATTCCA 

GTTAAAATGTTAACTACATTAAAGGTAGTAAACCAGTGAAAG AATCCT CAAGATCCCAGTGCAC^TGTTTC 
GATCAACTGTTTTGGTATTCATAGCAGAAAGCCATAGGAA 
TATTTATTTATTTATTTATTTATTTATTTACTTTATTTTATTTrA 
GCAGTGGTGCAATCTTGGCTCACTGCAGCCTCCGCCTCCCAGGT^ 

7 0 GCAGGCATGCACCACCATGCCCAGTTAATTTTTATTTTTAGTAGAGAC^ 

TCCTGACCTCAGGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGGCATTAC^ 

CCTTTTTT AAGACAG AT ACTT AGGG CTGGGTGCTGTG G CT CATG CCTGT AATCCCAGCACTTTGGGAGG C CGAGGCAGG T AGATCA 
CCTAAGGTCTGGTGTTCAAGACCAGCCTGACCAATATGGTGAAACCCA 
GCGTGCGCCTGTAGTCCCAGCTACTTGGGTGGCAATTCAAGGAGAATTGC1TGAACCCCCGG 
7 5 GAT CGTG CCTCTG C CCT CCAAC CTGGGTG ACAGAGCAAG ACT CCATCT CACAAAAAAAAAAAAAAAAAAAAGATACTTTG AT AAAG 
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AAATAATAGTTATTTCTCATTTTATTTCTCATTT 
GCITACATTTTAAATATTAGGTTGGTGa=JlAAGTAATTC 
ACAACTGTAAAGCATTGCAGGAATTATATGTGGAATTATAGGCTTACT^ 
AATGTGGTTTTTTTTACAAATTGAAGGTTGGTGGCA^ 
5 CACTTTGTGTCTCrreTGTCACATITTAGTAATTCTTGCAATATTTG 

CTATTGTAATTGTTTTGGGGTGCCACAAACCACACCCATAATAAGGCAGTGAACT^ 
CTCCACTGACTGGCTGTTCCCCC^CTCnTCTCCAGCCTCCGCATGCCCTG 
CTCTACAATGGCCTCTAAGTATTCAAGTGAAACCAAGAGTCACATGTCT 
AGAAGGCATGTCAAAAAGCTATGCCTTTTGTGCCAGACAGCTA 
1 0 GCTACTCCAGTGAATACACGAATGATAAGAAAGCAAAACAGGCT 

GATCACTTGAGGCTAGGAGTTCGAGACCAGCCTGGCCATCATAGTGAGACGTCGTCT 

AACAGCCTAATTGCTGAGATGGAGAAAGTTTTAGTGA 

GGCTTTAAAGCTTCAAAGGACAGGCTGACTCITACT 

ATTCCAAAAATCCTAGGGCCCTTTGGAATTCTGCAAAATTTACTCTGTCTGTGCT 
1 5 GCACATTCTTTTACAGCATGGTTTACTGAATATTTTAAGCCCACTGTTG 

CATATTACTAATCGTTGATAATATACCTGATCACCCAAGAGCTCTGATGATGTACAAGGAAGATT 

TGTGTGTGTG LTrrri T lX 3TTTTGAG ACAGGGTOTCATTCTTGCCCAGG CTGGTGTGATCATAG CTCTCTGCAGCCTTGAACCCCT 
GAG CTCAAGCAATCCTCCTGCCTCAG CTG CCTGAGTAGGTGGGACTACAGGCATGCACCACCATG CCCAGCTAATTTTTTTTTTTT 
TTTTAATAGAGACAGGGTCTATGTTTCCCAGGCTAGTCTCACAAAC 
2 0 CTTAAC CACT AACGCAGCATT CATTCTG CAG CCCATGGATCAAGG AGTAATTTTGACTTTCAATTCTT CTT ATTT AA 

TTTATTAGGCTATAGCTGCCAT AGAT ATTCCTATGATGGATCTGGG CAACATTAATTGAAAACCTTCTG C AAATTATTCACCTTTC 

TAGATGCC^TTAAGAATGTTTGTGATTCATGAGAGGAGGTCAAAATAGCACCATT 

CTTGTGGATGACTTTGAGGGGTCAAAACTTCAATGGAGGAAGGAAT^ 

GTGGATCCTGAAGATGTGACCGAATTGCTACAATGTCATGGTAAAACTTGAAGAAATGAGAAGTT 

2 5 AAAGTG ATTTTTTGAG ATGGGGTTTACT C CTG CTGAAG ATG CTG TG AACACTG AT AAAATGACAG CAAAAGATTT AGAATTTTACA 

TAAAATTAGTTGATGAAGCACTGGCAGATTTGAGAGGACTGACTCCAAGTT^ 
ACCATATTGCCTGGTTCAGAGAAATCTTTTGTGAAAAAAAGTCAGTCGG 
GCCGGCCGGGCACGCTAGCTTACACCTGTAATCTGAGCACTTTC^^^ 
ACCAGCCTGGACAACATGGTGAAACCCCATCTCTACTAAAAATACAAAACITAG 

3 0 CTACTTGGGAGGCTT3AGGCAGGAGAATTGCTTC 

CCTGGGCGACAGAGCAAGACTCCA.TCTTAAAAAAAAAAAAAAGA^ 
TTGATCCGGCAGCAGCCATCAACACTGAGGCAAGACCCACCAACACAAAAATGATG 
GTTTTTAACACTATTTTAAAATTAAGATATGTGCTTTTTTT 
TATAGCTACACT1TTTTGTTGTTAATTTAAAAATTGTC 

3 5 G AGGCAGGCAGATCACAAGGTCAAGAGAT CG AAACCATCCTGG CCAACATGGTGAAACCCCATCT CTACTAAGAATACAAAAATT A 

GCTGGGCGTGGTGGTGCACGCCTGTAGTCrCAGCTACTCGGGAGGCTGAGGCAGGAGACrCT^ 

CAGTG AGCAG AGATCCCG CCATTGT ACTCCAGCCTGG CAACAG AGCAAAACCCTATCTCAAAAAT AAAATAAAAT AAATTATGGC C 
AGGCGTGGTGGCTCATGCTTGTAATCCTGGCTCrTTGGGAGGCCAAGGCAGGTGGATC^CTTGAAGTCAGGAG 
TGACCAACATGGTGAAACCACX3TCTCTACTAAAAATAC AAAAAAAATT AGCTGGG CATGGTGGTGCATG CCTGTAAT CCCAACTGC 

4 0 TTGGG AGGCTAAGGCAGG AG AATCG CTT AAAC CTGGG AGGTGGAGG TTG CAG TG AG CTG AGATTT G CG C CA CTG CACT CCAGC CTG 

GGTGACAGAGCAAGACTCCGTCTCAAAGGAAAAAAAAATGGATAC^TAATGATTATATATATTTCTC 

CAGTTCT CTTCT AG CT ATTTTGAAAT AT ATG AT AAATTGTT AA CTG T AATTT C C CT ACT AT ACTG TG AAAT ACT AGAACTT ACTG T 
TTCTGTCCAGTTGTATGTTTGTATCCATTAACCAACTTCCCTTTATCCCTTTCTCCCTC 

4 5 TACTCTACTGTCTACCTCCATGAGATCCTTTATGTATTCTGGATATAGATCCT 

ATTCTGTAGGTTTTriTCACTTTCTTGAGAATATTCA 

TTGTTGCTCGTACTTTTGGTGTCATATCTAAGAATCCATTGCTAAATCCAAGGTCATTAA 
AGTTTTATTATTTTAGCTCCTATATCATTTATTCATTTTGAG 

ATAAATTGTCCTTTGAGCATTGCTTTTGCTGTATGCCATCAGTTITGGTATGTGTGTTTTTT 

5 0 TTTCrAATT TT TCT T GTXSATTTC T T T TTT T GACCTTGTATCT 

TCTTTTGTATTAGTGTAAAGGTGATGCTGGAATCATAAAGTGAGITGGAAATTACTTACTCCTT 
TGTAGAGGTGGTCTTATGTCTTCTTTAAATACCTGATAGAATTTGCTACTAGAGATTTATTTTTAGAAA 
TTAATTTCCITAATAGTTACAAGAGTGTTCAGATTATCTGITTTATCTTGTGAATATA 
GTGAGAGTTrra i T rr i' ClT TGGTTTTTGAGGAATTGG 

5 5 TCTTATTAACCrrTTAAATGTCTGTGTGTAGGGGAGTCrGTAGTT 

TTTTTGTCAGTCTTGCTAGAGATTTGTAAATTGTATTGGTCTTTTCAACAAACAAG 
AATCTCATGGATTTCTGCTTTTTATTATTTTTGTGTTG 

AACT GTTTTTCTTTGCTGAT AT AAT AGT ATCAG TTT AAG AG CAGGG CTT GG TGG CACATGCCTGT AATT CC AGCT ATTTGGGAGGC 
TGAGGTGGTAGCATCACTTGAGCCTAGGAGTTTGAAACTAGATTGCGCAACAT^ 

6 0 AACT AAAAAAAAAATAATGGCACAAGCTTAATGATATT AATT CACTGTAGCACT ACT 

GTATTTTCATTCAGTTGAATATTGAACTAGCCTTGTGGTCCT 

TGGATTCTGT ATG CTAAT ATTTT ACT AATGATT ATG AGG AAT ATT AGT ATG T AATTTGGGGCTT C CATTTG A CT CAAAGATT ATTT 

AGAAATGTGCTGTTTAATTTCCAGGTGTTTGGAGATTTTCCTATTAACTTTCTATTT 

AAACGTTTGTGGTTTTTTTTTAAGCTTGTGAAAGTTTGTCTTATGACTCAGAAT 

6 5 TTGACAAGAACATGTATTCAGCTGTTAAGTAGAATGTTATATA 

ATATTCTTACTGATTTTCTGTCTACTAGTTCTATTACTGAAAGGAGTGCTGAAGTCATCAAATA 
TTTGTAATGTTCTGTAAGTTTTTACTTCATGTTCITTGAAGCTCCATTATTAG^ 
AAATTATATTTGAAGTGAATTACTCGTAGACTACATATAGTTGGGTCATTTTAAAAATTCAT^ 
GTATCTATAGACTATTCACATTTAATGTAATTTTGGCATGTTTAGATTTAGGTTTACCAGTT^ 

7 0 CTTCTGTTTTCCATTACTCTGTCTTTCCTGCATT 

TGGCTTAAAAATTCTTAACTCTCCATATAGTTTTAGTGATCACTCCAGAGATTACATTATAAA^ 
TTTAAAATAAATTCATTTCITTAAATGGATTATGGTCCACTTAAAGAAATGT^ 
CTTTTCTCrrATTGTTGGCTGTATGTTACGTTTCTATTAATTC 
AAACATATTTAAC^GAAACTAAGAGGAGAGGGTTAATGTATTTT^ 
7 5 CCAGGTTTCCTTCCGGTATTATTTCCCTTCTGTGTGA 
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CTCITAGTTCCCATTTATCTGAAAATATCTTTGTTTTATGATTGCT 
TAGTTTTTTTCTCTTTCMCACnTTAAAATGTGTAA 
GTGGTGTTCCCCrATATTTTATGGATTGTTTTTCTCTGTT^ 
CCAGCACTTTGGAAGGCCGAGGCGGGTGCATCACCGGAGGTCGGGAGTTCXAGA 
5 TACTAAAAGTACAAAAAACTTAGCGAGGCGTGTTCGCGGGCACCT 

TGAG CC CGGG AGGCGGAGGTTG CATTGAG CTGAGATCACG CCATTGTACTCCAGCCTGGG CGACAGAG CAAGATGCT ATCTCAAAA 
AATAATAGTAATrriTTGTGTTTCATTTTCAAC 

CACCGAC CrT TTTGAATCTGTAGTTTTATGTQ^TTTGTCAGATTTGOGJAGGC 

TCCTATTCTTTGCITTTAGAAGTTCACTGACAC^GATGTTAGATCT^ 
1 0 TCTTGAATTGCTATAAAGAAATGCCTGAGACTAGGTAATTTATAAAGAACAGAGATTCAGTTGGCT 

GGGAAACAT AG CAGATTCTG CTTCTGGGGAGG CCTCAGGAAACTT AAAATCATGGTGAAAGG TG CAGGCATATCT 

CTCATGG CTGG AGCAGAAGCAAGAGATGGGGAGGGTGCCACATACTTTTAAATG AC CAGATCTCAT AAAAACTCACT ATCACAATG 

ACAGCACTAAGGGGGATGGTGTTAAGCC^TGGGAAACCGCTTCCATGATCCAATTT^ 

GGATTACATTTGAACATGAGATTTGGGTGGGGACATAGATTCAAACCATATC^CCT 
1 5 AGCTACATCAATTGGAGACCTGGTTCTCTCCCTGCTCTCAGAATGT^ 

GGGTT ACGTGAGAAAAGTG AGGGGAAAAAACTGAGACACT AT CTTTG AATG TT ATTT CTTTGCCTTT TT TTTTTTTrTTTTJTT ACT 

TTTITTCTrTTAAAAAGT AAACTTTT^ 

CAGATCACAAGGTCAGGAGATCGAGGCCATCCTGGCCAACGTGATGAAACCCCGTCT 
TGGTGGCGCGTGCCTGTAATCCCAGCTACCCGGGAGGCTGAGGCAGGAGAATTGCTTGAATCAG 
2 0 TGAGGTCGCGCCACTGCACTCCAGTCTGGCGACAGTGAGACT 

ACTGATGGTTTCATGATGATTCAGATCCTTGTCCTCTATCCCAATCGTCTACTG^ 
CTATGTGTTCTGTCCAAGTTTTATAGTTGGACTAAATCTTTGAAGTC 
CTGCrOVATTTGTTCTTTTTCCCCTCCTTGTT^ 
GTCAAGCTAAAGATTTGTCAGAGGTTTTGTTCAAATATT^ 

2 5 ATCAGGGAGTATATTCAAACITC^GGCCACCATGTTGCCTGACCTGGCTTT^ 

CATGAACATGCAGAGGCTCAGTCAGTCAAGGATGTGTG 

TTCAACTAGTGGAGTGTGGAAAGTGTATTAAGCCCCCAACTTGCAGTGGAGGTTATCACTTAAAT^ 

GTCAACACCCTCACACACACACAGTCTCTAGCTGAACACATGCC^ 

AGCAGCCTCAAGCAAAAATGCCZAGAGATGGCTGGGCATGGTGGCT 

3 0 GATCACCTGAGGTCTGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCT^ 

GGTGGTGCATGCCTGTAATTCCAGCTACCTGGAAGGCTAGGACAGGAGAATC^ 

G AG ATCGTG C CATTG CACTCCAGC CTG GG CG ACG AGCAGAACTCT GT CT CAAAAAAAAAAAAAAAAAAAAAAAG C CACAG ACTG TG 
ATGTTCTTACTCAGGTTCAGCCTTTAGCTGAAGTCCAGAGCACTGAAATGGTTGTTT^ 

TTTGGGGGAGAGGATTT ATCAATG T ACTCATTTCAT TAGT AGT ATTGAT ATG ATTTTTTAT AT AT AT CTTAG AAT 

3 5 TTTGAAGATACTGTTTTCTTATTTTCTAGCATACAACAATCTGAT^ 

TTTCCTTTTGAGTTTAACTTAGCAGGATTTGTTTAATTTTC 

CTCACCACAGCCTCCACCTCATGGGCTCAAGTGATTCTTCCTCTTTGGCCTCCTGAA 

AGCAGTCCTCCCACCTCAACCTCCCAAAGTGCTGGGCTTACAGGCAT 

4 0 AAGGAACTTTGGTTCrGAAGACTCTTATTTGCCTTTTGGCTCAAGG^ 

TTAATTCTCATAAAATTTCTCTTAGATGTATGTAGGATAATTTAATCTGTATTCT 

TTTCATTTT ACTTCATATTTTTTTTTT AACT^ CATGG ACTTT AG CTTTGTCCATTTTGCT AG TT AACTC CTTT 

4 5 TGCTTCAGCCTCCCAAGTAGCTGGGACTGCAGG 

CATGTTGGTCAGGCTAGTCTCGAACTCCTGACTTCAAGTGATCCACCTGCCGCGGCCTCCCAAAGT^ 
CCACCGTGTCTGGCCTCTCTGCAATTTAGTATTTTTGTITrCTTT 

ATTTTCTGTGCTATGCAGTATATTAGCCAGTAGCTGCAGTGGCTGGGAATGTGGCCTGTGAGAGTG 
TATTTAATGTTAGTTTATATAAAATTAAATAGCCATACATGTCTAATGATTACTAGATTGGACAGCACAGCT 

5 0 TCCTTATACTTTTTGGCTTCAAAATAATTCTGATGCCTTAGTTCATCTC 

TTTTTTTAGGTGTCTAAAGGTTAATAAAGGTTAAAAATATTGGAAGTTAGA 
GAGAAAGTGAATATTGAATATAAAAATCCATTAATTTCTAGTTCTACTAAG 
ACTTCTTCTCAATATGTGTATGTGTGTAGCCTCTGTCCCTGTGGTGGCCAGAATT^ 

AGACAGAAATCCCTGTAAGCTGATT ACCCTTTT CCACCAGGTTTAGATCAG AT ACAAAT ATATGGGGGCAAAGACTGAGTT ACT CT 

5 5 T1T 1 TTTTTTTTTTTTTTTTTTGAGACGG AGTCTTGCTCTGT CCCCCAGGCTGGAGTAGTGGCGCG ATCTCGGCTC ACTG CAAG CT 

CCGCCTCCCGGATTCACGCC1ATTCTCCCGCGTCACCCTCCCGAGTACCTGGTACT 
TTTlTTGTATTTTTTTTTGTATTTrTAGTAGAGACAGGGCTTCACCGTGTTAGCCAG 
TGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTACGGGCGTGAGCCACCGCGCCCA 
TGCATCTAAACCAAATTTGTTAAAAGTCTTTrTITTTTTT 

6 0 GTTAGGAGAAGTAAAAGAAAAAAAATTCTTGAAGAAAAGTGAGCC 

TGAAGTTTAGAAGTGCTCTTCGTGGTTCCGTTTCCTCTTTTGCTC 

AGATAGACTAGGTTTTTTTTTT 1"1 1 1 TTT CrAGTTAACCTAAGTCAGAAGGGTGATATCACTAGAGACTTGGAAGTAAAAAGAGCTA 

CCrGCAAAGACTTGAAGTACTAAGATTACCrrAGAATCTGAATGCCCTGTGTTACTCTC 

GATAAGATCTATTGTAAATACTCTAAACTAATAATAACGAGTAAGTA^ 

6 5 TTTGAAATGTTTTAAGAATATTTTTGAAATGTTTCTATTTTT^ 

TTACATGTAAGGAATTATTAACTTaVTTTTTTCATAAAAATAATCAGAGTA 

CCATCACTGT CATACGTTT C ACTT ATG ACAGGTTTGT AATTC TGCT CTGCATCTGG TATTTTCT 

CCTGCACCCrCATGACAAGAAAAAAGCCTTTGATTATTTCTCTTTTCTCT 

TGTGATCTACATGCAATGATGATTACTCTTTCCTTCCTTTC 

TTTATACACTTAATAACTTGATTATATATACTTGAATTTTGTTTTAGT^ 
GTGGCAGTTAATTCCAACATTTGTAATACCAACAATGCAGATAACTGAGTT 
TACATGTGCAGATAGCTACTGrTGTATATCATGCCTGCCATCTGGACAACAGTGATTT^ 
AGAGTCTCAGTCTGCTGCCCAGACTGGAGTGGAGTGGCGTGATTTCGGCT 

7 5 CATGCCTC^GCCTCCTGAGTAGCTGGAATTACGGGTGTCCGCCACCATC 
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TCGCCATGTTGGCCAGGTAGGTCTCGAATTCCrGACCTCAAGTGATCTGCCTACCTTGG 
TGAGTCACTGTGCCTCGTCAGAAAACAGTGATTTTTAAGATGT 

GGGATGGGGGAAAGGTGTCTCTTAGAACATGTAAAAGAACATGTTATTTCTGTCAACTTC 
TTACTTGATTTCAGATTTGTATACCCAGCTCCTGTTTTACATCTCCACTCTGG 
5 TTTTAATTCTGCATCCT ATGCCI~i\iTCCTC C CAACCTCAGTGTTCCTTCTT CATT C J lTl u i^l11ATllTlt^CTCTT ACAT CC1TTGA 

CTTAGGTTATAAACTOTAGAGTCGAGGATGACTCCCTTTTATCATACCT 
TTTATAATATATTTAGCATCTGACACATCTTATCATCTCTTCCACTATCAGTCT 
TCATTTCCCATCTGGTTTCTTGGCATTTACACTATAGTTCATTTC 
CATATTGTTTCCTTTTTTTCAAAAGTAACTTCATTGAGTTCAC 
1 0 TTAGTGTATTCAGTGTGCACAACAACCATC^ 

GCCATTCTCCCCCAATCCTTATATTCCCCCCATTTCTAGGCAATCTAATTTCTGTCT 

AT ATACGTGGAAT CAT ACACTGTG TGGTTTTTTGTGACTG CTT CTTTT ACTT AG TGT AATGTTTC CAAGGTTTGGTCATGTT AT AG 
AATGTATCAGCArmGTTTCrrTTTATTACCAAGTAA 
AATGTTTGGACTGTTTTAACrrTTTGGCTATTGTGAATAAT 
1 5 ACTACTGTTTCCCTAACAGAACGTATTAATTTTCTTTTGTA 

GTCAATGAGCACACTATATTCTGCACTCCACTAACTTTGGCTC 
CTA<jrCTAGGATGGTCTTGAGTGGACAACTTGGTTCTAGTAAACT 

G AGCAAGTTCAATTGT ACAAGAGG ACAAGACTTGCATATTTG CATTTT AAG CTT CT CTTTGGGTCATGTTTG CT AAT ACTACATT A 
CAAGGT AT(^CAAGAGC31AGG AG AAGCCCAGGGTCAGAATGGA CAG AG TT ACGGGT AAAAGATGTG C CT ACAGAGA 

TCTGTTTGAACTAGTTTTCTCTTACATACCT 

ATGAGACCTTCAAGTTTTACCATACTACAAAATGAATAGCCACCCACCCCGATCCT 

TTTATGACCATCTGACATTGTCTGTTTTTITTTTTAGCG 

TAGTTTAATTACACTGTCITCATTGCCTAGAACCCTGGCRCTCA^ 

2 5 TTTTAC CT AAATTGG AATT ATCTT AAAGTTC T AACCTG CCTTC ATG CT AT GG CATG T AT AAAGTGTCAGTCTTTTG ATT AGAAT AC 

CTAATGATGGAATATTTCTTCTACTTCTTATAGAGACATAGACCATGCAATATATGAGTAATTCTACCA 
CTTGGGCCAAGTTTTTTTTAACCTCCCTGTTTCCTCATCT 
AGGTAGGCACTGTATAAGGTTGGTGCAATGTGTATTGCATAATATT 
AAATGGAATTGAAAGCTGGAGCAATCTTATATTATGGAGAAGTTGCAAAA 

3 0 TAAAG CAAATGAAATAGGTG AT Trr TT l T T CATTGAATGCATCAT AACATTAT ATCCTT AAAACACTGTCTT ACAAAGTGCTCTCT 

GAAAAATACTGTTCTTTATTTCAGGGTTTACGAAAAAGTACATAA 

GTACCATCCTAGATGCATGGAGAGAAGATAATTCATTACTTACACTAGGTCATGCCTTAGG 

AACCCCAGTTGAGAAAGAACAAAAATATAACATCTTGACTAAGGTGGCT 

CAC^GAGCCTTCTTGAATGTGATTTCATCAGTCTCCTCTAAGCTT 

3 5 CTT AT AAACT AC CTGCTTCATGCT AGG T AGT AT ACTT AGGTGCT AGGAGT AAAACTG AT CAAGTGGGG TAAGTTC CAAATTCACAG 

AGCGCACAGTCCTAACAC^CrrGTCATGCCTCAGTTGTTTATTT 
TGTGGTGCTCTGGAATGGTGTCCCAGGGTTTTGGAGAACTGAGAGTAACCT 
AGGAGTTGTACCAAAACGCAGGAAACATGATATGTATATTGTTATGGTGCATAAAAATA 
CTACATCTATTGTTTCACATATTTTCCTTCAATTTAAATCCTC 

4 0 GAAATTTGATCTGTTTCACTGAAAACCCATTTGAATAGTAATTGTGAGCAATTGT^ 

ATGACTTTGTGGACAGTTTGACGTTATGGACAGTTATTCTTGATACrrGTTTCCCCT 
GCTTAATCTGCTTTT AATT AGCCAGAAAAAAATGTTTGATCCT CITATO 
TCTAAGTTAGAATAGTTTCTGCTTATTGATTrCATGAATATCACTTCT 

TGTGG AACT ATG ATCAT C AAACTCCT AAATCATCAG C CTTTT C CTT AAAAG ACTT AAAAGGGTT CT AT CTATGT AG ACACTTGT CT 

4 5 GGCTCCTGATTACAGTTTTACCATTCTCAAGTGATACTA 

AAATG ACTG T AAAAATT ACT AAT ACCTTTGG CTCTCT AATTTTTCT CT CTTTT CAT CAT ATGGG CCTG AC AAAATG CCCGCT ACCT 
GCCTGCACTTAGGCAATTGAATGTAGTGCTTTAAATGTATACCAAAACCCTCAACA 

CTT CTTCACTTT CCCAATT AAAAATGCGTTTG CTGG CTGGG CG CAGTGGCTCATGTGTG TAAT C C CAGC ACTTTGGGAGGC CG AGG 
CAGGTGGATCACTTGAGGCCAGGAGTTCAAGACCAGGCTGGCCAACATGGTGAAACCCCG 

5 0 TGGGTGTGGTGGCGTGTGCCTGTAATCTCAGCTACTTGGGAGGCTGAGGC^GGAG 

GTGAACCAACATCtSCACTACTGCTCTCTGGCCTGGGCAACAGAATGAGACT 
CACTCATTGCAAACCTCTCACAGCTCCCACTCCTATCTGTGGAATTCACTTCAAAC^ 
ATCTTTTCACCACCAATTCTACAAACTTGTCTGCATTGGTCCT 
ACAGTCTGATTTCTCTACTTGTGACrCAGATTTCATCTTTCCTGCATCT 

5 5 CCCTTTAGCCACAAACATGCTCTATTATCTCCCAGCCCCAATACATCTGCAAATGTATC 

ATATACAATTCCTACCTTTCCTAATGTCTTCTAGCCACTATTATGTGTTCCTA^ 
TTTTCCTTCTACCTCAGTGTCTGTTTCTTTCTTTTCAGCCATTTTTGGTTTCTCT^ 

TT CTTGGCCT ATACTCTCTTGT AGGTCTG CCTCCCAGTATGCT CCTCATCTCTCCTCCAGGG 
TT CTAAACTTGCATAACCAGCTGC Cn. ' CTl 'GATCATTTAGTAAG CCTCTCCTGTTGGTG ACAT AATGCTTG ATTTCTCTCCCT 

6 0 GGCATCACTGTCCACCCGTTG CACACATCAGAATTTCAGAAT CAGCCTTAATTCTT AGTTTTTTCTCACTCTTC CTCCATGTCTAA 

TCCAGTAGCCCTATCTCCACTGTTTTCACCTTGGTCCAAGCAATCATCTCTTC 
CCCTGCATTCACTCTTGGCCCAAGCAATCATCTCrrTGCTTAACTAT 

CTGCATTCATTCTTGGCGCACCGTAATCCATTCTTCACTAGAGTGATAATTAGAGGTACTATAATATATACTGCTCT 
TTGTCTAGAATTTACTAGCTGAGTGGCCTTGAGCAATATACTTAACACTTATGCCACA^ 

6 5 T ATTT ATCTGTT CT CTG ATTGCT AT AAAGG AAT AC CTGAG ACTGGGT AGTTTATAAAG AAAAG AGGTTT AATTGG CTCACGG TT CT 

GCAGGCTCTAC^GAAAGCATGATACTGGC^TCTGCT 

CAC^GCAGGTGCGTCITACATGGCAGGAGCAAGGGGTAGGGGGAGGTGCTACACA 

ACTCATCAAAAGGACAGCATCAAGAGGATGGTGCTAAACCATTCATGAAAGATCCACCCCC^^ 

CCCACCTCCAACATTGGGGATTACAATTTGACATGAGATTGGGTGGGGAC^ 

7 0 AAATATAAATCAGATTATGTCACACATACCTGTTTTTGAAACTITTAGATTCTCATTGCT 

CC^TGGCTTTCAGGATCCTTCATAACCTGGCATATTGCCTAATTTT^ 
TATAGCCACAGTCACTATCTTTAAC1TTCTAGTTGGAAACATGGCT 
GACCAACTITATTTACGTCAGTTTTGTGTTrGTTTTGTTATATGTAA 
TGAATCA1TTGT1TTATCTTTATAAATGCTGGGACGAGA 
75 GAAGTGGAATTGCTAGGTCAAAAGATAOAAATOTTT^ 
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CAAAAAGGTTTTCATTTGCCCTCATGCTTGACTGGT 

GTGAAAACATTATTCTACTATCTTTTAAJUUlATAACAGTTT^ 

TTTTAATATATTCATAGAGTTGTATAGCCATCACCGCAGTACIATTGTAG 

ATTAGCAGTCATTCTCTCTTCCCACCAATTTGCCCACCCCCCAGCCTTAAGTCAACAAGTAATCTAC^ 
5 TTAATGTTGCTGATGAAAAGTCTGATGCCAAATTAATTCT^ 

T AATTATG TAAACT AGGATTTATCT AGAT ATGAGTCTTTT AATTCAT CCG CCTT AG CCT CACTGAGCCCTTTTACTCAGACCATTG 
GTGTTTTTAACTCCAAATATTTTTAGCATAATCTTTATTTATTCTAri 1' ICl'TCATTTAGAACTCTTACTAAATGTATTTTGAACT 
TCCCAAACATATTCTTTATATCTGTTCAACTTTTTACITGTATC^ 
ACCTTCTGGGACATTTCTTTGTGTTACATATCACTACT^ 
1 0 GlU-XTCri AGTAATAATATTTaunTTTCrCCTTGCCCATTT 
ATTC^TAGTGT ClTG TGA T r ClTOT ' l TAT^^ 

TAAATGGGAAATGACTACAAGTGTCATGTATTCATTGACTGGCTTTCCTCCA 
ACCCCAGTTACCAAAGTAAGGACAGCTTTACTATGTTrATT 
ATCTTCCCTCTTTCCCTCTCGCCCTCC<^CITT^ 
1 5 TACCT ATT ACT CAGTT1TGTTGAATTGTCTT AT AT ATG CTTCAGATTTTTATAAACAAAAAAATT AT AG ACACAGAT ATAG CACCT 
TACAGTCTTACTTTGATACT7TTCTCTC 

GCACATTTCAGATACTTTTGCTGTGGGTGTAGATCCACAAAC^ 
CAACCCTTTAAGGCAGATTCTX3AGTGGCTAACAGTCCAAATTCA 
TGTGTACTCCGCATTCCTGGAAAACCTATACACCCZAGTAACTT^ 
2 0 TAACAGCTACCAGAAAATACCATTTGGCCTTTTGTACCTAACAAACAC^ 
TGCACCCCTCATTTGTATAGTGGACCAGAGTGGGAACCTGACTGT 

CAG AAGGCGC CTTT ATTTT CT ACCAAGGT AC AT CTCCACGGTTTG CAAACTGTTTGCTGG AAT AAAGCCTG TT TC T T TTTT AAGAA 

AGAAAATCirTTTCTGTAGATTGTTGACATTTTAAAA^ 

ATGTATTCATGTTGATAAATATAGCGCTAGCTTTTTGTTTTGACTTAAT^ 

2 5 TTTITTCTCrCTTAGGGAACATTTGAGTTGGTTCT 

' TGAATATAAGTAGCTACATGATAGGATATGCTCATCCTTAGCTTTACTCTAGATATTGCCAGTTT^ 
ACCACATGTAGAATATGATAGTGTTTGTTGCTACATGTACATGATTACCAACTCTTGTATTArrAGGCC 

CACTGATTTTTCAT ATGTATGTATGT ATT1TTTGAGACAAGGT CTTGCTCTGTTG CCCAGGTGGGAGTGCAAT AATGCATTCCTGT 
CTTATTGTAGCCTCAAACTTCTGGGCTOVGGC^ 

3 0 TACCGCATCCAACTAATTTT Cri - rin"lTT CrTTTTTT^ 

TACAGGGATGAG C CATTGT AC CAGG C CTGGT TTTl CATTG CATTT CTTTG ATT ACT AATG AGGT CATTT ATTTT ATGTGTTT ATT A 
GATATTCGAATTCCCTATAGTGAACTACCTATTCATAAGTCTTTTG 

ATAAATACCTGTGCCTCTGTTTCTGAGCCCTCTA 

3 5 CTTGTTTTCCTGGTAACACCAGTCTTACCCTCCTTATC 

TTAGAATCAGGCTGGGCGCAGTGGCTTACGCCTGTAATCC 

CAAGACCATCCTGGCCAATATGGTGAAATCCTGTCTCTACrrAAAAATACAGAAAATTAG 

CCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGA 

aSAGCCTGGTGACAAAGTGAGACTCCATCTCAAAAAAAA 

4 0 AATTATTTTTCXSAATTTTGTTTATTGGACT 

AAGGATATTTATGTAGGTCTTTGTCAAGTGAAGGGGAACACATCTGCATGCA 
GTTTAGTGGAGAAAAATAAAGAAGTATTAAGGCACAC^CAATTCCCTAAGGTCrGT^ 
ACACATCTAGAACTATGTACTGTGGAAAGGGCTCCTGTGTAATAATT 
CCATCAGTACTCTGTGGCTGATCAGCCTTTTCTTGCATGAAACAAACTCAGACCT 
45 CAGGGGTGGAAACCTCTAGAGTCATCTGACTCTGTCTCTCTTCTTAGT 
TCACAAAAGTTTTGGGAATCTTTTGTTAGACTTTTCTTCTA 
GGCTCTGGCTAATGTGTTAGAACACCATGTAATTTTATGTCCTTC 
TTTATAGTTAATAGGTATTAGCCCTGATGGTTTGTAGATTCTCTTGGGTTT 
TTGTCTCTATAATTCTAATTCTTATGCCrCTTTATATAAATATATATTTATATAAA 

5 0 GATTCAATGATTGTTAACATTTTACTGTGTGTATATCTCCT AAGAA 

AAGGTTAAAATTCTATCATCTTACATCTGATATGTGT^ 
CCCTAAACTAAGATCCACTCTGCATGCACTCAATTATTTGTTGA 
CrGTTTTGTTCTCCCATGAAATGATTTATTAAAGAATCCAGGACAGT^ 
CAAC^CTTTGACITTCAAAGAGAAGAAAAATGTCITAGAGAATG 

5 5 CGTATTTCTCTATTCCCTGTGTTTCTTC?TAAACCAGAAGTTAAGTATC 

GGTATGTTTTCCCCTTCATCrTTAATCTGTGGCGAAATACTTTGGCACTATGTAAGTATCCTCT 

TG ATTTTT AG CATCATTTG ATCCTTTCCTG AAC CAATT ATTTT AT ATT AG CTG C 
AAGTGGGAATAACTAATTTTTCCTAAAAAGGCGGGATACATGCATAATTTTrTTCA 

6 0 ATCCCCTACAGTGTGAGCAACTGCTCTCCCACCCTTTTAATTTTTATGGAATTTTGAAG 

ATACCATCTGTTATAGTCAATTACAGTACTrATTCTTT 
CTCTGATGTTGTTCTGATATCTGGCACAAGATGTTCCTAGATCTCTCT^ 
CTGGCCCTTTCCCCCATTAATGTAGTCATTGTGTTGTTGTTACTTTGTGT 
CTTATTAATAAACTCATTAGTAAAACCTAAATG CITATO 

6 5 CATTTTCTATGGCTGCAATAATAAATAACAACAAACTTGGTCA 

GGTCGGAAATCCAGAATGAGTTTGTTTTACTGGGCAAAAATCAAGGTATC^ 

AATCTGATTTCTTGCCTITTCCAGCTCCTAGAGCTGCACTGCTTG 

GCIAGTGTAGCATCTTCAGATCTTTCTCTGCTGAGTTTTTATATCA 

7 0 GTTCCTTTTGCCACATAGGATAACATTTA 

GCCCTCTCATGCCACTCTTTCTTTGATATTACTTCTTGATTCAGTT^ 
AACGTCAGTTCATTATCTTCTTGCTTCTTTTACTGATGTTGAGAAGTATGCCAT 
TTAAGAACTGAACTTTTTTATTTTCACCTGTTTTGGAAAATTTATGGCC^ 
TCCCAGATTAACTCATTTTGTGCTTCGTGTCITTCATATTTTCCAT^ 
75 CTTCTAGTCCTTTAA TlViVlV l T CAQCAG 
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ATCTCTTTATAGTTTCCTCATGCTTCTTCCT 

ACTTTTG TGTTTTGGT CTGCAATTT ATT AGCTTTGATTT CTTTGTCAAACTG ATTAT ATTCG TTTT CTGTTTTTCAGGAATG TTCT 
GAATTTTTCCAACTAAGTGGTOSATGGTCCTGGCCTCC^^ 
ACCACTTCTGTCTCTCCCTTCCCrCCCTCTGTTCCTCCCCCAACCCC^ 
5 GCTAGTCCTGTAGTTTTCCCCTTTGACCT 

TATTT AT AT ATACTATTATAT ATGTTTAT AT ATCAT ATAT ATTAAT^ 

ATGTTTTGTri'GTrit3TriT?rTT AGAGATAGTCTTG CTCTG TTGC CCAAG CTGG AGT AT AGTGGCGTGATCTCAGCTCACTG CACC 

CTCCATCTCCCAAGTTCAAGAGATTCTCCTGCCTCAGCCTCCCAA 

TTGTACITTTAGTAGAGAAAGGGTTTTGCCTTGTTGGCCAG^ 
1 0 . TCCCAAAGTGCTtyXIATTACAGGAATGAGCCATTGTACGTGGCCTAI 1 T I ' nrrr ATGTTATCTTTGTTGA GVrn 'GACATCTAGG 

TTTTGCTAGCTTCITAAAGTAATTTAGAAGATATATAAGTATATATA 

TTAATTACATTTTCTTTAAGCGATGAGGGGAGCC 

GCAGTTTCTCTGCATTTATATTTTAACATGGGCATCTTCT 

TTTGCCATTTGTAAAGTCACTAAAAATGATCAGTTATTTA 
1 5 AACAGGAGAGGAGATCTTTTAGTAACTTTCTTGAAGTTTTTAGTT^ 

^g^^-^-j-p CTTTTrnT C CAAGAAGATGTTTT AACrATGT AC CT ACCGTT AG<rTTTT CAGT AT CTT ATT ctacttttt aat at at 

CTCTATTTTATAAGAGTCATCAAAGTTCTTTCTCTGACTCT^ 

ATTCCAACAAGGGTCATTGTGACTTTTTTTTATTCTT03GATATAAAA 

ACACTT ATGG TCTGGC CC CACT ATAC CT AG C CAATATTATCATTTTL'rriTACTTTTTrTl'fGAGACGAAGTCTTG CT CTTGTCAC 
20 CCAGGCTGGAGTGCAGTGGCGCAGTCTCGGCTCACTGCAACCTCCGCCTCCTGGGTTCCAGTGA 
GTAGCTGGGATTACAGGCACACGCCACCACGCCCGGCTAATTTTTGTATTTTTAGT^ 
GGTCTCGAACTCCTGACCTCAAGTGATCCACCCGCCTCATCCTCCCAAAGTGCTGGGATTACAG^ 

CAATCTT AACCGTTTTTT ATACCCCAGCAAGAAAATTTTATTCT AATCAG CTCATTCTCTTCATTTCT CATCTCTGATTATTTTCC 
CCCACTTTCTCTCTCCCATCCTTTGAGAGAAATAGTTCTTTATGCTAACTGCATATGA 

2 5 GGAGTGTGGGTAATATTGTCCTGAATTTACTGGACCTAAGAAAGAATCACTGAACTAGGATC 

CTGAAAC CAGCTT AAATCTTCAG CTTT AATTG CAT ACCT ATT ATGACT AAG ACA.CCAAAATAAAT CATG AGGGG ACAAAAAAAG AA 
AATGTTTCTGATCTTAAAGTAGCTAATAGACATGGAAAACATAATCAACTT^ 
CCGAGTTGATCAGTt*AACACATCATAGAGGGATGCTCTAAGGAAAGATCCATCCTG<3ACAATG 
AAATACAAAAATTAGCCAGGTGTGCTGCTGCGTGCCTGTAGTCCCAACTACTTGGG 

3 0 GGCTGAAGCTCCAGTAAACCAAGATTGTGCCACTGCACTCC^ 

ATTTTTTTTTAAACAAATATATGTTTGCTTATTGTATGCTAGG 

AAACCTATGC CCTCCTGGACTT CATT CTGTTGGGGAGTGGG AAG ACAC ACAC CAAT AACAAAATG AACACGT AAAAT ATATAT AG T 
AAGGTGGTCACATATGCTACGGAACAAATAAAATTAACAA^ 

AGCCTTTCTAAATGAGAAGGTAAGAGAATGCTTCATTGATAAGGTGACATTGGGATAAAGAGATGAATGA^ 

3 5 AACTTTATAGAAATCTT AGGGG AAGAGCATT C CAGG CAG AAGG AATAG T AAATTC ACAAGT C CTG CACTGGGAGT AT ACTTGG CAC 

GTTCAGTTAGGCCAAATGTGATTGGAATGGGATGAGTGACGG<1AGAACAATAAGAAATGAAGC^ 
TGATGGCTTTGCTGAT1T1TAGTGGG<SAAGCCAGTGGAGGGTTTTGAGCAAGA 
TTTGAGATGGGGTTCACTCTGTTGCCTAGGTTGGGGTGGCGCGATCACAGCT 
CTCCCACCTCAGCCTCTGGGACCACAGCCACACACCACCACGTCTGCCTAT l^ ' l 

4 0 CCTACTATGTTGCGATCCTCCCGTCTCAGCCTCCCAAAGTGCT^ 

TTTTTG AAGAAT AGCTACTGTGCCATTCTAGG CATGAAATAGGAG AGG CCATTTCAGAAACTCTTGTT AATAAACTAACAGAG AAA 

TGAAGGTGGCTTAGACCAGGGTGGTAAAGTTAGAGAAGGTGACTAATATCTACA^^ 

TTGTCAGATGTGAGAAAAGAGTCAAGGATGATGCTGAAGTTTTTGG 

CT AG<3 AAAAAC AAGTTTTG TGGGGGTGATGGGGG AG ATCAG AAG TT AG TCTTTG AAC ATG TT AAG TTTG AAATG C CT ATT AGGT AT 

4 5 ACTAGTAGACATATTCAGCAGCCAGTTAGGAATCTGGTGTTGAGAGGAAAAGGTAGA 

TTCATAGTTGCCATATAAAGATACAAGATTG<lATGAGATCACCrrAGGOAGTAAGTGT 
TTTGGAGATATGAAAAATTAGCAAACWCATCATGAAAGAAAAC^TTGAAATACCT 
TGCTTCTCAGACTTTAATATGCATATGAATCATTTTC 
TGTTTGTTTGTTTGGTTTTTGGTTTTTTTTTTT^ 
50 GCCGCAGTGTAGTGGTGCACTCATGGCTCACCGCAGCCTCAACCTCCT 

CTGX3GACTACAGGCATGCACCACCATGCCCGGCTAATTTTTGTATTTTTTATGGAGATGGGGTTTTGCT 

TGTGAACTCCTGGGCTCAAGCAATCCACCTGCCTCGGCCTCCCA.GAGTGTTGGG 

TTCTGCTGTTCTCACAGTCTTTCAGAAGATGCTGCTGCTTCTGGTCTC 

GAGTCATGAAAAACAATTGAAAGACCTTAACATTTGTAGAAGAATAAAGATTAATATCCAGAATAAAGACT^ 

5 5 TGGAAAAAACAAACAACTCAATAGAATAAAAGCGAGTGCAGTATAGGGAA^^ 

GTGCTCAACTTTGTAAGAAATACAAATTAAAACATGATTGCCT^ 

GGGTGTTGGTGAGATTGTGGGGAAGAGTGGACTTATACCCTGCAGATAGGGACTTGACTTGATACATT^ 

TTKK3CATATTTATTAAATTTTAAAATAGGTATATCCTTC 

TTTACTCTGAAAAGCAGTTCAGAACrTATTTGTTGGAGTATTGT^ 

6 0 GAAATTATTAAATCAAGAGATTTGGAACTCTTTTTTTTCAT^ 

TATAACTTTGAATAGTTTAAAAC^GCCATAATTTCTACTGTGTGTTATAT^ 
GCAGATTTAGATCATTCAACTTAGAAACAGAATCTTCCGTATAAC 
GTTTTTCTTTTTTATTAAGAAAAAAAACCrGATTACTTA 
AACTrCTGGATTCTAGTCTGGCrGGCItKn , GATG<jGTAAGG 

6 5 TGCTCTTAATTTTTCCCCCTTTTCTCTCAA 

GAATGGTG CAT CTT AGT AG TATTTGAGGTGG AAAAGAACTTTG C CTTT CTTTT AT AAAGTGG AAAAT AATTATTTTAAAAGAGGAA 
GT AG AC AAGG AGAACCAGTTCTT AAG CAGAT CAAT CAGGG AG CAT ACAG AT AAAACTT GAGGAT CTGGAAATTCT CTT AAAATTGT 
CTATGCCCACCTAACCCCTGGATACCACTGAAGTTTM 

CCATTAGCXAGTAAAATATATGCATATTGACGTGGAGAGACTTCTGAGACATTACTCCAT^ 

7 0 TATATATAGTATTTAATTATATGTCTACATTTCATATATGTATGTCTTA^ 

TGCrCACCAAACCATAAACAGAACrTAACCTTTTT 
TCTGAATATTCATGTATTCTTTGAATCTTTTATAGTTGAGAATACG 
AATAAAAGAGAAAAACCACTGTACTAAATGATCTCTAAGCTCJVAA^ 
GTT AAAAT ATTTCTAATT CAT AT ACT AAGTGATTTTTATTT AAATTC^ 
7 5 ATTAACTTCACACACCAACATATAGAGGCCCTATTGC^*C^ 
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CTTTTGAGTATCATATCTAGTAATTTTGAAAAGAAAAAAAATG^ 
AGGTGGGATTAGGAATAGATTTGGGAAGAGAGAATTAGCTTTC 

ATAAACATGTATTTTATAATTTAGAAAACATCAGAACTAAGGGGGAAGTGATATAAATT^ 

TATTTTTATTTTTATTTTATTTATTT AlT ' ri '' l TTI ' I GA GACGGAGTCTCGCTCTGTCGCCCAGGCTGG AGTGCAGTGACACAATCT 
5 CAGCTCACTACAACCTCCGCCTCCCAGCTTCAGACGATTGTCCT 

CACGTCCACCTAATTTTTTCTATTTTTAGTAGAGATGGGGTTT 

CCCACTTGGCCTCCCAAAGTGCTAGGATTAC^GGTGTGAGCCACTGTGCCT 

ACACAGTTTTGGAGCTCAGCAAACCAGTTACCATATTGGAAAATGGGGT 

AAAATTGGTTTCTGCTTCCTTATAGGTAAAATTTGTAATTTTT 
1 0 GAGGTTTGTGTCAAGCCTTGAGCATAATTGTAAATTTATATTCATTA^ 

GCTTTCTTAGCATTAGTTTTCTTAAGTAGTCAGTCAGACrrCC^ 

GCCCACACTGCTCACTACTACCCTACACACCCTTTCITTATAAAAGCTGGCTCAGCT 

AACTCTAAATTAGAAATGATCAGGTTGCCATTTGGCTCCAAATACCAGAG^ 

TTTGTAACTTAGTAATTTAATTACTTTTTGAAGGGCCAGTTAAGTCTrA 
1 5 TAAACTTGGAAGTTTGGAAAT AACAG CT AAT ATGCTGTTTT CTTT CTCCTCTT C1TCATGTGGCTCAT C CCAGATCACACTC CT AG 

TTAATATCAAAGCTAGGACTAGAATTCTTATTTCTCATTACTAGTTAGTGTACTTC 

ATTATTAGGAGGCraATO^AGGTGTCATAAG^AAGCAAGTACTGCT 

TATAACCAGCAGTTTTTCTTCCAACCATTTCTATGTCTTGGAGACTT^ 

TATGTCCCCACTATAGCAGTGATTCACTGCCCCCTCCTCACATAACTCTTCAGGCTTT^ 
2 0 CAAACCTTAGAGCTTCTTATTCTGTGCCCAGTACTGTGTTAAftTGTATTTTATTC 

AG AT ACT AGGT ACTGGAT ATTTTTGTT ATTC CCATTTT ACAAGTG A^ T AACTT AACT AAG GTGACACTG 

CTAGTTAGGAAAGAAGGCAGATATCTAAGTC^GAAATCTGTGTT 

GCTCTTTCCCCACCCTCACCCCCCAGACAGCTTTTCTTATTTGGG 

TTTATGGACCTTTTATTAGATACAGTCTTTTTGGTTGGT^ 

2 5 TCTGAGTAGGACAGCATACATCAGACCATAACTCACAGGTTATATTATTAGCTT 

TGGTATACAATGTGTGTGTATTATGTGTATGTAGTATGCAAAGGTATGCATATACACATGTACATACACAC^ 
GTATAATATAATAATTACGATGAGCCATGGAGTCATACAGACTCCATGTTTGAATCCCAACT 

TAT C CAG C LTT L"l 1ATACCCCACCTCATTAGTAAAGTGGAAATAATATCACATCTTGGGTAAGTTAAGGATTAATTGAGTTTATAA 
TTATGAAATAACATAGTTCCTGATTCAAATATTAAGAATTCTATAAATAGTTG^ 

3 0 AGCAATTGTCTGTCTTTTTTTTTTTCCCCCTGAGATGGAATCT^ 

CTG CAACCTCCACCTCCCAGGTTCAAGCAATTTTCCTACCTCAGCCTCCCAAGT AGC C C 

AGCTAATTTTrGTGTTTTTAGTAGAGAOiGGGTTTCAGCATGTTGGCC^^ 

CCCTTGGCCTCTCACAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCTGGCCGGGAA 

TCTCCG CCTTTTT AAGCTTATATTTT AATGAGGGGATGGATATAT AGACAATGAAC^ CACTTA 

3 5 TCACCATTTTATTTTTTT ATTTTTTGTTG AGATGGAGTCT CATGCAGTAGCACGATCTCAG CTCAC 

TGCAGCCTCCTCTCCTGGGTTCAAGTGATTCTCCTGCGTCAGCCTCTCAAGTAGCTGGGATTACA 
CTAATTTTTGTATTTTTAGTGGAGACAGGTTTCACCATGTTGGCCA 

TCGGCCTCCCTAAGTGCTAGGATTATAGGCGTGAGCCACCGTGCCTGGCCATCACCATTTTAT 
GCCCAAGAGAAAAGACCTGTACATACTGATATTTAAAGGCCCATC^GTG 

4 0 GTTCTGCAGGTTTCTCATCTGCCATTTAAGATTCCTTTACACT^ 

TTTTAACCATTTTTAAGTATATGGTTCTGTGGCATTAAGTCCATTCAT^ 

TTTT ATCATC CAAAACTG AAACT CT CT AG CCTTTAAACACTAAAT CT CT ATTT CCTTCTTTCTCT AGTT CG TAG CAAC CACCATTC 

TACTTTCTATCTCrATGATCTATGAATTGGACTACT 

GGCTTATTTCATTTAGCTTAATGTCTTTGAGGTTTATCCA^ 

4 5 AGGCTGAATAATAATCCATTCATTGTATGTATTTACCATATTTTGAT^ 

CGTTTTGGCTATTATCAATAATGCTGTGAACATGGGTGTAOSAGTATCTGAGTTCC 
AGAATTGCAGGATCACATGGTGATTTTATGATTATTTTTTCT 
TCCTACTGGCATCGCATGGGGTTCrAATTrTTTACATCCT^ 
GCTGAGCGCGGTGGCTCACACCTGTAATGCC^GCACTTTGG 

5 0 CTGACT AACATGATG AAAT C CTGT CT CT ACT AAAAAATACAAAAATTTCAGGTG TG TG T AGTGGTGCG C ACCTGT AATC CCAG CAA 

TTCAGGAGGCTGAGGCAGGAGAATCGCTrGAATCCAGGAGAAGAGGTTGCAG 
GGGACAGAATAAGACTCCGTCTCAAACAAAACAAAACAAGGCAAAACAAAAA 
TTATGGTGCTGATTTGCATTTTCCTGATGACTAACAATGTTGAGCCTTTTATTATC 
AGAAATGTCTGATTTAAGTTCITGGTCCATTTTT^ 

5 5 CTAGATATCAGTTCITATTGGATATTTAATTTGCA 

AGT C CAAATTATCT ATTTTT CTTTTTGTTGCTCTTGCCTTTG ATATCATAT CCAAG AAAT C ATCACCAAAT CC AATG CCATG ACAT 
TCTTCTCCCAAATGCTCTTGCATTTGGGTCrrTTGATCC^ 
CTTTTGCATATGGATACCGAGTTTTCTCAGCATC!ATTTGTGGAAAAGA 
TATATATGCAAGAGTTTATTTCTGGGCTTTCTATTCCATTC^TCTATATGTCT 

6 0 ACTAGCTTTGTCTAGTAAGTTTTGAAATCAGGAAATGTGAGTT 

TCCCTTGAGATTCC T Tl' T l'TT T l'lTTGAGATGGAGTCrrCACITTG 
CAGCCGACCTCCCAGGTTGAAGTGATTCTCGTGCCTCAGCCTCCTGA^ 
TTGTATTTTTAGTGGAGACGGAGTTTCACCATGTTGACCAGGCCAATCTCGAACTCCT 
TCCCAAAGTGGTGGGATTACAGGTGCAAGCCACTGTGCTCAGCCTCCATGTGAATTTTATAA^ 
6 5 GGTCATCAGGATTTTGATAGGGATTGGATTGAATCTCTAGATTGCTTTGGT^ 

TATGAGTGGACAGTTAAGAATAACCAGATATTTAAAGAAAGCTGTCAACAGGAAAG 

G CAAT ACT AAAGAAGCAAAAG T AACAG AG AACAAGAAAGACCT CHTGGAAATTAAAAATACAAT AGCCAG AAATCT ATTTTTATTT 

TTATTTTTTTTTAATTTTCTTTTTAATTCT 
70 AGCGCACACTTGGACAAGGGAGGGGAAGGGGTTCTTATCCCATATGCATGTGGCT 

GGTTAGACTGCACAC^CTAAACTAATTCCGATTAATAACTAGAATTTT^ 

TCTTGCTCTGTCACCCAGGCGGGAGTGCAGTGGTGCGATCTCGGCTCACT 

CCTCAGCCTCCCGAGTAGCTGGGACTACAGGCGCCCGCCACCACGCCCATCTAAT^ 

TTCACCGTGTTAGCCAAGATGGTCTCGATCTCCrGACCTCXn' 
75 G AGC CACTGTG C C CAG C CCAGAAATTTTT AAAAATAT ATGT AT AT ACAC ACAT AT AGGT ATGT AC R .TGTGTAT AT ATGT ATGTGTT 
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GTATATATATAAATAAATGTTGAAAAATAAATATTGGAAGATAAAT^ 

TAGAGGAAACATGAGAAAAATTAAGGAATAAATCCCAATCCATAAAAAT^^ 

AGAATAAGTT T C T GGAACAAATAGX& A Tr iT CATTTCCT 

ACACAGGTCAGTTGTCAGATACTTGGTTGAATCGCTATAGTCCCTTTAGCTT^ 
5 TCCCTCATACAACTCCTCAAAGCAGAAAGCAGAGAACAGAGAAGTTCTCTTC 

CAAAAATCCTCTATAACTCTTCCTTTTATATCTCATTGACTAGAATC 

TTTCCTGAAGTAAGAGTCTAATAAGTCAAAAATTTC 

GTGCTATGATTATTTTATTCTATAACATTTCAACAGAAACCCCTT^ 

TGCTTCrrCTCTTCATTGTAGCAAAACAGAGAAATTGGTTACACC^ 
1 0 ATTTATTTAAAAGCCATCTTTTCAGTTCTAGATGTTCTCCTA 

CTACAGTGAAATATGAGATACAGTACTCTAAAGAATTTCAAATTTG 

TCCAAAGGAAGAAAAGATAAATTTACTCTGAAAAGTTGTTCCCCAGAGTTTCTT^ 

CCAACAAATTATTGGAATGCCTTCTATGTGTAAGATATAATGCTAGA 

TATTGGAGCCCAGAGATGAAGGGATACTGTCTCTGTAATCACCCTACAGCTGAGTCCTAA 
1 5 TACAGT C CC CTGTAG ACTG ATGAAGACAGTAACAG CT CCTGACATTT ACTGAGTATTTA CT ATGTATCAGG T ACT AAT CAT AT ATT 

AGTTCATTTAATCTTCAGCCTACCCTTTGAGTTACACTTAACTAT^ 

GGATTTAATAATATGGCCAAGGTTACACAACTTGTAATTGGTAGCCAAGG 

GTTTAATCACCGTATCATTCTTACACTrGGCAGCCTTTCTG^ 

CAAACCCTTCAATGAAATTAATACTGGAAGCTGGATGOT 

2 0 TGTAAGATTTCATTGTATCTTGTGAGAGTAAGAATAATTAGACTAAATTTAATTAA 

GCTGCTTTTTCTCATTACATTGAAATTAAATAGATCCT 
ACGATTAACTTTTTAAAACTGTGTAATTCAGTGGTTTTTACTATAT^ 

AAAGAAACCCTTTATGCATTAGTAGTCATTCCCTGCAGCCCCTGACAACCACTGATCTACTTTTTC 
GAAATATTTCATATAAATAGAATTATATTATGTGGCCT^ 
25 ATGTAGTAGTATGTATCAGTACTTCATTTCTTGTTATGGCTGAATATTTCATCATACACAT 
TCAAOTGATGAACATTTGGGCTACTTATATGTTTTTGCCATTATC 
GATATATTTTCATTTTTCCTGGCTGTATGCCrrAGGAATAGAATTGCTGGGTC^ 
ACTGCCAGACIAGTTrTTCAAAGTTGCTGCACT 

CAACCCITG TT ATT ATCT ATTTGTTT1TTT AATT AAM C CATCTT AG T AGGTGTG AATTGGT ATTTC ATTTGGT TTTG ATTTG CAG 

3 0 TTTCCTAATGACTAATAATATAGAACATCTTTTCATGTTCTT 

CCTTTGCCCACITAAAAAAACTGGGTTTGTCCTTT^ 

TAATCCCAGC^CTTTGGGAGGCTGAGGTGGATGGATCACTTGAGTCCAGGAGTTCGA^ 

GTCTCTACAAAAAATACAAAAATTATCCAGGCGTGGTGGTGCATGCCTGTAGTCCC^ 

GCTTGAGCCTGGGAGGCGGAAGTTGCAGTGAGCCAAGATC 

3 5 GCAACCrCTACCTCCTAGGCTCAAGCCATCCT 

ATAA1TTTTGTATTTTTTGTAGAGATGGGGATTTGCCATGTTGCCCAGGCTGGTCT 
CTTGGCCTCCCAAAGTGCTGGGATTACAGGTATGAGCCACCGCACCTG<3CCAGGAG 
GACTTGCAAATATTTTCTCCCATTCTGTGGTATTTTT^ 
TGGAATTCCAATTTATCTATTTTAAAGTTTAAATGTGGATGGA^ 

4 0 ATCATACATGAGAGACCATTGCCTAATCAAAGGTCAGGATCATTTACTCCTGTGTTTT^ 

TATAGCCAAAACAGGTTTAGTTGCTTGCTGCCTGCAGAGTCCAArrAGTA^ 
AAAGTTAGCTTAAAGGAAGAAGACGTACAGGCTTCCTCCCTTAAGGGTACTGCTTCCCTG 

AGAAGGTGCCTACACGGGGG CAGAAATGAGCGGGTGGAAGATCTG CATATTCCCTTCGGTGCCTTCTTTCTCAGGCAGTCAAGTTG 
GTGGCTTCATGGGGAAAAATACCTCAGAGGTGGCTGAAAACTCT^ 

4 5 GGGAACTTCCTGACXjGGTGAGAGTTCCACTCAGGATTGTCTAAGCAC^ 

AAAGGAGATAAAAC^CCATAATTGCATTTCTTTTATTCITTTA 
GGAGTGTAGTGGCATGATCTCGGCTCATTGCAGCCTCTACCTCCTGGGCT 
GGACTATATGTGCATGCCACCATGCCCAGCCAAGTTTCATATTT^ 
AAACTCCTC^GCrCAAGTGATCTGCCTGCCTTGGCCTCCCAAAGTACT 

5 0 TGCATTTCTAAAGAGCTAAGTAGGAAGTGGGGAGGAGGAGGAAAGAAAAAAA 

CAATTATATAATAGATATGTGACCCATTTTGTTTTCTTT1TAATTTTTGTAAA 
CTCCTGGCCTCGAGTGATCCTCCAGCX3TTGACCTCCCAAAGAGCTGGGATTGCAGGCGTC 

GAGACCATTCTTTCTGCATTGAAT^GTCTTGCTGGCTTTGTAAAAAAAA 

5 5 GTTCTATTGAGAAATAATTCACATAOTATATTATTCACTTTTTTAAAGTGTGTAAT^ 

ACAACTGTCACCACTATCTGATTCTAAAATATTTTTATCATTCCCACAAAGAAACCCT^ 
CCCTGACAACCACTGATCTACTTTTTGCCTTTATGGATTTACTA^ 
CCAAAATTATTGCGCTGTCTTGATTACTATAGTTTTATAGTTAAGTTTTC 
TTTTATAAGATTGTTTTGATAATTTCAAGTCCGTTGTATT^ 

6 0 TTGAGATTTTGAGAAGGATTGCATTGAATCTGTAGATCAACTTC^GAAGCGTTGTCA 

TITTTTCTT T T' n GAGATGGAGTCrTGCTCTCGTTGCCAGGCrrGGAGTGCAGTGGCGTC 

CCAGGTTATCAATTATTACTAATACAAATATTTATTGGATTTTTGCTATG 

TCTCATTTAATCCTCAC 

65 HUMAN SEQUENCE - mRNA 

CGCCTCCCGGCCCCCTCCCCGCCCGACAGCGGCCGCTCGGGCCCCGGCTCTCGGTTATAAGATGGCGGCGCIGAGCGGTGGCGGTG 

GrcGACCCTGCCATTCCGGAGGAGGTGTGGAATATCAAACAAATGATTAAGTTGACACAGGAACATAT 
ATTTGGTGGGGAGCATAATCCACCATCAATATATCTGGAGGCCTATGAA 

7 0 AACAACAX3TTATTGGAATCTCTGGGGAACGGAACTGATTTTTCT 

TCTTCTAGCCTTTCAGTGCTACCTTCATCTCTTTCAGTTTTTCAAAATCC 
AAAACCTATCGTTAGAGTCTTCCTGCCCAACAAACAGAGGACAGTGGTACCT 
AGAAAGCACTGATGATGAGAGGTCTAATCCCAGAGTGCTGTGCTGTTTACAGAATTCA 
GACACTGATATTTCCTGGCTTACTGGAGAAGAATTGCATGTGGAAGTGTTGGAGAATGTTC 
75 AAAAACGTTTTTCACCTTAGCATTTTG 
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TTCACCAGCGTTCTAGTACAGAAGTTCCAC^ 

CACCACCCIAATACCACAGGAAGAGGCXSTCCTTAGC^ 

TATTGGGCCCCAAATTCTGACCACTCCGTCTC 

GAAATCAATTTGGGCAAOSAGACCGATCCTCATCAGCTCCCAATGTGCATATAAACACA^ 
5 ATTAGAGACCAAGGATTTCGTGGTGATGGAGGATCAACCACA^ 

TAACXTTCAAAGCCTTACAGAAATCTCCAGGACCTCAGCGAGAAAGGAAGTC^ 
CACTTGGTAGACGGGACTOSAGTGATGATTGGGAGATTCCTGATGGGCA^ 

GG AACAGTCTACAAGGGAAAGTGG CATGGTGATGTGG CAGTGAAAATGTTG AATG TG ACAG CACCT ACACCTCAGCAGTT ACAAG C 

CTTCAAAAATGAAGTAGGAGTACTCAGGAAAACACGACATGTGAATATCCT 
1 0 CTATTGTTACCCAGTGGTGTGAGGGCTCCAGCTTGTATCACCATCrCC^ 

GATATTGCACGACAGACTGCACAGGGCATGOATTACTT^ 

TCTTCATGAAGACCTCACAGTAAAAATAGGTGATTTTGGTCTAGCTACA^ 

AGTTGTCTGGATCCATTTTGTGGATGGCACCAGAAGTCATCAGAATC 

GCATTTGGGATTGTTCTGTATGAATTGATGACrrGGACAGTTACCTTAT^ 
1 5 GGGACGAGGATACCTGTCTCC!AGATCTCAGTAAGGTACX3GAGTAACTC 

AAAAGAAAAGAGATGAjGAGACCACTCrTTTCCCCAAATTCTCGCCTCTATO 

AGTGCATCAX3AACCCTCCTTGAATCGGGCTGGTTTCCAAACAGAGG 

CCAGGCAGGGGGATATGGTGCGTTTCCTGTCCACTGAAACLAAATGAGTGAGAGAGT^ 

TGAACATATGTTTGCITATATGTTAAATTGAATAAAATACTCT 
20 AAAAAAAAAAAAACCC 

HUMAN SEQUENCE - CODING 

ATGGCGGCGCIX1AGCGGTGGCGGTGGTGGCGGCGCGGAGCCGGGCCAGGCTCT 

2 5 AACATATAGAGGCCCTATTGGACAAATTTGGTGGGGAGCATAATCCACCATCAATA 

AAGCTAGATCCACTCCAACAAAGAGAACAACAJGTTATTGGAATCT 
AATGGATACCGTTAGATCTTCTTCCTCTTCTAGCCTT^ 
CACGGAGCAACCCCAAGTCACCACAAAAACCTATCGTTAGAGTCT 
GGAGTTACAGTCCGAGACAGTCTAAAGAAAGCACTGATGATGAGAGGTCT 

3 0 TGGAGAGAAGAAACCAATTGGTTGGGACACTGATATTTCCTGGCT 

CACTTACAACACACAACTTTGTACGAAAAACGTTTTTC^ 

CGCTGTCAAACATGTGGTTATAAATTTCACCAGCGTTGTAGTACAGAAGTTCCACTG 
GCTGTTTGTCTCGAAGTTCTTTGAACACCACCCAA 

CCCCTTCCGCACCCGCCTCG^SACTCTATTGGGCCCCAAATTCTCACCAGTCCGTCTCCT^ 

3 5 TTCCGACCAGCAGATGAAGATCATCGAAATCAATTTGGGCAACX5AGACCGATCCTCATCAGCTCCG 

CTGCCTCATTACCTGG CTCACTAACTAACGTGAAAGCCTT ACAG AAATCT CGAGAAAGGAAGTC ATCTTCATCC 

TCAGAAGACAGGAATCGAATGAAAACACTTGGTAGACGGGACTCGAGTGA 

ACAAAGAATTGGATCTGGATCATTTGGAACAGTCTACAAGGGAAAGTGGCAT^ 

4 0 CACCTACACCTCAGCAGTTACAAGCCTTCAAAAATGAAGTAGGAGTACTCAGGAAAA 

GGCTATTCCACAAAGCCACAACTGGCrATTGTTACCCAGTGGTGTGAGGGCTC 

CAAATTTG AGATX1ATCAAACTT AT AGAT ATTGCACG ACAGACTG CAC^ CATGG ATT ACTT ACACG CCAAGT CAATCAT C CACA 
GAGACCTCAAGAGTAATAATATATTTCTTCATGAAGACCTCACAGTAAAAATAG^ 
TGGAGTGG<5TCCCATCAGTTTGAACAGTTGTCTGGATCCATTTTGTG 
4 5 ATACAGCTTTCAGTCAGATGTATATGCATTTGGGAT^ 

ACAGGGACCAGATAATTTTTATGGTGGGACGAGGATACCTGTCTCCAGATCT 




CTTGTGCTTCTCCAAAAACACCCATCCAGGCAGGGGGATATGGTGCGTTTCCTGTCCACTGA 

50 
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Table 87 

MOUSE NOMENCLATURE 
ICSGNM Tle3 
Celera mCG21830 

5 

HUMAN NOMENCLATURE 
H6NC TLE3 
Celera hCG3 879G 

10 MOUSE SEQUENCE - GENOMIC 

TGTTCCTGTGGCTTTCAGGAACCTAAGTCACX3TCT 

CrCTTTATTTTTGTTTATTACTrTAAAGTAATTT^ 

TGTTGATCAAATTAGATTCAAATTCACAGAGATCAGCTTGTCT 

AGGCITTCTCTGTTTTAATTGCATCCATTrATTGACT 
1 5 GAAGCCAGGAGGTGU^GAGCATATGTTAGGGGATGTATCCT 

TGTAAGGAGTAGAAGGCAGGATACTGCTGAGAGGGGTTGTCCAGTGGTAATGGGCACC^ 

TAACCCTGATCTTCCACAAAAGTCACTCCTGCCTAGGTCT^ 

TATGGAATCCATGGCTTAACCTCTGTAGACTGCCGAGCCATTTC^ 

CAAAGATAGAAGAATTGGAAGTTCAAGG C CAGGCTTGGCT CT AAGAG ATC CCG TCT CAAAACAAAACAAAACIAAAACAAAACAAAA 
2 0 CAAAGCIAAAACCATGGGGCTGGAGAATX3ACTCAGTAGTTAAAAG 
CTCACAGCAGGCT^GCTCATAGCTACCTCTAAACCCA 

CAAGTGGCTTACATACCTCGAATGTGCACACTCACAATGTAAGACTGATGATAAAA^ 

CTCCATAATAGTTCTGGAGGGAGCGGTACCCCCATTTCTCTGTCAATAGTTCTAGGAGTC 

GGGCTGTTACACAGGAGTCGGTACTGCrrGAATGCAAGCAAACAGACrr 

2 5 GCACCCAGGACACATCGCCACTCGTGGAACTACAGCCCTGCTCATACCAAG^ 

GCTAGGAAGTTCAGAAACCCAAGCCCCAGGAATCCACCTGGCCCCCCMAAGAA 

CTACTTCCCCATCTTCTCCCTGCACTATCATCTTTTAAAGGTC 
ACGGAAAGAATCAGTTTCAATGCCAGAAGACCrGACTTTC^^ 

3 0 AGCCAAATATAAATCCCTCCCTCCTTATGTTGCGTTT 

GTTACCTAGCTTTAG CACT ATTAGGGGTAGG AGCCATGTG ACACCCTG CTGTGGTGGCTTTTCITGGCACTGT AGTATACTCAGAA 
GCCTTCACTCTGACTACCTGAGTTCCATTCCTAGGACCCACAGGGTGGAAGG 
CCACATACACACTAACAGAGAAATGGTGAGAGAAATTAAACAAArTATTATTATTAT 
TGCACCCCTATTCTTGAGTTATGACAACAAAAAAAGTCTCTGGATATTGCCT 

3 5 AGAACTAGTAGGTACGCAAACTTGATATGCAATCAAACTGTCACTGTGTATG 

GGACAGAAAGAGGGTAGAACITGAAGGGGAGGGAAGCTGTCAGGTAAATACCCTCTCrr 
CATCACAGAGCATGCACACGGGGCCACAGTTCTGTACCAGCCTTTACCCAGGCTTCATGC^ 
' TGCCCCACGCCACTGGCATGGGTAATTCTTCTCCACCCTTGCTAAGATGGATGCGTTAATACCAG 

AAG C CTCAGCTTTGAAT CTTT AAAAAGGAAAG AGAAACGG C ACAAATCTCTT AATT AGGACGC CC AATCACTTCG AT AAAT ATTT A 

4 0 CCCGGTTCCAGCATGAGGCATGGAAACCCGGCGCCATTAGTCAATAGAGGTC 

TCAGCCTAGCTTGGGAGGAGGTGTOUSGAGAATGTTTCCAGTCT 
TACTGCTGCACACCTGCCCCtSCACCAAGTCCCAGGTGGACATG^ 

GCTTTCTGGACAGAGGTAGTGAAGCTACCGAGCAGATCCAGTATTCTGGGACTCTCCAGAG 
GTATTATTATCTGGAACGCTACTCTCAGTCAACCAAGGTCACAGCACTTCCCAAGCAG 

4 5 TGCCCAGTATTTCCTGCATTGAGTTCTAACACTGCTGGCTCTACACATAGCC^ 

CATTCTGGTAAGCTCTTGGTTGGACrrAACACCCAGACACCATTCTTCCCATTTT 

CT CTT CACTCCTGATCT AAAT AAAT AGCCTTTGGGGACAT CCT CCACTGG AT GAGAAG TC CAAG AT AGGTG TGGGATGAGGATGAG 
GGTTGGTITGTTTGTTTATTGGTGTTTTGTTTTGA 

GGCCTAAGAGAACGGCTCTAAAGAAGGAAGGCTACCrTGGAAGGCK3CAGGAGAGG 

5 0 AGAGAACTTITGTCTACACTCCTGGGCTGAGCTCGAACCCAATCTTTCCAAAAG^ 

ATCAGACTACTTTATACATCTTATGAACTCGCK3GCANNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNblNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

5 5 NNNNNNNNNNCCCCCCCACACACACACGCAGAGGAAATG^ 

CATAGCTAACTGTTATTGATCACTTAATGTGCCAGGGCTATACCAAGTAGTACTTGAACAATCTCCATTGATCTGG 
GTCCAAGGTGGGCAGGGCCATTATTCTCTTCACTCAACAGAAGAGGAAAT AGAGG CTTAG AAAG AGTTCACAAGTG CCCCAAAGTC 
ACAAAG CTTGACCCAGCTAGAGAGTTGGAAG AGGGTTT AAG AGACTGTGGGAAGT C CCTT CAC CAC CAGGAGCCACAGCCCTCTTT 
GCTGAAGTGAGCTAGGCTCCTCTTACTGCCCTGACTGGCAGGGCAAGTGTCATTAAGTT^ 

6 0 CTCAAGTCCATCAACCCCCACAGTGACCTTGAGGAAGTCATTGT 

ATGACATCAGTGGCTCTACCCTCATTCITTGGGTGAGGATTAAACTTGTTAATATTTGCCCA 

GAGTGAGCACTATGTTAAGTTTGTTAAACAAATAACAAACTCCCCTTCTGGCCCCATAGTGAG 

TGGGATTAGTGATGGATGGGGTTGGAGX3TGGGGGGAGGTTGTGTGTCTGTGTGGGATGAA 

AGAAGG GG ACCACAGAATC CACCAAAG AGAGGTGG CTGG C CTTGAGG AAGAAGACTCTCriTlTT G GGCCAG CTCCAACAAGAGG C 
S 5 TTCCCAAAAATCTGGGCCAGTAGCTGGTGGGGGTTGGGAATGACAGGAGGGGAGGTTTACAACCCCACCCT 
TTCCTCAAAGGATAGGCCACAGTCTGGTGGCTAGCCAGAAGCCATTTTTCTTAATTC 
TGTTTTGGATTTATTTTGAAGGAACCAGAGAAGCATCTCTGTCCCCACCTCAAGTTCCCA 
GCTGGGGGCCTTCCGCTGATGOiGATCCATCCCGGACITTCCAGCTCACCGCGTGA 
TCCTCTCCCACGTGGGGTGGACTCCGAGGTACITAATTAAGCAGCC^ 

7 0 TTCTTGTAACATAAACAGTTGAAACGCTTCCAATTAGAACTGG 

TT AAAATAATTG T CAAG CGCTTCGG ACAGTGTGTTGTGTG CAT CTTTCTCT AG CAT AAAAT AAG CTCATTT CCATTTCTT ACAAAA 
TTTGTGAGGTGGCATGGGGGTGGGGTTCTGAGCAGTAGTGAGTTTTC 
TCCCTTTCAGGCTCTGGGTTCTGGTTACAGAAGAAGC^GAAGAAGGAAAAAAAAAAGC 
GAGGCTCCTCAACCATACCTTCAGAGAGGGTTAGAATGAATGAATGAGTGAATGACT 
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TAGATTCGGCTTGCACAAACAAAGGAGTTTGGCCGTGGATCTGGCTGTCCTGGATTCCAATTCT 

TACTAAGCTTTGTTTGTGTATCAGCCCCTCTTCTACTCTATA 

TGGGTAACCATGGCCGCITTGTACCTATTAAACGCTCAAGG^ 

TCTGCCGTGGGGGCGGCGCTCGGCAAAACCGCATCCATCACGCTCATCATCCGTGTGGC^ 
5 AGGAGAACTCAATGTGAAGTCTGCCTTGCACTAGCTCACGCTGACTAGTGGG^ 

CGGAGACCAGAG ACAGAGCTGCGGCG CTTTTCTGT ACACAGTCCGCAT AG ACAGG AC CTGAAAGCC CAAGG CATCGGCGG CCG CCA 

GGAGGAGGGCGCCGTCCX3ACCCAGTCCGGJVGGGAGACGGAG 

CATTGTGTTTAATAAATCTGGGCGAAAAGCACACCTC 

CGACCCCCAGTTTAAGGGGTCTTAAAACCCCTGCACTrTGATTAGCAGCTTC^^ 
1 0 ACCTCACCTCTCCTCGACACCCATCCCCCCCCAGCACT 

CCGTGGAGTTTGGGTTTTGA rr ' lTC ' r I CC TTTTTTTT CTT TC T TTCT TTCCTTTTTC 1"! CTTCTTTTTTTTTTCTTCAGGGTTTGT 

GTGTGTTTTTACCrTTTATrATTTAAATTGATACGTTTCATTACTA^ 

TTGATATTTAGCCTCTGCTrACTTTTTTATGCATGGCT 
1 5 CCATTCAAAGCTAATTAAGCAGCCATTCCAAGCACTATTTGCAAGCGGGAGG^ 

CCCCTCGTGGTTTTGACTGACAGCTCTGGCTAAGGACr^ 

GGTCTGTGATTGGGTTTTAGACTTGCTCGGGGGCGTGGCCGGCGCGCCCCAC 

AGGAAGCGGGTGCGCCCACCCGTGACGTAAGGGGGAGGGCAGAGAAGGGGGAGGGGAGAGGG^ 

AGACTAATGGATGCTTTGGCCAGATGCTCGCATCTGTCGA 
2 0 ACG ATTT ATGTTTTT ATTT AGG AGAAT AAAT AAATGAAG AGGGGAG C^ CTG AGAGG CT C CATCTC 

GGGGCGGCCGCTCACGTGGAGGTCCCCGCGCGCTCCCAGCTACTCATTAAGATGC^ 

AAATATAGATTCCGGGGGCGCGCC^SCCGGCTCGCGCGCGCCCGC^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 
• GGTC-GGCAGTGTGCCCTGAACGTGACCCTTGTCTGCGAGCAGATGGGCCCCAGAGTCGA^ 

2 5 CTGGTG ACCAGGGACGCTCCAGGCTCGCAGTCGCTCTTCCACGGGTGGCTG CCGATGCATGTGCCCAGGCAGGCCATCAGG CCAAG 

AAACGGGGTCGGAACTCTGGGTTCGTAGCCAGGAGGGCCCCAACTCC 

CTGAAGG AACCCG CC CTTGGCAAGGCTGCGGCCAGG ACGTAGCTGTCCCCAAAGGGCIAAGAGCCCCAAA 

CTGAGCCTGTTGTCCGCAGACTGCOIGGCCCTTCCATCTGCGCGGCCCGCCAA 

CCGCCTTGCATCAGAGGCGCTGTTTATTTGGCTTTCTGAAAAGAAGGGCG 

3 0 GGAGAGGGGGCAGAAACGGATAAAAGAAAGAGAAACTOUriTTGGAG 

TTTTATCJGGTCCTAAATCTTTTGAAGTGAAAACATTAAACTGACA 

TGCAACCCTAATGAGATTAGAGGGGGGCAATCGTTCGCaSCCTCrrGCAGAATCAACTGATC 
GGATTGTACAGCTTGTCCTTTTTTCCCCCCCTTTTC 

T AGAT CGTGGG AAAAAAAG AAG AAAAAAAACCT AG CAG C C C AGG ACT AT AAT AT AT AT CT ATTT ATT AT ATGTCTTT CGG TT AATT 
3 5 CCTGATGTGAGGAGCGCTGTTTTTCACTCCCAAGCGAAAGTCACCAGCGGGTAGGGCCT 

GACACAAGAGGGCATTGTGACGTCGCGCCTGGCGGCGCCCAGTCGCAGTCCGAGTCCCGTGGCCCC^ 
CATTGGCCAACGCTAGGCGAAGCCGCCACAT^ATTTTGTTACTTTT 

CGTGCTCGGCCCCCTTCGCCTGTCTGTGTGTGGTATCCGCAGGTCCGAGGCACTTTTTTTTT^ 
TGTTGAGTGCGTGTGCTTGTGTGTTTTGGGGGTTTGTCGGGGCGCGGAGAG 
40 AGCTCCCGGGCTCGGGCTACX3TCTCCCCCTCCGGTGCTCTCCAGCCrGCGCGCTCCTAGGCGGCAGGAC 

AG AGACGCGGCCG AT CCACTTGAAAGCAGAGGACCCCTTCGCCGAGGCTTC CTATGCCGGCTGGAGTTAGGATGCCACGGCTCATT 

ATTTGCACCGACTGAGCCCGGAGTTCCTCCTCGACCCTGGGAAGGGC^ 

CTCTCCGAGGGGGGTGGGGGGGCGTCGAAGCX3TCCTCGCTGGGCTCTAGCCCTCACCCAC 

45 NNNNNNNNNKNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNWNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNKNNNNNNNNNNNNNNNN^ 
5 0 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CT AGG CTTATTT AAG CATCT ATCATGGCAGGGTAAACG AGGCAGTTT ATCT AG CACCTCCATTTTG CTTGTTGCTACCTTTATGGT 

GCGCGCGCGTGTGTGCGCTCIAC^CAAAGGGGCTGGGTGGGTGGG^ 

GGCGGCCTGCGTTGGAGGGGGACGCTGGCTTCCGCCCCC^CCCCTCTCCTAAGTCTTCCAACT 

AG AGCCCTTTTGGCTTTGGGGGTGGTCGGGATGCCCGCTTCCGGGCCCCGT CGGGAG AGCGTTG AAAAACTTG CTGGG AAGGTTCT 

5 5 GGGCTGTGGGGTGGTGGCCTTGCCGGAGGCTCCCGGCGGGGAGGGGGCCAGGGACCC^ 

GTGGCCGAGTCCTGTGACAGGATCAAAGACGAATTCCAGTTCCTGCAAGCTC^GTAT 
GGGCTGGCGAGGAGGGCtKSCCTTGCGCGCTCCTGCTACCCCCCCCCCGCGGGGCCGAGGGGGAC 
GCTGGGCCTCCGCCTTTTGTTTCCCCTCACCGAGCG^ 
60 CCTGTCGAGGGTCGGGGTCTTGCGCCTTCGTCCGGGCCCGCCCCCGCGT 
TGGAGTTGGTACrGCTAGGCTCCGGGTGGTGGGCAGCCACCCTCA^ 
GGCAAGTCAAGTGGTCCTCCTAGGCCCTAGGGTCCAG^ 

CTCACAGCCTCAAAGTGGAGTATGACAAGCTGGCTAACGAGAAGACGGAGATGCAGCGCCATTATGTGATGGTG 
CIAGCAGGGCGGAGGGAGGGAGGGCAGGGCAC^GCAGGGCACCCAC<5CTTTGGGA 

6 5 GACAGCCCCCTGGAGGGCTCCCTTCTTCAGCrTCCTCTTCCCTGCCA 

GT AAAGAGCATTTCAGTTGTGTTTTGAAGGTTTCCCAG AGGG CTGGTTG CCTTAAACTGGTT AGAAACATAAG AGTCAAATGAGCC 
GGTGTCAGGGTTGTATAAAAATCCCTCTTGGAGCCCTCCTGGGAGTGTTTGGAA 
TGGGACAGGAGTGCCTACCTTTCTTCCCCAGAGTTTTTCCAGATTGGAGGAGCCTTTT^ 
TTTTAAAGCCCTCTGCCTCCCGTGTCCCCTCCTCAAGTATTTCCTCAGTTGGGACTCre 

7 0 ATATCTTTTAACCTTGAGTTCAAGTTGACATCTTTCGTTTGG 

TCCACATAAGAGACCTTCCACCCAGTGCTAACCAACTCCCAGAC^ 

ACCTTGTATAAACAGCACATATGGTACAGGGGGTGCCTGTGGGAGGGGGCACATATATT^ 
GAATGCTTAGCCACTTCCCCTATCAGCGTTGACACCTTTGTGGG 

TAAAGCCTTGTGAGGGGGAGGGGTAAAATGAGAAGTGTCCTGGGGTTGCCTGTTAATGACCC^ 
7 5 TT TT TTTTTTAATTTTGTAGTACTATGAGATGTCCTATGGCT 
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ATAAATAATATATATTTAAATTGTAAG&TGTGTGArri'CT^ 

TATTCCCACTTGGTCTTGAAGGATAAATGTTCAGTGAGTGGTCTGAATGAAGGCCT 

TGGGCrriT AG CATTTGTGAACTTAGGCCTGGTCTGGGAT CTGGG TGG ATAAAG AC CCAAAG ACCT AAGGGAAAGTTGTCCAGTTG 
TCTCCCAGGAGCTGTGTGACACCCGGAGGTAGAGAGTTGGCTGTCTGCT 
5 AT AGG CAGAGTTC CTTGC CCCTCITCTTTCTTTTC CTTGCAG AGAG AG ACA^ CCTTGAT AAG ATTACA 

CTTCAAGGGCCTTTAAAAGCCAGGCTGAGCrGTTGAAA 
GAAATGCAATCCTGTGCCTTGCGTTTAGGAGGAGTTTCT 
GCTGAAGCCCAGAGGTGAAGTAACTTGCCCAGGGGCCAGCCTCACCAGGC 
ATCCTGAAGTCAAACCAGTTTAGCTAGTGTGGTATCCTTTGATGGATATGTAAACT 
1 0 T AC CTCAAACC CAGCAAATACAAACACTCTTT ATCTCGAG CATATCAGTAGCTAGTCCTGAGCTAGGCCAGGCTGTGGGAGAGGCC 

TGTTTGTCTATTTGTGTTCCAAGGAGAGGCrGCCTATAATTTG^ 
CAGGCCGGTTG 1TCT TT T TAACAATTGGTGTT 
AATGGATTTTCTGCTCT AAATT AACAGT CATTT ACAAG^ 

GGG CAGCTGAC CCTG AGGGGGAAGTGTG CAGATTC CTGTC CAG CACCAG TGG C*l"l' C1"1"1TG AAT CT AG '1 1 1C 1 i rTCCCTTCCTGT 
1 5 AGGGAGCCCCGCCATTGGGTGAAGGGTAGCCAGAGAGGGGCCAGCATTTAGTG^ 
GAATTGGTTCCCTAGGCGTGCACACATGGAGGCTACCACATTTG 
ATGTGAAGAAGGTTCTGGGGAAGATAGGCAGAAATGGCTTCTGC 
CTTTGGGGGGGGTATTTATTCACAGTGCAAGGTACTTCCAC^ 

TGG GCGTGTTTT AAAGAGGGT AATTG CACCCAGGTTCAAAATTT AT AGCAGT AGTTGTTGTT AT AAGAG CTT CTAATT AAGGTG T A 
2 0 ATATGTCCTTGCATGTCX^GGGCTGCATGGGTGGTGTTGCCCGCGTCTGAATACAT 
CGAC^GAGCAAAGCTATTCTGAGCCCCGGGGATGCTGTGAACrCAGAAGAGCT 
CTGTGCAGAGAGCGGCTGTGTTCATCTGGGATATCTGGC 
GACATTGTCGTTGATCCTCTCAGCCAGGGGTGGGAGAGAGGGACC 
GAAGCCCTCATGGCAAGACCrAGGGGCTTATTTCAGAGCCTCTG 

2 5 CCTTCCCGGCITGAATGCAGGCTGAGCAGTCTCTGGGTTGGGTGAACTCACCCTGTGGCTGTG 

GGAGAAACCCrGACTGGGAAACAGGGCAGTACACCCCIGACCTAATGGCGTGCTAGC 
AGAAAACCCTGCCACTAATTGTAGTGTGGGTCTCGGGTGCATTTGGCG 
ACCTGTTTAGCGGTTGACTGGTGTTTTATCTGCACAGftC^ 
CTAGACTTGACTGTGTGATTAC^TCriXKrcrGGAACAAACAGG 

3 0 GGATGTACTTTTATATAATAGTCTACAGAGCTAGCACCTTGGGAAAAGTCTGTCTAGTCA 

ACACACA(^CACACACACACACACACACGCACIT<^CC^CCACCA 




ATCTCAGAGAAGTAATAAATGGGTCCTCCTACTTCITCTGCCTCCCrC CAAAG ATT ACAGAATCACAGG CC 

3 5 TGGCC1GTGACTTGAAGAGGTCACCACCTCCC CCAGG CTCCTGCTTCC AGG CIAG AGTGTCAAGG ATGATTCAGCGAGGGCCAGGAG 

GCGATGATGGCTGACCGACCGGCCCATCAGTTCCCGTCTTGTTCCrc 

CTTAGAAACAGACTXrCAGAGCAACTGTCATGCAGTGGCTTTGA 

CCTCTGTTGGTGAGCACTGATGTTTGGCAAGCIGACTTC^GGCATTTT^ 

TATTTTCAGTAACCCAAGCAAGTCCCAG AG CCCCAG AGCTGGAAG AG CG AGCGCTTTGG CTTCTCATCCAAACCTTTG AAGTCATG 

4 0 GATGAGGAAGTAGGCTCAGAGAGACTGGAGCTGGCATCCCCAGACTGCCAC^ 

ACTTGCCCTCCCTACCCCAGGACTCCTCTGGAGCQVGCCCCTTTCOT 

GTC CCATGCTATTTA1T ATCC AGCCAG^CT CIAGGCTGCAAACTTGGCTC C^CCAAAGTC CAAAGTACCCCATTTGGGGTTGTATAA 
CCGTGTCAGCrGAATGCTCTGTGTCCACAGACCTGTCCCCATTTGCCAGCTATTTGCACAACTAATGATTGT 
GACAGAGGGCAGG TT CACAGATG AGG CG AAGG TGGT AAT AAAT AAAT AAATAAAT AAAT AAAT C C CTCAC CGTTTT ACCCTGAC CA 

4 5 C AAGG TGGCCAT CGG CACAGGGCTGAGGACTG AAGG C Q1A.CCA.T AGG CAACAGTTTTCT G^ 

G ACTT CT JL'TG CCGGGAG CT CCTT CATTCCC CCACACCGTGGG TT CTTTGACTTT AG CTG CCTT G TTGCTCT ACCTTGG AATTGTGG 

GGGTTGTGTGATCTCCCCAGCAGGGCACC^TCTTTGCTGTACCTTGGGAATTACTGAG 

CCAGCTAGAGGGTGACCTAACCCTCTCATCTCAGGCTTC^GTGGATCCCTTG 

5 0 CTTTCTGTACTCCTCCCGAGTCTTTTCCTTCTTGTACACCTGAC 

CTGGGAAGCCCTACTCCCAATTCTACTATTCCATGCGCTCTTCTGAGCAGGGTT 

CTTTAGAATTCTCCCTCCACCCAGG AAGAATCTCATTTACTTAGGG ACT CCTTTACTTTTTGGT CCTCAATGCCACATTGTTGAAG 
GGGATTGTTGAAGGGGGAGAGAGTTGGCCAAGAAATATAA 

TCCAGGAACACCAGGAG CCAGGAGGAGATAAACAGCTCTCCGCAGGGCTCGCCCGTG ATCTGGC CTCGTGGATCTG AG ATGAACCA 

5 5 CTTCATCTCAGAGCACTGGAGGATGCCCCTTTGCCAAGCACACAGTGTGT^ 

TTGGGTTGTGGGCGGCTGGK3TGGCCCTACACTGCACAGAGCCCTGGGAAGGAAGTGGG 
CTGCAGCCTCTCTCTXnGGCTGAGCATACAGTCTCCTCATCAGCAAA^ 

ATAAGTAATTGCGATGCGGTGAGGTAAGCGCTAATGTGGCCX3CAGCGACACTTGCTTCACTCTTCCTC 

6 0 AGAGAGGATGTCCCTGGAGGAGGACTGTTACTTGAGTGGGGCCTTG 

CAGCCAGAGCTCCTCCCAGGTGACAACAGCTCCTGGGACGGCAGTGAAGTACATTTAGGA 
GGGAGGGCAGGGGCCGCACTTC^AACTGAGAAGGTGGTGATATTGTACAGCACA^ 
TCACGACTCGCTGGCTCCCTCTCTCCCCATATGGCTCTCCAGGGCTGGCT 
NNNCCCCAAGTGGCATTGTCTCCTTTCTCATACTGGTAATTCTGTAATC 

6 5 C ATCAGCCCGG CT CTCCACTTTG CT ATT AATG AT AG CAG C CAGGAAAAT AGCTTTCAACATG CC ATTGACAGTCTGGCTT AAG AGC 

GCACTTGAGGCAGTGCTAGACAGCTGAAAGCACCCATAATTGTCTACCAGAGTGAG 

TG CTT AAACKX3 AC AAT ATCTTGTTTTCAACAAATG CT CTCTG AG ACACATCCAAG ATG CTTT CTTC CTTC CT CTTGTATTCCTC CT 
CTTGGCTCTTTCTCTCTCTGGCCTTTCAGAATAAAGGTTC 

GCTGAAAGAACTT AT AGAAGAGGAGTGATCTACTCTGAGAGCAGTGG CAGAAATCCACT AGGCTCG CCCAAGGCATCATCCACTCC 

7 0 CAACCCAAAGGCCAGTTGGCCTTTCCCTGCTGACACCAGCACTGGAGCAGAGACAAG 

TTTGTTGGCTGGGCATGGTGCATGTC^GCCAGAGACTAGGGGATTCGAGCT 
AGGACATGGAGAATAGGTAAGCCTTGGAGCTAGACGGACTGCC 

CAGAGCCCTTGGGTCTAAGGAGCCTGTCTGGAAAACCAGGAGGGCGGTGTATGGGACT 
ACTTGAGGGGAGCCAGAGGAGGTCGCTGATAACTCCAGAGATTTTACTC^ACAAGAGGTCCCCT 
7 5 CGGTGGCAAACTACATACCCAAGTGCCTAGCTATGGGAAGGGGAGGAGTGAGGTCCAAGAGCCAGAGAAACC 
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TCCTCACTTCTCTCCATAGCCAAATTTCTCTT^ 

TGACAGTCTGAAGTTCTGATATTGCTATTACTACATCCT 

ACCCAGGTGATCGTGCAAATGTCAGAGACGAGAGTAACAT^ 

ATTTTTCTCCGGAAATACTGGGGCGTTTGCTTC CAGAACCCGAGGATCAGACACCAGAAGAGCTGGGTGTGGGTGGAGGTGACCTG 

GCTGACTGAGCCTTGCTGTCTGCCTGGCAGTGAGGTGACAACACTTCATCCT 

TCTCTTCCTGATTGCCTGCTGACCAAGGTTATGGTTGTAAAAGGGATGA 

CTGTTTGTTTTCCTTGTTATCTCCT 

1 0 GTTCCTAGGGCGTGGCCTTTGTTTATCTTGGCTTGACAACCTC^ 

GACCCTCTTTAAACTTGTTTATACTGGAAGGGACCATCTTCCCACAGATC 

ATGGTCCTGTCTCCAGGACTGTGCAGTCCAGCCTGGGTCAGGCGGGAAACC CCATGTGAGTGAAAGAGAGAAGAGG 
GGCCAAGGTCACTACCACTGCCCAAGACCATAAAGTCTTTCTTTACTCCGGGT^ 
CACATGTTGCTTCAGGAGCOIGGGAAGAGGAGGGCA 
1 5 AGCCCCTTCTTCTGCTTTCTCTCAAGGAGGAGACCGCAGCCCAATTTAGCTT^ 
AGGAATTGAGCTCTTAACAGTGCTTGCCTCTGCCTC 

CAGGGTTCTCCTGTGTGGGTTTGACACTACCTTTTATGTCATAGCCTAATG 
ACCCCATTCTTTCACTTGAATCTCACCCCACTTCrrCAAGAGACCCC^ 
GGCCATACAGGATTAGGGAGGGTTTCTGACAACCCTGTT 
2 0 AGAGAATCCAGGTTCTCTAGAGGGAGAGAAAGAACCAGGAGCTTGGGGGGGGTGG 
TGCTTTAGAGTCTTAAAAATATGGGTGTCTGTACCT^ 

TTGTGAGTGGTAGGATGGGTAATATGACCCTACCTGCTAGGGTAGCCAGAGTTTTCT 

CAACGTGGTCGACATTAGCTACGTGGACTGCTGACCAATTGAAACGCGTGGCCCTGTGG 

ATGCAAATCCAGTTGCACX3AGGTCCTGTGTGGCCCCTGGCTTTCTAGACAT 

2 5 GAG TGGATG C C CAGG ACGTTAG CACACAGACGG CCT CACATTGGGACAGAT AAG AG CCTCAAAATTT C CAAGG C CTCAG AGAGGGA 

GAAGGGACTCAAGAGGAGAGGAAGGGGCCGGGCTGCTrAGGTTGCTTAGGCTCT 
TCAGCCCTGCAGTGTCTCTGATTGGCCCAGTACCTTGAAGCAGGTGCCCCTGCTGTTG^ 
AAGGTGGCCCACTGGTAGGGTCAGATCAGGACCACATCTGTTTGACCTGGCCTGTAT 
TCCAGTGCCCACIAAGTCrTTATATAAAAT CriTT AAGA^ 

3 0 CTGGGG CT CTT ACCCACTGGTTGGTTGTCAGCAG CACTGTTGTGGTC CTGACAAATGTTTG CAG ACATTG C AAACATTTGCAATGT 

CCTGTCCTCCAAGAACCAGAATTCGCTAGCAGTGCCTGAGGAGACCTTTGGTATAAGGG 
GTGGTTACCTTCGTCTrGATCTGTGGAGAGCGATCCCTCGCCCT 

TGTCTGTCAGAGCAGTCGGTTTGCGAGTGAGGAGATGGTGAGACAAGTGCCTGAACTTGATT^ 
CGTACTTGAGGAAAGGAGACCCGCTGACCCTCTGAAT1TTAATGTAAGCAGGGGACGTGCT 

3 5 AGGGCTCTGTTTCTCCCTACTCGGTCCXSCTGTGTGCTACCTTGGTTTCTC^ 

TTTCATTTGTAACCCTGACCTGGTGAGGAGACCCCCAAAATTTAATGAG 
TGAGGGGCAAGGGGAAGATTTGCATAACGCTCCAAGGGGGGGNNN^ 
NNNNNNNNNNinWNNNNNNNNNNNNNN^ 
NNNNNNlWNlttWNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
NNNNNNNNNNNKNNNNNNNNNNNNNNNNOT 

GGTGCTTCCATCCCCCTGCCCTCTGCCACCCTCCCCCCCCCCGAAlTTCrAAGAGCCCTC^ 
TCGTCAGTGCTGCGTAAAGGCCCAGTGGGAGGACTGCCCCAGGACGGGTGCAGGCTCCTGAC^ 

4 5 AACC C CTCCCACT CTCACTTTG CCTGGTGTGT AG ACTT CTTAACTT ACCAGAACAAGGAAGGC CCATTG CTTTCA C CT 

CCCTGTAA 

GACAGAGATCTTAGGGGTCATGCX5GAGACACAGGTGCTTCCCT 
GAGGGCGTAAATATTTAATGGTGTTGTAGTTATATATAA 

5 0 TATCACTAAGCAATCTGTGATGTAAGCCAGCACCCCCTCCCTTCCCTCTGCCCCCA^ 

AAATGGGTCATTTATCCCTTTCTTAGTC CTCAG CGGGGGTTTG T AG CATGG AGGGAATGGGGC CTG GG AACAC CAC C CTGG CTTTT 

GTTCCTTCCITGTTGCTCCCTCrrCCCCTGATTTTCTTTCC^ 

GCCAAGGGCX3CCATCTTATTCIAGTAATTGAGGGGCTATCAATTTA 

TGCCCAGACATTCTGGAATCTTCTAATACTTGCITGCT 

5 5 TCCGTCTACACATAGCTACTGGCCCCAAACTCTTCTCT^ 

GTAGGTAAAACCTGATTGAAATTGGGGGTGGGAGGCTGTGATCCCACAGCTC 
GGATATGGAGAGACAC^AGTCTGACACTCCTGAGATCACAGCGGTAGGCT 
GCTTTGATGTGTACCTCACACTGCAGGTAGGCAGCTGTCTGTCCCTTGTCTTTCTCAC^ 
ATTTGATGACTGTCCGTGACACTGCCCTGATCCTCACAGAAGAAACCCCT 

6 0 GCAGGCAGACATGCTCCTGCTAGTTTAGGTGAAGGAATC 

TACATCGTTACTGCTTGTATCAGCrGATGTCCrGTTGCT 

TGTTTTCATTTGGTAAGATGCCTGCACTGCCAACATGAGGACCTGAATTT 

TAGCAGAGATGGACAGAGGCAGGCTGACCAACCGCCTGCTGTTCACTGGCCAGCCA 

GTGAGAGACCXTTGTCT C AAAAG AAGGTGTG AATGAT CAAGG AAAGACAC CT CTGAG TTG AC CT CTGG CCT ACATGTG CAC ACGGCA 

6 5 CATGGACACAAGCCTACACACAAAACCTGC^TATCCC^ 

TTCATGTGTAGGTTAGGGCCCAGCTCGTGGGATAGCATGT 

GAGAGTCITACrrGGCATGCCCTGAGGTTTGTCTCCTGTCAGGTTGATAATTCTGGCCACC^ 

ACAGTTGTATGCAGTCTAGGTATTGGGCAGATTGAATGGCTCGTGGAAAGATTC 

AGGCTTGCCCACTGTCCCATCX3CTCATCCAGCTGT 

7 0 AG AG CTGGGTGAGG AAGTGGGGAATTGG AGGT CACAT AG CAAGTCAGG CTGTTCAGACGTGTGTGG C CGTCT AAAGTCTG AATTTG 

GCCCCTGAGACCAAAATGAGCTGCCCGGTGTGCCCCTTCCCTTTTCCAG^ 
GCrTAAACTCGGCGGAGACATCTGGGAGAAGGAGCTGGCTCATTTCTTCT 
TTTCTTCCC CGCCT CCCTTTTGGTCITGCT 

TTAAAGCT AACTAG CC^TGAGATGGTCTAGTGGAGTGACTGTCAG AGACACTGTGTACTGTGTCAAGCATGGT CAGCTAAAGGAGG 
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TCTCCCTTAACCTCAGGGTCAGGGATAGTCTGAAGCTCGAGCAGATCGCT 

AC CCTCAGCCTTTGG CCTGATGTTGGGAG ATCTTG CTC CGGAG CTCTGACAACACAGGGAACA 

GAGTGCTCTTTGGGAAATGTGTCCCCCCCCCCCATTT^ 

ACACCCAGTTGGTATCACAGATACTGAATGTGTCAGGACTCCGGCCCTTCCCAGAAATCACT 
5 ACCAGCAACACTGCCACCCrTGAACTGTACTTAAAAAGCACTTTGAAAGGCC^ 
TTOUJUSGAAGCCATCCTCCCCTCCCACCACCTCTCAGCT 
TCAGGAGGCGGGGAGGATGGGAGCCAGGCTTGCTTAACCCTGGCT 
CTAGTGCACGCATTAGCTCTAGTGTGGTGTGTGACCTGTCTTAATTGTTC 
CCCCTCCCITTCTTTTTTGGAGCGGGTGCTGAGAGGT^ 
1 0 GCTGTCAATCATT AGCCTCGACGGTCGATrrrCl'TTCAAAATCATl^l 1 CTCTG AG CCOTCTCAACCTACTGAAGTTTGG CTTCCTGG 

CCGGAAGTTATTTAGTGGAGTCGCTTAATGGGTGGGTGTCAGTTTCCCTGTGTGG 
GGCTGTGAGGGCTAACACAGAGAGCGCTGGTGGCAGCGGGCAA^ 

GT AG CTGGTTCCC CTTC CTCATG T AACACAGTGGTTCTCAACCTTC CT AACG CT ACTGCCCITTAAT ACAGTTC CT 

TGAGCCCCAACCATAAACTTATTTTCATTGCTAC^ 
1 5 GTGGCTCATGCCTTTAACCCCAGAAACCCTGCTCGAGAGGCAGAGGCAGGCAGAT 

TAGAGAAACCCTGTCTTGACAACAGAAAAACCAAAACCAACCAAACJ^ 

AGGAACCTACAGAGTGAGAACCACTGCTCTAATACAACCCCCAAGAGAT^ 

AGTGGAGTACACAGCAGCCTGTGCACAAAGATGGGAACGGCT 

AACTCCTGCCAGGAGGGAGGGCAGGTAACAGAAACAAA^ 
2 0 GTTCTCCAGATGTAGTGTGCAATCCCCCGCCTCCCAAGTGCCCTTGGTGACCTGTCAGAATCCTTTCT 

TCATGCTTGAGCCCAGATGTCATTACCATGGACTTCTAGAATGTTCTTCCTGGGA CT 

CTTTGTGGTGCTGGAAGGTGGATTAAGTAGGGAGTGACGGCTTCT 

AAGAGAATCCCTTTTCTTAGTGATAGAG<3ATGCrGGGCTCCGGTAAG^ 

TTCCTATAGGTCGGTGAAGTAGAACACAGTGCCTTTTAATTTTCTCTAACC^ 

2 5 ATGTAGGCTCTGTCTGAAACCCTTGAGAGCTGTAGTAGGTCAACT 

ATGTGTGGGGAATTGATAGGGCAAGGAGAGATCCAGTGAGCTGGCTTGTGCTTGCCCT 
AAGGGCCTAAGCCATTGTCCATCCCCACCTCCCTTC^ 

CACATAGGGAATCGTGGCCATACGTACTGCAAC^TGGAGTCTACATATACATGCACACAAACACAG 
AGTCACTAACACCCTTGCTGTGTAAGCATGGGGACCTGAGTCTAGAATTCCAGTGCCT 

3 0 CCTACGACCCAGGACTGGGGCTCGCAGAGTCAGGAGGATCCTAGGAGCTCATTGGCTTCTCAGTOT 

GATTTACTAGCAGACTCTCTCTGTAAATAAGATGGAAGGTAAAAGAGGAAGGCACCAGA 
TGCACATATATGTGCGCACAGGCAGCATACGGGCACCACACAGG^^ 
ATATTTATTGAGCACGTAGTGTTTTCGAGTCATTGCTACTCTGATTGTTTG 
GCACCTGGGCTGATGATCCAGCCTTGGTGTTTTGCCCT 

3 5 ATGCAGGCAGGCTCGGTGATTTCAGAGATCTGGTGGAGTACAGCTCT 

AGTGAGAGGGGTCCTTGTTTTGGGGGGTACACTTACTTCTTCCTGTTT 

GAGCCCTTACTTAGAGCAAGAATGGCCGACATCCCTGAAAAAGGCCAGAGTCTCTGTGGTTAAGA 

CCCAGOSGAACAAACAATTTGTGACCAGTGAGGTTGAGA 

ATGTGGGCCCCACCAGCTCTTAGCTGGAATGTTACCTGCCCT^ 

4 0 GTCTGG AGTCTGAGK3AGCAGTCC CTCACAGTAAAG AATTGTTTTGCTACGAAAGCCT 

CTGTTTCTGAGGAGACCCAGC7IX5TTCCCTCATGGCCCTGACCTACC 
TCATTTGGCCCCACAGGATACAGGGCTGCTTAAGCCCATCAG^ 

CTTTAATGCCCAGTGGCCTCCAGACTCTTACACTGGCCCCAGGAATTCCCCAAGGCCAGAG 
TTGTTTTTATCTGAGGCACACrAGATTTCGCATGCAGAGTCATCTGACATGCAGATGATC^ 

4 5 GGTTTTGTCAGAAAGATTGCCTAGCTGATGAAAGCTTTGAAGGCTG 

ATAAGAAAGCCAATTCCAAGAGTTGTCTGTCTGCGTCCCAGTCTTTCCTGCTC 

TGTGATCTGAGTCACGCGTGGTCCCACTGGAAATTCTTGGTATTT^ 

CCGTCTGATGCTTTGTGACCGCTTGTATGAGGATGCAGGAAGGGGCTGGAGAGATGGCT 

TTGGGTT CCATTCTTAGACTAGACCCTAAGGATCGAACACTCAC CAT ACTGCAGGGGTTAGAAGCAACCGTCAAAGTAGAGGACX3A 

5 0 GGGCCACATGCCACTGCATATGAGGCTGGCCCTGGGAGAGCCCT 

TTGGAGCATTAGTGGGCAGCGCAGTTCTCTCAGCACTGGA 
ATATAGTAGGGATTCTTTCTAGAGGATGCGAGCCATTGACCCGAGTCCCCTGTG 
GCCCATGTGTAAGGGGCTGCAGTGTGTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGT 
AGAGAGAGAGAGAGAGAGAGAGAGAGAATGTCTGTCTTTCTGTCT^ 

5 5 TTGCAGACAGAGATTGCGAAGAGACTGAACACAATCCTAGCCCAGATCA 

TTCCTCTTCTGCCTAGGX3CAGCCCAGTGTGGCTGTTTCCTGGGGCCCAAGC 
A1TCAGAGGCCTTCTCTGCTGACCTGGGGGCAG 

GCCACACCTGAGGAGGGAGGCTGAGCCCTGTGACCAGTGTCCTTCAT^ 
TATACCTAGTTCAAGTAAGGCTGGTTGGTTGGTGC 

6 0 GAAAGTCTCGTCACTTTCTAAGTACCCCTTGGGAGACAGCCCCC^ 

TGAGGAAGGTATCTCCAAAGAGGTGCCAGAGGCGATGTGCAGACCT^ 

GGCCAGGGCCTACCCTACATTTATCCCCTAAGGTGGCTCGG 

CAGGACTGACTAAATACATCTGAGAATTAGATATGGCTCATGCT^ 

ACAGACTTT AGTTTTT CAAGGGCAATTTTAGGTTCGTT AGG CAAG CTTG AGTGG AAAGT ACAAAGTTTCC CCATGG C CTCTC CACA 

6 5 CCGGGCTGGAGTGACTGTTAACCJUVCGCTCATTGTGTTC 

GTTAGACACTTAGCTCTCTCTTCTGGTAGCCATAGATC 

ATTCACCCTAGCCACCATAGCTTCCrGTCCTGAAACATGGTGTCrGTCCTATGTGATT 

AGTATGTAATGCCCGACTAGGTTCTTAGTGCACAGAGCTAAGGT^ 

GAACTCATCTCTGCCTTCCTTAGAAAGTAGTACTTGCTCGTTTAAAGA 

7 0 TAAGTCCTGGGCCTGTTGAGCAGGCCTNNNNNNNNNNNNNNN^ 

GTATTGAGGACCTTGATCATG CTTCATTTCCCAACAG CATCAGCJU3CAAGTGGCGCAGGCTGTGGAAC GCGCCA AGCAGGTCACCA 

TGACGGAGTTGAACGCCATCATCGGG^TACGTGGACTCCCCAATCTGCCTCTCACCGTC 

ACCATGACCAGAGATGAACTCTGACCrTAGGCt3CTCTCAAAGAATGGCAGCCC 

7 5 ATCCCCAGCATAAGCTACTGACCAGTTCC^ 
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GGGTCAGAACCATCTCTGCTGTTTATTGATGTGCACCGTGCTGTA 

G CATGG CTG AG CACC CACACACATCCTTCAGAGTTTCAG CTGTGTACCTGGATGG AACCTT CT AG AAG C CCCACCTCATC CC CCAA 
CAAGGGAGTACTrGATCCTGCAGTGCTCTGTAGGGCCCT 

CCGGCTGTGGGCGTGGCCTCCGGATTGTCCTCAGAGTGTGTGGTCACTGAGTTCT^ 



GTGTGGGGGAAGGGCAAGAAGCTTGGTGCTGGGAGGCAGCACTCTGGGGAGA 

GGTGATCATC CAG CT CAGGGTTCTCTTT AGGAAAT ATTTTTTTT AATTTG CTTTCTGAAAATTTTGTCAT ACAT ACAGTGT AGTTT 
GATCCTATCCACCCCTACACCCACTACCCCCTTCTGAAAGAGTATTC 

TG TT ACAGAG CACATGT ATGTG CTGGAAAAGG ATGAT ACATATGAGCCGTT CTCTC CTCCCACCACGCCGGACC C CATAAT CAAAC 
1 0 TCAAAATGTCAAGCTTGGCGGGAAGTATTTTCACGTTCT 

GCITCTTCTGGGTCCCTTCTTCCCCCAAGTCTCCCTTTGAC 

GGACATTTATTGGAACAAAC^GCCTGACACTGTTGAAACTTG^ 

AGCAGCAGa^TCTTGCrCrrcTACCTACTGGGTGTGA 

CTGTGCACrCCAAATGTCTCCAGATGTTGCCrGTATCTGC^^ 
1 5 CCAGTATTTCACAGAGATTAGCAGTCAGGCCGGCCTGAGAGCATCCCTC^ 

TAGAAGGGGGCCTGAGCXrCOlGGTTGTCrrCAGAAGTCTTTCT 

AGGGGCTACAG CCHXaGTCCXJlCCTCTTACTCTTTGGAATTAAAATCCACATG 

AAAAGTAAAATCC<X3CAGCAGCGCAGCTC^ATTGGCTC 

CCCTAGGCAGATATTGGAATTCAATTAAAGCCCACGAGTGGGGCCTC 
2 0 GTGAATGCGGCCX3TCGCCCIX3TCATTAGGC!AGGCA 

ACAGCTGGGGGACCCAGACTGGGGAGCTGTGC^ 

AGCCACCCTCCTGAGATAAGGGCACAGGCAGGGGCAGTGCTCCAGTC^ 

GCTCC1AGTTGGCTGATCX3TGGCCTCCTGACACAGGGCTGGAGTCT 

TGGATGTAACCAGAAGCATGTCACGGGGTTGAAATATTGTGGGAG^ 

2 5 CTGTCCCCTAGTTCCCAGTTTGTCCTTAGGGAACTCATGTCCCTTTGGGTAAG^ 

TAGCTTTGTACATAavGGATTGTGTGGTGGCCTGGGAGGTGGTGGACTCTTG 

GTGGGTGGTATGTGGAGTGTTGACTTGGTGGTTTTCTTAAGAGGGGAGAAGAGAC 

CCAGACTGTGCAGAGATCrGTGAAGCGCTCTC^ 

3 o cSgattg^gaWtco 

GGATAGCAAAGATCATGGTGACCTCTAAAAACTGCTGCC 
GGCTGTGTGGAAGACTGGGAGGTQIAGTTACCTGACTTCCCAGGACCCT 
TGACTCTAGAATTGAAGCAGTTAGACAACTCCCCTAAGTTTGACCGAA^ 
AGAGGGATGAAGCATGAGGTGTGTAATCCCGTTACACACGCTCCTCCAGCAAGCT 

3 5 GCCAGTACCAGACCGTGCGCACTGGGCACTCAGGGATGGAGGGCCCAGGAAACCCCAaS 

TGAAATGGCCAAGGATGGAGAGATGATGTTCAGGAGTGTGCTCGGTTGTGGGCT 
GAGGGAGAATCACTGAACTTCCCTTCTTCTCATACATGGAGAAGAGCCAGTG 
CTCGGACTTGTGGTATTGTGACATTTCTCTGCCATTGAGAAAGCTTTCCAGTT^ 
GTTTGGGTACCTTGCTCTCTGAAGCCACCAGAGGACAGAAACACGGCCCGAGTGTGACTGCATC 

4 0 GACACAGGCCATGCTCCTTCCTACCT AC CTGTGAATCACTTCATTGCCAG CTGCTCCTGGCCCCATCAGGGAG AGCATTTGGCTTC 

GCTCCGAATTGCTGCCAAGACCATCGCTGGCAGAACCTCCATCCAGAG^ 

CX3AAGGCAGTGAGTGTTTTCTGGCAGTGCAGAGCCAGGTG 
GCCTCTTGCAGAGGATACAGACCCrGAGGAGCACrGTGATAGTGTTGGAAACT 

4 5 TCACAAGTTTCTGCTGTGGATTTACTAGTGGATATTTGGGGA 

CCrCTAACTAACTGGGCCTCCAGGGCTGTGTTGCCACCTTTGGATCAAGCT 
CCTTCTCGAGCGTTTCACTTACTCACTAAGTGTGGGTACAC^GATG^ 
GCTCTGTCTTACTGCAACTCAGCAGTTGCTCAACGACAATC 
GCTTTTACATGGAATAATGTGAGTTCTGCACAGCCCAAGGGTG^ 

5 0 CAGAGAGGTCGGGTGACTTGTCCAGAGTCTCAGCGCTGATAAAGGAGAAGCTGA^ 

CAGAGTCTGGGTTCirGAAACTGGGAAAGGAAATTTCCTTCT 
GGCAGGGTGGGAGAGGAGACCCACACACAGTCTCACATG 
ATCTTGACTTACCAGTTCCTCTGTACCATACAGTGCAGATTAAGT^ 
ACCAGCAGACTTGAAAGGGCTTCCCAGCAAGGAGTTTAAAAACAACAGGGCCC^^ 

5 5 AAGCTGTGGGGTCCAGCCATGCCATGGAATACTAAATACTCATTC^UUU^GAG 

OX^TCATATAACAACAGTGTGGACAACAAGAGCTCATTTGTAGAAAACTGC 
AGATGGCCTCACAGCCGGCXK5CGTCCTGCGAGGTGGAATGAATC^ 
TATTCTTACAGTGAGCCTGTGTTATTTTTGTAATAGGATGTTAAAAATTCAT 
TCTCCCATCGGAGAGGAATCCAATCTTAGCTGAGAGGACTGGAGGTCCCGTGGT^ 

6 0 CTGTGACACAGGGAGAACCACAGTCACATCGCTAAGATCTGTG 

TGTCATTTCAGGTCCTCTGTGGTCCAGGGCAATGGCCGTGGGTGCGAACTT^ 
AGAGGTGTAAGACTGCGAGAAGAGCCTTAACTTGGAGGGAGAGCTAGCTAGGTCAGCA^ 

GTGGGC AG CTTTTTCAT C CTTGG CCAGATCT CTGT AGG C ATT C CC CAGG TG CCGTTG ACCTTG AG AG CTG CTTG CT CTTTCACAG T 
GATGGGCACGAGCTCCGTCAGCCTATGGCTATCCTTTCTTGGCT^ 
6 5 TTCTGTAGATGATGCTAGTGGTTGACAGATAAGAGTCCAGATCCCATTTAGAGTAA 
TCTATCAGTTGGCTGTCTGTAAAATGGGCACACTCACTGTTGCCT^ 

CAGTGCCCCAGCCAGTCCCAAGAAATACTTCCATCCCTGAGAAATAGTTGATACC^ 
TGGCAATCTTATGAATAACAGGGCATTCTGTCTTGGCTGCTGGGGAAGCATG^ 
' 7 0 AGAATCTGGTGGGGAGCCTTGACGTTAGCAATGGGAAGAGGTGCCAGGCAGCAGGrrCCTGGGTTC 
ATGGATGTTGCTCTGTGCCTGGGAGTACTGAGGTCGTTAGAACA 
GCCACGAAGCCCATTGAGCX^TGGCTGTGTGCCC^^ 

ATG CAG AT AAGGAAATTT ATGG CTT AG AGGGTTTGGAAGTCC CTGT CC (TAGGCAT ACAGAAT C AGACT AGAATC CTGCTCTGTCTG 
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CAGGTTCCAGCTTTGGGTGGGAGCXrrCTGGCTC 

CTATTGAAAGGAAG CCCT AGTGACCG TGTGGGTTTT AAATGCATG CATT ATTT ATCTTGGGATTGGAAACAGGCATGGGAAACTCT 
GTCCTGTTGCTGGGGTGGGGGTGGGX3ATGATGTTTGAAGACATCAGTCCA 

GATGAGCATGTGGTCGAGGGGGAAGGTTGGAGGX5GGGTGTTTTCCCTGTCTC CAG 
5 AGGTGACCTACCCTCTGCCCTTCCCGGCCTGTCrTGGCTTAC CAGCAGCAACTCCAGG CCCAG CACCTC 

TCCCATGCCACGCATGGTCCCCCGGTCCAGCTGCCACCCCACCCGTCAGGC 

CAGCTCTGGGTTGCTGGCACTTGGTGCCCTGGGAAGTCAAGCTC^ 

ACAGAGGTGTGGCTCTGGCrcGGCTGGGCTGGGCTGGGCTGGGTG 

ACTGTCACCTGAAGCCTGGTGTTAGCTAAGACTTGGAGCCGTCTGTTGAGTCCAGT 
1 0 CCCCGGTGACAGTCTGTTTATCT ACAGTAGCCCrc 

TTCAGGGTGCTTTCGTGATTGCACTGGGTGC CT T T C 1 1 AAAAGAGCTTGAGTTGGGGCCTCGTTGTAGCCTTCTGTTGCrATCT^ 

CTGGATCTCTGAGTCTCCTCACTGGGAAACCTrCCAAGAGGCAGCACrc 

GCATGCCCGAACCCTGAGTTCATCCCTTGGCACCATATAAAACAGGGGTGGTAGTGCCT 

AGGCAGAAAGGGAGAAAGTTCATGGTAATTCTCATCTGTGTAGTGAGTTCAAGGCTGGCCC^ 
1 5 CT AGTGGAC CATCTGT AAGTGT AGGGTCT CAGGAAT ACGC CAACAG AG AAGT AACTAGAG AGACGAGGAATGGG AACT CTTGCCCC 

AGCCCCCTGAATAGCACCACTGGGGCTCAGGAGGATATCAGAGGGAACAAG 

AGGTTGTACTTTTGGTTGGCTITTAGGTATTTACTGG 

CGCCATCCAAGCCTTAGTCCGTGCTGTATCCTAGGAGCCAGTGGCCAACTGTTTTG 
AGAGCTGCTGATTAGCTGGACTGGTGTGACCAGAGGCCTATGCTTCTGGGGGC^ 
2 0 AAGCAAACCAGTAGAAGTCAGTTCCAAATTGCAATTTTTAA^ 

CTAGAAAGTACGGCGAAAAGOJUVCCAAGATTCTTTTGACACACCAGGAAG^ 
ACTCGTGGGAGK^GGCACCATTAAGATGCCATATACGGGGCGTGCCTTCACTGCACT 
GAGGTGGCTTTGATAAACCACAAAGGTCAACTGTGATTTATTAAAGTGACCT 
GACCCTCCTCClTrCCTGGCCTTCCTGCCTCTCAGAGAGAGAGTCCAGCA 

2 5 GCGGGAGGAAGATGGGCCTGCCTAGAGGCCTGTTCCTAGAATTACAGGCCTGGCA 

TTTCCCGTTCTGCTCTGTGGGTGGAAGCTGGGGTTTTTATTTATCATTTGTT^ 

GACTGGTATGTGGGTGACGGTGTCTCTACCCATATTCAGTACTAGGGGTAGTTACTGTCTATCATAATATAGGAGTTC 
TGTTGAGTGACCTTTAATTAGGGAAGCTGTTTAACT 

GTCAACTGAAG CCATTAGCCCACCAACCT ACATCTTCCAAGGGGCTCTTTGG CTGCGTTCCCGATG CTG 

30 CCTAGGTTGATGATGGTTCATCAGCCCCGTAGGCTGGATGCCTACTTCCGGCATAATCCCGA^ 
GGTTTCTCCCTGCTCTGCACTTTGACTCTGTTGAAAAAAGTGCTAGCT 
CTCAGAGATTTGCTGTAAATCTTTCTGATCACACCGAGGCT 

GGAGTCTGGATAAAAAAAAAAAATCCTCCAGOGATTAGCAGAGCT CG CAG AGCTTGT ATCCGGGACCATCTT 

AATCTG CTAGGATGGCTCTCCTTATTACCTCATGTTAAGT AGGGCTTGGAATGTC^ CAGCAGCC 

3 5 TCCTCGGGAAGCTCTCTCTGCTTGACCCGCCTTCCTTTCCAATCCCrC 

CTGTCTCITCTTGCCCTGCCTGCCTGCCTGCTCAGAAATCCAGGCCC 
GAAAGGATTGATTAAAGCTCAAGTCCATTTTGATTGCTTTGGTGTTG 

CT ACATAGAAGCTGGACT AGGTATGTCCACCTG CGTTGTCTC CCAACCCCACCCCCAAC CTCCACCCCTCC CCAGCTTCAATC 

T CTCAGCCTT ATG AGGTCTTGTGTCTTGACCTTGGAG CCCCTGTCTCCTCCTGCATGTC ATGGGGATTACCCreCCAGACGGCCTC 

4 0 TCAGCTTTGGATAGXaCAGACCCCCTGTAGCAGGGCTCATT^ 

ATTACCTACrTCTTCATACCrArrCGCriTTTGGCTTT 

GTGCZAG(XCCCCCCCAATAATGCTCCTGGCATCCAAGCTAGG CTG ACTCAACATGGGAG CTGGGGAATG AACAGGGATG AATCCCA 

TGACATCCTAGGGCTGGGCAGCTCAGGGAGGGCACCAGTGGGGG^^ 

TGGATGGTGGAGATTGAAATAATCAGCCTTAGGAAAACAGCTTGT 

4 5 GAGAGGGGAGGGGGCCGGGTGGACAGGCTGGAAGCTGTCCC^GAG 

TTCCTTGCCTAGCTGAAGCAATAAAAAATACTTTAGTTTITTGGCTO 
CCTGACATGAOTCTCCGGTAGCGGCGAGGGTTTTCAGCCTGACTGCATGAGAAGA^ 
GGAGAAACACGAGCAGCCACCCCGTCCCCTCCCTCCTCTCCATCACCCGCCrrGCGCA 
TTTCTGCTTAGCACGGG<3TtMCCGCCACCATCGGAG^ 
50 ACTAATTGAATTGCGGCTGTGAGCGCATCGCCGCCGCTCGCCCGGCTGCGGGGAAGGTTGA^ 
TTTTTTTCTTCAAGGCTTGATAACTrCTGCCAT^ 
TTAGGAAAGGAAATTAAGTTCrGCGGCTGGCTCCCTCCCAC^ 

CCTTGCCAGTGAACCCCATCCCTGGGCCAGCATTGGCCTCTAGTAAAATGGGGAAAGGAGAC^ 
TAGTGTCCCTrTTTGTAAGCACATCGTCrTGCT 

5 5 TGCGTClTTAGCCItjTGTGGACTCCAGCATACATCITTAGCATC 

TTCTCTGACTCATCCTATCACTCCTGACATTAATGTCAGTCT^ 
GATAACTGGATCGAGGAAGGATTATTTCGCTCATGGTTrGAGGGTO 

GAGTGTGGGGTGACTGGTTACATCCTAGCCACAGTTAGGAAGAAGCTGGCCTGTATCAGAACTCCCCT 
GTGTCTGCGAGCCAGAATCCATGTCCAGAAGGTTCCAAGGACAACCCCT 

6 0 CTTCACATTCAAACAGTGGTGCTTGAGGTAAGCCATTG 

GCCAGAGGTTCCCCCTCCCTGTGCAC^CCAC^VGGACACCCT^ 

TCTCAGGCTTGTCATCCCTAGGGATTCCCCrCCCTCCCCCrGGAGCAGAGGAAGAACTGAGAAAA 
CATGAAACCTAG&GTCCTCTGCTGGGAGACCTTAGCITGCT 
TTGCAGGCTTTGTGCCTITTGTGACACTGGCTTGTCTTCCGTTT 
6 5 ATCTCTGCTTAGGGCTTTATCTGCAGCTCAGCACCTTTrCTCATTCG 
CTATGGCTCAGKSTCTGACITGAAAAGACACATCTT^ 

CT CAGG CCAGGAAACAGAAAT AG CAGT AC CTTG CTTCTAAAAATTCAT ATCTTGGGT AAGTTC CCAGGG ACCCAAGGTG TCCTGGG 




7 0 CTGAAAGCCTCCGGCCCAGTGAGAAGCACCGGGGCTCTGCAGACTACAGCA 

AGCCTCAGCAGATACGTACGTAGGGGTCGGCGAAGTGGCTTCGCCGGGTAGGGTTGTGACCTCCCTTTGG 

TGAGTTCTGAGTCTGAGAGGCTGGTAGTTGTGTCRGGGCCCAGAGAGAGGG 

TTGGGAGAGAAATGX5ACAGGGGCATTACCCAAGCCT 

CCATGCTCCAAGGGAGTGTCCTGGGGTACTCCAATTGTGAGCCCCACCGTCT 
7 5 ACAGTTTGTAACAACCTGATGGCGTGACCTTAACCTGTG^ 
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CAGGGGTGAGATTTCAGTCTAGAGGAGGGGCTACTGCCAAGCCATCTTTGTCCCATCTC 

ACAGAGCTCAGTTGTGCTACAGAGTGAAGCCAGTCACCTGCCCCAAAGTGATGG 

GACTATCGTAATGAGGCTGCTTAGACACGTCAGAGCCCCAG 

CATAGAGGGCAGGGGACCCAGCAGCCGAGAAGCATGAGCITTGGGGTTTGGT^ 

GCAGATCAGGGGTCCACATGCTCTTACTGGCITCTCTCT 

TGTCTCTAATGAGGTAAGGGGAGGCCCCAGTTGCAGGTGGGGTGATTTGGGA 

GCTGGAG CCGGG CAGTGGTGCATCAGGGGAAGCAGGGAGACAAGT AG CTT CATGTGTGGG AACTGAAAGGAAGCAGCAGTCTTCGT 
GTCCCATAGTTGTCTAGGGTGGCCCTATCTTCCCTCTCCAGGGTTGGGGGAGT^ 
TGTGAACCTTGCTGGTTAGCATTCTGGACTTGTATGTGACCTGGGGGA^ 
ATTGTTCAGAGCTAATCCCCTCCCACCCGTGCCCATTACCTTCC^ 



GTCTGAAGAAAGATGCCCCCACCAGCCCAGCCTCCGTGGCTTCCT 

TGCTGCCTGCAGGGTGGGAATTGGGTGTGGAAAGAACGTGGCTATTAGGATTGGCT 

TGTTTTGGAGCCTTGGTTTCTGCATCTTTAAATT 

TGGCTTCATGTGGGACCAGGGCTAGGCATTTTGCCCAACTCTTGGG^ 

TT GC CIT CTCAGAATGACAAAT CTTCCACACCTGGGCTCAAG TC CAACACACCAAC C CAAG AAATG ATGCCCCAACT CCAGGCAC 

CAGCACCACCCCGGGGCTCCGGTCAATGCCGGGCAAACCTCCAGGCATGGACCCGATAGGTATAATGGGTAGGC^ 

GGGTCAATCTGATGGGGGAGGGGAGGTTGGTCCCTGGCTAGATAGAGCCA^ 

GCTT CACAGGTCTG CCTGG TGAT CTTTCTTGTCAGGTTT AAGGAT ATGGAAGC CTG CACCTTTGACATTGAT A 

GAGGGTCTTTAGCATTGCTATGTCAGAGTCTACGAGGCCT 

TAGATCTGTGTTCACTGTCTGTTAATGGTTTTGTGAGCTC 

CATG ACTCCAATCTTGGC CT CAGAAGTCAGGGGT C CAG TTGGGT AG CTTCAG ATCGT AAGAGGGCT ATGG CCT CAG CTTTGT CT AT 
CACCCATCACCCTCACCAGCTGCTATCCAGCACCCTTTGCCATGAT^ 

GCCTATGCTGGCCTACACAACATCCCATCCCAGATGAGCGCCGCCGCAGCCGCTGCAGCCGCCGCCTATGGCCGATCGC 
GAGCTTTGGAGCTGTACGTAGACCTTTTGGCTCCTTTTGGTGGGGGCGGGG 

AGTACATATCCCCGCATGCTGAGGGGATCCTCGGTTGTTGTCTTAAGGTGTCTATTGCTGCGATGAAACACCATGACC 
GTTGGAGAGGAAAAGGTTTAGCTCACTCAGTTCCATAGAACAGTTTATCCCCAAAGCAGTGAG 

CACTTG AAGGCAGGAGATG ATGCAGAGG CCATGGAGG AGTGCTGCTT ACTGGATTGCTCTCATGG CTTTG CTCAGCCTGCTTCCTT 

ATGGAACCCAGGACCATCAGCCCAAGGATGCTACCACCCATAATAGGCTGAGCCCTCCTCTACCCA 

CTACAATTGGATCTTCTGGAGACATTTTCTCAATTGA^ 

TAGCCAGGATATCTGTGGACACCTGACAGCTGCTCTGGGAAGAGC^CCTGTTTAGAGAGAGTTTCCTTATGGGGTG 

GTCCCAGTAAGAGCTTAGTGACATCTGGTCAGGTAACAGGGTTTTGGGAGGT 
AGAGGGGACTCATTTTTCAGGACAG1TTTTCTCTACTCTTCTATACTTGGGGTGG 
AAGGAGAGAGGTGGGGCAGGGCCTACTCTGTCTGGCAGTGCCGAGGGGATACAGGCTGAGGTGG 
TGTTACTGTCCTGAGAGGACACTTGGCCATCCTGGCATCTGCC^ 

GTTCTGGAGAAGTCAGAGGGTGCAGCAGGCCTAGAGGACAGTGGCCATGGGGAACAGGGTGAGGCTATGTGA 
AGCCACTCTCCTTTCTTTATAGGTTGGTTTTGACCCTCACCCCCCAATGAGGGCCACAGGCCTGCCTTCC 

CTGGAGGG AAACCGTAAGCACTTGGCT ATTCTTTTTGCATGCTTCACGG CC CTGAGC 

ACAGGGTCTTGCTGGATACAATGTCCTCTAAAAGGTATCTGTCCCTCCTGAGACACAAAGGCATGGTCACCACCCAAGGTAAGTC^ 
GTCCCAGAAACCCCCTCCATCAGGGGAACTTAGGGACTCTGAGGTCCAAGTTGATGGCTCTGG^ 

GAGTACAGAG AACAATGTGGGCCCTGGGGATTCCTTCAGGTTCCACCTGTTGCAAAGG CCACAGCCCCTACCCTGCCT CT AGTTCC 
CCTGCTCTTCAATGG AGCACAGT ATTGC CCTGATCTTTGTCTGCATGG CTT ATTGGGAAATT AAAGTTTGAACT ATGTGTACCACA 
T AG AAAC CTG CTCTG T ACTG T AATAG AT CT CATGC CACCATGGCCACGG AG AAGGG C ACT CACAGT AATGTGG ACAGACAAAG CTG 
AGTCrCTGTGGCTCAGATCCCCTTGTCGTTTTTGCCT 

CAGGTCGTGGCATTTGGTAGGGTCCTTTCCCAGATTGGCATGTCTGGAGACTTTATAATCT 
GGGGGGTGGGGGCATTCCACAAGATTCCATGGAGCACCTAAGCTGATTTGGA 
CTGATGGAGCCATCCCAGTACCAAGGGGCTCCTGGACTTAAGG 
TGTGTATACATTGATTTGGATGGGGCCTGTGGGACATGTTAGGTGTAGACAGCAGG 

TCCTTCCATGTG AGTGCTGATGGGCAGATG CAACCTGTGCCCTTCCCCCATG ATGCACTAGCAGG CCCTGGCATTCCCAGGCATGC 
CCGGCAGATCAATACGCTCAGCCATGGAGAGGTGGTATGTGCTGTGACCATCAGCAACCCCACACGAGACGTCTA 
AGGGCTGTGTGAAGATATGGGACATCAGCCAGCCGGGCAGCAAGAGTCCCATCTCCCAGCTGGACTGCCT 
CACACAAACTTATTCTGTAGCATGTAGGAGGTAGAGGAAAACAGGACAGAC^ 

GCAGCT CTGGGTATGTTGACTAACCTCTCTGGCCCTAGTTGGGGCATCTG CGAATGGAGATGACTCTT AGG CTCCAGATTCTAG AG 
AAAAGCATGAGAAATTGGATGGGTGTAGAATTGAGAGTGGAGAAGTCTCCAGGAG 

GCTCGTGCAAGCTTCTCCCCGATGGGCGGANNNJn^ 

CATCAAGGCTGAGCTGACGTCCTCTGCTCCAGCCTGTTATGCCCTGGCCATCAGTCCTGATGCCAAAGTCTGTTTTTCCT 
GCGACGGGAACATTGCGGTTTGGGATCTGCACAACCAGAC^ 
TACTCACCCAGCCCATGATGTTGGTCCTTTTGGAGAAGTAAAGGGTGGTCA^ 
TTCAGCCTAGGAGCTTGGGGGAAGATGTGGTGGAAGGCAGAGCTTTGAACAGTTCT 

GAGAGTGGAGAAATGCAGC!ATCGCTTAATGCTGAGGCCCAGGTGG AGCAGAG CAGCTGAG CTTTGTGTGTCCTGTCTCTG AG ATGT 

CCTGTGTAGCTTGACTGGGTTCTGTGTGACCTGGGTGGTGTGGTGTCITCCTCTGAGTCA 

GTTCCTGCCCGAAGTCACATCCTTAGTGTTACCATGACACCCTTGGGCATGTTTT^ 

TAGGGACAGCAAAGACTGGGTTGAGGAGTGTGTCCCAAAGGCAGGTCCTAAACATGATAGAAAATCCCT 

TCCTCTCCATTACCTTCTTTAGAGCAGAGTGGGCTCCCCIAGGGGTCCCACCT 

GGAAACTCTCCTTGGTAGCACCCCrcAGTGGACTAGCAGA 

CAC CCTT^TTC CTTG CTTG G C CTT CAT AGG CAGTTCCAGGGC CACACAGATGGGG C CAGCTGT AT AGACAT CTCT CATGATGG CAC 
TAAGCTGTGGACCGGGGGCCTGGACAACACCGTGCGCTCCTGGGACCT 



GACTGCCAAGGGACCAGCCCGAGCCAATCCGGGAAGGTTGCTTGTAAGCAGCAGCAGTC 
AAATGGGCACACACCCAACTCATTCTAAGTCCACAGCTTGCTCCCTGTGCTCTGCAGTTTGCGCATC 
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TGGGCACCTGACTCAAGCCTCCTGTGAACCTCGCCCAT^ 

TCAGTACCATAATGTGGAGTATAGTCGTTGCCAGTTTCTTAGGGAGTGAGCTGAG 
CTCCTGCTCAACCTCCAAAGCCAGCTTCCTCAATGGGATACAAATGACATCCT 

AGCGGA 1 T ' l t> TT TGTAAGTGGTGAGGGAGGAGGGAGGAGGCTCTAACCCT ACCCAAGGGGCTG CTCCAGCCCTGCGGGT AGAGGGA 
5 CACTGAACTGTCTGTCCTTGCAGATCTTCTCCCTGGGTTACTC CCGTGGGCATGGAGAG CAGCAAT 

GTGGAGGTCCTGCACCACACTAAGCCCGACAAATACCAGCTGCACCTGCACGAGAGCTGCG 

TGGTAAGCCCTGAAGCCCTAGCCTGGCATGAGAGCCAGCATGGGGAGCCTGG^ 

TCTTGAGTCAGCCAAGCTTATGCTCCTTTAATGTCCTOSAGGAGAA^ 

TGTCACTCCAGGACACTAAACTCTCAGGACAGTAGGCTTTCCATGTTGACCTGGA 
1 0 CTTAGCCTCAGAGAAACTGTCTGGGAAGAGTACGTGGCTGTGGGGGATGT^ 

CCTCTCTGACTATTGGTCTTTGGCTGTTTCK^TGCTCrCT 

CrTTGGCAGGGAT CITlTl XSGGAAACATGCTCAACACAGTTAATTC 

AAGGCATCCTCTAGGTATCTGTGTGCCATGCTTGTAACTGCTCTTCAGGAGT 

TGTCTGAGGATGTTTCCACGCGGGGCAGACACAGCTGGGTrGGGTCCTGCTGC^ 
1 5 TGCTTGCCTTGCAGGCAAGTGGTTTGTGAGCACrGGGAAAGAC^ 

TCCAGGTACACTCCTGATTTTCCTCCCCACTCCTCCCGAGT03GGATGCGGCCA 

ACAGGAGGAGTCTGGAGCTAGGAGCAGACCTACTAACCCCTACCCCCACTAGCCCTTACT 

GTGAGGACCAGATAAAGACACACCCTCTGTTGCTAAAGGCrTGTGAATCCT^ 

GTGGCACCTGGTGCGTAAAATAGTGAGCTTGGCTGGCTCTGGGCAGCT 
2 0 TGAGGTCAGGATGGCCACCCTAAGCCATCAGTGTGATGAGGGGGCTAGAATGGACA 

CTCTCCATTTCAGACACACTCAATCCTCTCTCCCTGGTGCAACTTATAATCACT 

CCTGCAAACCCAGGACCAGTCTTTCCAGCTGGCCTGGCTGAATGAAGGGTCT 
CCATCAGTAAACTTAGCAGAGATGCTTATGCTAAGTGTCCTGTTCTCATCCT^ 

2 5 GGAGCTGTTCCAGATGAAGAGGCAGGAGGAAGGGACTCCAAGCCG 

GAGCCGGAGAGTAGGATGCGATTTCXSTTCAGAGCCCTGAAGTGCCAGGGAGAATTCCTGTCrc 
TCCTGTTTCTCCATGTCTTCCTCCCCCAGTCAAAAGAATCCTCATCTGTCTTGA^ 
GTAACAGGCTCTGGTGACAAGAAGGCC^CAGTTTACGAGGTCATCTACTGAACAAGGACT 
AGACACCCACGTGGCCCTGACAGGGAGATGCTGGCCAGGCCCCGAGGACGGCAGAAGGAGGGA^ 

3 0 AGAAGGTGGAGAGGACGCACGCCCCTTCGTGGATTGATGTGTCTGACAGACTTGAAGG 

GGCAGATGCTTCAATTAGATTTCTCTGGAGGCTTCTGCTTCCGTTTCTCC 
TTCTCTGACCTGTCCCCGCCCCACTCTGGGATGGGGGGCGGGGAGATGCAGGAGTAGACATTGTT 
GTTCTGATTGTGCAGTGCACAAGGTTTGTGAACAGTGTAAATAAGAGACTGGTTGCTGACTGGCCAGTGAGG 
CACCATTGCTCGCCGTGGATTTTGCCTGCTGTATTTGTAACCCACT 

GGGGATGGACAAGCCATCCTGGCGCACTGGGACTGAA^ 
AGATGGACAGACAAGATGATGGACAGTACTGGGATGGGCCAGCCTG 
GCTCTGCCCCATCTGTCTCTTGCCTTGCCCACACACCTGACCTCCATCTCTITCTGTCC^ 
GCCCCCATCTTTCTCTTTGGTGCACACTCAATCATTGTGATGAGGAATGGAAGT^ 

4 0 CGCAG C CT AG AAAGG ATTC CAT ATGGGGGTGGGGGT AGGGGGG AG ATG AAG CG ACTCAGCTCAGGG AG CT ACCAGAGAAAAACAGC 

AACTGGTGTGGGTGCTTTTTTTTTTTTAATTTGAATAAAAATAAT^ 

TGCTCTCAGCCCTTT CTGCTCCCTGAAGCGGGGAAATGGTGACCCCCCC CCCCACCAGTCTGAGAAGATGGGCGGGACAGCGGGCT 
GGGGGCTGGGGCMCAGTCATCTGCAGATATCACCCCCTAGAGTACCAATGTTTGTGGTATTAGGTTT 

4 5 TTTATATTGCATTTITATGGATTATTTTATAATGTGGGA 

AGAGCTCCAGCTGAGCGGCTGCCCGGAACAGGATGTATGTTGGGACATGG^ 
GTCGTCGTGCCTGTCTAGAGTCTGTCCATCTGTCTCTCGGTTTCAGATGGTCACT 
GGAGGGTTAGGGTGGAAGAGAGTTATTCCTGAAGAAAAAAGAAAAATTGAAAAG 
GTTACTATTTAAAAAGGTTGGTTTGATTGTTGTGTTTTAATTGCCTTTT 

5 0 GTCCATGGTTGGTCCAGTTGCATTCTGGAACCTGGGAAGCATCrCACCAGAAAGTCCTCCTCCCTT^ 

CCTCCCTCCTCCCACCCCATCAAAAGTCTGTAAGGGCTGTTGGGCTTTTTCTAGTCCAGAGCATACTACACCAC 

ATGCACCAGGCAGAAGTAGACTGGGAACCCCTGGCTAAGATGGGTCTCATGGGCAGCACCATGCTCATATATGG 

GCTATTTGAG CAAAATTAGGAAATTTCAAGTGAAT ATCATGT AAGTAGTTCAAGTAAG CGTC ATGGTCCTCGGG ACCCAGAAAGTC 

TGCA UT1T1T1TCT TGTAAGCTTCATTGATCT 

5 5 AGCAAGGC AG ACCACGGG CAAGGCCAGGTG AATGAT C CCAAGAAG G CTGGAAAAG CAG AAAAGACTG AAAAGT AGTTT C CTGGC CT 

TCGTATACCTACCCATCGCTGAAATAAGCCTCTCAGAGGACCAGGATTTTAATGGTG 
TCGGTCACACCGGTAACCTGCTGTCCCTTCATGGAAGGAGA^ 

TGGATAGAACTGCTGTCCACCTGGATCTTGACAGAAGGGCTGCAGGGAGGAGCCAGC 
CTGACCAGCATGCCCTCCAGACCACTCTCTTCTCCA 

6 0 TCCTTGTGGGTGGGGGAGGGGACAGAAGGCCACTGGTAACTCCCAATAGTTTCTGACAGCT 

GGCCCCAGAGAGAAGGTGACTCTCCAGCCACCATATTTGCTTCCCAGATACCCCCATCT 

AGTATGGCAGGGCTAAAGGGTCTCTTCACAAGGGGTTATAAAATAACCACT 

GAAAACATTTTTTAGTACTAACAATATAAAATATTCT^ 

AT AATGG CAT ATG C CTG AAATTG CAGCACTT AGAATGG AAT GGT AAGTGG AT CT CTG AGTT CAAGG CC AG C CTGGTCT ACTG AGTG 

6 5 AGCATATTCTATACTTAGGATCCATTTACAACTATTGCACCAGCT 

GGCTATCTAGTAAGTTTCAAGTCAGCTTGATCTACAAAGTAAGAC 
CTGAAAAAATACAAAATGACCTTTGGTGTGTGAAAGGACATTCTAAATTGAGTOT 
TTTTTAACACCCAAGTCTAAGCCATTTCAACCTATCACGGGTC^ 
GCTTGATTGGAAGGAAGACTCTAATGTAGGGAGTAAGGCTACCTGGGCCT^ 

7 0 GGCAGATAGCCCAGAGCAGTCATGGGACTATTCTAGAACTCGCCTGTTAGTCTTGCT 

TACTCACATCCATCTACTCTTCACrCATTTATCTAGCATT^ 

ATCCATCCATCCATCCATCCAATGTGATAGACTCTCATAAGTTCTCAGGCCTGGTTCT 
CAATCCTTTCACAGAGCTCTCAGTCATTGAGAGACAATGACAGAAAC 
ACCTGGGCAAGAGACTCAGAGCAAAAGGGCGCCCAGGAGGAAGAGCTGAGCATACCTGAAGG 
7 5 GTGGGGAGGAATGCTG CAG AGAG AAGGAACAG CATGTTCTAAGGT ACCCAACCCAATCTAGTTCTG ACCCCAT CTTTCTTCCTGCC 
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CCAGCCCAGCTTGTCAAGCTTCAGTTCTTGCT 

AG AAGATGCAGGGGACTAGGGAAAAG ATAG AC CAAGGGT ACAG CT CAGGAGC AGAAGGGTCAATGC CCATGCTGAACTCT AGACCT 

CTGGGTTCCCAGTGTGATTGCCATTAAAAGACCTTAAGCCAACCATTGCCAGGC 

GCCACTGTATTCACATTTCCCATTCCTCTGCTTCCTTTAACT^ 

AGACAGACAAATAGATGTATGGGCAGGTGGATGTGCTGATGGATGTGAGGATG^ 

AGAGAATGGATAGATAGGTAAGTAAATGGATAGATGTGTGGGTGGAGG^ 

AATGTGT AGATGG AAGGATGGGT AGGT ATGAATAGAT ATAT AG AT GGGTACATGGTTGG CT AG ATCAT AAATAGATGGA CAAATTG 
ATTGGG AATTTGG AT ACATGGTT AT AT AG T AGATTTCTGCTTCATGGGGG CATGCTCAGAGT CTCATAAAGAACTGATCTCTT AGT 
GGGAAGAGATGAGAAATCTTGGGGTGACAAGTAAGGGAAGAAAACCCAGAGAC^ 



ACCATCTCAAAG GTAGAATGAAAGAGG AAAAGG C CATG CAG CATAGGCCATCT CTCTGCTTCCCAGCAGAGCACATTGCT CTCTAT 

(^TGACC^CTC^TCCACAGTCCAGAAATGACAGCTGGCAACT^ 

CTGGCTCGGGCATTACGTCATAGTTATGGAAGGGCT 

GAGACrTTGGGGAGTTGGTGACTACATCAAGCCAGGTGA 

CTAACTAATACACCAGCACACGTGTAAAAAGTTTGGCAT^ 

GATCCTC^GATGCTTGCTGGCrrAGCrrAGTGTAACAGTGAGCT^ 

GCAGTTAAGAAAGCTATCCAGATGCTGATCGCTTTTCrACA 

ATGCAAAACX^AGGGAGATCAACATGCTGGACTAATTCTATTTT 

ACTCAGAAGGCAGAATGAGGAGTCCCCAAGCTTAGACTAACA 

CTAAGCATAAGAAATGGAGGTTTTATCTAAATGAGGAAAATACCTAGT^ 

AAGAAATGGAGGTTTTAAAAAGGCITAAGGGGAACTGGTAGACCAC^ 

AG ACAT AACTG AT AAAGGGTT ATG ACACAGAAC CATATAACCT^CCTTTGGAAAACAAG CAAGGTTT GGAAAACACCAGGTG ACTC 
AGGCTTATCAATAAAACTAAACAGATAAAGAGCATACAAAGGTCAA(ZACrCACT 
GATTTTATTTCTATTTTACATGCATGGGTGTTTGCCTGAAGATATGT^ 
AGTATGTCAGATCCTCTGAAACTAGAGTTATAGATGGTTGTGGGCCTTCATGGGGATGTTC 

GAGCAGGCAATGTTCTTAACAGCTGAGCC ATCTCTGCCAGTT CCTGTGACT T'l TC ' r'l'GACATATCAAAGT AG AAAGCATAGTGTGT 

GGCTATACTAAGCCTATAGCCAAGTATTAAAGTTGACAAAG^ 

TTATGATTTAAAATTTTTATTAGCATATATCAGTCATACAAAGTAATAGGTTTCA 

ATATTTACCCCTCAAATTAACCAACGTAATAAGGaVAAATGAAGTGATA 

ATTATTTGCAGACATCTTGTCTGCATATAGAGAGAATGTACAACAATGAAAA 

AATATTCATTAATCAGTAGCACTCGATTCATAAGCAGAAACCAATGG 

ACT ACC C ATGG ATTT ATCT ACATAAG AT ATTT ATGGAGGAAAAACCACAC C ATC CT ATCAAAG AT^ 

AAGCTAAACTAAGTGAGTGATTTAAAAAAACCAATCTCACTTAA 

AAGTOVGAGCAGGTTGTTCrrTGTGTAATCTGACACTAGAATATATATGAA 

GACAAACAGGG AATGGACAGAGGATGGTCCG ATAACAT AG AG CAATGGAAAGCATG CT ATCAGTATGGTGTAG ACAGATCAACAGA 

ACCCACTCITATGAGGCCCATAAATCGACCATATATAGGGGGAAOT 

TCCTTTGGCAAATATTTCCTGGACAGTCGGCTACCTT^ 

GGTCACCAAGCTGCCTTGCCACATTTACATTGCCAACATGGCTCTCrATGTC^ 
GGCTGTGGTTAACAGCTGTTTTAGCrTTTAGCATAG 

AAGCTAACAAAAAAAG CCACCAC C AACAAC AAAACAAAAG CC ACC ACCT CCACCACCT CCAC CAC CACCACC AC CACCAC CAC CAC 

CACCACCAACAACAACAACAACAACAAAAACCCCTAACAACTTC 

TTCAAAGTCCTTGGATCTGTTGACCCTGGCCTGTGGAGGAACAAAACACGTAGGCA 

C^TGATGAAACAGGAAGCCCATGCTGTAGTAACAAGAAGGGGGAGGATACGATCCCAGAGACC 

TCCCACAGTTGCTCCACCTCTCCATAGTCTACTCAGAAGTTGAGTT^ 

GGTCCGGTCACTTTCCATGGCTGGCAGCCAAGACCCCCTGCATG 

AGAGCC^TTGAAOVGAGCCTAGGAAACAAAGCTATCATGA 

GATAGATAGATAGATAGATAGATGATAGATAGATGATAGATAGATAGATGATAGATAGATAGATAGATAGATGATTGATAGATAGA 

TAGATAGATAGATAGATAGATAGATAGATAGATAGATGATAGATAGCTAGATAGATGATAGATAGATAGATAGATAGATAGATAGA 

TAGATGATAGATAGATAGATAGATGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAAATTTGATGTTAAA 

ATTCAAATTGTCTTTACAACAAAAAATGCCCAACAAGAAATAAGAAAACAGTAA 

AAAACAAGGGCACCrGGATTGTATGACAAATTCAGATTCTtCCGA^ 

NNNNNNNNNNNNNNNKNNNNNNNOT 

NNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

JJNNNNNNNNNNNNNNNNNNNNN^^ 



NNNNNI^NNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNN^ 

NJJNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNN^ 

ACATTAGAATGTAGTTTAGAAATTTGTGTAGTTGATCTGGTATGTTTTT^ 

GGAAATAGGAG^TCTGGGAAGATCAAACTTCGTGAG^AGTTGCCGCCACAAT 

GGCATTCCCAGGAGTAAAGAACCCAGCTCCTGTAGCTCACTGTGTCACCTGGGAA^ 

CCTTACCATCCACCAACACATTCTGC^AGCCTACGCGAGCCATTCCT 

AGCTGTCTCATGCAGAAACGATGTGGGTGAGGTTTGTTTAGCITTT^ 

GT AATGACTC CATGAAG CTGTCACTTG C CCTCAGGAAATCACACCTG AG CAC CCCAG CTCATC CACTCACG ATG T ATCG CTTCC AC 
TGGATGCTTATCTGGGAAGCCGCAGACAGTTCIATCACTGAA 



75 
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CCTTTTGGCTTTGGGGGTG 
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gtcgggatgcccgcttccgggccccgto^agagc 

gaggctcccggcggggagggggccagggacccagaggaggcacggggccgcgtggagc 
atgtgtccttgtgggctccaggcaccccatcaacccgggcajgccgggatt^ 
agacgaattccagttcctgcaagctcagtatcacagccrcaaagtggagtatgaca^ 
5 gccattatgtgatgtactatgagatgtcctatggcttgaatattgaaatgcaca 

at c ct ag cccagat catgc cttttrrg tcacaggag catcagcagcaagtgg cg caggctgtgg aacg cg ccaag caggtcaccat 
gacx3gagttgaacgccatcatcgggcagcagcaa 
caccccacccgtcaggcctccagcctcctgggattcccccagtgacaggaagcagct 
agtcaagctcacttggcggtgaaggatgagaagaaccaccatgaactgg 
1 0 gtcaccctctgaaagcctccggck:cagtgagaagcaccggc^^ 

agaaagacagcctcagcagatacgatagcgatggggacaagagtgacgacct 
ccccgggtgagcccagcacactcccctcctgaaaatgggct 

CTCCXjTGGCTTCCTCCAGCAGCACACCTTCCTCCAAGACCAAAGACCTTG 
CCAACACACCAACGCCAAGAAATGATGCCCCAACTCCAGGCACCAGCA 
1 5 GGCATGGACCCGATAGGTATAATGGCCTCGGCCCTGCGAACACCCATCACCCTCACCAGCTCCT 
GAGCC^CCACGAGATGAATGGCTCCCTCACCAGCCCAAGCGCC^ 
CAGCCGCTGCAGCCGCCGCCTATGGCCGATCGCCAATGGTGAGCTTTGGAGCT 
ACAGGCCTGCCTTCCAGTCTCGCCTCCATTCCTGGAGGGAAACCGGCATACT 

TGTGCCCTTCCCCCATGATGCACTAGCAG^CCCTGGCATTCCCAGGCATG CCCGG CAG ATCAATACGCTCAGCCATGGAGAGGTGG 
2 0 TATGTGCTGTGACCATCAGCAACCCCACACGACACGTCTACACAGGCGGCAAGG<3CTGT^ 

GG CAG CAAGAGTCC CATCT CC CAG CTGGACTGC CTGAACAGGG ACAACTACAT C CGCTCGTGCAAG CTT CTCC CCG ATGG CGG CAC 

GCTCATTGTGGGTGGTGAGGCCAGCACGCTCACCATCTGGGACCTGGCCTCACCCAC^^ 

CGGCTCCAGCCTGT^ATGCCCTGGCCATCAGTCCTGATGCCAAAGTCTGTT^^ 

GATCTGCACAACC!AGACCCTGGTCAGGCAGTTCCAGGGCCACAC^ 
2 5 GCTGTGGACCGGGGGCCTGGACAACACCGTCOTCTCCTG^ 

AGATCTTCTCCCTGGGTTACTGCCCCACrGGGGAGTGGCT^ 

AAGCCCGACAAATACCAG CTGCACCTGCACGAGAGCTCCGTGCTG CAAGTGGTTTGTGAGCAC 
TGGGAAAGACAACCTTCTCAATGCCTGGAGGACGCCrTATGGAGCCAGCATCT^ 
GTGACATTTCAGCGGATGACAAATATATTGTAACAGGCTCTG^ 
30 ACTCTAACAGGCCTGTCAAACTCTGGG AGAGACACCCACGTGGCCCTGACAGGG AGATGCTGGCCAGGCC CGAGGACGG CAG AAGG 

AGGGAGCAGCATTGTGGGGCTGCCCTCCAGAAGGTGGAGAGGACGC 
CTTTCTCCTCTCCCCCTAACCAGTGCTGGCAGATGCT 
CC 

35 MOUSE SEQUENCE - CODING 

ATGTGTCCTTCTGGGCTCCAGGCACCCCATCAACCCGGGCAGCCGGGATTTA^ 
AGACGAATTCCAGTTCCTGCAAGCTCAGTATGACAGCCTCAAAGTGGAGTA 
GCCATTATGTGATGTACTATGAGATGTCCTATGGCTTGAATATTGAAATGCACAAGCAGAC^ 
ATCCT^GCCCAGATCATGCCITTTTTGTCACAGGAGCATCAGCA 

4 0 GACGGAGTTGAACG CCATCAT CGGGCAGCAG CAACTCCAGGCCCAGCACCTCTCCCA 

CACCCCACCCGTCAGGCCTCCAGCCTCCTGGGATTCCCCCAGTGACAGGAAGCAGCTCT 
AGTGAAGCTCACTTGGCXGTGAAGGATCAGAAGAACCACCA^ 

GT CACCCTCTG AAAGCCTCCGGGCCIAGTGAG AAGCACCGGGGCTCTGCAG ACTACAGC A CGGAAGGCGGAAG 
AGAAAGACAGCCTCAGCAGATACGATAGCGATGGGGACAAGAGTG 
45 CCCCGGGTGAGCCCAGCACACTCCCCTCCTGAAAATGGGCTGGACAAAGCCCGTGGTCTGAAGAAAGATGCCCCC^ 
CTCCGTGGCTTCCTCCAGCAGCACACCTTCCTCCAAGACCAAAGACCTTC^ 
CCAACACACCAACGCCAAGAAATGATGCCCCAACTC(^GGCACC^GC^CC^^ 
GGCATtK5ACCCGATAGGTATAATGGCCTCGGCCCTGCGAACACCCATC^ 

GAGC CACCACG AG AT G AATGG CT C CCTCAC CAG C CCAAGCG C CT ATG CTGG CCT ACACAACATC CC AT CC CAGATG AG CG CCG CCG 
50 CAGCCGCTGCAGCCGCCGCCTATGGCCC^TCGCCAATGGTGA 

ACAGGCCTGCCTTCCAGTCTCGCCTCCATTCCTGGAGGGAAACCC^CATACTCCTTCCATGTG 

TGTGCCCTTCCCCCATGATGCACTAGCAGGCCCTGG CATTCCCAGGCATG CCCGGCAGATCAAT ACGCTCAGCCATGGAG AGGTGG 
TATGTGCTGTGACCATCAGCAACCCCACACGACACGTCTACACAGGCGGCAAGGGCTGTGTGA 

GG CIAGCAAG AGTCCCATCTCCCAGCTGGACTGCCTGAACAGGG AC AACT ACATCCGCTCGTGCAAG CTTCTCCCCGATGG CGG CAC 

5 5 GCTCATTG TGGGTGGTGAGG C CAGC ACG CTCAC CAT CTGGGACCTGG C CTCACC CACACC C CG CAT C AAGG CTG AG CTG ACGT CCT 

CGG CTC C AG CCTGTT ATG CCCTGGC C AT CAGTCCTG ATG C CAAAG TCTG TTTTT C CTG CTG C AG CG ACGGG AACATTG CG GTTTGG 
GATCTGC ACAACOlGACCCTGGTCAGG CAGTTCCAGGGCCAC ACAG ATGGGGCCAG CTGT AT AGAC ATCTCTCATGATGG CACTAA 
GCTGTGGACCGGGGGCCTGGACAACACCGTGCGCTCCTGGGACCTACGTGAAGG 

AGATCTTCTCCCTGGGTT ACTGCCCCIACTGGGGAGTGG CTGGCCGTXKK5 CATGG AG AGCAGCAATCTCGAGGTCCTGCACCACACT 
60 AAGCCCGACAAATACCAGCTGCACCTGCACGAGAGCTGCGTGCTGTCCCTCAAGTTCGCCTATTGTGGCAAGTGG 
TGGGAAAGACAACCITCTCAATGCCTCGAGGACGCCTTATGGAGCCAGCAT 
GTGACATTTCAGCGGATGACAAATATATTGTAACAGGCTCTGGTGAC^ 

HUMAN SEQUENCE - GENOMIC 

6 5 AAG AGTCTCCCTTTTCTAGTAGGCCATTTCACAGG AGCAAATG AGATCTG AAATT AGA CCAAAG CACG AGG AAA 

CACAAGATCCTCGGAATGAGGCATCACCAGGACACCTAAACTATAAAACTVGATAAAAATTAAG^ 
AGCGATCCCTGTGACCACTCTCITTACrTCCTCATCTTCrCAGCACTCCCATCA^ 
C^TTTGAAAGCTTGACTGGTTTACAGAAAGAATCAATGCCAATGCCAGAAGGCCT 
TGTGTGGTGGTXKIACAAACTACCTAACCTTCCTTGGCCCTCAGCT^ 

7 0 TCCGTCTTAACTAGAAACCCACAGAC^CGACAATCCTTTAAGCA 

ACCTTGCTTGATACTATAGATTITTTGTGG 

TTTTTTTOVGTTCCACTCCAACATCAAAATCTATAGTCATCTATGTGCCTl 
CCCCATTTCTCTCTACTTCCAGAACTCCCAGGTCCACCTC 
OVACCITCTrCCAACCCACCCCTCrCTGGTTGCCCCATCTCCCT 
7 5 CAAAAAATACCCCATCACCAGTCCACTCCTCCCTGCCAAACTCCATACAACCTAACTGCATT^ 
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AGGGATGTGCCAACAGCAA Cr r ri XSTGACTATTTTCCAATGTTACGCAGGCACCTAATGGCC 
GAGCTGCCAGAGGATCCCAGGACTTCTCTAGCATTAGCTCrTC^ 

AGAG CAATCTTT AAAATGG CTCAAAT ACTGCAGAAGG CAGGAACCT C 
ACTGAGAGTAGAGGAGAGAGGATAGGCTTTGGAGACAGAGGGGCCTGGGCT 
5 ACTTTCCTTCCTGTATCTCAGTTTACTCATCTGCAAAATAG 

T AAGTTTT AGGGTC CATGTGCAT AACATG C7VGGTTTGTT ACAT ATGT ATACATGTGC CATGTTGGTGTG CTTC 

TCATTTAAC^TTAGX3TATTTCTCCTAATGCTATCCCTCCCCCCTCCCCCCACCCCACAA 

CCTGTGTCCATGTGTTCTCATTGTTCAATTCCCACCTATGAGTGA 

TG AGAATGATGGTTTCCAG CTGACAATGACACT ATTT ATT AAG ATT ACATG AGATAGT AT ATGTCAGG ATC^ 

CAAAGGCACTGCTACCCTGAGTTGTGAAAACCAAAAATATCT 
TTAAGAGCCACTGGTGTAAGAGAACACGCTTGATACAAAATCAGGCTCTCACTA 
AAATTCATTTATTAAAAAGATACAAAGCrGATAACAGATCTGAGAGTGAAA^ 
GTTCCCrCCCCATTCCCCATCCCAACTCAGAACCTCCATCAAAGACCAGGTATGCCGGGCACGAA 
1 5 TGACCCCAGAACTCCCCTCCACTGGCGTGGATAATTCTTACAGCGTGGGCCT 
AAAATATTTCTTAATCTCCTAAGCCTCAGCTTTGAATCCTTAAAAAGGAACGA 

CAATCACTTCGATAAATATTTACCCGGTTCGGGCATGAGG CATGGAAACCCTGTGC CATTAGTCAATAGAGGTCAGCAAAGTGCAC 
CTTGGGAACAGTGGGATGCAATTCAGCTCAACTTTGGAGCGGGTC 
GGTtMCAAACACATTCCATGCCTACCCTCCACACCTCCCCCCCCACAAAG 
2 0 CGCCTGGTAGTCTGGCITCCAAAGCTTTCTGGAGAGAGGCAGTGGC 

AAGGGCCACCCTCTATAGGCAGCTTCCCTAATAGTCCCAGTCCCTCCCCACCCCC^ 

GG CT CT ATTT AACTCTGAGGGCCT AGAT ACAGCCTGGAAT ACAACCTGG AACTGGGGAC^ CTTGTGTATGTTTTCGAAG 
GCAAGGGCTTTCCTTAGATTTTGGAGGTAAGCTTTCTGCAGATTC^^ 

2 5 GAGGTAAAAATGGAGACCCCAGGAGTTTCGGGGTCTTGCCCAAGATCTTGCAGATCrrT 

GCCACACAGTCCGGATXaAGTGGATCCCTCTCTCCTGGCTGTA 

GGATAGGGGCAGGGTCITAGAGAAGCAAGCTCTTAGCTCCGTTCCCCAGGGAATAAAAAC^ 

AAGACCAAGGAAGCAGAAACCACCCAGGTGCGGGGTGAGGGGGAAGGGAAGGGTTTCGATAGAAGTATTTCT 

CTGCCCACCACAACTTCTGTCXATCCTTTTGAGCTGAGCTCAAGCTC 

3 0 AACTGTTCTCCCCCACTTCATACTTTATAGGCTTACTTTTTGTGGCT 

CAGACAGATTTCTCrCTGCCTCTATCCACTCCCACCT 
CTCCrcCACACTCCAAGGGTCCTTGGGCTTCAGGGATGC 

3 5 GAAAATGTTGGAGGAGTTGAATCACCCAGATAGTTGTACATTATC^ 

GCCAGGTTCTGTACTAAGTACACTACATGCATTATCTCCATCGATTCTCGGGACAATCCT 

CTTTACAGATG AGG AAATGGAGG CTT AG CAAGATT AAAT AAG TC CCC CAAGGTC ACAT AG CTTGAACC AG TCAG AGAATGGG AAG A 

CGAGTTGGGAGACCCTGGGAAGTCCTCTCCTTCCACTAGCCTTGGTCCTCTrCCCT 

AGGAGCTCCCTCKX3GGCCCTCCAGATTCCTGTGGTCCCTCTTGCAGCCCTGGCCTG 

4 0 CAGCAGGGCGTAGTAGGTAAGGGCAGGCACACTGGTGCCCAATTGCCTGGGTCCTGTCCC^ 

GTCCATCAGTCACTGATGTGACCTTGGGCACATCATTGTACCACTGTGTATCAATTTC 
TGGCACCCACrrCATTTCT T GTGTGAGTATTAAATGTGTTCATATTTGT 

TATGT AAG AGTTTGTT AAAT AAAT AAAAATTTC C CTTTCTTTTC CT CT ATGTCGAGGTG CTTG CTTGTGGAGG AAGG T CTGGG ATG 
CCCGGTGAGTGGAGGGGTGGGGGTAGGGGGGACAGAATTACTCTAGTCATGAA 

GCAGTAGAGGAAGGGATGACAGGGAGAGGTTTGCAACCCCACCCACGGCACTGTGATC 
GTGGCTGGCCTC^GCCATACCCTTTAATTCCTAAGGGGGTAAAGCATATACT 
GACTCAGAGAAGTGGCTCCTCGGACTGGCAGACAGTTGCTCACCAGCCA 
CGCTGCTCTGTAAATCCyVGGCCGGGCTTTCTGTCTCACC 

5 0 CCACGTGGGGTGGACTCCGAGGTACTCAATTAAGCAGCCAATGAAAAGT^ 

AACATAAACAGTTGAAACGTTTCCAATTAGAACTGGAGACAGAGACGTTTC 

AATTG T CAAG C CTTTGGACGGGGTGTTG TAT ACAT GTITT1T ATTTT CTTTT CT CTTT CTCCAG <^T AAAAGACATGT ATTTTTCT 
CCTTGTTTTTCCTAATTTGTGGGGTGGO^TGTTGGGGCGGGGTGGGGGGGT 

TGGCCTGGTTCCTTACCTCCCTGCCCCCTTCCTGCTCAGC^TTCAGGCAGCCGGTGAGAGCGGGGAGTGACGCTGG^ 

5 5 AGTGGGAGCTACAGGCAGGAGAGGCCTGrrCTCCCAAAGCCAGGCTCTCAGAAAGGGCTO 

TTCTTCAGCAAAAACTTGCACGTATCCAGCTTGCGTAAGCAAAGGGGTCGCT 

AATCCTGACCCTCTGCCC!AGCTCGGGGCZAAATGATTGAGCTTTCTCAGCCT 

AACTGCTTCCCTCC^GAGTGAGGTAAGGGAGAGGAGAGCGCTGCCTGGGTAGCAGTGCCCT 

CtSCTGTTCTTCCTGACTGCGTAGCGTGGTTCTAC CGC CCGGCTCTC CGGAGG AGGCGGCGCTCTGCAAAACCGCGTCGGGG ACACG 

6 0 CAGCCGCGAACTCCTCCAGCGTGCCGGGCTCGGCCCTGGCGCTGGGGCTCACAGGTGAACT 

GCTCTGGCTCGTGTGGTCACCGC^KKjGCGTGGACGTCGGGGCAGGAGGGACACCCXKKS 

tac^caatcttcggaagacccgacctgggggccccgcggcttctc^^ 
agactgagagggaggcccggcgaggcgccgcgtccctggcatcttgaaggcattgtgtt^ 

GCTCTCATCCCCCTCTCCTCTCAGCTCCCAAATCGAGACCTGCTGTAGGA 
65 TGAGTAGC^GCTTCAAAGAGCTCCCTTTAAAGTTCGCCCCCTCCCCCGCCCCCACGTCCCCTTTCACA 
ACTCCTTCTAGACTTCTGGAGGACGACGACAACAAAACACTAAA 

CAG AACGTGGG CTGTGG AAGTGACCTGTTT CAGGCGTG CG CAGG AGT(Xl C<^CACCGT AGAGTTTGGG l H riTXn v lTIT CTfTCTTT 

TTTrrccTcrrTrcTTTTCCTTcr^ 

TACGTTTCATTACTAGGGAGAAAAATAAAATATTCCTTTGATACACAAAATCAAATTC 

7 0 CATX^CTCCTTTATACCACATGGTAGTGACAGATTGTlTGAGCrG 

GCACTATTTGCAAGCGGGAGGCTCrAGAAGCCTCGGCATTTGTCAGC 
GCGGACTGACAGGGCCCAGCAGCTCCGCCAATCAGCAACGAAC^GCCCCT^ 

GAGGCGT^TCG^CXSTCCCCCACGTGGGGCTGGAGCG AGAAATGGG CGGGG CGAGTAAG AAGCGGGTGCGCCCACCCGTGACGTAA 
GGGGGAGGGCAGAGAAGGGGGAGGGGGAAAGGAAGCGCCAAAGAGGAGCCA^ 
7 5 CATCTGTCGAGCCGGGCTTCTCCCTCCCCCGTCTTCXKjTGCtSCGCCGAAACA 
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. GAAGAGGGGAGCG CGGTTCGGGG CTGGCGGCCGG AG CGGGCGCX3GGGAGGTTCAGTTCTG AGGCGGGTGCTCACGTGT AACTCCCC 
GCGCTCCGGCCTCCGAGCGCCCGCTAGCTCTCAGCrCCTCAT^^ 
TAGATCCGGGGACGCX5CX^CCGGCTCGCGCGCGGGCTCGCGGACTTGCCGG 
GGGCGGTGCGaSCTTTGAGGTCCACTCCCTGCTTGCTCGT 
5 GCGGGCAAGCTGGCCCCGGAGTCGAGTGGCCTGTGCGACCAAGCGGGGGGTCCGGGG 
AGCGTCCGCGTCCCTCGGAGTGTCCAGGCTAGCGACCGAGACACCTACAGGCTTCC^ 

TGGAGTGAGGAGCCCGCGCGGCGAGGCTGCGTCGAGGGTGA^ 

TGAGCCTGTTGTCCACGGGCAGCCGGGCCCTTCCATCTGCGCrrGCCC 
1 0 CGCCGCACATCAAAGGAGCTGTTTATTTGGCTTTCTGAA 

GCGGGCAGAAGAAGGGGTGAAGAAAAGGGGACATCCTGAGTTCCGAGCAAA 

TTTTATCAGTCCTAAATCTTTTGAAGTGAAAACATrAAAGTGACAAAGTGA 

CAGACCTAATGAGATTAGAGGGAGGAATCGTTCGCCGCCGCTGCAGAATAAACT 

TTGTACAGCTTGTCTTATTTTCTTTTGTTGTTATTCCTCTCT 
1 5 CCGTGGGAAAAAAAAGGAGGAAAAAAACTCGAGCAGCCCAGGACTATAATATATATCTATTTATTATATGTCTTT 

TG ATCTGAGG AG CGCreTTTTTCACTCCCATG CXIAAAGTCAC CAG CGGGT AGGGCCTGATTGCT ACTTTTCATG CAT AG TTTT AGA 

CAC>AAGAGGGCATTGTGACGTCGCGCCCAGCGGCACCCAGTCGTAGGCCGAG 

TTGGCCAGCGCCGGACGAAGCTGCCACATTATTTTGTTACTTTTCAGTCC 

TGCTCXSTCCCCTTCGCCTGTCTGTGTGTGGTATCCGTAGGTCCGGGGCACTTl'rT T T Tl GGTGTGTGTGTGTGTGCGTGCGCG CGT 
2 0 GTGTGTGTGTTGGGTGTGTGTGTTTTGGGGGTTTGCCGGGG 



CTCGGTGGGGACGCGGTGCTCAAGCTGGGAGCAGCGAGAACCCTTTGCGGAG^CTTC 
CGTTTGATTTGCACCTACTGAGCCCGGAGTTCCT^ 

TAGCAGACGGAACAGCAGGCTCTCCTCTCCGAGGGGGGGCGTCGAAGCGCCCX3CGCCGGGCTCCCGCCCT 

2 5 TGCGGCCCCAAGCCACCACCTCGTAGCCCGCCTCCTTCCTCTTCTCGC 

GCGCGCCCCGGATGCTTGGGGCTGCCCCGGCGCGCCCCCCTAGAGGCGCGACCCGCTGAGTGACCC 

CCAGGAGGGGCCCCCGGCCTCGGGACCCCGCGCCGGACCTGCTGGGGAGCCGACGATGTCCGCGCAGTCCCAGGCATTGACG 
CGACGCAGCCCAGGGAAGGCGGCCGGCCGCTGCCCGCCGAGCCCTAGGCGCACAAAG 

GCCGACGACTTTGCCGCCTGCT CCGCGGCTCTTTGTCTCCACTTGGGG CGGGCG CCCG ACTCTGGGATTTCGCTGCGAG AACGAGC 
30 TGGGGGGGCCGGGGGCGAGCTCTCGGTTTCCCGGCCGCCCCCCGCTCGGGCT 

GCTCTCGGCGCTTCCACGCTCTCGGAATCACGACCCCTCCCTGCCATGTATCCGCAGG 

GGTTTATTTAAGCATCT ATCATGGCAGGGTAAACG AGGCAGTTTATCTGGCACCT CCATTTT CTTTATGGTGTG 

GGGGGGCTGCTTGTGTTCCAGGGGGAGCTGTTCACGCGGTCCCCGGCCCCGCTCTGA^ 

3 5 GCCCCCGCCTTACTGGGGAGAGGGAGGCCCITGGTGAAAGGCATGGGAGCCCCGCTGCCGA 

CCGCCCCGTCGGAGAGCTGAGAAACTTGCTGGGAAGGTTCTGGGCTGA 

CGGGGCCGCTCTGAGCGCGGTCTCATGCCCCCGCCCCCTCCCCTGTCTTGTCTTCGCGGGCT 

GCCGGGATTTAAATTCACGGTGGCTGAGTCTTGTGACAGGATCAA^ 

GCCGGTGGGGGCCGCCCCGGGGCTGGCGGGGAGGGGTGGGTCCCTGCX5CGGCTCCTGCCATCCCCGGCGGGGACGGAGGGGGCT 

4 0 CGACCTAGGGGCGGCGGGGCAGCCCGGGCCTCCGCCTTTTGTTTCCCCTTTCCGAGCG 

CAGCCCGGCTCCCAGCGCCGCCCGTGGGGTCCCCGAATGAGGGTCTAGGTCGCGGCGAGCGGGTTCACTCCGGGTTCTGGGTGTCG 

GGAGCCAGCGTCAGGGTCCTACTCCCCTGCTGCCGTGATAAGTCCTTTGGCTCGCGGGAAGTT^ 

CCCACGGCGGGGACCOU^TCCCCCCGCGTCCCCAG^ 

GTGGCTCTAGAGGCCCTTGGGTCGGGGCCTCAGAGGAGGCTGTCCAGGCGCCACGCCTGAGCCGGCTCCT 

4 5 CAAAGTGGAGT ACGAC AAGCTGG CAAACG AGAAG ACGGAGATGCAGCGCCATTATGTG ATGGTGAGATGCGGCCG CGGGCCTGGGG 

GGCGGTTGGGGCGCCGACCCCGGCCGGGGGCGCGGCGTGGATCTGGGGAGATGAGCGCCAGGTCCCCACCACTC 
GCGGCCCTGAGAGAGGGCATAATCTGGTCTTGCCCTCCCCTTCCCCAGCTTCCATTGCCGGGGCC^ 
GGGAGGGGCAGAAAGAGAAACCCCGATTCCAGGCCGGAGACTTCTGTACTATCTTTATG^ 
CCCAGGGGCTGATTGCCATTGCCCTATCTTGGCTAGTAAGATAGGATTGGACCAGCCCATGTGA 

5 0 TCTTAGTGACCTCCTGGGAGTGTTTGGAAACTTTTCTGCTTTGCAGCTGGGATCT 

AGGGTGTTGTGAGGAGTGGAGAGGCCTTGTAAAGCTCTCTCTGCCTCCACTCCCCTCCT 

CCACGCTGGGGCAGTAGGCAGACTGTCTTCCTTTTGCCCAGCAAGTT^ 

CTTTCCACATAAGAGGCCTTCGACACAGTGCTAAATAACTCTCAGAGGACAAGAGGGTC 

CCTGAACCTTGTATAAACAGGATATAGGGCACATGTGGTGCTTGTGGGAGGGAAGGCCCAGCTCCCCTTACTACCACATAAT 

5 5 CGTTCTGTTGAGATGAGAGGAACGCTCGCCCTTGTCAGCTCTTAAACCGCT 

TT AG CAAATG TAT CG T AAAAGT ATG ATTTGG CCAAAACCTTT C CLAAG AAAAAACTGATTTTGAAAAATG AC CAG ATTT CTGTGTTG 
T CTGTT AAT AG C CATTCTTTTTACTrTTT AGT ACT ATG AG ATG TC CT ATGG CTTG AACATTG AAATG CAC AAGCAGGT AAGTG CTTT 
TCTCTTTTAAAAATATATTTCATTTTITAAAAACTTATATATG 
TTCXIATTCCTCCCCCTTCTCCTTGAAGAGTGAATGGATCCAGTCCTTGAA^ 

6 0 (ZATGTTGAGCTTTTAGCATTTGAGCTTGTGAAATTAGTCCrGG 

ACCTCCAGGATCTGCrTTACAAACTCCTGAAAGCCC^GAGGTGGCTGGCTGCT^ 
AATOCACK5AGGATT TTTTTTTT C T TTCT 

AGAACCITAAAAGCCAGGCTGAGCTGTCGAAATTTAAGGGAACGAAGC 
AACTTCAGTGCCTTGCATTGACAAGGTGGCITTCTCATCCCTG 

6 5 AGCTGAAGCTCCTGGGTGAAGTAACTTGCCCAGGGGCAGCCTAGCCAGGACTTTCGAGATGGC^ 

ATCCTGAAGTCAAACCAGTTTAGCTAGTGTGGTGTCGTTTG^ 
ATACCTCAAACCGAGCAAATACAAACACCCTTTATCTCAGTGCATATCAGTAACTA 

GCCTGTTTG TGT ATTTG TGTTCTG AGG AAAGG CTGCCT AT AATTTGG CTTGG CCAACGTTT AGAG TTG ACTTGG CACG TAT AAATG 
GAGCIAGG CAGGCCGGTTGTTCTTTTTAACAATTGGTGTTC^ 

7 0 TG ACTTTGAATGG ATTTTCTGCTCTAAATTAACAGTCGT CTCTTTAAGGAGGCAGG ACCCAGGG CAGG 

TGAGAAGGGCAGGTGACCCTGGAAAGTGGGCAGATTCCTACCCACTCGGGTGGCTTCCT^ 



GCTTGTGGCTTCCTGAGGGATTAGTTAGTCCCTTAGGAAG^ 

TAGACAGGGCTTTATACGTAAAGATATTTCTCCTTGGAAAAATAGAATAGAGACAGGTTCrc 
75 AAGAATGCTTGAAATGGAACTGGGGGCCCAGGTGTTTC 
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CTGTCTTAATTTTITTTTTCTCAAATA^ 

TTATTGGAAGAGCTTGGTAATTAAGGTGTAATGTCTCGTTGCATGTT 
ATGGTGCTTTGGCACTAAGAATCCACCTTXX5CTTC 
AAATATCTTAACGCATTTGGTGGTGGGGTTTATATAAAGAGCAGCTGTC 
5 CTTAAGGGGAGAATGCCTGGGGTTTGGCTACGTTGTCTCATTTAATCCTCT 
CCTGTTTATAGGTGAGGAAACTGAGGCTTCAGGGAGGCCAGGAGGG 
GGGCACTTTACCTTGCTTGCCACTCACCTTTGCCCTCTAGC 

GCAAGCTT ATGGCAGTCTCTGAGTTG<MTGAAATCACCCTGTGG CTG TGG AGGAAC AAAGAGGCAAGGAGGTGAAG AAACCTGTGC 
CGTGAAATGCAACCCTGATCTAATGGAGCTGGCCACTCCCTAGGTTTGCAAGG 

TTTTATCGGCACACACAATCAAGTGACTGGAGT^ 

TACATCTGCCTGGAACAAACAGG CTGGATT CTTCATCTTTAAGCTGGGG TGGGGGTGGGGACTGTCTG T ACTTTT AT AT AAT AGAT 
GATCTATAGAGCTAGCACCTTGGGAAGTGTGGCAGTCCTCAGCCTC 
ATTTTTTTTAAAAGCTCTCCTTTGCTGGCTCTATGGGAA^ 
1 5 TTGACAGAGGGCAGCrGATGAGGTGCGTCCAGCCCA ri T IT 
CTGTTTCTCTGTCTCTCTCTTTTTCC^ 

CCGTAATTATGGTAAAGOVAAGATTACAGAATCACAGGCCTX5GCCC 
TTCCAGGCAGAATGTCAAGGGTGGTTCAGAGAGTCTGGGAAGGC^ 
AGAGGCTTTCCAGTGTCAGCCACAAATAACAAAGCC^GCrTAA 
2 0 ACTGGCTCTCTGAGAAGCTGGGTCAGGCAGCCTTTT 

TTGATTTrcGGGCAGTGTCGTGCCGACACAATACAGTCACCTTrr^ 
TCTTGGAACTCGAAGAGCCCTTTGGCTTCTCATGCAAACTTI^ 

ATGTACTCAGTCATACAGTCCC^CGTTAGTGGTAAAGCTGCrACTGCTGCTTCCCITCCAA 

AAGAAATGAAGAGTGCTC C T 1 1 1 CTAGTG TATTCTGTGT CTTT AATT ACT ATTGITTCTTT CTGTGTT ATTATCCAAACAGGCTAA 

2 5 GG CTACAAAACTGGCCCCACCAAAGTCCAAAGTGCCCCATTT CGGGTTGTATAATCATATCACTGAACCCTTATTGTCCACAGTCC 

TGTTTTGAATTGCCAGCTATTTGCACAACTAATGATTTGATGTGTGTAATAATAATACATAATGCATA^ 
AAGATTCACAGACCGGGAAAGGTGGTATATATAACCTGTGAACTTTTTACCCTAGCCATGGGGTGCC^ 
AGGAAATGGAAAATCCCCATTTX3TCACTCCCCTCCCCTCCCCGCTCrGGATCTTCTCATCT 
ATCTTGAGGCACACAGAAGTTTTGTGACCACGGTGAAGGTGCAAGGTACAGGTGG 

3 0 AAGGACTCTGCCAAGGAGTTGAGCCTATCTTGATAGGAGAATGTGGGACCACTTGC^ 

AGGGAAGTGCCTATTTGTGTGGCATCTGAGTTGAGCTGTC 

TCCCACCCGGCCCCATGCCCrGGTATCTGTCTTGACTTTTCCTGCTATATTGATCTTCCCT 

CCCCACCCCTCAAGACTGCCITTGGACCCCTTTTCTCTAGTTGCCAGTGGGAGTGGTAG 

3 5 TTTTCCTTCTTGTGCATTTGTCATTGAAATGACCCTGGGAGATGAGCAAATGGCCCC^ 

ACTCCTC CTT C CCACCTAGG CTGTTTGTTTC CATTTTGTTGAAGG AGTGAG T CAG T CAAGG AAT AAATC TCC I'lTlGCnTCTACG 

TGGGAGAATGTCATTCTGTCTCATTGTTTGTACAAAATACTGGGACACCAGGGGC^^ 

GTTTTCCCCTCTGATCTGGCACTCATTCCTCATCT 

4 0 AGTATGTAATAGGTGCTCAGTAAACATCTTGGTTTTTCTT 

CCCTGTGAGTAGGAAGTGGGGATTTTAGTTGTGGGCTGGCCGCCCGCTCTGCAGGCITCCT 
CATCAGTGAAATGAGGGGGTTGGATTCATTCCCAAAGCAGTTATTAGGGGACTGG^ 

ACAGCCGAATAAGACACCATTCCCTCTG CGGGGGAGTCGGATGTCTAAAGAAGTAATTG CAATG CAGTGAGGT AAGTGCTAATGTG 
GCCCAAGTGTCATAGGAGGAGCACCCAGCACAGAACA^CTTATTCTT CTTGGGTG AGAG CAG ACTTCACTG AGGGCTTTGCTTGGC 

4 5 GCGAGTCTGAAAGGGAGGGTAGGGTTTATGAGGCAGAACTCGGCAGC 

GTGAGATATGTTCTAGAAGCCACACGTCTTCGGAATGATGTGGGGCGGGGACTGCACTT 

TG CAGCAGAGCGATCTCCAAGGCCCCTCTGCCCTTGCAGTCG CGGCTCCCCACCCGGTGGCTCCCTG CATCCCTGCATGGCTGGCT 

CTCCAGGGCTGGCCTGAGTCCATCTGCCGCGGGGTGGGGGTGGGGGCGGGGGCGGGGG 

TGCCTGCTTTCCTACCGTAATATGCTGCTTTGAACTTTATTCCCCAAAGTGGCATT 

5 0 TGACTGGTTATGACCCGCCCCCTCCCCCATATTTAATC 

AATAGCTTTCAACATGCCATTGACAGTCTGGTTTAAGAGAGCACTTGA^ 
CCAGAGTGAGAACTGCGTGCTCAAGGGCCT^GTTTTTATATGCTTAAAGGGACAATATCT 

TCACAGCCAAGATGCTTTCTTCCTTTTTCCTTTTCCTT1 CTT CT'ri'l'GGCTl'TCTCTTCTCCTTCAAAATACATATTTAAAAAAAA 
TTCTTTATAAAATAATTGTTCCCTTCAAGTTGGGGTTAAGGAGAAACTTAAGGAAGGGGGAAACG 

5 5 GAATATTTGGTGGGTTTGTTGTTTTGAAGGATC^ 

GG ACCTTG CGTTT CTCAAGG AGGT AATGGG AGAG T CTG AATT CGCAT CAG CATTG ATTTTG AATT CTTTTCTTTG CT AAGG ATGTT 
GCACAATCGGTGGGCTGCAGAGCTGTCACTGTGAATGTTTAGGCAGAGGCCACCTTTrGG 
GTGTTGGAGAGTCAGGCCATGACCGCTAAGGAATCAGATrCTAGGAGGCTGTTTGCT 
GGAAATCAACAGGGTGGGTGGGGCAAGGACTTGTATTCTTAAGTACAGGTTCACACC^ 

6 0 CTAGTCCACCCGG CAGATTTTAT CTGAGAACAAAT ACTTTT CTCC C CTT CACAC CAG CCG ACTT CATT ATTG CTTTGAAAATG CAG 

TCCTTTGGAAGAGGTTTCCITTGCCTCTCrcTC 
TTCCCATCTGTGGGGCCTGAGAGGCCAGGGAAACCCTGTTGGGGCCT 
CTTGGTAAAGGGTAGGAGCATGGATGGGCrTGGCTCGTGTGTCTTAGGTGGT^ 
CAGTGTTCTGGCTGCTGCCACCACCAAATAATCCCAGGGGA^ 

6 5 ATATTATAGATTATAATGCTACGGCCTCTTAGAATTTCTGTAGCACTTGATTGCAAAGC^ 

AAAATAATGGGTATTTGGGTCCAGGACAGACAGACACCAGGATAAAAGCATCAGAGCATACCCT 
ACCnKSTGTACTAGAGGGCTGTGTTCTCAAATTGGGGTTGGGGAGGTGAGGCCT 
CAGGACCATTTTGTATTTGTTTTCACTAAGGGTCCTGAACCCTGAGCAGTGCC 
GGGGACTCAAGGAAGGAAGGTAATATTCCAGCTCA^GGTTCTCTCT 

7 0 GCTTTCTTAGTTTTCTGACAGAGGCCTGTAGGCT^ 

AAAAGTGTCTTTAATCTATCTTAGGC^TCTTTTTCTTT^ 

GTGGGGGTG CGAAG AGATAACTGGG AGGAAGAGATGAC AAAG CAGAGATG CTGG AAG AG ACAAG TGCTG AGTGGGT G AATTGGT AAA 
CTCGGCTTCCCATGCCTTGGCTGTTTGGGCATGGAG 
TCACAGTTCTTCTACrGCTTTTTTTGTTTGTTAATTCT 
7 5 GGAAGGCTGACCCTTTCCTCAAGTGGAGACCCAAATGCAGAAGAGCT 
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TTGGGTACAGGAGAGGGCTCTCCCTTCCrGGTCAGCCATGAAGACITG 
AAGGTTGGAGGAGGCGGCCTTTCACCCACTTCCCAAATCTTTCCCTTGGGCTA^ 

AAACT(^TTAATTCTGAT AAT ACTGGCAGGGG CATG CTTTGGAACCGGTGATGCTTTGTGATTT CTGCTGGGG AAAATGAGAAGAG 
GAAGGCAGTTCAGTGCAGACTAGTG<SACAAGGGATGGGJ^TCCATTCT 
5 ACCTTCCTGGCCTCTGTCCAGGAAGAAGTAGGACACTCAGAGGGCTCAGA 

CTAACTGGTCGGGTGGGATAGAGGAAGGAGGAGCAGGGGAGACAGAGGAAGTGAGCCTGT^ 

CCAGTTCTTATTTTGCTTTGCCTAACAACAGAAGTCCT 

ATCCCTCAATCG CTGGGTTTTCAT ACTTT AT CC CAG AGAAATGGGGGATCT C 
1 0 CTGGTTTAGAGCTCGAAAGACATCCCCCTCTCCTGCAGGGTCCTCCTGCT 

TGTCTGTGGOVGGCACCCACAGGCCTGGTCTGCAGGAACTTGAGAAGTG 

CACTATTCTTTTACCTGGTTTTTTACCTO 

GGGCAAACTCCCTTTTTAAACTCCCAGGATGTTGAATC 

GTTTTTCTTGGCCAGTGGATTCCCTTTCTCCCTCAGAAGAT 
15 CAGAAGCTCGGAGATTTGTCTTCATGGCrAAGAGGTT^ 

TGGTGTGTGACITTGAGCACACTGTTTGACCTCTG 

TGGTGTCCAGTAGAACTTTCTTCAGTGATGGAAATGCTGTATGTCTGCACT 
TAAGCACTTGAAATATCGCTACTGCGACTGAGGAACTGAGTT^ 
TGGCTTCATAAATTGTACAGTTAGAGTTATAGGGAGTC 
2 0 AAGGCCGAAGCAAGGTATTCTCACAAGGGGACAGAGTCCCCAAATTGTCAAGGCCTCCGACAT^ 

CAT ATGTTGT AAG ATTT CT CAGAG CC CTTT C CCATATGTG ACTTG TTGG CTCT ACGG T CCTG CAGTG T AGG CAC CGTGGG CGCCAC 
GCATCAGATAGCAGCACrGACGGTTAAAGAGGTTGAGTAACTTGCT 

CTACCCCCAGACCACCATACTACAGGCATCAGTTCCAGTGCTGGGAAG 

2 5 GCATCAGCCACATGTGGATCAGGGATTGCCACCCTGGGCGCTGTTTACATTT^ 

CTX3TGTTTAGGATGTTTAACAGCATTTCTAATCTCTACCCACTAGTTC 

TCTAGACATTGCCAGAATCCCTTGGGGGAAACAATTACTATTGGTTGAGAGGC 

GGGAGATTTGCTTACCACCTGGTCTTCCTGTGTC 

.GTAAAATGAGAGGTCAAATGGTGTGGCTTGGGAGGAGGTCAGTTTGAGAAGGAGCAT^^ 

3 0 ATCTTGATTTTTATAGCCTAAGAAATGCCATCACTCCTGCCTTCCITA 

CCOTCATGCCACTTGGGTTTTCCAAAGAGTCCOTGCCCTTAAGAAGTC 

GGAGACCCCCAAAATTTAATGAGCTGTCTCTGCAAGGGTAATGTGACTTTTACAGCCTG 

TGTTCCAAGGGGGATTAGGTGGCTGGGGGTTGCCATCGCAACCCCCACAGGGATGA^ 

3 5 GCCAATGGAGTCTGGGAAGGTGGGG<1AGGGGGCAGGGGGTTATCTGAGGGGAGGGAGGAGGGGGAA 

GCGGTGTGCCTGGGATGAATGGGCAGTGTTGGGGGAAGGTATAATCTGAAAAAGC^ 

TTGAGAGCCATCTCCCCCCACAAGTAGGGGTGGGAGGCAAGCAGAGriTAGTC 

ATTTAAAGATTATTCGATCTTGTTAACAGGAAGCCCGGTAGAT 

GCCAGGAGGGACTTGAGCTGTCATCTGGGCCCTGTGGTGTCTGCCTCCAAGTCTCTGTTGG 

4 0 AGGGCACTCACACAGCACCTAATTCATTTTTGAGAGCTCCGATTATTA 

GGCCCAAGTGCAGGATGGATCTGGGCTAGGAATGCAGGACTCTAGACTGTATCTT 
TGCATGCTAGCTTGGCCTAACC^TCCCCAGTTCCCTTACACTGTAGACCCCTGGAAGAAAAGAGGAGGT 
AGTGTGCAGAAGGGTCGTCCCAACCCCAGCAACCCCAGAGTCCCITGCCGTTGATC^ 
CCTCCTCCCTGCCCTGGCTCCTCACTGTCATCCT 

CCTCCC^CATAAAGAATTCAGCAGGCTTCCTCGGAGGGCATAAATATTTAA^ 
TC^GTGCTAAAAGGGCCTTTAAGAAGGCTCACCTGTCATTGAGTGATCTGTGATGTGAACC 
GGGGCTACATTTCCCAAGGCTGCTGCATAGATCATTTGTTGAC 
CCATGGCGTGGATGGAGCAGGGCCTGAGAACCCCACCCCAGCTTTTGTTGGCT 

5 0 TTGATAAAAGACCCACTTGTTTTATTTGAGTGTTCT 

TCACATCATTGAGAGGCTATCCATTTAGTATTGCCTAGGAAATGCAATGGGATATTT^ 
TGAATCTGGGTGATTGGCGCCTGGCTGCTGGGAGGCTGCTGGGAAGGTTGGGGCCAGGGCCACCT 
CCAGCCTTTCCCTGCAGCTCTGGGATTTAAATCAAGATGAAGCAGACTCT 
AAACCTGATTAAAATTTAAGAACCTCAGAGCTGACTGAGATGACACCCCCAAACACACAC^ 

5 5 CAGAGGCCTCCTGGGATt3AAAATCGTCACTTCCCCATGGCTTTGCAATGTAATTCR.CCTTT 

TCATCTTTCACAC^CATCCGCAAACTGGTCTCAGACTCACAGTACCATGCT 
AATCCTCATTTCACAAATGTAACTAGAAGCCrAGTAGCATGGGATGGCCGCTTGCC^ 
GAATGCAACAACTTGGAGAACCCTGGCCTGGTTGTCTTAGCCTCACCAACC^ 
GGGGCTTGACACTTAGAAGTATGATATCATTCTTCTTGCCTAGAGGCTCTTCCACA^ 

6 0 TGAATGGCTCGTT3CTCTC AAGACTGGTCCTTGGGACCCAGCC CTGCCCAGCAAC CAGCCAGCATCCATCCACAT 

CTCAAGGTGGCTTGXjCTCTTCTCTGCTGGTCTTTTGTAAATTCATGTC 
AATTCGGGAATATTTAGGGTTCTAAAACTCTAAATTAGGGCTCCTGGATCTTATACGT^ 
CAACGAAATTAATGCCTAGAGAAGGGAAGTGATTTGCCCAAAGCCACACAACTAAATGGAGG 
CATCTCTGCGTGGCGTTGGAGCATAACACACTCITTC 

6 5 GAATTGTGCATTCAGAGTCCACAGGTACACACGACAGGGCAGTTTAAAACTAACTAGCAGACAGA 

CG ACGG TGTGTTGGACACAGTCACCT AAATT AAAAAT AAGGG C^AGG ACTGG CTGGGTG CTGCAAG C CCAGGAG CAGG AGGTTG CT 
GGGGGGTACAGATCAAGAGCAGGCTCCGTTTCCCTTAATTTAGTGTCTTGGGTCAAG 
CGTGAATACTAGACTGTt3GTTAGAGAAATAGACTTTrAGAATGCTCAA^ 
TTTAATGAGGAGATGATCATGCAAATGGGAGCTACTAAGGC^ 

7 0 ATCTTGTTXTCTGTCATTTATTTCCCTAACAATCTTC 

ATTCCGGATGTAGCAGGACCTCTGTCCCCATTTCTCTGAACTGCTTCT 
ATGTTTTTCCACCAACAAAGCGCCTGCnTTGAACC^ 
TCCAAAGTGCTTTCCAAAGGGAGCCATCTTCCCACCTTCCCA 
CTTTCTtKSGACCGAGATGGAGGGAAGGATGGAGGTCGG 
7 5 CCAAAAGGAGTTTTCCAACATACGATCGAGTTCCAGTT^ 
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TTTTTTTTTCCTTTTTTGGAACTGATGTTTAGAGCTTC 

CTGTCAATTATTAACCTTAATGGTG GTTTTT C TTr CAAAAATATTTCT 

AGGAAGTCATTTAGTGAAATCACTTAATGGCTG^ 

GGGGTTATTGTGGATGCACACATGGOGAGTTGGGTGGGTGATATCAGCGGTC^ 
5 CTTTCCATTTCCAGGAACTTCCCCTCCCCCAAGOGCCACCCTCCAAAAAATAAACATT^ 
CTGTTCATGCTTTGACTCTGCAGAGTGCACGCAGCCCATTCTCAG^ 

gcttccactttgtggggagaccccaagagggtact 
tgtccctttcccaaggggggacaggagagccaacctccaaggctccttcct 
agggtccxkztaacaacaatcttgggtacaacagagagtgaaacgaagattt^^ 
1 0 ccacctgacccccgtccctatcacagctcatcacctctgniwn 

atgccactaccccaggcttgtggaaggttctgcatctgtgtggcattgt^^ 

ggaggcaggaactactttaggagggtggaggggtgagaatgagagaggacttgccct 

ccggatggctacataaatcctgggagatccgttgtcctcataaccagagtgagctgggctccagaccagccctatg 
1 5 gtctgtgggaagcctttggccaottctttgctgaa 

gaccctgacccttggcctgtttaccagtggaaggaggccatagccagaacccttt^ 

gactgtgagagttaatagttgctgaggaggccyvgaatgactggggatgttctgac 

gtagtggco^gggacccatctaggtggcttgcgattggcctggcttggaggtgc 

ctggccaccaagtgtatgtcaacactccatccaccacggcccctcccacagatgct 
2 0 caaggggtcactgcaatatgaagtctcaataaatgcagagatgccagtgatact 

ctctgtgccagacactgctctattgttgtttaagataacaggccrtggacctgtgagcttcct 

gtttatttttgtttgctttttgttttttacatatctttct 

tgccggtttggagtgagctgcagcctatgcttgtttctagaactcctc 

cctactatcagttgatttaggcgaggaaaaaataagcagagtgaggaaggaccct 

2 5 TAACAATTTATATAATGATTATTTAGTAATCCTCC 

G ATGCTTTT AAAAGCC CTGCAGTG CACAGAACGGG CT CCACACAG CA.GTGAATTATCTGACCTCAGATGTCAATAG CATGGTGTTT 
GAGAAACCCTGATGTATGGTGTCCTGGCTAGAGTGCCCCACAC^TGTGTCAATGTGGAGTCCCAGTGGGT 
GGAAGTGGGGGCCATTGTTGACACAGTTGGCCTGGGATATGAGCGACAGTCCrriTA 
CAGTGGTGCTGGCTGGGAAACACTGACTCTGTTCCTGAGGAGCTAGAGTGTTCTAGGCA^ 

3 0 CTCTCCCTGCCAGGAACCCCCACTGTAGCGGACAGGCTCCGTAGGTTGGCGAGCCCTGCCACCGTGGCTGGCAGCC 

GGGTGGAGGGTGTTAGCCCAAGGAATTCTCCCGAGGTCCAGGGTCTATTTATGCGCCTT 
GGATTTCTCATCCAGAATCATTTGACATGCAGATGATCACAGATTTTTAAAGGCT 

ATGTG CTG ATG AAAG CTTTGAAG CCAG CTTTGGGACCTCAGT CCTGGCGGG CT CAGG ATT CAG AATG AG AT AAG AAAAG CAATTCT 

3 5 GAGAGTCGTCTGTCAGTGGTTCTAGGCCTTTCTATCTGGACTCAGTTAAC 

TACATAGGATGCTGGTGGAAACTGTTGACTTTCTAAATGCAGTCTTTTCT 
TTTCTAAGATTGACTTCTGTGAGGGATTAGGGATCTAATGAATTACATCT 

AAAAAATGAAGTGTTGGCTGGGCATGGTGGCTCCTGTCTGTAGTCCCAGCCCTTTGGGAGGCCA^ 
OVGGAGTTTGAAACCAGCCTGGACAACGTGGCGAAACCCAGTCT 

4 0 TAGTCACAGATATTTGGGAGGCTGAGGCAGGAGAATCACCTGA 

TACACCAGCCTGGGTGACAGTGAGACCCTGTCTTAATGGGAAAGAAAAAAAAAGAAAGAAGGCTTAATGGTAGGAT 
TAGCMATATGTGGGGGAGGGATTGATCTTGGTGAGGGAGAGGCTAGTCCCA 
TTAGTCCCATTTATCTGAGAGCTCATGTCCTATTGGCTVCTGGGCACCAG 
ACTCTTACAGTGGGTTTTGCTGGGTTGTTGGGGGTGATTTTTATTTCGTTT^ 

4 5 CTATATTCTGAACCAGACATGTAGACTACCTCTGATGGAAATrGGAGCTCCCTAA 

AAGAGATTTCTTCATGCTGTGGATTTAGCTAGAGCCTCCTGGTAGGAGGC 
GGGCrGTGGTCTGGGGGGAGTTTGCATTTCCACTTGCCTATGACATTT 

CCTTGCCTTTGCAGACAGAGATTGCGAAGAGACTGAACACAATTTTAGCACAGATCATGCCTTT 
TCTTTTATTATTTCTTCCTTTTCTTTCCAAATCAGCCCAGTGGCT 

5 0 GCCTCAGCCCCTGTCCATTCATACTGACTGAGCTGACAACCTGGAGATTTCCAGAG 

CATATCAGAGGGACCATTTGCT CCTTCTTCTCAT AGAATTGTTTCTCT C 
GACCTAACCATCAGTGGGCAGTGGGTGCCTCTTTCCC 

AGTCCCCTTGCTGGGAGCATGACCCAGGTGGTCATGTCACTTTCTACGCCTACCTTCCA^ 

GACTAGTGGGTGATTTGAGATAAATGCCTGCAAAGAAGCCCCCTGGTGCCGACCAGACA 
5 5 TCTGTCCTCTGATTTCACTCCAGCCAGGGCAGGCCCTGCCCTCTACTCCCACTCAG 

CrrCCTAATGGTGACCTGAGTAAATGATTGAATTAATATATAAGAATCAGATATGGCAATGCCCTCGG 

GAGAAAGCTGTGTATGTGGCTGGTACTACCTCAAGGCCAAAGGGAATTAATGCAACC^ 

GAGCAGTTTTAGGTTCACAGAGATTTCCCATATCCTCCCCACTA 

ATCTACATTGACACATCATCACCCAAAGTCCGTAGTTTACATTAGGGTTC^ 
60 GAAACATACTTTGTTTTCTCTCTATTTAGACAGGTATGGATAAGGTATATGTAGACA^ 

ATTGCAGATATCCTG ACAATTCACCACT AACTACTATAACTG CATCT CCTAAAAACGGATACCATCCTTCATGATTACAGT ATTGT 

TACCATGTTTAGAGAATTAATAATCATCCTGTAATATCATCTGA 

TGTGGTTGACTAATAGCCATCTTTTGGCATGCTCTTTGTTTTAGAACTTAAAAGGAAGTT^ 
TTGAAAAAGGTCTAAAAGAGAAAGTAAAATCGCCTGTCACCCTCCCCCCT^ 
€ 5 TCTGCTCTGGGCCTCTCCGAGAGGCCCTGCCAGTGCTGCACAGCCTGTGACCTCCT 
CACCCT(TITCCCTCCAGCACCAGCAGCAGGTGGCGCAGGCAGTGGA 

ATCGGGGTACGTGGACTCCCCAATCTGCCTCTTACCGTGTGTATAGCCCTTTTATTCCTCT 

CTGACCTTAGCTGGCCTCAAAGAATGGCGGCCCCAGGGCTCCGGCTTCTACAAGC^ 

TTCCCCCTTGGCTCCCAGTTGCCTCCTAGTCCTC^ 
7 0 TCATCITCCACTGGCTTTTGGTGTCCTTGCCATTGAGGAGGGTGGGG^ 

CCTGGCCACCTCCTCACACTTACC CCGGGGCCTGTCTGCCTCCTCCCTGG CAGAAGTCTCCCCTTCTTrGGCCACCTGGG ATCAGA 

ACAGTCTCTGCTGTACTGTTGATGTGCATCGTGCTGTGGAATTTGGGTTTACGT 

AGCATTCATCATGTACGCTTGTGAACTCCAGCTGGGGAGCTGAGACTGGTTTTCCTGAC^ 

ACAGGAAGACATACCCAAGAGTGCTCTGTGAGGCCCACTTCCGGACTGGCCCCT^ 
7 5 TGTATCTTTAATGCATTTCTTAGGTTCAAGTGGCATCTTTCTCCTG 
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CACCTGAGGAAGAAGACTCTTTTTAGTGTGACT 

TGAAGAGGGTGGACAGGATCACTGCITCCCAGTCCACGGAGGAGCTTCTGGACG^ 

TTTTCC^TAGGTCIX^GGATGGGACTGOAAATCTGTGTT 

AGTAGACTGATTCAACTCTAACATTCTG^GAACCTGGTTG^ 
5 TGGAACTGAAGGCCCCACAATTTTCCTCreCTTGGTAGGA 

TCTCTGTTTTAGTCATGGTTCTCCCC^ 

GGAGAACGTACACCGGCTCATCTCTTTGGGCA^ 

GGACACCGGCCCTTCCATCACACTGTCAGGACCCGTCATGGITGCA^ 

GGGCTGAGCCCATAGGGACACTTCTCAGTGCTGCTTGGGCAGGTGG 
1 0 TGTGGAATGGGCAGAGAGGAGTCTTGAAACTGGAACGTGAA 

GATATTTTAATGTTTAAAGTATCATCAGAGATCATCTAGCCCGGGGATCGTCAAC 

TTCTTTGCCATCCTGTGCATTGTGGGATGTTCAGCAGCATCCTTGGCCT 

TATTCATAGGAGGCAAAATGGCCCTTGGTTAAGAACCTCTGATCTAGCTCAA 

ATGTTTAGGAGGAAAGCCAGGCCAGAGGCCCAGGTGGCTGACTCGCAGCCCA 
1 5 GGTGGCCTGTGTCATCTGTCTCAGTGACCAGCTCTCAGCTTGGTGGTGGTTCT 

ATACCTTGTGAATTCTCTTTTCCGACATTTCTCCTCTTCCAGCT 

GCC^CTCCTCCTACCCCTGCGGTGGCCAC^GCAGGGTCCCCTCTCACCCTCTG 

AAATAATTTTTTTTAAGTTTTAAGAAAGGAGTAAAACTGGGGGAGAAG 

CAGGAAGCTACAAGTTAGGTCGAGGCAGCTCGAGGCAGACATTGGAATTCAAT^ 
2 0 TTCAAGGTAATTGCTTCTCCAGCTCCAAGTGAATGCAGCCATCGCCCTGTC^ 

AGAGCCTCACAAGGCAGGAATTACXX3ACAGCCGAGTCGTGCT 

GCTAATCT AGTCAGGGCGTTCTTTATCTCCTC CTCGAGGAGAGCAGAG AGCGAG CCTGCAGCTGAG CTGCCTCTGGGAAGCAAGGT 
GAAG AAGG AT AACGGGGTGCAGAGCCATGGGCTG AG CAAAAGGG CAGG CAT C CTGG CCCCAG CAGG AAAGGGCAGG AG AGAGAGGG 
TCCGAAGAGGCTCTTGGCAGCTTTCAGTGGCTACCCCCACTGG^ 

2 5 CTGGTCTTGGACTTAGAGGAGGGCTTTGGGCAAG 

GCTCTCTTGCTCATTGGATTTGAAAGATTCTGAGGTTGTGGCTCGG 

GGCAGGGAGGTAGCTGGTGGCATATTTATCATGAGGATTAGAGTATTCCTGGGGTCTTCC^ 

AGGTCAGGGTGG CATGTGTC AAAAGGGG AAGGAAAGACAGAC AG AGGG AACCAGGCTCTCAAGCCAAGCCT CCCCTGCT 
GGACCACAGGGAACTCAAGTGTCTACCGAGGGAAGAGAAGTGGAAGATTGTCTCTCTTCTA 

3 0 CAGTAGCCCTGTTTCTGTTACATGCTTCTTGGATCCAATTAACAG^ 

AAATTTGACACTTGGTCAGCAACCTGCAGAAGGCAGC^ 
CATTTCTGGAACCCTCTTGTTACATTGCrATATCATGGCCCAAGATC 

ACTG ACAATTTGGTCIT CATTCC CTTGGAAATT AACAG TAGGGG AGATGT CAC AG AG CATTGACTTGATTTTTTTTTT AAGTGGG A 
GGGAGGAGAGGAAGAGGGACGTTGTAGCTTGTCACAAAATACACAGGATATT^ 

3 5 TGGATGAGCTGCATTCAGTTGACTTGAATTTTGTGTGAGGGGAAAGGATGGC^ 

GGCAGCCATGGTGGAAGGAGGTAACGCTGAGTTTGAGTCTGGGCTCTGGTTC 
CCTGTTCACCATCTCTGAAGTGGGGATCACAAGGGTTGTTGTGAGCACTAAAAGCTAT^ 
GAGCTATGGTt^CCACTGAGTGATGTGAAAGAGAAGGAGCAGTCGGAGGCCT 
TGTTCATTTTACGCTGTCAGGACCCTGTCTGACTCCrCTGGTGGGAGCCCTTGGCGGG 

4 0 TCACTACGTTCAACCCAAATCATGAACCTGAGAAACGCAGCATTAAATCACCT AGCGAG 

ATTACACGTACTCTGCAAGCAAGATCTTCCCTTCAACCTCTCCT 

TTGGTTCCAGGCTCCTAGCGCCTCTTTGGGGTTAGGGATGCAAGAGGGAATTATTGACT 
CCCATATTGGAAGGAATGGTATAAGCTGGAGACAGGCGGTCTCTGATTGTGGAATTGATTCCATTTCTCT 

4 5 CCCAGTTGAAAACAGTATCAGCCTCCCACTCAGACATCACT 

GG AT AGGAATAC AG CCTGAGGAT AG CTGTGC CCGCCTG C CTG CAG AGG TGCTG AGCTGTG AT AGGG ACGGG GGT CAGG TGGGG C CT 
TGCCCACTGGCCCCCACACCTGTCAGGGAAAAATCTTCGTTTCACTCT 
TGACAAGCCCTGGCATCTGCTCTTGCACTGAGGAAGGAGCCTTGGAGGTTGGCT 
CGGTCCTTGCTGTGCCTTCAAACCACCCCAAGGAGACCT 

5 0 ATGAGGAACGCTTCAGTTTTGCATGGGAGATTCCTTTrCT 

CCAGCCTCTCAGCCnrGGAAGCTTAACAGCTTTGAAAGCCA 

GTCGGTTCACTGGTTCATGTTTTGGGCAAGCAGGACACCTCAT^ 

GTCTAATTGGGCCTTGGAAGGAAGGCATGGAGCTGGGAGGCGGTTGGCACCT 

TTCCAAAATTTTCACT AATTT ACC CACAAGACATGT ACTGAGCyU^CATGCCAGGCACTGTTAGAGGTG CTAGG ATGG AGAAG ACA 
AAGATTGAGCACCTGCTCCATTTTTACCTGTAGTCACACATGTGATCCT 

CTCATAGCTGATAAAGGATAGAGCTGGAAGTCTGCCTTATTCTGCCTGGCTCCATTGTCTGTGCCG 
AAATCCATTCTGACCTGAAGTGGGAGGGCATTCAGAGGATATGGCCTTTGAACTGGGCAAA^ 

GAAGAGACCTTTCCrGGTAAG ATAAG AGTCAGGGTGCGTGG AG CTATTCAACTCCATCTTGCCTTGGGTACAAATGTT CAGTGCAC 

6 0 ATTTTCTCTAGAAACAGACrrGTTCATGAAAGGAGT^ 

TCCTCTTCTTTGTCTCriTCCTCAAGGGATCTTATTGGCTCT 
TCTGGGATTTrACTAAATAGTTATCACCAGCAGAATTAACGGGCCTACCA^ 

CATTTTAAAACCAGGTATCCAGATAAAGTTGTATTGTCCGACCACACCAGGGAATACTAGGTAGCCATTAA 
TTCTTCAGAACATTCCTAAATTAAAATCAAGCTGTATAGCAGTATGGTTCAC 

6 5 TGTAGATTTTCATAAAGAAAAACCTAGGCAGATTTAACAGTAGAATCrTCCGAGGGTAGG^ 

TAATGTCTACTTATTTTTAGTATTTTCTTAAAATGGGCATGTATTAGTTTTGTAATAAGATAT 
AGAAAAAGTGGAGGAGCTCAGAGTAAGCGCCCATCCTAGAGGAGCT 
CTCTGTGATGCTGGCTCCAGGGGCCCCCATAGCTTTG 
C^GTGCCC^TGAACTGCCTAGCAACAGACGTCACAGAAGTG 

7 0 GAACTGACAAAGGGAGCCCTCTGGGXSCCAGGAGTGCCATGGCrT 

TTGCAGCTTAGCTGGGGGAGAGTAGGGCCCnTTATCCATAA 

CTGTGTTCIGTTATCTGTTCGCGGTTGCCAGAAAACCATCTCATA 

CTTCTCAGATCTATTCTCTATGGCACTTTCCA 

GGTGGTGGTTGACAGATTTACTAGTGGTGGGATTTGGCCAATTACTAGCT 
7 5 ACAACCCCTCCCTTGCCAGGTGGTTGGAAGGTGCCCAGGCATTCATTCAGG 
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GTTGAGAAGAAAGATGTGTGCATAGGTGG^^ 

GGAGAGGAAAGAGAAGCCCAAGCCCCTGAGGCTAGTGTTGAGT^ 

TGGGGAGGAGACAGGGAGAATTGCATTGTTGTTGATAATACTGACTTGCTGCA 

TTAAACTCTTTGTAAATGTTAGCTACTGCCATGAGGCTTAGTAGCTGTGTC 
5 CrcrcGGGTCTGCTGCTTTCCCCCTTTACAGATAAGGAAATTTGA 

AGGAAGTGCCAGAGCTGGGATTTCAATCCTGTTCTGTCTAGTTCCAAAGCCTCCACT iCCACCACTGA 

TX5TCCGGTGACGTTAGCTCTCATTCCACTGGCCCCCAGCTCATCTGGCTCTGGGGC 

TCrcTAAGGCAGGCTTGGAGTTCCTAAGAAATAAGGAGCCTTCCCCAG 

TGAAGGCATGTATGCATGAGCAAACACTGCCAGGACTGTGAGCTTAAAAATGGATCCCTCTCT 
1 0 TGO^TTTTTTAAATGCATGCATTATTTAT 

TGGGGGGTG CTTGAAGACAT AAATCTTT ACATT ACAGTCT ATT AACAGACTTC CT^ CGAG ACATT CATGTGAACTGAG CTGGG 

GGCAAAGGTTTTGGGGGTGCTTCCCCCTGCCACCCACCCAGTGAAGGAGC^ 

TCAGGTGGCCACGCCTCCCCTGCCCTCCCCAACTCAGCTGTACTCACCACCGACTTCA 

ACCTCTCCCATGCCAC^CACGGCCCCCCGGTCCAGTTGCCACCCCACCCGTCAGGTCTCCAGCCTCCA^ 
1 5 GGGAGCAGCTCCGGGCrTGCTGGCACTGGGCGCCCTGGGCAGCC^ 

CGATCACJlGAGGTGCXiKSCTGGGCTGGGGCAGAG 

ACGCAGCAAGTGCTGTCGTCCTAAGACCAGGAGTTACCAAGGATTTGG 
ATCCGCATCCCATGCACCTTTTCCTACTCCTTGGTGTGGACCTGGTATTCTCTG 
GCTCCCATCTTCCTACAACACCCTTGCCTGTCCTTCAX5GGGCCAGCCC 
2 0 GGGCTTTTTCAAGTTCCTGTAAGAGTTTAAGAGTTGGGATGCCAA^ 
AAAATGTCTTGTGCTCACCACGATCATTAGTTCCCT 

ATGCAGAAACCTGGGAAAGGTGCACAGTCAGTt5CTCAGGTAAAGGTTTGTTAGCT 

CAGCAAGAGATCACATAGACTTGGAGTCAGAACTTCTTCTCTGACCCTCT 

CGAGCTTGGCCTCAGGAGTGGTTCATTGGATCAGAAAAGAGAAAAGCCTC 

2 5 ATCCACTCTGCCCTTAGATACGGTATTTACTGGGGCACGTAGCCAACACCT^ 

CCTTGGCCCAGGCCACAGGGTGGTAGCCGGTCC^GGGAACTGGAGGCTACATTTTTT^ 

GAGGAAGAGGGGCCTCTCCATAGCAGACAGGAGAACTCGGGAGCTAGTGCTTA 

GGGCATCCTGACAAAGAATCACAGAATGGTGTTAATTTCAAAAACAAAATTAAACCACT 

TT AACAAT AAAGTTCTT AACTTCC CAGGT CCT ACTGGT ATTTT AT^ CTCAAG CTT GTG ATCTTAAGACCCCT AG 

3 0 CCTGCTAAAGAGTAAACTTGGTGCCCTCTGGCATGACAGGGAGAA 

TCCCTTGCCCACCCTGGGGACTAGAGGGGTGGGGGCAGCATTGGAATTTCGGTGGCAAA 

GGCCCTGCCTTGGCTGATCCACGGAGACAGCAATCCAGGCCTTCTGCT 

GCTGGACTTAGACTGACCTTTTTCCTTTCTTTTTTGTCTTTTTGTT^ 

3 5 G CTCCCAG AATCGCGGG CCTGAG ACCAGTCAGTCGGTATCC AAAGCTGG CTTCTGGGTTCCCGCTTCTGCTCTGTGGGTTGTAACT 

AGC^CTGT^ 

GGTCTGCCATTTGTCAAATGAAGAGCTTGGGCTAAATGAGCTCTAACTCCGTTTCATATGAGA 
GACCAGCCCTGGATCTTCCATTGCGAGGGGCTTTGT^ 
40 GGGCCAAGGTTTTTACCACCCCCCTGCTAGGATGCCTCCrGTATGATCT^ 

TCCCTGCTCTGGGCTTTCACTCGATTTAAGAAAGTGCTAGCAGGCGAGCAT^ 

GATTTGTTGTAAATCTTTCTGATCAGACTGAGGCTCCGAGGTTCTGTT^ 

TGGATAAAAAAAATCCTCCAGCGATTAGCAAAGCTATTTTATTCCTCATCGCTGTC 

ATGGCTTTGGTTATTACCTCX5TGTTAAGT AGGACTTGGAAGGTCAGG CAGTTAACTAAC CATCT AGACAGCAGCTTCCTTGCGCAG 

4 5 CTCTCTCCAATCC^GCCCTCCrrCCCTTC 

CGTCCGTGGCCTCTTGACCCGCCTACATGCTCAGAAATCCAGGACCACATTTGGGCT 
TCATTGAAGCTGAAAATCCATGTTCACTGCCTCGGTACTGATCCTCTAACAGTGGC^ 
GGCGGAGGCCTGGATTGGACATTTTAAGGCTCAGGTCTGGTCCCAG 
CTCAGTTTCCTTATCCATACCATGGGAATAATTCCTGCTTGACCAAC 

5 0 GAGGATCCAGATTACCTGGCTAGCAGGGACCCACTTCAGTGGAATTGGAGGGGGAGTGGGAAGCC 

AGTCTTTACCTCATCCTCATCCCTATTGGCTTCTGGCTTTGTGCCCCATCTGTG 
CCCCCAAGAGCTGGGCTGTTTACCATGCTCGCTGAGGTCCATGCATGGCACACATGGGCCT 
AAGATG CTGGG CAGAGGCAGCAGTGGGCAGGGCCGAGGCTGTGCGCATGGCATGTGGro 
CTGAAATAATCAGGCITTGGAAACCAGGTTGTCAACAGATG 
55 G ATGGGGGG ACGGGCTGG AAGCTGTCCCAGAGGGGTTAGTCTGG AGTCCAATTAGTGGTAG CTTCATTCATATTTCCTTGCCTAG C 

CGAAGCAAT AAAAAAT ACTT AGTTTTTTGGC CGC CTGCCAG CCAGAGGCGGG CTGGAT CGATGC CG CT AAATG C CTG AC ATG ACT C 

A^CACGAGC^CO; 

CACAGGGTGGCCACCACCGTCAGAGG AAATG AAATGAAATGACGGTGATTTTTCreCCGGCGATTGCAGCAGCrAATTGAATTGTG 

6 0 GCTGTGCG CGCATCG CCACCGCTCCTX3CACX3GCTGTGGGGAAGGTTGAGCTGCAGTTCC^ 

CTTGATAACTCTGCCATTTTW^TTTGCTTCGGGCAGA 

AGTTCTGAGGCTGGTTGCCGCACTATTCCCACCGAATACTGCCCAGCCCCGTTCCCT^ 

TGGCCTCTGGGAAAGTGGAGAAGAGAAGGGAAGCCTGATTCCACT^ 

GGTCTCGATATGACCTCCTGTGTTGCATGGGGACTGGGACCTGGGATGGCCT 

6 5 TGAGCTCCTGGAGGGCAGGGATTGTATCTCATTCAT^ 

GAAATTGGTCGGTGGGTGAAGGTAAAGrTTGGACTCTTAGAAAG 
ATTAGTCTCTGGGCACAGAGGTTTGCTC4GCAGGTGCCCTT 
ACATGGTGATTTCTGACCACTATCAACCCCTAGGACTACTCC^ 
TCCTCATTTGCTTTATAGTTGAGGGAGAACTCACGAAATTTTGGGGT^ 

7 0 TTCACCAACCTCTTGGGGTCTCCACrTGGTGATGGCTACAGAGGCAGTGAC^ 

GGGAAAGGCTTCATGACGTGGC^TTTTTGTTTTTTTTACAACCCTGG 
GGGGTGTTCACJVCAGAACX3CITGGCTGCTCATCT 
ATTCCTGTCGC4GCTCAGCAAATGAAGTATTGCTICATGGCCCCrGCCT 
CACTTATCCCTCCTTTGAGCtAAATGAGTGTGGCAAGCGCGAGATGGGTG^ 
7 5 GTAGTCCCACAGGCCAGAAGGCAGACGGCTTACTCCAGGGTTAATTATTTAAT 
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GCGCCCAGGGCATCTCCCCGGGGACTCAGTCTCAGGGGAG 
GGGCAGAGAGGAGTTCZAGCTTLTC'ITCCCTCCCCTGACCT 

CGCAGCAGAAAGTTTT AAC CCCGAACC CT CTTTTTCCITCC^ AAT AACTCTGTGTCACCCT CGG AAAG CCT 
AGCACCGGGGCTCTGCGGACTACAGCATGGAAGCCZAAGAAGCXK5AAGGCGGAG 
5 AGGCGCCATGGCACCACTCGGGTTTCTGTTGGGTAGGGCAGGGG^ 
TGGCAGTGAGTCAGAGAAGCACAGGTTGGCGATAGGGGA 

ACACGCTATGAGCATGCGAGCCACTGTGGGGCGGAGGTGGATGGGGAGGTCXrrCCT 

G CCAACAACCCCATGCTG CTGTCATGGTG CTG CCGGAATCCCACTTGG CTCT AAGGGGCTGCTC CCAGTG ACCCCCAAAATG TGG A 
CCCTATAGTCCTCTCAGTTGGCCTGAGGCATGCACTCACTTGGCCACCCA 

GGGAGGATATGGTGCAGGGCCTCAGCGGTCTTCCACCCAGCCCTCI^ 

AC C CAAAGGACACT CAGCTGTGTTTCTGTGAGAGG AGGCAAGTT ACCATC C CACACAG ACGGGTGGG AATGACCAGGTGAGAG CAG 
CACACTGACTCTGAAGG ATGGTGG ATGTAAAACCAG CTGGGAAGAAGG 1" 1 LTiGG CTT CAGAG AG AAG CAGAAGGG CAGG CTGTGGG 
GGCAGGGGCTTAGAACTGCTATGATGGAGCGGGTGTCAGTGCGGCTGCCCAGTGGGTACAGGAGTG^ 
15 TAGGGGACAGCCTCACGGGGAGACCtSTGTTGAGCTGGAGTGGAGCCTGTTGAAG 
GGTTTGGCCTGGCrrCAGGTCCTGGCCAGTAGAAGGACAGGGTGGTTTGGGC^ 
GACCAGTTGGCTTCTCTCTTGGTTGCAGGACAGTGATGGGGA 
CAGGCCCGGTCGCAAGGGGGCTCTTGGTTTGGCATGGTTTGGAATTATT 
TTACTGGCAGCCTGAGCAGGGAGACAAGGCCTCCAGTCTTGGGGTCTC^ 

2 0 GTGTCTGGGCATTTATGGGCCTGGAGGGCCTCCTGCAAT^ 

CTATTCCCCGTGATCCTCCGTCCCCCCAGGGTCTCAGCCTCAGGGGCCTGAGGAGGAGCT 

GGTTAAGAGCTAGTTTTTCCTCACTGTGGGGCCCCCAGTCrTTCAG 
ACTCCCAGAGCAGGGAAGCCTCGAGGCTGGCTGTTGCTTTCAGTGCA^ 

GGTGGCCTCTTCCAGfrAGCACACCTTCCTCCAAGACCAAAGACCTTGGTCAT 
GAGCGGCAGGGACTGGAGCTGGGGTGGTGCCGATTGGCAGG 

TTACCTTCTGTGCCTGCTGGG CCTGTGCTAATGAGTTAACGTTT CTGAGCCTCGGTTGCT ACGTCTGTAAAATGGGAGTGATCACC 
CCAGTCCCTGCCAAACAAGGTTGGGCAAGGATTGAGAGGCTGTTCGGCAGAAGCTrC^ 

3 0 GGACCGGCACCAGTTGTGCCCGGCrCCTAGGAGAATGGAAGCCTCATG^ 

AATCCTCCACCCCTGGGCTCAAGTCCAACACACCAACCCCAAGGAACGACGCCCCAACT 

AGGTCGATGCCGGGTAAACCTCCGGGCATGGACCCGATAGGTATAATGGGTAGGCACCATGGGCTGGGTTAA 

GGGGGTCTX3CTCTTGGCCAGGGCAGGGGTGACACACACGCA 

3 5 GCTCCCAAAATCTGTTCCCACCCCTGTCATTAGTCCAGTG AGGCTCT AAG C AAG CCCATTTGGGCCTCCATATCCTGCTCTGTAAA 

AAGAGGTGGGTGCATCTGCCAAGCTCTAAGCTTCTTTGGCTCTIXjAAGGCA 
GCCTGGAG CAGG CAG CTT AAGAATGCAGGGAGGGTTATTGCTGGGTGAGGCGGCCAT 
GrrCCCTGAOSCCCACCCCTCGCAGCCTCXSGCTCTGCGCACGCCCATCTCCATCACCAGCrCCTATGC^ 
AGCCACCATGAGATGAACGGCTCCCTCACCAGTCCTGGCGCCT^ 

4 0 CGCTGCTGCAGCCGCTGCCTATGGCCGATCGCCAATGGTGAGCTTTGGAGCTGTA 

GGGCGTGGTGTGGGGAGCCAGTGGTTGGACGGGTGCTCCCCACATGCTAA^ 
CCrTCTGGGAAGGTGTATCGGAAACAATTGAGACTTGGCTG^ 
CTC^CCTGTGAGCCGCCCTGTGACCTC^TGCAAGGCAGCCTCCCTTT^ 
CTGGGGGGTAG<*AATCCCAGGGCGAGTCAGGTGATGTCCATGGAGCAGGG^ 

4 5 TTGTGGTGAATAAATAATAAGAGTTAGGGGTCGAGCGCAGTGGCTCACGCCTGTAATCCCAGCACT 

GATCACCTGGGG T CAGG AAATCG AG ACCAG CCTGG CCAG C CAACG TGGTG AAAC CTC AT CT CT ACT AAAAAT ACAAAAATT AG CCG 

GGCGTC^TGGCAC^CGCTATTCrGGGAGGTGAAGGCAGGAGACTCACTTGAACCT 

ACCACCGCGCTCCAGCTTGGGCXjGCAGAGCGAGACTCCATCTC^AAAAA^ 

GGGGCCCTTTTTC AGGAGACAGTTTTC CTTTTTACCTCCTGG ACCTGGGGTGGC CTCCTCT CCACTGGCAGGCCCACTAGGACCAC 

5 0 AGGAGAGGAAAGAAGTAGGGCAGGG CCAGCTCTGCCTTGCAG AACAGGGAGG ATGCTGGTT CACAGTGGGGTCTGCACTGTCACCC 

CTTCCCTGCCACGGGCTGTCCTGGGCCrGTGTCTCCTCTGAATCAATGGGCTCT 

TGGCAGAGCCGTGCAGGTAGGACCCCGGGTGGGTGGTGGAGGAGGCAGGGGTCGGGGTGTGTGGCA 

TCTTGCCTTGCTTTACAGGTTGGTTTTGACCCTCACCCCCCGAT^ 

AG GAAAACCGT AAG CTTTGGCTATACTTACTG CATGCTTTGTAGGGTGGGCAGG AGAGAGCACTCTACCCCCAATTCCAACAACGC 

5 5 CCCTCATCCCCTGACCAAGGGTTTGCCCTGGTGCACCTCCCnTTG^AAAGGTATCT 

AACCCCAGGAAACTGTAGCCAAGCCACACTTCTGAGCACCCACAGAAGCTCCCTCCGACAGAGG 
ATGGCCACTGGGCTGGGAGAGAAGGAACACTCCTCACCGAGTCACCTGGAGCTGTGGTGGGCCTTGCCGACTC 
CTGCTGGGAAGGCCCCAGTGCCGCTGGTCX3CAGGCCCTTCCCT 
CCTTCACTTCCTTGGGTTGCTAGGAAACTAAAAGATTGAGCCTACAGATAATC 

6 0 CCACGAGAATCTAGATGCTGATTGAGAGGGCACCCACCAGAGCCTGAAACAGCAGGC^ 

CATGGACTTTCCCATrTGCTATTGCAGGGGCAAGCAGATCCCTTCCCT 

CCAGGCCCTTTCCAGGGTTGGCATGTTGGGGGAATGTATGGATCGTGGG AAAGGG CATTCCAGG ACACCTGTGGGTTT CTGGTCTG 
ATTTGAAGCAAGGATATATAACTGGGTGTAAAAATCAAAGCTGGTCTC 

TGGGGTTACTGGGTTCGAATGGTTTCAGAG CAG AGGGGGCT AG AGG CAGGGCAGC CAGTCGGGGG CAGTG CAGATGAC CTGGGCCA 

6 5 GGCCTATGGGAACTCTGAGGGGCCGGGGTGGATGGCAGCAGTGACC^TTTCTCACCTTTTCAG^ 

CTGATGGGCAGATGCAGCCCGTGCCCTTCCCCC^CGACGCCCTGGCAGGCCCCGGCATCCCGAGGC^ 
CTCAGCCACGGGGAGGTGGTGTGTGCCGTGACCATCAGCAACCCCAOSAG^ 
CTGGGACAT CAG CCAG C CAGG CAG CAAG AG CCCCATCTCC CAG CTGGACTGCCTGGTC 
AGCTGGTGTAGGAGGATGGGGCAGCCTC^GCCACAGGGGAGAGCCCCAGGGCCCGGCT 

7 0 GCAAGCTGACTAACCTCTCTGAGCCTCAGTTGTGCCTTCTGTC 

GGGTTGTGAATTTGAGGAATGGGGACAGCCTGGCAGCTGCTGGGTATAGGGGGAGGCTGTAAGCA 
GTTGCTGGACAGGACTCTCCTGGTGCCCAGGAGGGGAAGTAGGCAGCACTCAGCCCTGCCCCTTCTCCCT 
TACATCCGCTCCTGCAAGCTGCTCCCTGATGGGCGCACGCTCATCGTGGGCC^ 
CTCGCCCACGCCCCGCATCAAGGCCGAGCTGACX3TCCrCGGCTCCCGCCTGTTATG 
7 5 GCTTCTCCTGCTGCAGCGrATGGGAACATTGCTGTCTGGGAC 
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TAGTGCCTCGTCCTGGGGAAG TT1 ' 1 1 ACCCATCCTGCCTGGTCTTGTTTGCTGTGA 

GCCAG AACCGTTGGGATGAGGGATGCACAGGTGAAAAGACAGCTCCGCCTGCCTTATG CG AGGGACAG AGGAGGG AGGG CCATTGG 
GCCCAGGAGTTGGAGAGGGCTCTGGGAAGAAAGTGGAGCTGGAGCATCT 
GAGTCTGGGAGCAGAGCCCCAGAGACCAGATTAGGGAGACCCGACTGGGA^ 
5 AAAACrcCTGGCTTTGAATCCTGCCTCCACTATGACCTAGTGCA 
AGTGACAGTGGAGCATCTTCCTCTGACTGTGCCX5CAGCGCAA 

TGTGTGTGTT ATTCAGCGTGTT CATGGATGGC CT CAG CC CCAT AGTTCATG CAGGATGAAGAGATAGATT AAGAGGGCAAATTGAG 

GGCGTGTGGAGTGCGAGTTAAGGAGCTGTGTCCTAAAGGCAGTGGGGCCCT 

CCCAGGGCCTTCCTGCCTTCCTCACCTTCCTGAGCCTCCAGC^ 
1 0 AGGCTAGTGCAAGGTCCCTTTTTCACCTCCCCCCCC 

GAGGCCTGGAACTOGAAAGAGGCCGGGGCTAGCTTTGCTGCAA 

AGTTTTCTCTTCTGCTCAGACTGCCCCTGGGATGGCCGCC^ 

CTTTCGTAG*5CAGTTCCAGGGCCACACAGATGGGGCC^ 

TGGACAACACGGTGCGCTCCTGGGACCTGCGGGAGGGCCGACAGCTACA^ 
1 5 CTCCTCCCTTCCCAAAGAGCAGTGTGTTCCCTCCCTCOK3GGCCCCACT 

CACGGCCAAGAGACTTGGGACCAACCTGAGCCATCTGGGAAGGCTTCCTGGAGGAAGCAGC^ 

GGGTTGGACACATTTGGCCOVACTGTGGAAATTCGATTCATTCrCTGCT 

AGATGGAAATAATAGTAG CACCCTCCT AGGTCTGATGTGGGCACCTGAAATGATCCCTCT AAAAGGCGT AG CCATAAACCCGACTC 
ACAG AAAG CATTCAGT AAATGC CAGGGATT ATT ATTT CTT ATTTGCATT CCT CACTTT AAT ATTTT AAT CTT AAATTTTT AAT ATT 
2 0 TT AGlTlTrT TT T TT T AACTCZAClTlTlTCTr AAGGAAAGATGAAAGTCT CAG AG ACAGAGT AACTGAATG CAAAATGT CTGTGCT 
TTCTCACCTATAAAATGGGGATGGAAAATGACCTCCCATACTCCTGGGTTAT^ 
ACCTGAGCAGCACCTTTGGGAGCCGCCCTCCTOT 
TCTTCTCGCTGGGCTACTGCCCCACTGGGGAGTGGCTGGCTGTGGGCA 
CCTGACAAGTACCAGCTGCACCTGCACGAGAGCTGCGTGCTCTCCCTCAAGT^ 

2 5 CCTGATGGAGTGGGGTGGGGAACCCAGGGCAGCCTGGCCCAACAGGACAGAAGAGGGGCCCA 

TGGAACCAGGCCTCCTGACCCCTGAGCTGGTGCCCCTrCAACACCCTTAAGTC 
ATGTCACCATGAGrTATTGAACTCTCAGGACACTAGGCCAGAGCCAGTTCTT 
TGCAGGCAGTTCCTCTCCATGGTTACAACCACTTGCTTTGGC^ 
CTGTGCAGTCTTGAACATCTCACCCTCTCTGAACTCTAGTGGATTT 

3 0 GGGAGCATAAGGGCTTTAACACTTGTAGAGCCCTGGCCGAGTTCCTGCTTGG 

GGTCTCCAGACTCCCAGGGCTTGGATATTTTCAGTATTTC 
TGGCTTTAACAAGAATCTTTTGAGGCAGCGTTATCACTGCAATGAACT^ 

CTCCCCCTGGATTTCAGAGAAAGGCCTTCTCCAGGTGTCCGCATGTCCCATTTGCAAAATGCCTCT 
AGTTAGCAGTTCTCACTGCTTTTCTATGGCAGCTGGCCATGTCTAGGGC^ 

3 5 CATGCCTTTGCCTCCCACCTGCAGGCAAGTGGTTCGTGAGCACTGG^ 

AGG ATAGCAGCAGGAAGAACAGGAGGAGCCTGGAGCCAGGATCAGAC CTCCCCTGGTCCCCTCC CCACCCCTACCTGATAGGGG AG 
CAAGATTGGGTTCTCCCAAGGACCTGGGGACrrCAGCAGGACACACAAT 

TTCTAAGCCGGGGCGCTTGTCGGATTrGG AGAGTGAGGAAT AGCATAAGTGTTTTTC CTGGATGCTG AGCACCTG CGGG 

4 0 AGGTGCGCTGGGCACCTGGGCACCCGGCAGCCTrGCCCAGAGCAGTGTTGCCT 

GCACCAGCAGGGACATGGACAGGAAAGAGAGGAGGAAAGGAAGAGGCTTTITATAA 

ACTGATTGG CACACACACGTG TG CACACACACACGTG CACTCTTTCCCAT ACACG AT CAAACTCACACTCATTT ACTCACG CAC AC 
TTGGTCTCCCCCACACACTrTCTTCCTCrCTCTCT 

TGCCGCTTTCTGGACCCAGGCTGAAATGTCTGGTGTGCAGCGTGTCAAACCTCATTGTTAGG^ 
4 5 AAACATCAGGTGGCAATCCTTTTCCTAACrCATCAGGCTGTGGGCCTGA^ 



GTTACTGAGCTTTTCCTTATGGAAAAGCAGGTGGGAGGGGAGCTCT 



TGAG CACTGGAAAATGAATTCCTGTC 



CCCTGCACCCCACGCCTTCCTGCTGCTTCTCCTCGTCTTCCTCCTCCAGTCT 
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TTATGAGGTCATCTACTAAACAAGAACTCCAGCAGGGCTGTCAAACT 

GGCGAGGGGCCCCGAGG ATGG CGGAGG ATGGGCCG CAGGCAGCCG AGCGTTCAGGGCTGCGCTCCGGCCGG CTG AGAGGGCACGTG 

CCCCGTCACAGTCTGGACTCCTGGGCCTGGATTGATGTGTCT 

TGCTGGCAGTTGCTACAAATAGATTTATTGGAGGCTTATGGCTCCGGTTCCC 

CCCTTITCTTTTGCCCTGTCCCTCACCTrGGGTCGGGGGTC 

TTTGCCGATTGTGCAGTCCACAAGAT1TGTGAAAAATGTAAATAACAGACTCCT 

TTCCCACCGGAAACTCKlAGTGTCTATrTCGCCTGCTGTATTTGTAAT 

ATAAACAGATGCTCTCACCTGGGAAGAGGAGACAGGGAGGGGAACCAATTGAAGAAAGA 

GCAACCAGGTTGGCCGTAAGGTGCCTGCTGGAATGCGTGTGCCTCCACACGGGTCTGGGCAT 



CTGCCTCTTGCCCTCTCTGTGTCTCTCTTTCTTGGTCTCTCCCTCTCTCCT 
TTATTGTTGTGAGCAATGGAAGTTCAAAGGAACTCCCTCTCCAGCTCTTCT 

TAGAAG ACAGCAT AG CAACTCAGCTCAGGGAGCTACCAGAGAAAAAT AG CAACTG ATGTGGGTGCTTTTTTTTTTTTrTTAATTTG 

AATAAAAAGAATTAGAAGTGATGTCCTTTTATAAAATGCCTTCTCCCC 

GGAAAGTGTATAAACCTACAGGGTTGTGAGTCTTGAAAAGAGGATCCCCCTCACCCCCACCCT 

G TGGG AGAGGGGGACACAG AT CCTG G CACACTGTGGAT ATTT CTTGCAGATTG CAGTCT CTTGTGG CCCAAACAGGTT AGGTAG AC 
TATCGCCTCTGGCAGGTGCCACCTTTTGGTACCAACATGT^ 

GCATATCGAAGATAATTTTTATACTGCATTTTTATC^^ 

CAGGGAAAGCCACCCACCTTCAGTGAGGTTGCTCCCCAGCT^ 

CCTCTCCAAGGCTGGGCGCX5TGGGGCGTCCAGAGTCTCTCTGGGTCT 

GAAGGG AGGG TGAGGG TAG AAG AAAG TT ATT C C CG AAG AAAAAAAGAATG AAAAGTCATTGT ACTGAACTG iiITT AT ATTTTT A^ 

AAGTTACTATTTAAAGGTTGGTTTGATTGTTGCGTCTTTAATTGCCT 

AAACAGCGAGGCCACTGTTGCTGTCACATTCTGGAGCCT^ 
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CCCATCCAAGCGCCAGGGAGGGTGGGGGGTAGGAGCTAAGACrTCCT 

AAGACCACAGGTTAACACCAAAGTATTGTACAGCTGTCAATGCAAAACri"!"! 1 1 GGAAAAAAATT AAACG T AATG AACTTGAAGCG 

AAACTGCAAAGTGTCTGTGAACATCTGXSACCTTGGAAACCTGCACAATTTTAT^ 

ATCTTCAATCTCTAGGTCACAAAGGTCTGTTTTTGGATGAT^ 

5 cattgtccccagggagggagtcgggggctgggcgggaagaggagaatgctggtgcc^ 

TTTCATGCACCCATTAGCAAAAGCTCTCJlGGGCACTGGAGATTAACA 

ACAAACAATTGCCTGCTGTCATTTCATACAAGAAAGGGAATCTCACATTTAAA 

CACAGTCTTTATGACTAGAAGGGACAACTGGC 

TGGAGGAGACAATGCCAGCAGGTAGAAGGGACCAGACAGGCTGCAGTGG^ 

TGGCCAGAATAAGATTGTGTGTTTGTCCAGJU3CATGAGGTTACCCA 
CAGTTTGGCAGAAGAGGAAAATGTCCATCAGGGAATACT 
TTGAGGAGAGACTGAAACTCTGCAAGTCTGTTC^CTGAGATTTCCT 
CTGCCACAGAAGCTCACAGTTCAAGGTCGGGGAGATGATACAGCCCCTTAC^ 
15 • AATGCCGGCACACTGGGCCrCCCCTGCACTGAGCCTCTCTGCACGTGCTTC 
TTTCCACCCAACCCTGGCTGGCGGCACGAACCA^ 
GAATCTGCATTTTAACATGCTGCCGCGATTCCTG 

TGGTAAACCTATTCCCATCCTTAATCCCCCCATGTTGCCCCCTTTTTACTTAAC^ 
GGCCTTTATCACCCrcCAAGTGTTCCTTTCCTCTTATC 
2 0 ACCTATTTG CTTGTCTCTTCCCACCTAGACTAGTTTCCCTGG AGGG CAGTGCGC AGGACCTCATCAGCCCAGGGCAGTGCTAGTG A 

ATGATGAGCTTTCACCAGTTTGAGATTATCCTAATTCACAGTGTCTGGCGT 
CCAGAGAAGTGAAATGATGTCTGGAGTGTTTTC^TTCTATTGTGCTGTTT^ 
GGGCCACAGTTGTCAGAGGCCAGGTTTAATGTTTTAGTTG 
TAATCCCAGCACTTTGGGAGGCCAAGGTGGGCAGATCACCTGA 

2 5 ATCTCTTCTAAAAATAAGAGAATTAGCTGGGCTTGGTGGTGGGCGCCTC 

GCTTGAACC CGGGAGATGG ATGTTGCAGTG AGCCCAG ATCATGCCATTGT ACTCCAGTCT 

AAfiAAAAAAACAAAAAAAAAAGAACAAATATTTGGGTCATCnTTGGCAATAGCT 

TCGCAGCCCCGAGGTGGCCCCAACCCQIGCGTGGGCAGTAATTGT^ 

TGGACX5CTGGAGCCAGAAGGATCCTTAGCAAGACTGCTAAATGTTAGGTACTGGCCATCAA 

3 0 TTTTTTTTCCCJVATAACTTATTCAGTGTTT 

GTATGATTAATGGAGCATTTTTAAACATTATACCAAAGTTGGACCTTTTATATTTGA^ 
AGATGGGGCAATTAAGGCCCAGGGAGGGCAAGTCACTTAACATGA 

CCCCTAAAC CC1TAGCACACTCAGGATGGAAAGCT AGGAGGG AACTGCCCAGCCTACA CTTGGAGGATTAAT A 

TCCAGGGAATGAGTCTTCCTGCTCCTCTACCAACAGTACTGGTAAGGAAATTACTC^ 

3 5 GAAACCAG CCTCCTGCCTCAAG AACTTCCTAGAACTCACCCACCCAC CCATCTTGCCCATCCTTCT ATCCATCT ATCCACCCATCC 

ATTACCCACCCACACATTTATCCGTATACCCCTTCATTCACTCTTCCATCCATCTATCTAGCCATCCT 

CCAACATTTATATAGTAAGAATCTAATATGTGCCTGGCATGGTTCT 

CTTAGAGCTCTCAGCCAACTGGGAGAAAACAGAAACTGACAATTCCCACT 

ACACATAGCTGAGGCATTGACTTGAGTATGGAAGGGCAGAGCAAGAGGGTCCTGGAAGGACCAAGAGCT 

4 0 ACCATGGGCATGCCAATGAAAGGTATGAAGAAAAGTGTTCCAGGTGGAAGGAA 

ATGGTCCCAACAATGCATCCCTACCTAACCCATCTAGTTCTTGCTCCTCCCTCCAGCTCCATCCA 

TG CTGCAACTTCCCGACCCCTCACCCCCACC AGTAGCCTCCAGG CCAGAAG CTGGGCTAGG ATTCTGGCTG AGC AGGCTGGTG AAA 
CCGGAGGAGGGAG CTGGACAGAG ACCACAATTCCAGCCTGAACCAGGAGAGGCAG AGG AATGGGGAGAAGATGG AAGGA^ CA 
ATTCAGTGTCTGAGAATGCCCATGATAATGCCCCAGGGCTCTGAGTC^ 

4 5 TTGGATCATGGTGCATGATCAGTTCCTTACCTGGGTAATATACCTATTCTAAGCT 

CrGCACATCTGGCCCCATCATTGCCCATCTTGGAGCCCCTCACTGGCCCTCTTCACA^ 
TGGGCCCTGTATCTCTAAAGTCATCCCTCCCCACTTATCTTGCTTCCCrTCAC^ 

AAGCAATGCTCTCC CATTTTAGGTTTTG CATGTGCTTCrCTTTCAGCTTGAATGTGGTG CZATCCTTCTTTG AATTTCTGCTCATTC 
TTCAGGAGTAGGCCTCAATGTCACTTCTTCCAGGAAGCCTTCCCTGACTGCCCACTTCCCTACTCC 

5 0 TCTCCCAACTTTAGGTAGCACATACCAAGCACAATGTTAGTATGATTACAGATGGTTCT 

TTTATTGGGTATTAAATGCATTTTTTTTTTTGAGACGGAGTCTTGCT 

CACrGCAAACTCCGCCTCCCGGGTTCAAGAGACTCTCCTGCCTCAGCCTCCGGAGTAGCT 

CGGGCTAATTTTTGTGTTTTTTAGTAGAGATGGGGTTAC^CCATCTT 

TGCCTCAGCCTCCCAAAGTCCTGGGATTGCAGGCATTAAATGCATTrTTGACTTAGAA 
55 ACATAACCCCATTGTAAGTCAAGGACCATCTGTATGGTTTAATGATATCCTGTCTCCTGATGCCCACCTC 

GTGKX5CT CTC AG AG AACAGCG ACTTTGT CTT AT ATCTC C AG T G CTT AGTGCATTGT AG GT ACTGAGG AAAATGTT ACTTG AATG AA 

TCAGTGAGTGTGTGATTAAGATAATACACTAATATTTCTAGAGTGCTAGTCCATGTAAAT 

TGTTTAATACTCATAACTGACCTATGAGGTCATTGCTATTATTATACTCATT^ 

AGGTCACACAGCTTGTAAGGTTT^TATGACACTGCTGCTGAATGGTTGGATGGATGG 
60 TGAGTAGATGGATGCATGAAAGGATGGATGGATGGATGGATGGTTGGTGGATGGATGGATGGATGGATGGATGGATGGATGGATGA 

ATGGATGGATGGATAAATGGGTGGATGGATGGATGGATGGATGGGTAGATGGA 

GAATGGATGGTTGAATGGATCGATGAATGGATAGGTGGATGGATGAGTGGATGAGAGGATTGGCGGGTGAGTGGAGGATAGGTGGT 

GGACTGCCrGCCTCATGAGGACTGGAGTAGAAGCTTGGAAAGAACTGA 

AGTATGGGAAGAAGAGCCAAGAGACAGAAGCTCCCCTGGCTAAATCAGACCTTT 
65 CAGAAAACCCCACAGTGTGCTCAACCTGACCCGGACCCMTTCCCCTCTCAGACAGGTCAGAA 

TGCTGAGGCAAGAGAGGGAGTTAGGATGGGGCTATGGCCrTGTCCCTACAGGGTATCA 

TGGTGGCAACAGGTTAGAAGATGAAAAGATCGGGCCTAGAATTTCCAATTCAGG 

TTGACCATGTGAACACAGACATATTGTATAAAATTAAAATAAAACAATT 

GTGGTAAATAAAAGACTTTAAAGGTAGAGTGGAGATCCCTAGCT 
7 0 ATTACAAGGAGGAATGGGAAGTTTTATC^TACCACTAAGAGGAACTAGTAGATCAAG 

T ATG AACAACATAATTAACAAGCTTGAGACATAGAACCCT ACAAG CAACT ATTGGAG CCCG C AAATT ATTTTCTAGAACAC CTGG A 

AGTCTTAG AAAGTTTTACCAGTAAAAAGAAG CAGATAGAG CATTTTACAAAATTCAACACCCATTCATGACTCCCTTAAAGTTACA 

AACCAGG AAT CAAAGAAAACTTT AAT CTG ACAATGGAT AG CT ATATCAGACCTG CAG CCAAAT AAT AG AT AACAGT AAAGTTTCAG 

AAGCATCATATTAAAGTCATAAAAAACAATCAATGAGCCAGGACAGCAGTGTGTACCTGTTGT 
7 5 CAGGAGG AT CAC CTGAG CC CAGGAGTTGG AGACT AG CCTGGG C AACAT AG CAAG AT CCCATCTCT AAAAAACTTTTTTT AAACACA 
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AGAATATTCATTATTACTGTCTCTGTTCAACGTTGTGCTG 
AATAAAAGTATCATGATTGAAGTCAAAGAAACAAAGTTGTA^ 

GTTT AAT AAGGG AGTTT AG TCAAGTTGCTGGATC CAAGATCAA^ CAAT AGCATTCCT ATTCACAAG CAGCAAC CAA 
TTAGAAAATGTAAAGAAAAATGCATTTATAATAGTAAAAGCTACCTATGAGTCAATCCA 
5 TAAAAATACAAAATTTTATTGAAGATTTAAATGTAAAAAGACCTCTGTATATCACAAA 
CATTTAAGAAGCTGTATTTTACAACTATTTATAAATACAATGTTAAACAAA 

TGAT ACTGAT ATTT AT ATGAAGAGT AAGGGACCAAG AAT AG CAAAT ACAATTTTG AAGAG AAATG AGGGAGGAGAACTTCG CCTGC 
CAGATATCAAGACTTACTGT AAAGCTACAGGAAGCAGGAAAGTG TGCT^^ CATGTG CAG ACAAAT AGACCAATGGAGCTGA 
ATAG AGAGTCCATAAGTAGACTACTATATAGGTGAGAATGTGACTACCACAT AGATGG CATTATGCATCAGTGGAGAAAAG CACTA 
1 0 TTCAGCAAT AAGTT CTGGAG CAACAGGCCAT CTTCATGGTGGGAAAATGG AT CCCT AC CTCACACAGTACACAAAAAT AAATT CTC 
AAAGGATTAAGGTTTTTAAGCTTATAGTACAGAATATTGGAG 
CAGTAAGCAAAAGCTATAATGAAATAAAGATCAATTTGATGATATT 

AACAAAACATGAAAAAACAT AAACACAATG CCTACAACTG^ TGGTGTCTGGAAT AG AT AATGAATT ATTCCAAATC 
AGTACATTCAAAGCCAACAACTCGAGAGGAAAATGGGTAGAGGACATAAATA 
1 5 CAAACACGTGAAAATACCTCATCTCACCAGCTATCAGGGAAATG 

TAGACTGGCAAG rrrr CGAGTTTGATCATATGAAGCrrCCATGAGGGTGGT 

AATTGGTTCAACCACTTTGTGGTTCCCAAACAACTATT^ CAG TTTGG CAATCATGAAG 

GAGGAAAAGCACAACCCTCTGACCCACTAAATCTACTTCC^^ 

TGT ACAGAAT CTTCATTG CCTCCTTGTTTGAGCAAAAAT AATT AG AAACAAC (H^ CATT AATGG ATGAATGG ATTTT AAAAG AT 
2 0 CTATGCATTTTCCTATAATAAAATTGTACATAGGCATTAAAATGC^TAAGCT 

AACAATGTTGTGTGAAAAGAGCAAGTTGTGGAAGGAAATACATGATATGATACTATTTATATA 
CTATATTTGAGCATATATATACATATATATATGAGTAGAAAGAACACCCTACACCACCAG 
GAAACGGAAAGGTCAGCGGGAGTTCAGCTATACCTATCACATTTTAT^ 
CATTTTAAAAATCTGCGTGGTGGCTrTAGAAGGGTra 

2 5 ATTTTTTTTTAAAAAGGCAGAAAAGTAAAGACTTAAGAACrrCGCCCCCGTTA 

GGGTGCCCZATGCCCAGTGGCAGATAGCIAAAGCCCCGGGCrCACCACATCACCT 

CTTTTCATCCTTTATGTGATG CTTGAAGGCAGGTGAATAATCAATCCCTGGGTCCTCTCCAG CTTGGATGTTTTGTGGTTCTAATG 
ATTTTGTGAGTGTGCAGGGGCAGTGCTCTCATGC^GAGACAG CATGG ATTAGATTTGTTTAG CTTTTCATTATGAAAAATGCCAAA 
CAGAGAAGTAGAGAG AACAGAAT AATG AATCTC CATG TG CCCAT C AAC CAG CTTCAACAGTT AT CAACTCAGGG CT AATCTT CTTT 

3 0 TATCTATACCCCACCCATCTCTACCAGTCACCACCACTGGAGTATTATGTAGGAAA 

TATTTTACTATTTCCAAAACX3TAAGACCTGTTTTGAAAATGTAA 

AATATCCAATCTCTATTCAAAGATCCCCAATTGTCTCATAAGTCTTTTTACAAAGTGTATT 

TCGACACAGAGCAAGCAAAATTGATATGTCTTTTAGATTCCTGC^ 

ATATTTATTTACTGGAAAAAAAACCAAATTTGTCCTTAGAGCTTC^ 

3 5 CTTGGAGCTTGATTAGATCCAGGCTTGATT^ 

CTTCTCCTTTTGTAIX3TTAGCAGTCACCGATGATCATTGTGTAGATTCA 
GCAACATTTGCITGAATATTGTTATAAAAAGAAACTGCCCCAC^ 

GCAAGATG AATGCTTG ATT CATTTG CTTT ATTTG CCAATTTTCAAAAT AAT AAGTT AATTCTTCAGTATC CAG CAATTGAC CAA 
TGAGGTATTCTTTTTTAAAACATTATGAATGTGTGAATTTAAA 

4 0 TCTGAAATCATCCTACTTGGGGATCCTCTTGAAGATGCCTCCTGAGTCTTTTTAATAC 

GCTITCTGGTAAAAGACAATGTCCCTGGCTCATCTrGTACATTT^ 
TGGTTCTGTTTAGTGGAAAGTGGTATTTGGAGACCATAATCTGGGTGCTTGGGGTGCT^ 
GGGCCTTTTC^GTAGCAGAGCTAAGAAATGTGTATGTGCATATTTATATATTTGTTAATATATA 
GGCTTCTAATTCAAATTCAGGGTTACAGG 

45 

HUMAN SEQUENCE - mRNA 

GAATCACGACCCCTCCCTGCCATGTATCCGCAC^CAGACATCCC^CTC^^ 
GGCTGAGTCTrTGTGACAGGATCAAAGACGAATTCCAGTTCCT 

CAAACGAG AAG ACGG AGATGCAGCGCCATTATGTGATGTACTATG AGATGT CCTATGGCTTGA CAGACA 

5 0 GAGATTGCGAAGAGACTGAACACAATTTTAGCACAGATCATGCCTTTCCTGTC^ 

GGAGCGCGCCAAGCAGGTC^CCATGACGGAGCTGAACGCCATCATCGGGCA^ 

C^CACGGCCCCCCGGTCCAGTTGCCACCCC^CCCGTCAGGTCTCCAGCCT^ 

CTGCTGGCAGTGGGCGCCCTGGGCAGCCAGGCCCATCTC^ 

AGAATCCAGTGCGAATAACTCTGTGTCACCCTCC^AAAGCCTCCGC^CCAGTGAGAAGCACCGGGGCTCTGCGGACTACAGCATGG 

5 5 AAGCCAAGAAGCGGAAGGTGGAGGAGAAGGACAGCTTGAGCCGATACGA 

GTTTCCAATGAGGACCCCGCAACGCCCCGGGTCAGCCCGGCACACTC^ 

AAAAGATGCCCCCACCAGCCCTGCCTCGGTGG CCTCTTCC AGTAGCAGACCTTCCT CCAAGACCAAAGACCTTGGTCAT AACGACA 
AATCCTCC^CCCCTC^GCTC^CTCCAACAC^CCAACCCCAAGGA^ 
AGGTCGATGCCGGGTAAACCTCCXXJGCATC^ACCCGATAGGTATAATGGCCT 
60 CITATGCGGCGCCCTTCGCC^TGATGAGCCACCATGAGATGAACGGCTCCCTCACCAGTCCTGGCGCCTACGCCGGCCT 
TCCCACCCCAGATGAGCGCCGCCGCCGCTGCTCCAGCCGCTGCCTATGGCCGATCGCCAATGGTGA 
GACCCTCACCCCCCGATGCGGGCCACAGGCCTCCCCTCAAGCCTGGCCT^ 
GAGTGCTGATGGGCAGATGCAGCCCGTGCCCTTCCCCCAaSAC^C^ 
ACACACTCAGCCACGGGGGGGTGGTGTGTGCCGTGACCATCAGC^^ 

6 5 AAGATCTGGGACATCAGCCAGCCAGGCAGCAAGAGCCCCATCTCCCAGCTGGACT 

CAAGCTGCTICCCTGATGGGCGCACGCTCATCGTGGGCGGC^ 

GCATCAAGGCCGAGCTC^CGTCCTCGGCTCCCGCCTGTTATGCCCTGGCCATTAGCCCTGACG 

AGCGATGGGAACATTGCTGTCTGGGACCTGCACAAC CAGACC CTGGTCAGG CAGTTCCAGGGCCACACAG ATGGGGCCAG CTGCAT 
AGACATCTCCC^TGATGGCACCAAACTGTG43ACAGGGGGCCT 

7 0 TACAGCAGCATGACTTCACTTCCCAGATCTTCTCGCTGGGCT 

AACGTGGAGXjTGCTGCACCACACCAAGCCTCACAAGTACCAGCrrGCACCTGCACGAG 
CTGCGGCAAGTGGTTCGTGAGCACTGGGAAAGATAACCTTCTCAAC 

TATGAGGTCATCTACTAAACAAGAACTCCAGCAGG 

75 
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HUMAN SEQUENCE - CODING 
ATGTATCCGCAGGGGAGACATCCGGCTCCCCATC 
CAAAGAOTAATTCCAGTTCCTGCAAGCTCAGTATCACAGC 
AGCGCCATTATGTGATGTACTATGAGATGTCCTATGGCrTGAACATTGA 
5 ACAATTTTAGCACAGATCATGCCTTTCCTGTCAC^ 

CATGACGGAGCTGAACX3CCATCATCGGGCAGCAGCAGCT 
TGCCACCCCACCCGTCAGGTCTCCAGCCTCCAGGAATCCCCCCAGTGAC^ 

GG CAG CCAGGCCCATCTG ACGGTGAAGG ATGAGAAG AACCACCATGAACTC CACAGAGAGAG AGAATCCAGTG CGAAT AACTC 

TGTGTCACCCTCGGAAAGCCTCCGGGCCAGTGAGAAGCACCGGGGCT 
X 0 AGGAGAAGGACAGCTTGAGCCGATACGAGAGTGATGGA 

ACGCCCCGGGTCAGCCCGGCACaCTCCCCTCCTGA 

TGCCTCGGTGGCCTCTTCCAGTAGCACACCTrcCTCC^ 

AGTCCAACACACCAACCCOVAGGAACX^CXSCCa^ 

CCXXXSC^TGGACCXGATAGGTATAATGGCCTCGGCT 
1 5 GATGAGCCACCRTGAGATGAACGGCTCCCTCACCAGTCCTGGC^ 

CCX5CCGCTGCTGCAGCCGCTGCCTATGGCCG^ 

GCGACAGGCCTCCCC!TCAAGCCTGGCCTCCATTCCTGGAGGAAAACCAGCGTA 
GCCCGTGCCCTTOXCC^CGACGCCCrGGCAGGCrc 
TGGTGTGTGCCGTGACCATCAGCIAACCCCAGCAGGCACGTCTACACAG 
2 0 CCAGGCAGCAAGAGCCCCATCTCCCAGCTGGACTGCCTGAACAG 
CACGCTCATCGTGGGCGGCGAGGGCAGCACGCTCACCATCT 

CCTCGGCTCCCGCCTGTTATGCCCTGGCC^TTAGCCCTGACGCCAAAGTCTGCTTCTCCT 
TGGGACCTCCACAACCAGACCCTGGTCAGGCAGTT^ 
CAAACTGTGGACAGGGGGCCrGGACAACACXKSTGCGCTCCTGGGAC 
2 5 CCCAGATCTTCTCGCTGGGCrrACrGCCCCACTGGGGAGTGG 

ACCAAGCCTCACAAGTACCAGCTGCACCTGCACGAGAGCTGCGTGCTCTCCCTCAAGTTCGCCT 

CACTGGGAAAGATAACCTTCTCAACGCCTGGAGGACGCCTTATGGAGCCAGC 

GTTGTGACATTTCAGCGGATGACAAATACATTGT 
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Table 88 

MOUSE NOMENCLATURE 
ICSGNM Nek2 
Celera mCG14118 

5 

HUMAN NOMENCLATURE 
HGNC NEK2 
Celera hCG23474 

10 MOUSE SEQUENCE - GENOMIC 

TATCACCTAACCATCTCTGCCCCAGCAAACACTGGATCAAATCGA 

TTCGGAGGGGACTAAATATATGAGTCCC^GAAAGGGCAGAGCTAGGCAGGAGCG 

GGATTGCACGGGAGGGCTGGCACAAGTCACTGACCCTGCGACA 

TCAGGX3CTTTCCTGGAATGCAAACTTCTAGTGTGCAATGTGGGGCIAGTCCCAGG 
1 5 ATGTCACCTGGTGAGGTGAGATCTTGAAACAGATTTCTGGAAAAGCGTCAAG 

GTCTGGAGAAGCACTTGCACACACTC^GGCAGCrACACCAAGACTG^ 

TCGTGGGGGTGTGCTTTCAAAGCACCAAGGACCCCATGGACTAAATTTATGGT^ 

AACTGGATGGGAAGAAAGGGGGATCAGAGAGCAGT(TTGAAGACAGGGTGTGGGGAGGCT^ 

GGCCATTACAGTCAGCCTCTGTGTATTGTGGGTCAGATATTTATATTACAAACCT 
2 0 CTGCTCCTCTTCACTTTCTTTCCTTGCAAGGCTGGGA^ 

ATCTCCCCGGGGCTTTTCAGCCACTTTATTTCCTTGGCGATTATTTGGGGCT 

GGCCTGCAGAGAATCITACIACAGTGCCATGCATGGTCT 

TCCACCTTGCATGTCCTATGAGCATGAGCCCAACAGGTTGGGCAATCTTTC 

GGCACACACTGTCTGGGGTACAGCCATACTCITGGCCAGCT 

2 5 TAGGATCCCAAGGTGCTGTAATAAATC^TATCCCTGAGGGTGGAGTCAGGGAAGCTrTTTGATCTATAAAT 

TGCCTTCATCTTTTTCATAGGGATTTCGGCCTGGGTAAGCTGAGTGCC^ 

CTGCTTGTTATCCTTGlTGAATCrrGGGCCAGGCCGGGGAGCACCACGTTCTGTTCT 

AGGAGCCCCTTGGGCCTTCCTGTATGGGAACTAGCTGTTACAAACACATTC 

CGTAAAATGATGTGTTAGAGAGCCACGTGATCTTCAGCTCACAGTCTACATGGAGATTGTGGGTAGATTCCCTCCACT^ 

3 0 CCCATTCATGTGTCTGTTGAATTTGGGTGCTCAGCACTGAAGTCAGGGCCCCATGCAGGCCATTGCCT 

AATTCTGTGGAGTTCAGTTCTCTTTCTCCACCTTTGCATCTGTCrCAGGCTTGG^ 
CGAGCACCCTGCATTCTTTGTTTCTGAGGAGTTTTACACTTATAGGAAAGCTGTGATC^ 

GTGAGTTCTGAGGGGGCTTCCTGCTGGGTGGGGCCCACTACGGTCCATCAGTTTGGri 1 1 IGTCTGACTTCCCTTTTCAGTGGTAA 
AGAGTCACATTGAAACGGGCTATGAGGTTATAGCirCTCCAACCGACCCTGGTCCTCACCT 

3 5 GGCCCATCTGCAGCCTCTTCTAACTGCTAAGCAATCTCTCCAGACCCTAAACATATTGTTTCTAGCCTGCT 

TGTAGTCCCAAATTTGGAAGACTGAGGAGAAGCCGTCTTGAACTACTTCGTGAGTTCIA^ 
GTATCACAAATTTTACGTGTGCTTTTGAAGAGTGGCACTGAGCACTC 
ATCATTTCTGGCTTCTTTGTGCTTTGGX3TTTGAACTGTGAGTTAGAGAG 
CTTTTCCCTTCTTCT T CTTCr T CT r CT T Cr TC TTCT^ 

4 0 CTCTTCTTCTTCTTCTTCTTCTTCTTTTAATTTTGT 

CCTGGAACTCACTCTGTAGACCAGGCTGGCCTCAAACTCAGAAATCTGCCTGCCTCTGATTCCCAAGTGCTGG 

CGCCACCACGCCAAGCTCTACATCTTTCTTAGAAGAGAGGAAA 

GACCGTGAAGGAAAGAGCAGGTCTTGAGGCTGATAGAGAAGTCTCCT^ 

ACAAGCTGAGAGTGGTGTGTGGGGAGACTAGAGAGCAGTGGGGAGAGAAGAGAAAGTGGAAGGCTAGGGGA^ 

4 5 CTGACAGACACTGGCATTGGGGAAGTACCGACACCTGTCTCAGTGGGGAAAGAGATCCGTGCCTGGCCT 

CTAGAGTGCCCAGGTTCAATTCCCAGCACCCACTTAGTGGTTCACAACTCGTTTCAAAGGAGCCA^ 
ACACACACACACACACACGCAGACAAAACACCCACATACAAAACAACAAAGACAATTTTTGTTTG 
AGCTCTGGCTGTCCTGAAACTCGATATGTAGACCAGGCTGGCrrTCAAACCTACAGAG^ 
GTGTGCCACCATGCCAGCTAAACAAATCIAAAACAAACrrAGAGAGGGGCCACAGCCGCC 

5 0 AAATG£AAACGCAGAAAGCTCATAGCATTTAAAGCTGGAAG 

CCAGAGATGGGTTGGGCCACACAGCTCCTAATTCCCAGGATTGCAGTCCCTGTTCCTGCAA 

TCTGGACATTCCTGCAAACACATTAG CTGCAGCCTACGGTCTGGGCTGCTAG AACAAG ACG CCTGGT ACCAGGTGATTC^VTGCATT 

TATTCCTCGCTGTTACAGGGCTGAGAAATCTGATGCTGGCAGAAACACTCGAGGACTGCCCTC 

ACAGGATGCAAGAGGCCAAGCCCTTCATGCCCn'AGGAGAACAAAGCCAACCACAGG^ 

5 5 G CCCAGC CCTGCACCTTCACCTCTItKKKKSTGGGGGATGGGTGGCte 

CTTTAG CATG T AG CG TGTGAATTTGGGG TGG AG AGACGCAAT ACTCAG AC CAT AG CAGCCC CTGT CATCTGTGATT AG CTTTCATG 
TTCTTATCCAATTGAAATGATTAATACTTAG AGACAGGGCCTGG AG^ 
GGATCAGGGTGTGGTTCCTAGCACCAAAGTTGAGTGGTGACAGCTGACTGTAA 

G C CTCAAAGGG CACC CTCAAACATGTG CACACACAC ACAC ACACACACA CACAGAC ACAC AATTTGT AAAAAT A'ri"rriTAAAGTT 

6 0 ATAGTACTATAAATAACATAAAAACATGCATGATTTGTTGAAAAGTCTAAGTTT 

CAA1TTGTATTCTTGTGTACTGTTATGCCCTGAGTTCACGGG 

TGGCG<^CGCCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGCGGArrTCTGAGTTCGAGGCCAGCCTC 

CTGGGACAGCCGGGGCTACACAGAGAAACCCTGTCTCAAAAAAACAAAAACAAAACC^ 

CAAAGTCCACTGCAACTCGCAAAAGAGTCTTGTATTTTCAAGTTTGAACTC^ 

6 5 GCAAGCGATCCTGAGTCCAGAAAACATTGGGTTTATATATAGTATAGGAATAGTGTC^TATrCACAG 

GGAATAGTACTGAAG CGGGAAGGGGGCTAATGGGTGTCTATTCAAGGACTGTCTGTGACCrGACCTCTGTTTAC CTTTTACTTTTT 

CCATGGTCTCCCrrGAGCTTGTTATTTCTCTAAGGTCTCAAGGACAGGC 

TTTGGACAC^CAGAGGTGCACCATTCTAACTCTAGGGGAGGGGGAGGGAGCCAGCC 

AACCAGCTGT CTG TG T ATCATG CTTGCGG CAGG CTGG AAAAATG ATGGG AGCTTCTGG CAGAGGGG TTGG AGGAAACGC AGGGG CA 

7 0 AACTGTCTGCACAC!AACAACACTGAAGTCTCTCTAGTATCCTTAATTTG 

TGAAACATTGTGGTACTCrGACC^CACACAGGCTTCCCAGC 
GGCTGCAGTCTGTGCTtTlTGGCAGCATCCACAGAGTCCCTGCCACT 
ACTGAACATATACTGTX3CATTACACAGTTTTGTTTTGTTTT^ 
CCCTGCITGCrTCAGCCTATAGGTTCTTTATTTATTTCT 
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CACGACCTTTAAGAGCCACTGAGCC^TCTCACCAGCCCTTTTACAGG T T TTTTTTTT T TTTT 1 TT ITT TTT TTTTT T TTTT AATGT 
TTGGAATTGAAATAGAATCCCATGGCrTCCCCCTCCCTGGACCCCCrTCCTTC T T TTTTTTTTAT 

TATTATTGTTACACACATATATGTGTGTGTGCGC^^ 
GTTCCAAGGACAACCACTTGGCATTAGACAAACAATCCCTGGGAGAG 
5 TTTGTCTAGAGGAAAAACCCTGACAACCTTTCCTTATTCCATG 
TTATGCAGCCATTTCTAGAGCGGCTGTTCACAGCGGATGCCTT 
TTTTCCTGAACCATAGACAAAGGTGCTGTGTGTCCXSTGT^ 
TCCAGTTGTGGTTTTCCGTGATGGGCTCCAT^ 
AGGTAAGCTGAGCACCTACTCTAAATTTTTGAGACAGTGT 
1 0 TCTCTTTGTAGAAATCITTACTAATAACTTCAAAACATTGTCATAAACC 
TCACTCACAAAATGTCTGACCCAACATCCTCCCAGGCCAGT 

GCGTACCGCCCTCTAGTGGGAAGCITGTGTCAGTACACAGACGCTCCCGGAGG CGAGCCTGG AGATTCTTCAGTGAACAGAG CTAA 
CGATGCTAGCCTTTAGCTAAGAGAAGGAAGACAAAAAAGTAGC 

TGGGCCCAGGGCACAGTGCAAAAGGTCCCTCCCTTGAGATGG CACACACTGTTAATGGGGACTGACAG CAG AAGCAG ATTGGGCAA 
1 5 GAATAGTGTGAGCCAGGTCCTGCAGGAGCCAGATGGCTGCCCAGAGCAGGGTAGGCT 
CCACCTCCCATCCCCACCCGATGACCCCATACTGAGG^ 
GGAACCCCCAAAGCCAACCATTTCCCACITTGGTAATTACTGATT 

ATTGTGGCTG AACACT CTCACCT AGGG CCTCTCTCT CT C CCCCG AGAT ACAG CTGGCATGGCTGAAT ACTT 

CAGGGATCATTTGGAATCTCTGTGTAAGTCTTAATAACAGGCCTTACTGC^ 
2 0 TTCTCTATTGCAGGACTTACACTGGCCACAAGCAGTTTGATGTTCAG 

CACTGTTCTCTGAACATCTTGGATGAGGCCCCTCAGAGCAGAGGTGACAAGGTTC 
CTGGATAAACATAATAGAGGGAGCATGCTGTTGCCTTGAGTAAA^ 
TTCACAGTTCATCAGCCTTGGAAAGCAGAGACTTTACAGCCTGGCATGGTGTTG 
GGCOVGCCTGGTOTACAGAGTGAGTTCCAGGACAGCCAGGGCTATACTGA 

2 5 AAAAGAAACTTGATCCTTCTGGTCACTTTGCACCTGCAGCCAG^ 

CTCTTTTTTATGTAGTCCTAGACTGGTGGTTTGCATGAGAATGGTCCCCAGAGGCT 
TAGGTCTGTTCTGGGAAGTTATTAGGAAGTGTAGCCTTGTTGGAGGTGTG^ 
GCCAGGAAGAGTCTCCCTCTCTCTCCCTTCTCTCTGTGCTCTGTGGT^ 
GCTATCACGCCTGCCTGCCACCATGTTTCCTGACATGAGGATAATAAAGGACTCACT 

3 0 GCTTTCTTTTGTAAGTTGGCTTGGCCATAGTG 

CACTTTTAGGGTGTGTCTTTTCCATTO^TTAATTTAACATAAACCCT 

TCrACAAGTGTACCCAAAGGCTTGACCCCCTAGGTGATTCTACATCCTGTCAATCAACAGTAACCAAGGTGGG^ 

TAGGGATGCAAGTGGAACTTCTAACAACTGGAAGGACAAGTACCAA^ 

CGAAGAGTAAGAGCTCTTTGTCTAGAAAGAAGATGAAAGGTTGTGTGTGAAGC^ 

3 5 ACACCCATCACATACACACCACACGCCTATACACCACATATCACAAACACACCAC^ 

CACTCTACACATCATAAACAACACACACAGCACACACACACACAACATACT 
CCACACATATCACATACGACACACACACAAACATACCACACACATCAC^ 
TTGTAGCrCCTTATATAATTTCTGAGACCTAAATTAATCCTACCCCTTTGAGGTGTGA 
TCTCCTGGCn'GGCAATTCCAATTAGCTTATACTAAATTCAGCGC^CAAAATA 

4 0 TATTAATAGGACATTGGGTTCTTTTTCAATCACAGCATGAACAGATATTTT 

CTTTCTGGTAATGCACATTACCCCCCACCTGCTCTGGATGGAAGCCTTTAGCT 
TGGCTATGTGAGCTCAAGTCAGCAAACTGCATGCAGCTCCCGGATTGGCrATTAG 
ACACTTTTTTGGTAGATGTCTTAGAAGTACAATTTGGCAGTGTG^ 
TGGATACGTTTCTATGTTCAACAGGTCTCTCAGTTTAATGTTCAGAGATCTAGACT 

4 5 , ACTCTGCGTTATT ACTATGAATATCCTTT ATTCTTATTATTTTTT AATTAGTTCTC CCTAGTTTCTCT CTC 

CAAGCTCACAGAGATCTGCCTGAGCX3CGCCACCAAGCCGGGCTGGAACAGGGATTCT 
CTCGGTGCACTCACCGCACCCTCAGGGTCATCAAATCCTCAACACACAAGGG 
AAAGGGTGGGGTCAAAGTGGAAAGGTCAAAAGGCCAGTGTCTCCAGTC CAGAGC CTCGGACATC 

5 0 AAGGAATGGAATTACCGAGGCCCCAACAGGCTTTGCGA 

TAGACGCAGACACCAACGCCGACGCGGTTGCCTAGCGACACGCATTTTCCCAGGA 

(^GAGTTGAGATGCACCTTCGCTGTGGACCAATAGAATGCGAGGCCGAAGCCCGCGACGGTTGACCCGGTGGCCGG 
ACTCAGGCAGGCAGACGAGGGGCTCGGGTTTCGGCGCGGTTGGCCTGCAGCGGGCGAGCCTGGGCT 

GTTGCCAGGGTGGCCGCTCGCCATGCCGTCCCGGGTGGAGGACT ACGAGGTG CTGCACAGCATCGGCACCGGCTCCT ACGGCCG CT 

5 5 GTCAGAAGATTCGGAGGAAGAGCGACGGCAAGGTGAGGCACCGGGCAGCCCTTCCACGGC 

CACGCTGGGGAGGGGCAGGGCGGCCACCTGAACTCTCGTGAAGGACCGGGCT 

GGCAGCCACGGCTGGGAACCX3GTGGAGGCGCGAATCCGCGGCGAGCAGAAAGACGAGCTCACGTGACCTGCT 

CACGCCGTCCCGGACTCAAGCGCTGGGGATGGTACCGCCCACATTTCGGGTGGGTCTTCCCACT 

CTCACTGACACACCCATTGAGTTGTTAGTGTTTTGTTTTTCAACTT^ 

6 0 ATTTCAGATCCTGGTGTGGAAAGAGCTTGACTATGGCTCCATGACGGAGGTC 

TTCGGGAGCTGAAACATCCAAACATCGTCOTTTACTATGATCGCATTATTGACCGAACCAACA 
TACTGTGAGGGAGGGGACCTGGCTAGTGTCATTTCAAAGGGGACCAAGGATAGGTAAGGTGAA 
TAATAATTAACITCCATGTTTGTACGCTCTGAATGACTTGCCCCAGATTGAAGATGG 
TTCTCCAGACAGTACTTGGAAGAAGAGTTTGTCCTTCGAG 

6 5 CGATGGTGGCCACACTGTGCTTCACCGGGACCTGAAGCCAGCCAATGTCTTCCTGGACAGCAA^ 

TTGGACTAGCTAGAATATTAAATCACGACACGAGTTTTGCAAAAACGTTTGTT 
TTAGAATTCAACTTTTTTTTTTCCCTGAGACAGGATTTCTCTG 

TCCTTGAACTCAAAGTGATCCTCCTGCCTCTGCCTCCCAAATGCCGGGATTAAGGGTGTGAGCTACCACACCT 
ACTTTTTAATATCTTAGGTGAAGCATTCAGATGCAAAGAACTGAGGCTTGG^ 

7 0 ACAATTTAAGTTTGAGTITGACTTTGTTCCC 

CTGAGTTACACCCCAAAGTCTTTTATATGTATTATGTATTTAAAAATGTT^ 
GTGAGTGCTATGTGTGTGGTGAGACCAGAAGAGGGAGTGGACTTACCCTATCCCACACACT 
GAGCCGTCTGACCAGATGCTGGAACTTGAACTCAGGACCCCCGGGAAGAGCAGCAGGTC 
TTTTCAAAGAGACTTCCTGGCAAACAGTTCTCTGTATTCAACTAGAGCTTT 
7 5 TATGGGTGTTTGGTCTACATGTATGTAAGTGTX3CCACCTGTGTGCCAAATGCCCAAGGG 
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GAG CTAGAGTT ATGGATGGATTGAGAG ACTGG AAAC CAT AT CXIAAG TCTTCTGT AAG AGCAACAAG TG CTGTTAAC CACTAAGCCA 

TCTTCAGTGTCCCCCATTCCCCAATGTCCCCCATCCCCC^GAGTTTCACTGTAG 

CTGCCGCTGCTGGGATTAGAGGTGTGAACCATCVITGCT 

TT ACAAAT AATGTTT AAGG CAACAATGTTTC CAAGGGGAATAGTGTTTGTTTG ITTlX5TTGTVriTTGTlTTITCAAGACAGGG TT 
5 TCTCTGTGTAACCCTGGCTGTCCTGGAACTCACTCrcTAGACCACGCT 

AGTGCTGGGATTAAAGG CGTGAG CCACCACTGCTCAG CTGGGAATAGTGTTT AAAGG CAACAAT ATTTCCGAAGATGTCAGTCCTA 
AAGTGGTGACTTCAGACTCCCTGGGAGTGTTTATTTTAT^ 
CACX3CTGTCTCAGCAGGCAGCTGAGGTGAACTGAGATGCTCAGG 
TGGTGCCTCCACATTTAGAGTGAGTCTCCCCAGCTGA^ 
1 0 TGATTTTAGATCCTGTTAGGTGGACAGATT^ 

AGTGTTTTAGTX3TTAATTTCTGCATGTCCTGTCCTTGCT 

GCACCCTAGGTGCATAGCCTTGTATGAATTAGTCCTCATTTCT^ 

GGTTTTTAATCTGTCATTAGATGTCAAGGTCAGGGCCTC 

CAGAAG C AAAG CACAAG T AAAT AC CAACCAG CCCTCAAGTCTGTAAGGGG CAG CACACACAGACACACACACACACACAC^ 
1 5 ATCCTCCGTCCACTGTTATAAAAGCACAGCAGTATCrAACACTCTCCITACTTCT 
TTCAGGCGCTAGGGCAGTGGAGGGCAGTGGGGAAGGAGGCAG^ 

TT AAGCGTTCTGTGTGCAACATG ACAC CATGG AAAAC CG CCCAGCTCTGAAAATAGT CACAAG TTCACT AAGTTTT AT ACTGTCAC 

CAGGTCTATTAGTTTGATTATTTATCTGGAAAGTTCTAGAAATCT 

GTACCCAAGCTTCCTAGAACrTGTATAGCTGAGAACAACrTTGGATl" 

2 0 ATTTTTCCTGAATGTATGTACACCATGTGTATTCAGTGCCT^ 

TGTAATGAGTGGCCGTGTGGGTGCTGGGAGAGAAGCCTGGGCCCTCTGCAGGAG 
TCTAGCCCGCCATCGTTTTCTGGTCCTCATGCTCTATCTCCCAAGTG 
ATCCAGCCCTGGGCTTTGTGCCTGCTAGGCAGGCCCTCTGCTGACC^ 
TC^CCTCCTTGGTATTTTCTTACCTTCTCCTrTAGGAA 
25 GGCTGCCTGCTGTATGAGCTGTGTGCACTAATGTGAGTGCGTGGAGACAGGCCT 
TAGCAACAACCAAACACCTGGCTTGTACAAATTTAGAATGGAACATGTGGG 
TGCAGTGGCvAGGGCAGTTTATAGACCAGGCCAACAAAGCAGATGC^ 
GGGCTCTGAGTTTGACACTTTCCTCCTGCGATTCTAA^ 
TCACACTGGAGACAGTTTGGGCTGTCCTTCCAGATCTAGCAGGC^ 

3 0 CCCCGAGTCAGAGCTGTTCAACCCTAAAATACAGCAGCACCGCCTTGCC^ 

CCATAGCCAGGGAAGAGTCCCAGATGAGGAAGTAGAGTGTGAGACAGAGGCC^ 
TGACATTTTTATAACTTCTCCCCTTCTCTCTTTCT 
AGGCCTCCCTTTACAGCTITTCAACCAAAAAGAGCTAGCTGGG 
TGATGGCJTTGAATGACCTCATCACTCGGATGCTGAATTTAAAGGTAATG 

3 5 GACTGAGAC^CGTATTCTGTGTAAAGACCAATGAAACCTAAAGGAGCATCGGTT^ 

CGGATGCTCACAGGCATGGAGACATGCCCCCTCGGCAACCGTGAGCCTCAC^ 
GGATGGTTAGTGGTATGHSAAGGACTGAGGAAAGGCCA^ 
TCTCCCCTCACCATGAGGGATGATGGATTATGAAGGGAAGGTCAACAGGGA 
GTGGCCTTTGTCAGCTGAGGAAAGAGTGAATCCGGGAGCTGTGGGTTTGTGGAGGATGC^ 

4 0 TGGAAGGTCATGAG<3ATGTCCTTCTTGGTATTTAGAGA CAGCCCAGTG ATGCCCAGTG ATGCC C 

AGTATGCATGGAGGAGGGAGGAGTTGTGCCCGCCTGAGCCAGCTCC^ 
C^TCATGGTGCAGACCCAGCACCTTGTGACCAGTGCACTTG CT^ 

TGTTTACCCTGCTAATGTAGCTGCITTTCTTAAGATGAATCTGATCTGCCAGCCACACTG 
AGTGGAAGAAATTCTGGAGAGCCCTTT^TAGCAGACATGGTTCCA 

4 5 GAGGCGAGCCTTCGAAGCTGCCGGACTCCAGCCCTGTGCTGA 

GCACTCAGAGCTCGGGAGGACATTCTGGAGCGTGAGTG^ 
GGGTCTAAACAGAAGAGCGTATATTCACAGGCTGTTAGAAAGAGCCTTTATTC 
GGATCTTATGAATGCCAGTGAGCTACATCCCAGCATTTTTCTCT 
CTCCTGCTCCAGCGGCTTGTGTTACCAAGCCTGAC^ACTTT^ 

5 0 GGCTCAAATGACCCTTACCTAAGACCAACAGAAAACATTTATTTCCCATGAG 

ACACATGCAGATACACCCAGGCATC^TAACTVTCATGACAACTCTATCACATATACCCT 
TCATTAGAAATATTCCATTAGAAAACCATCCCMCAACAACAAAAAGTAAGC^ 
ACGGTGAGCACTTAAAATAGTCGGTGCTCCAGACAGAGCrrCCGCCATGTCTTACGGAAGT 
TTAAATGAGTTTTTGAGCTTCACGATGAACTCAGAATGTTCTTAGATAGCAAGGCTGCrA^ 

5 5 AGTGAGTTGCAGACAGTAACTTACTTC^TTGTAATCTTTG^ 

ATT CTTCAG TTT ATGT CAAAGGATCCGATTATTCCAAATG AAACTT CAG CTTTGG CAAAAAC C CAAAAAGG ATG AAAAT AAAATTT 

ACACTTGCACATGCTTGCAGACAAGACATTTCATT^ 

GTTCATTCAGTCAGAGCTGAAGCTCTTCGTGAGACAGCACAAGAGGAGTTGACT 

6 0 CTTCTCCACAGTGCCCCAGCTACACAACTCTGGATCAAGTGTTTGC^ 

TTAATAATGGTTCTCAAGTGCTTTAAACTGCCTTCTCCAGCCCACCACCCAC^ 
GGGGCAGTGGACCTGTTTAGCAATTCTTITTTGGAGCAAAT 

GGGCTCTT AATGCACTGGCAGACCGTCAGTCTGGTCCTGTTGGGATAG CAG CAGG ATCCAG CAGCAGTGGCACCGAGCAGAG ACGT 
GTGCCCTTGCATAGACTGCCCTGAGAATTTCTGATCTGGTGAGA 

6 5 GT ACTGT C GCAAAATGT AG CAC CATG CTTTCAAGACAAAACTACATTT ATTTGT AATTAGGAGT AAAT ATTTC^ 

AGTATATTITTATGACTAATCACTGAGCrTTAATTATGTTCAATTTGTAACAA 
TTTGTAACAGTAGCAAAGTTATAGTTAGAAAGTAGCAACAAAAATAGTGT^ 
AAAGTACCCCAGCATTGGGAAGGCTGAGAGCACTGCAGTGCAGGC^CCT 
AT<K3CCAGGCTGCAGTAGATAGGTTTATGGTCAGAGCTCCTTACT 

7 0 GTGAACTTTGTATTCGAGAGAGACTTGCAGAGGACAAACTGGCC^ 

CACAGGCTCCTATGTCI^CTGGTGGCCCAGGTATC 
TGGCCCCTGAGTGGGTTAGAGCCCCAGATTTCAGGATGAC^ 

GTG ACTG AGTC CCTGG C CG AGAGACCTGAT AGGAATGG CAATTT CATTTCAG CTT CACTCAGC ATAC CAAAAG CGGTGG ATGG TAT 
TTAAATATTTGAGTTTTGAAATGGAAACATTTTCAAAA 
7 5 GTTTGAATAGCTACAGCCCCCATAGACTCATGTGTTTGAATGTTTGGCCTATAGAGAGT 
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GGAGGAAGTGTGTCACTGTGGGGGGCAGATTT^ 
AGATGGAGAACTCTCAGCTCCTCCTCCAGTACCCTGCCTGCCTC 

TCTCTATACCTGTAAGCCAGCCCCAAGTTAAATACCCnTTGTAAGCGTCGCCGTTGTCA 
TAAGACAGTTATGTGACCAAAACTATATTTCTCTATATTTTCAAGCAAAAATCCTGGTATTT^ 
5 GTAGGAACTGTGGCTTAACAGCCATGGTAGTATAAGTCTGTTATCCCAGCACTTGATTTGCT 
ACATAGTGAGTTCAAGAGAGTCGAAAAGAAAGGGGATGGCACTTG^ 
TCTTAGTCACTGCTCTATTGCTGTGAGGATGCACCATGACCCCGGCAATTCTTAAAAGAG 
AATTTCAGAGGTTCAGTCCATGACCATCATGGTGGGGAGCACGGTGGCGGGGGCAGTGCTGATC 

CATCTGCTGGGATTT CCTCCAGCAAGG CCATACCTCTTCCT C CrTTTCAATCAGTGCCAATGCCTGGTG ACTGAG CATT CAAAT AT 
1 0 ATGAGCCTATGAGGCCATTATTTAAGCCACCACCCTGTTCTTCCTGAGGA 

TAGCTACTGGGTAAATGTAAAATCATGTACTATGAAATCAACTACTGCAATATACACGAG 
CTAATCTGGATGCTGGTCAGTCCTITTTAAATCA 

GCCCGGCACACACACACACACACACACACACACACACACACACACACACACAGCCCTCCT 
CCTGATCACTTGTCCCAGAGAGCATCTCACAGGAATCAGTGGATAGGTCAGCTCTTTGATCTGTGTC 

1 5 ACTCAGATCCTTTT CTTTCT CTTTGTAAAG AACTTG ATCTTC CATCCTCAGCCATGAAGAAGAAGGTT CATTT CCACGGGGAAAG C 

AAAGAGAACACCGCAAGGAGTGAGAATTCTGAGAGCTACCTTGCCAAGTCCAAGTGCAGGGACCTGAAGAAG 
CCAGCTGCGGGCTCAAG CCCTGGCTGATATTGAAAAAAAC^ CA 
AGGCATGGAGCTGGGTCAGTGTT^IATACTGACAACCCACTAGAGATTGGTATTC 
GGCAGGAACCTTTGTGTGTGGTGAGCTCGTGGCATTGCTTGTGGTCTGCAAATGGATGTGTGTC 

2 0 TGAAAAGCAAGCTGTCTTTGCTGGTTGGTTGGGCTTTTGATCCTGTC 
TACTATAGATCTTAGGAGAAAATAATGTCAGGGAAA^ 

CCTTCACTGTGGTTTCCAAGTG AGG CCTCAGCGGGTCCATGACCTCACTGCTAACGGGATGCGTATGGCAGGGCCCACAGGGTTG C 

ATGTCAGAGGTGTTGTAATGTTACAGGGAGAGACCCAGCCTCTGGCCTGACCTCCTGATG 

CCAGGACGCTCACTGTGCCAGTTAACATCAGATTAGAAGTGGAGGGGAGGGGAGGGCGAGGGCT 

2 5 GGGAAGCGCGTGCCATGGCCCTCAGCACGCTGCTGTCCTGCATCCTAGGCAGGCACTGGGGACAG 

GCCCCCTCCTGCTCTGTGCAGGGCCTGTCCCCGTCTGTCTGTATCTGACCTCTAGGTCAGTTGATGACA 
TACCT ACAAGTATAAG CCAAAAGGT CGGAAATGCCTCAGAGTCACACATGTGGTCCCCAAACATTTAATTTCTGA 
TTCAGAAAGTTGGTCTATTTAAGGTGACTGGGGAAAAGGCAGGTTTCCTTTGCT 

AGAGAACAGTGTTCAGTGTGCGTTATTGAGGGCCCTGTATGCCTTCACCTGTGTGAGGGCCAGATTGCTGTTTG 

3 0 TGAAGACCTCAGCTCAGCAGAAAGGAGTTCATTTGCCCTAACAAGTACTGT^ 

TTCGTTTCATGGCTAAACACACAGACTCCTTGCATTTTAAAATTCTGTGCCTTTACTTCCTGGAGCT^ 

TGTGACTrcCCGAGGTTGGCTTTrcCAATGCTTTATGTAAACTCTG CTGTTGGCACTTG CAGCTGTAGAAGCTGTCGGAGGG ATT A 

CTTACTTACGGTTATAGCATCTGGGGTCTCTTGCAGCTGCAATGCCTGGG 

GAATGTTAAAATATGGTGAGCCTCTTCTCTGTGAAGGAAGGGCCTGACAATCTGCCA 

3 5 TGGTTTTCCTGCCTTTCCCTTCTCCTTTGGTCTATAGTGATACTGACAGCAATACAAAGTA 

GTATGAGGTCTTAAATGTTCTGGAATTTTAGAAAAAACTTAATTGGATGCAATTG 
TTTTCCAATATAATATAAATCTATTTTGTGGAAATTTATCACTGTGATC 
TTCACGTGTGTGTGTGCAGGTGCAGGTAGCATAGCACCTAGAG^ 
TTTGATAAGGATTCTCCCTGGTACAGGATACAAOVGTTTTGCCAAGCTTGCT 

4 0 CG CCTC CCT AATGCTG AGGTT ACAAG CAC ATTTTAT ATTACCT AACTTTTTTTTTT AAAG ATTT ATTT ATT ATTAT ATCT 

ACTGTCTTCAGATGTACCAGAAGAGGGCGTCAGATCrCATCACGGATGGTTGT^ 

GG ACCTTCGGAAGAGCAGTAGGTGCTCTT AACCACTGAGC CATCTCTCCAGCCCCJ^ CACCTAATTTTTTATATGGTTCTGGG CCC 

TGAATTAAGCGCGCCCCCCCCCCCCCCCCATGCTTGCATAGCAAGTACCTTTCTGGCTGAGTCATGTCCTTAGCCCATGATC 

TTGTTTTTATTCCGGATGTCATTTGTGTTCCATTTTTCTACAGTTATGT 

4 5 ATCATGGAGGCTAGCACTGCAGAGACGATCGAGGAGCACTAGGCAATGTTGAGAAACATTC 

CAGGGCITCCCTATGTAGCCCTGGCTGTCCCGGTTCTCAGTCTGTATAGACCAGGCTGGCCTCAAAGTT 

GTGTTACTTGCTGTTAAATGTCCTTAGAAACATTGGGGAACTGTTAGCATGA 
GTTTTAAGTGGAAATGCTTTGGGCTTGATCCTGTGTGGTTGGACTGCACAGCT 
50 AAGCAAGGCAGGAGAGACTGCACTCAGCCTTCAACAATGTGTGCACGTG 

GACTTACTCTCCCTTCCCCAGCCCTGTGCAGTGCAATATCAGGACTCAGGTCTAGC 

ACATGGATTGTGACTAGCCCTGTTCTGAGATGCCTAACACCTGTTCCCAGGCCCCTGCTGAGTGTCCTTATAA 

ATAACCGAAGGTGTAATACCTTCCACGACAGCAGTCTGCAGTCTCAGTCACCCATAAAACTTCATTTTC 

GACGTGGCTATATCTATCTCAGACTAACCTTAATTCATGTACTGTCAATGGACAGTCATTGAGGGCCTACCATGTTAAAACCT 

5 5 ATTAACCCATTACCCTGGAGAGCCCGAAGTCTCCGGCATTAGATCTC 

AGAGCTAGGAAATAGTTGTGTGTGAACCCTTTTTGGAAAAGGGGACATTGGGG 
CTGTGGGGTAGGGCAGGAGGAAGGGGGCTTGCAGGGCAGGGAGAAGAAGATCCTTGA 
GTCCTGCCACTGAGGCGTCGACGCAAGCCAGTTCCGGTCCATCGGGACCGTATGCTGGACCAGAGAGGAA^ 
GGCTGCTCTGAGAGGCGATTCTTCTCATTAGGCTTTCAGCAGGTTCACCAGCGCCT 

6 0 CGCCAAGGCGCCGCGAGTCCAGCAGCTGGGAGGCCATGGCCCAGCTGAGCACCACCAGCT 

GCGAAGAGGTGTCAGAAAAACCGAGGCACACAGGTCGCAGATGAGTGGCACAG 

TGAAGGATCAGGCTCCCAGGAAGGAAGCGCCCTGCCTGCTCCTCTGCCTCCCAGCATAGCAGCGACCTCTTTCCCACAGTGGG 
TGAAGCTTTCAATTCAAAGGGTGAAGAGATGTTTATTCTG 

CCCCAAAAACCATGTTCCAGCGAGCTGACAATTTCATGGAGTTTTTATAGTAACAAAGTCAGGTGCA 

6 5 CCTGCTATAGCCCTCAGATGCCCTCTGAGAGTCTCAGAGTTGGGGCTGGAAGAAAATAGAGCT 

ACACAGAAGTGGGCAAGATGTGGCTGTCTACTGGGTTAAAATGGCCCAAGACCATTTTGCTC^ 
CACAACCCCTGCCTTTCTGATCACATTCAGCCTGGTOkn'AGCAGGCTACCACAGTGTGA 

AAGGTG AACGC C CAC CCTG AGCATTG C CTGTTG CTTGTGGTT C CAG CG CTGT AGTGG CTG AG G CAAG AT AATT AGG AT AT AAGGTT 
ATCCGCAGCTCCGTAGCCAGTGTGAGGCCAGCCTGGGCTACACAGTGTGAAGCCAGCCTGGGCTA 

7 0 AACAAAAAACAAAATAAACCTTTAAAAATCGGTAACAATAAA 

ACTTCGGGAGGCAGAGGCAGGCGGATTTCTGAGTTCTAGGCCAGCCTGGTCTACAGAGTGAGTTCCA 

AG AAACCCTGT CTCG AAAAAC CAAAAAT AAAT AAT AAAAT AAAAAG CAGAAGAGTTTTT AAAAAAAAAACAAAAAAACAAAAAAAC 
CCCAACCCAAGAC!AACTGGGCAGTTTAGCAAGATGGGCCAGAGTGGTGTGATAGAGTCAGGTGGC 

AGG TG ACTGGG CAGGTTT AGG TAGGG T AG AGTAGATTTAG CTG CTGGTTTT AAAGCAAGACCCCGGG AGGATGGGT CTAG AATG AT 
7 5 TTTTCTTCCTCCAGCTTTCCTCGATGTTGTCTGCCATATGGA 
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TCCCTGAAGTCTGTCCCACTTACAGACAGGTCITGTTGTGAGA 
AAGAGAATGCGAGGTGTACCACGAGTGCTCAGTGACITGGAAAATGAGGACCrc 
AAATCAGTCTCACCCCAGGGAGGTTCATGCTGGCTGATGGCCTGCAAATGGACCTAGG 
TTATCTTrCCCAAGTGCCAGGGTTCCTCTACCACCAGCTGAATGTAGTCAGAGC 
5 CACCTCTGACCCATGCATCCCTCTGCrTCCTCTCAGCAGCCAATTGACCTTTTG 
GACTCTGCAAGTGGAGATGCCCTGGGGGTGGGGGGCACCCAGCAGCT 
TCCTCACCGGTAAACAGAACATTAGTCCTGGAAAAGGTGAACTGTGTTTAA 
TGGCCCTGAGGAGGATCCAAAAGGGCTTGGGTTATAATGGATGGGCTCTAATAAAA 

ATGAGTGAAAC CAAGAAT AAATCCAACATAATTG CAG T AATT ACACCCGGAGC CAAGCCTTGAGGGAG CAGAGCATT CACTGAGGT 
1 0 AAGGCAGGAGAGACATCTCAGGGTGTGGCATAGGCCTTTGCTGAACCT 

TGTTAAAATATAGATTCAATTCAACAGATGCAGGGTGGGCCAGATAATCTGTCCTAACTGCTCT 

GTAATTTCCCAACTGTACAGGAGGGGCTGAGAGCCCACCAGGCTCAGGGAGCTC 

AGACCCCTATTGGCTGCTCTCTCTGCCCCAGACTGAAGTCCCATGTGATCAC^ 

GCTTTTCATGCCCAGATCCCATTTATCACTCTAGTATATGGGAG 
1 5 CAGGAGAACTAGAGAGCCCTCCTGCTCTCTGTATCTGCCTGGAAGGGGTTTCTGGGGC 

TATGGGTCAGGGGCCAAGTGGGCCCCAACCATGTTTCCAGGATCGATC^ 

ACTCACCCT CCATG CCAGCAACAG ATGT CCTCAAGG TGCCCAGGACAG ACTGGT CT AGGG C AGC C CATGG ACCAGCCAGACT CAGA 
GCATGGAGCTGTGAGCCAGCCAGGGGCATAGTTCACCTATGGCAACTTTCGTGATGACACCTACT 
TGGCTCATTAAAGTTGAACITTGGAGTTAACT 
2 0 TAGAGATGGTGGGAACAGCTGTCATTGCrrCCCTGGGTAGAACAGATACGTTTC^ 

CACAGCCACTGCACGAAGCTCAGCCCCACGCTGCCTCAATGGATCAAAGATGCAAG^ 

GCACTGGGGGTTCCCTTAGAAAGGAGAAAACAGCAGTATCCAGGAGAGGCAGAAACATTCCAGCT 

GATGGGCTAAGGGAAGCAGGGATGTGGCTCAGTGGTTGA 

CACACACACACACTTCAAGTTCATGTCCCCAGTGACCTAACTTCCTTCCACTCC^ 

2 5 GTTTCTCTGGGTAGCCCTGGCTGTGCTGGAAGTTGTTCTGTAG 

TCCCCAATGCTGGGATTAAAGGCATCCCACCCACCACCCTTTAATAATACC!ATGATCTGGG 
AAGAGCACTGACTGCTCTTCCAGAGGTCCTGAGTTCAATTCCCAGCAACCACAT^ 
GCCCTCTTCTGGTGTGTCTGAAGACAGCTACAGTGTACTCACAAAAAGTAAATAAATAATCA 
TAATAATAATACAATGAGCAGGGAGGAAGCCTTTAGCACATGAGCCTTAA 

3 0 AGAGTGTTCAAAGCACrrGTAATTTCTCAATGTGCTGTTCTGTTCTTGGC 

TGGGGG CTGT AGAG C ATCCTGGGG AGGG CT AGCTG C C CTGGGT ATTGGGG ATG CTGTTTTCAGGG AC CT ATATTCTTTTG CT CTCT 
TCTCCCGGCCCCTCCCCCAATACAAATTGTAAGCX3ACGCACACATGCATGTTGACATGT 
GACAGGAGGTATCGGGTGCTGGAAACGGGACAGCTTGGAACGCTCTGGAGTCTCTTGT^ 
CAGGTGTCCTGCTGTAGATTTGCTGTGGCAGTGGTTAGCTCTTACT^ 

3 5 CACACAGAGCCAGAGAGCCCGGCACCTGAAAAGCAGTCACT 

TTATTAATGTTTACGATCCCATAGAAGCTGGTCTCTGTGAACCACAAGCCTAACACTTC^ 

GCCAAGTT CACTGGGTG CAG CTGGCGG CATGGG AG CTG AT AATGCT CT CG AAGAAAC C CAG ATGTCriTG 1TGCCTTTCAG AG CTG 
GG C CT CAT CACG CC CCTTT C C CGGTGTGAGTCTT C C CTT C CCTTCTTGTTG CTT AG CTG CGGT CGGCAGAG CCTTTT C ACACACAC 

4 0 AGTCTCAGTAGCTCCCTAACAAGGCTCCAACAGGCCTAGACGAATTCTGTTCTC^TACAGAAAAT 

CACTGACACCCAAGAACTCCAGGGATCCCAGTGTTAAGGCAGGACGCT^ 

TTCCTTGAAAGGGTCAGTGGGTGGATTTTTCTGGTAGCCTACAGACTTCATCAGACTCTTCAC^ 

CTTTCAAATCAGCACAAAGACTTACC^TGCCTACCGATGGTGCCACCTGCT 

GGTGTGGCCACCCTGGGGAC(TTrccrGCTCrTGACCATGGAAGACTCC^ 

4 5 CCACACTGTCTTTGGGGAGCACAGGGG^ 

GAATTAACCACTCTGAAATGAGAAGTAGCCATACTTTCATAGCACGCCTGC^CCATGCT 
TTCCCCTGACAGTGACAACTCAGACGAGTGAAGATGTAGGGAGTGTGAAGC 

CATCATGGTGT ATT CATT AAT ACTGAAAT ATC CAG TTTGTCTTGTCTG TGT CTG CATGTGTGTGTGTGTG TGTG C AC ACG TTTGTG 
TATGTGTGTGTGCAAGAGTGTGCGTGTGCCTGCTCGTGTGTGTGTGTCTT^ 

GCCATTCAGCAGTAGAGTGCTCrCCTAGCACACATACAGACTTGG^ 
GGCTGCTTGGAGCAGGCCCCTTTTCAGGCTCTGCCTGGGGCT 

TGACAGTGTTACCATTCCAGGGCTCGAGGAAACTGTGAGAGTCCAAAGTCrrCCGGTGTCACGA 

AGCCAGAGC CCAGCTTCCCAGAAACCTCCCAAACAATCAGCGA CTGGCAG CCAGCTTGGG 

5 5 GCTGGATGCCAGGCCTX5 AGTGG AATGX3ACCCAGG AGACTTCTG CCCCCGGTGGCCT ATG AGGTCAGCTCCTCTCCTCAG CAGATGC 

TACACAAAGTGTAGGGACAGGTAGCTAGATCACITGAGGCTAGCCAAGGCT 
CAGCCTGGGGCAGGTCCCCAGCACCTTGGCCTCCGAGGACGCCCAGTAC^ 
, TCAC!ACAAGGCACCTTATATGGCCATGTGGGGCAGCGAGTGACCTCTCTACATGCCGCCCT 
OkGGGAGCATGTGAGGACAGCCTCTCAGGTGGGAAGACAGTGGAGGAA 

6 0 TCTCAGGTGTAGGTGCCTAGTCTCTCATCITCCCCATAGGGGCTGAGGTTTCCTCTAGTCT 

CAACCAGTTAAACCCTGGCTGGGACGTTTCGAGCTGAAGGTAGTAAGACTCGAAGCCACATGTATGA^ 

GCCCGGCTCTCTGTCATTATGGGACAGTTGTCACGGATCTGCCACTGAAACCC 

CCCCTGTCTTGCTTCTTTAAGCACATTCCCAGGGCCTCTCT 

GAGCAGGGGTGGCATATGACTTGTCCACAACACAGACTCTCCGAGCCCACACCCACCT 

6 5 GTTTAAGAGGAAGAGCTTGTGGGGCTAGGGCTCAGTTGGTAGAGTGTTTGCTTAACAAACA 

CAGGTGTCCATGTGGAACACGTGTCTGTAATCCCAG CACG C AGGAGGCTG AGGCAGGAGGATTGGAAGT CTG TAGG AGTG ACGAT A 
ACrrAAGGACTCCTCGCCATTCTCTTGCTTCCCTGGGCACGACCn3AGCT^ 
CTTTCTGAGCCAGTTTTCCAAGCCACTCTCATCTCACCACGGAGACCCGAACT 
ACGTATTATTGGGATTTTTAGAATTCTTCAATGAGTTGTC^CAC^AA 

CACTCCGCGCAACAGGGCTTT CCGAGAAGG AAGGCTCACCT 
AGATCCTGATGTCTCCTCAACCCrGGAGATTGAAGCAGCCCGGGAACAGCCTGC^ 

ATCTACTGG CTTTGTTTTTT ATG CTG AAT CAGTTTTG AAACATTCTG CCTTGGTTTTTTT C CT ATT ACTTTGAT AAAACCCCACGA 
CCAAAAGCAACnTGGGGGAAGAAACGGTTTATTTCATTTACAACT 

7 5 TCAAGGCAGAGACTGAAGCAGAGGCCGTGGAGGGGCACTGCTTAGCGGCTTGTTCACGATAGCTCGTTTGCCTGCT 
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CCCCGGACCCCAGCCCAGTGGTGGTACCCTCTCACAGTGATCTGGGCTTTCCATAT^ 

GGGAGTAATTTTCrCAATTGATGTCCCCrCITCCCAAATGACTCT 

GGAAAAGAAAAGAAAAGACCCCCAGCTCACCTCTTTCTCTCrrCCCAGTAA 

5 MOUSE SEQUENCE - mRKA 

CGGGTTTCGGCGCGGTTGGCCTGCAGCGGGCGAGCCTGGGCTCCGAGCGGCGCGGT^ 

GCCGTCCCGGGTGGAGGACTACGAGGTGCTGCACAGCATCGGCACCGGCTCCTACGGCCGCTGTC 

ACGGCAAGATCCTGGTGTGGAAAGAGCTTGACTATGGCTCCATGACGGAGGTGGAGAAGC^ 

CTTCGGGAG CTG AAACAT C CAAACATCGTCCGTTACT ATGATCG CATT ATTGACCGAAC CAACACAACC CTG T ACAT CGTAATGGA 
1 0 ATACTGTGAGGGAGGGGACCTGGCTAGTGTCATTTCAAAGGGGACCAAGGA 
TC^TGACTCAGTTGACGCTGGCCCTGAAAGAGTGTCACAGA^ 

GCCAATGTCTTC CTGG ACAGCAAACACAATGTCAAG CTGGGGGACTTTGG ACT AG CT AGAAT ATT AAAT CACG ACACG AGTTTTGC 

AAAAACX3TTTGTTGGCACACCCTATTA<IATGTCrCCTGAACAGATC 

TGGCCrGCCrCCTGTATGAGCIGTGTGCACTA^ 
1 5 GGGAGGTTCAGGCGCATCCCCTACCGCTACTCTGATGGCTTGAATGAC 

AC CTCTCAGTGGAAGAAATTCTGGAG AGC CCTTTGAT AG CAG ACATGGTTG C AG AAG AGCAAAGGAGAAAT CTGGAG AGGAG AGGAC 

GGCGCTCAGGCGAGCCITCGAAGCTGCCGGACTCCAGCCCTGTGCT 

GAGCAAGCACTCAGAGCTCGGGAGGACATTCTGGAGCAGAAGGAACGTGA^ 

GGCCAGAGCCGAGAGCCTGATGAAGAACTACAGCCTGCTGAAGGAGCACAG^ 
2 0 TTCCATCCTCAGCCATGAAGAAGAAGGTTCATTTCCACGGGGAAAGCAAAGAGAAC^^ 

CrTGCCAAGTCCAAGTGCAGGGACCTGAAGAAGAGGCTTCATGCTG 

CT ACCAG CTAAAGAG CAGG CAGAT CCTGGG CATGCG CT AGG C CGGCAAGG (^TGGAGCTGGGTCAGTGTTGAT ACTGACAACC CAC 
TAGAGATTGGTATTCAGCTGCTGTCGTTTTGTC 

TGTGGTCTGCAAATGGATGTGTGTGTCTGCTTCCTAATGTCCCTGTGTGAAAAGCAAGCT 

2 5 AT C CTGTGTGTG ATT ACT ACTTGG AATATGAGATGGGG CACT AT AG ATACT AT AG AT CTT AGGAG AAAAT AATGTT AGGG AAAG AA 

T ATITAACCnXK3 AGGGCTCAGAGAGGCTCACACXXn^AGTCATAC CTT CCAAGTGAGGCCTCAGCGGGTCCA 

TGACCTCACTGCTAACGGGATGCGTATGGC1AGGGCCCACAGGGTTGCATGTCAG 
CCTCTC^CCTGACCTCCCTGATGAGAAGGCACTCACCAGGATTCCCATTCCAGGAAGG^ 
AAGTGGAGGGGAGGGGAGGCCGAGGCCrcAAGCTTTTAGGATTTGC 

3 0 CCTGCATCCTAGGCAGGCACTGGGGACAGGAGGCTCTGGGAGCCCTGAG 

GTCTGTATCTGACCTCTAGGTCAGTTGATGACATTAGTTAAGTGTTTCTGTA 
CAGAGTCACACATGTGGTCCCCAAACATTTAATTTCTGAAGAGAACTGTCTTC^ 
GGCAGGTTTCCTTTGCTCTCTGTCCACAGTCCCTTGAGATGGCTTCAGAAGAGAAC^ 
TATGCCCTCCACCTGTGTGAGGGCCACvArrGCTGTTTGTCTA 

3 5 C CTAACAAG T ACTGTTGGCTG AGAGGGG AGGTCAG AAGCATGT AATGC CATT CG TTTCATGGCT AAACACACAGACTCCTTG CATT 

TT AAAATT CTGTG CCTTT ACTTCCTGGAGCT TG TG ACTTG CCG AGGTTGG CTTTCG CAATGCTTT AT 

GTAAACTOTGCrGTTGGCACTTGCAGCrGTAGAAGCTCTCGGAGGGATTACT 

CTGCAATGCCTGGGATCTTTTGCAACTCrGACTTTTT 

GAAGGGCCTGACAATCTGCCAGCAGGGAAAAGACTGGGAACTCAGTGCAGTGGTTTT 

4 0 GTGATACTGACAGCAAACCCAAGTAAAATCTGTGTAAAAAAAAATCCTAAGTATGGGGTCCTAAATGTTCT 

ACTrAATTGGATTCCATTGTTTATAATTA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

MOUSE SEQUENCE - CODING 

4 5 ATGCCGTCCCGGGTGGAGGACTACGAGGTGCTGCACAGCA^ 

CX3ACGGCAAGATCCTG^TGTGGAAAGAGCTTGACTATGGCTCCA.TGACGGAGGTGGAGAAGCA 
TGCTTCGGGAGCTGAAACATCCAAACATCGTCCGTTACTATGATCGCATTATTGACCGAACCAACAC 
GAATACTGTGAGGGAGGGGACCTGGCTAGTGTCATTTCAAAGGGGACCIAAGGATAGA 
AGTGATGACTCAGTTGACGCTGGCCCTGAAAGAGTGTCACAGAAG^ 

5 0 CAGCCAATGTCTTCCTGGACAGCAAACACAATGTCAAGCTGGGGGACTTTGGACT 

CTTGGCCTGCCTCCTGTATGAGCTGTGTGCACTAATGCCTCCCTT^ 
AAGGGAGGTTCAGGCGCATCCCCTACCGCTACTCTGATGGCTTGAATGA 

CGACCTTCAGTGGAAGAAATTCTGGAGAGCCCTTTGATAGCAGACATGGTTGCAGAAGAGCAA^ 

5 5 ACGGCGCT CAGGCG AGCCTrCGAAGCTGCCGGACTCCAG CCCTGTGCTGAG CGAGCTCAAGTTG AAGGAAAGCCAACTG CAGG ATC 

GAGAGCAAGCACTCAGAGCTCGC^IAGGACATTCTGGAGCAG^ 
CTGGCCAGAGCCGAGAGCCTCATGAAGAACTACAGCCrrcCTGA^ 
TCTTCCATCCTCAGCCATGAAGAAGAAGGTTCATTTCCACGGGGA 
ACCTTGCCAAGTCCAAGTGCAGGGACCTGAAGAAGAGGCTTCATGCTGCCC 

6 0 AACTACCAGCTAAAGAGCAGGCAGATCCTGGGCATGCGCTAG 

HUMAN SEQUENCE - GENOMIC 

CGCCACACAGCCCTTAGCGTGTTGCACTGTTGCAGTGGACAAGCT 
AGATGCCAGGCCTCACAGCACAGCTATCACAACAGAACCTAAACTGGAA^ 

6 5 CATCTTTGTTGTTGTTGTTGTTGTTTGAGACGGAGTCTCGCTCTGTCGCCCAGGCT 

CAACCTCTCCCCTCTGGGTTCAAGCGATTCTCCTGCCTAA^ 
TAATTTTTGTATTTTTAGTACAGACGGGGTTTCACCATATTGGTCAGGCT 
AGCCTCCCAAAGTACCAGGATTACAGGTGTGAGCCACTGCGCCTGGCCAA 
AAGATCTTGAATCTCGAGTAGATCCCAGCTGGTCTTCAGTTTTTATCrc 

7 0 TGCCTCAAGTTTCTCATTGGTAAATTCCTATGGGCTGTTGTGAAACAGAATAAGATGATGCT^ 

CAGTAACCAGCTGTTCTAAGAACrAGAGGAACAGGCTGTAGCTT^ 
CAACAGCACTGAGACTGCTGTCCTGGTGGGTTCCACCGAAGGATTAACACT 
TGGACCGAATG CACACTCGAGTTTGTTGCAAGTACCTGAG AG CT 
ATGTCTTTCTGTTCTATCCCCTGAAAGTCAAAACAAAAGTTTCTTG 
7 5 TTGGACACCAGACTAGATTGAGGACTAGCTAAAACAGAGCCGGGGTGAAAGCAGCTTTGAA 
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GTCAATTTACTGTAGCCATGGCAACACCTGGGCATTACCACCCCTTTCCATGG 
CCCCTAGAAATCH'CTGCATAAACTGCCCCrTAATCTGCATGCAGTTAAAAG 
GCTACTCCCTGCCTAAGTTAGCCCTCCTCCGCAGGAGCAGTCATG^ 
C1X3GCTTGCCCTTGAATTCTTTCCTGGGCAAAGCCAGGAA 
5 TGACTTCCTTCTGAACCCTCCTCACACTAAGCAGATGCTCAGCCACCTC 
TGG<3CTGGGTTGCATTCAGATTGGTGTGATCA.CCACTITC^ 
GAAGGCTGTGAGTTGTGACATGTCCTATAAG^TACTATGTCATGAGCATG 
GGTGCCCTCCTATGTaSGGGTGGGAGGCCTCCTGGGCAGGATTCriTT 
GGGCAGGTCCCACAGCTGCTCTGGAAjGGCTGACCAjCKjCTGTCCA 
1 0 ACTTCTGCTGGCTtKSCTTC^CCCCACTCAACTTG 

TACAACTCTG CC CTT ACCTTATTTG CT CTTTCTGCAACATCTGG CATCATGGACCACTTCTG CATTCTGGAAACCATTTCTTCT AT 

GCTTGCACAGAATAACCTCCAGGGTTCCTTC 

TCTGTTATCACCTCCCAGCrCTCTGTCTCTGGCCTAGACCAATCTGTTGAGACCTT 
1 5 CTCATAGGCCTITGGGGCTCAACATGAACTCAGTATCAGCTT^ 

ACGCTACCATCCATCTGGTCACTCAAGCCAGGAATTTrc 

CCTTCCAGGCATTTCTGGACTCCCCTCTTCATTCCCACAGCCATrc 

TAGCTTCCCACGCTGTATTCCTATGCCCTGCATTCACTCTTTACCAGAGAAGTCT^ 

GCTCAAAGCTTCCACTGTGAGTCCATCACCTTCAGTAGACTTCCTCAACrrCTTG 
2 0 TTACAGGTAACGAACCACAGGTACCCTTTGGTCCACCAACCCAACCTGCAGCTCACAGCAGGCCT 

CCCTTCTCCTAATTCCAACrGCTCCTTTAGATGGGATTCAAGTGTCACCT 

AT CTTTATCAGAATCTGGGTC CTCCTCTTTG CATTTT AGGCAAAGCTCTTT CAG AG CAATT ATCATACT ACAT AG AGAGAGCTG AT 

TTACACAGCAGTTCCAGTGAAGTTTTGGTGAGAGGGGATGGAATCACA 

AGAGCCCAGAGAAGGGATGTTGAAAAGCTAATGCAGA^ 

2 5 CAGCCCAGTGTTCCAGTGAGGTGAAATGGATTCCCTCGGTCAC 

CCTGTAATCCTACCACTITGGGAGGCCAAGGCAGGAGGATCACCTGAGGTCAGG 
CCCCACCTCTACTAAAAATACAAAAATTAGCCGGGCATGGTGGTGGGTAACTGTAAT 
GATTGCTTGAACCCAGAAGACGGAGGCTGCAGTGAGCCAAGATGGCACCCCTGCACT 
ACCAACAAACACACAAAGAAGACAGAACTAGAAGAGCCTGGGAACC^ 

3 0 CCCGATCGAAAAGGTAGGCGTGTCCTCTAAAACCTCCrAAGTCCCTTAGAAACACATTATGGTCCT 

CTATAATAAAATACTTTAGACTGGGTAATTTATGAATAACAGAAACTTATCGCTTACA 
ACGTGCCAGCAGATTCAGTGTCTGATGAGAGCCCGTTTCTCATAGATGGCATGCTCTATGTGT 
ACATGGCAAACAAACTCCCTGGGGCCTTTTGTATTAATATAAGGGCACTAATCACTCCCCCAGAGGCT 
ACCTTGGGGGTTAGGTTTCAACATACGAATTTAGGGAGGTCACAAGCATTCAGAC^ 

3 5 TAACATTTGTCTTATCACATTTAGGTGATGAATATTCTTAGAGA 

AAAATTATGAATGTCTGTCTTTGTCTGAAAGTTTATCCTTATATTGAAAAGAAATGTTTTT 
ATTCAAAACTATTGCCTTTTAAAATGTGATGGAGCTCT™ 

CTCCAGCTGCCTGCCATTTAGCTGAAGCTGTCTCTCAGTCTACATCTTGTCCTGTTTATCT 

■ AGT CTG CCT AAC AC CT CATT CTTTGTTCT AGAACGTGGTT ATTT GTTGAGCAT AT AG T ATGTC ATG CATCTTTTTTTTAAAAG CAT 

4 0 (TTCTTAGCTTCrGATTGTTATAAACAAAGAATCTCTCClT 

ATCCCTGACTTCAAATTGCTTCAAGTAATTGTGGAGATAAGACT 
AATCTGGGGATTTCCTGTAGGGACAGGCGGAGTAGTGCTGCCGCGOlGCCGCCAGCT 
CTCTAGTGGGTCTTTAATGCAACATAAAAAAACGAAGCTGAGGGCAATGAAGTATGGC^ 
ACAAGTGCTGGGTTCCTGCTTTGTTCCAGGCAC^ 

4 5 ATACATTAGTTACTGGAGACTAAGAAGATGGAGAGGAGAACAAATTTGGGATGAAGATC^ 

CTTGGGATGCT CATT AATCTTCAAGTAG CTT CTTTTAAACAT^ 
TTTGGGAGGCCGAGGCGGGTGGTTCACTTGAGGTCAGGAGTTTGAGACCAGCCT 

AATACAAAAATTAGCCAGGCATGGTGGCCTG CACCTAT AGTG CCAACT ACTCGGGAGGCTGAGGCAGG AGAATCTGTTGAACTCGC 
GAGGCGGAGOTTGCAGTGAGCCGAGATGGCGCaVCTGC^CTCCAGCCTGGGTGACAGAGTGAGA 

5 0 AAAAAAGAAAAAGAAAATTAAAATTAAAATTAATCAGGCCAGGCGTGGTGGCTCACGCCT 

GAGG CTTG AG C C CGGG AGTT CAAG AT AAGC CT AGG CAACTT AGTG AG ACCTCATC C CT AC AAAAAGTT AAAACAATTGGCT AGGCA 
TGGTGATGTGCACCGGTGATCCCAGCTAACTCAGGAGGCTGAAGTGGGAGGATTGCT^ 

GCTGTGATCGTGCCACTGCACTCCAGCCTGGGCCATAGAGTGATACCCTGTCTCAAAAAATAATAATAATAGCAA 
ATCGACATGTACAGAGTGCCTGTGGGAGAAAATGCAAGTGCAAATACT 

5 5 ACAACATGAGCACAGTTAGCAAGAAAAGTGTGAAGACAGTGAATTA 

GGTTGGCCCGACTCTTATAGAGTTGCTCATCTTCrrTTCTCTGTTTA 
TCTTGCCACAATTTGTCACAGTGGAGTrTCTCTGTGTACTTAGGTTACATCCATCT 
ATTCACTGTGGTATCCTCAGCATCTGGCACCTGACTCAGCCTCTATTTATACTTGTTGGAGG^ 
GAGAAGATACATGCACATGAGAAGCTAGTAATATGGCCTAGTATAATTCTAAATGA^ 

6 0 AATTAGAAGATAGATCATAGTGGGTTGAAGTCAATCAGGAAAAATTTCAGAGAAGA 

GATGGAATATGAAGGAAAAAGGAAAGAAGTTATAGGGGAAAGGATGTTCCCn'GTAACCTGTCCCT 

CATCTCCCAGTGTCTCTTTCTATTCTGGTGTCTAGTGGTCTCTGCTGGGCTGTCCT 

CCAACCTTTAGGGTTTGTGATGTGAGTGATTCCTATC 

CCAGGCTGGAGTGCAGTGGCGCGATCrrCGGCTCACTGCAAGCTCCGCCTCCCG 

6 5 GTAACTGGGACTGCAGGCGCCTGCCACCACACCCAGCTAATTTTTTTGTAT^ 

GATGGTCTCGATCTCCTGACCTCATGATCCGCCCACCTTGGCCT 
CTrrm iTrrrn ^ 

TCCAACCTCTGCCTCCCAAGTTCAAGCGATTCTCCTGCCT 
TATTTTTGTATTTTTAGTAAAGGGAGGGTTTCACCATACTC^CCACACC 

7 0 ACCTTATTGGCCACGCTGGTCTTGAACTCCTGACCTCAGGTGAT^ 

AGACACCACACCCAGCCTCCTX3TCCTACTCTTTATTATTGT 

AATACCTATCATACCTGGGCCATAGCAAGTGCTCAATAAATACTTGTTAAATGAGCA 

TTGGGATGGCTCCTTTC CAGAT ATTTCGTCTCTGCCT AGAGCTGGGGCATTTGGCAATTGAGT A C 
TTGCATAATTATTAATAAGGGGCTTTCGCAGATTTTTAm 
7 5 GCTATAAGCAGTTCGGAGTTCAGCTACTGTGAGAGCGATGTAAACAAA 
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AACACTGTAGATGGAGGATTGACrrGTCCAAGTACGCCAGAGAGGCTTCTCAGA^ 
GGACACGTTATGTCCTCTGGGCAGACATGACTGGAGTGGAT^ 

G CAAAGATGGTCGTGGTGGTGAGGAACTGAGAAAGTGTG CGTTT AG AG CAAAAGCAGAAGTGGCCAGAGAGAAGG TG CAGGACTGG 
ATTTGGGGCAGGATAACCAAACGGCTCA' m 'T T ' l T T T T T I 1 1111 1 1 T C I '1 GAG ACGC AGTCTCACTCTGTTGCCT AGGCTGGAGT 
5 GCAGTGGTGTGATCTCCACTCACTGCAACCTCTGCCACCTGGGTTCAAGCG 
ACAGGTGCCTGCCATTGTGTCTAATTTTTGTATTTTTAGTAGAGACG 
CCTCGTAATCCACCCGCCTTGACCTCCCAACGTGCTGGGATTACAGGT 
GTTTCATGTTGCTATAATAGGATGCCACAGACrrGGACAATTTATAA^ 

T CCAAGATCAAGGGGT AG CATCTGG CAG CAGCCTTTTCTCTGGGTCAT AAAAT AGCAG AAGG C ATCACATGGT AAG AG AAACAG CA 
1 0 AGGGAAAGCCAAACTCATTTTTATAACCAACCGACTCTCAAGATAACCCACTCCrGTC 
CCCTCATAACCCAATCAATTTAAAGGTCTCACCTCITAACACTTTTG 

T ACATTCAAACCACAGCIAAGCT CTATGGAT ACCAATCTTG CT AATT AACACAGCCACTT 
GC^GTG CCTT ACZAT AACITTTG AAACCTAAATTGGTCCT C CCTTGAATGTT AATTT AAAAATT C 

TCCTGGTTGGCAATTCCAATTATCTTATGTTATATTCAGGGCATGTAAT 
1 5 TATTAATAGAAAATCAGGTGTCTTTTAATCCTTTGCTGAACAGATATTTA 

AGCTTTCTGGTAATTTACATATTCCTCTAACAAGCCCTGGATGGAGGCCTCT 
ATTTGGCTGGGTAAGCTGAGAAC<3TTGGTGTGTGGGTGC^ 

GTGTATTTGGCACACATAAACGGGACAAGACGTGTAAAGTACTTGGTAGGAGAGCGAC^ 

G AGACTGGGTGTCACTCTGTCGC C CAGG CTGGAGTGCAGTGGCGCACGG CTCACTGT AACCT CTGCCTC CCAGGTTC^ CAATT C 
2 0 CTCATGCCTCAGCCTCC CAAGAAG CTGGG ATT AC AGG CCTGCAACCCCATGCTCTG TT ACTTTTTTTTTT^ 

TAGAGACGG<3GTTTCX5CCATGTTGGCCAGGCT<K3TCr^ 

AGGATTATAGGCGTGAGCCACAGCGCCCGGCCTAATTTTCGTATTTTCAGTAGAGAC 

GAACTCCTGGGCTCAAGCGACCCACCCACCraX5CTTCCaVAAGTGCT 

GAGTCTGTATTTCTAATTTGTATTCAGCTCTCAGCATATCCTCAACGTGACCCC^ 

2 5 GAAGGCGTGTGCCGGGGAGAGTAGGTAGTGTGAGGGGCCAGAGTGCAAAGGTCAGAAGGCATCTGGTGGAAGCGGTC 

GCCTGAGGGGAGGTCCTTAGATCTCGGTTACCTTGGCGATCTCTATCAGAGGGTTAG^ 
CTCTCTCCCGCCCTCCGTTTGGCTTAGCTCGCCGGGTTCTCCCACAGGTTA 
TTCCCAAAATGCCTAGGGTATTGTTGTGTTCTTCTTAGCCAATGAGAGAGGrc 
TCAGGGCCCGCGACGGTTAAACGGGGCCCAAGGCAGGGGTGGCGGGTCAGTGCTGCTCGGGGG 

3 0 TTCCTGGTCCCTGGAGCTCCGCACTTGGCG^CGCAACCT CCTTC CCGG 

GCTGAGGACTATGAAGTGTTGTACACCATTGGCACAGGCTCCTAC^ 
GAGCGTGGGACCTACCCTGCGCCTGCCCCrrGCTTAGGTTTCGTCGCCTCGGGGCGAGCG 

AGACCGLUTC T T T1 X5ATATATGTATTTCCATCTTTTCCCGT^ 

3 5 CCTTCCATTGGTGACAGGTAACTAATTCCTGAGCTGTTTTTAGAGCTCTAAGA^ 

TACCAGAACCCGAAGCTTTGTATCTGCAGTCAGACTCCCTCAAAATATCCATTATT 
TATAGGTAGAACTTTTACAGAAAGTAACATTGGACAGGAAACTGCCTGGCTGCTG 
TGGGAAAACACGACTGGGGTGGAATGGGGCAGGGACTTTGACCACCTTCACCCC^ 
CCACTAACCAACTGAGGGTAGGGGTTGGGGGAGTCCTCATGTGCCTCATTCCCGTTTGTCTCCCC^ 

4 0 GAACTGGAGGAAGATTATGGAAGTAAACTCACAGCTCCTTAAGTTCATTGAACTAAA 

ATTAAAAGGGAGGTAATTTGTTAAATCTTGGAACTTTCATTCTT1XSTTTTTGA 

CirCGTGAACTGAAACATCCAAACATCGTTC^TTACTATGATOK^ 
ATATTGTGAAGGAGGGGATCTGGCTAGTGTAATTACAAAGGGAACCAAGGAAAGGTAAG 

4 5 ATTGGTCTAGTTTCATTTATTTTAAGCATCTTGTT 

AGTACTTTAGAAAATTCCTGGTGAATCAACACCCATATTTTATATCAAAATAACAAT^ 
GTAATCCCAACA.CTTTGGGAGGCCGATGCGGGAGGATCACTTGAGCCCAGGATT^ 
TGTTTCTACAAAAATTAGCCAGGTATAGTGACATGTGCCTGTAGTCCAAGCTACrrTGGGAGG 
CTGGGAGGTCGAGGCTGCAGTGAGCCATGATCGTGCCACTGCATTCCAGCCTGGGTGACAGAGTAAGACCCT 

5 0 AAATAAATAAATAAATAAATTCCCTATTTGTACACTGGATAATTTATATCATCCCATCT^ 

GCTGTATAACTGCTTTGGTTTTAGGCAATACTTAGATGAAGAGTTT^ 
AATGCCACAGACGAAGTGATGGTGGTCATACCGTATTGCATCGGGAT 
GTCAAGCTTGGAGACTTTGGGCTAGCrAGAATATTAAACCATGACACGAGT^ 
GTCTCCTGTAAGTATTTCAGAATTTAGCTACTTTTTAAAAC^^ 

5 5 CTCGCTCTCTTGCCCAGGCTGGAGTGCAGTGGTGCGATCTGGGCTCACTGCAATCTCTGCCTCC 

ATCAGCCTCCTAAGTAGAGTAGCTGGGACTACAGGTGCATGTCACTATGCCTGGCTAATTTTTGTATGTT^ 
TCATCATGTTGACCAGGCTGGTCTCGAACTCCTGACCTCAAGCGATCTGCTCA 
TGAGCCACCATGCCCAGCCAAAACATCTITTTAATGTTAAGCATTCTGAAGCAGGGA 
TAGTAGCCAAGAATTGAAAATACTGTTATATTTTTCCACATGGACCACCAGACCAATA 

6 0 TGAGTTATGTGCATTAATGTAAGTATGATGAAGACAGATTCCTAGTCTAAGGTTTGCGCCGTTTGTCATCT 

TTTTTGAGCATTCATGCAAATTAAAGCATATTCAGTTCAAGCTTTTAT 
ATGAAATGGATTTTGTCCTTGCCCTCAAGAGTTGTTTCCCAATGTGTAGGTCCT 
GTTTAAAATGCAGAGTTCTGGGTCCCAAACCCAAACCCAAATCCATCGAGTGAATTTCGA 
T rx T 1TriTTTTTTj ^^ 

6 5 AGCTCAGGAAGTTTGCTCACTTCAGATAGCATCTTTT^ 

AATATCTTGCTGAGTCCACTCACCAATTACCTCTATTGCTGGATATTTTGTT^ 
CTGCATATCTTTGTATGAATTATTCTTTCTTCCTTTTTG 

TGTTTATGGCTGTTATTAGATATTGTCTGATGGCTTTTAAAAAGTTTAAGTCAATTGT 
TCTGAACAGATAAAGATGAAGAAATATTTGTTAGTATTTTAGTAGTTCACCTTTCT 

7 0 AACAGATCTTACTATAGTATAAAACTTTGTAAATTAATGATTTTGGTTTTGACTAAA 

GTATCAAATTCCAAATAGAATGTTITTAACATAAAGGACAATTATCCAGAAGT 
TCCITTTCCCTCCTGCCCTTGTAATTAGAGTGGTTCAGCTTTCCT 
GGACTTTAAAACACTAACCAGCCACTAGAGATTGCTTCTATTCCCTCTGTTC 
GATGTGGAAGAAAGG CAAACAAT AAAAT AGGTAAGGATAAT ATT AGTT^ 
7 5 AATGGAT AATT CAATT AGCTATTTATGTCTATATCCATTTAAATGTGGACTAGATACATAT^ 
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GTTCTTTTAAAATGGTGTTTCTITAATTTGC 
TAAGCACTTTGATTTGTGAATC^AAAATGAGTGAATGAATC 
TTATAAATTTCAAGATAAACTATAAGCTAAAAACITTGAGGTTCTGTTTTC 
AAAACAAAATAAACTCTTAACACCTTCCACTTTCTTACCC^TTT^ 
5 AATCAGATATCTGGTCATTGGGCTGCTTGCTGTATGAGTTATGTGCATTAATGTAAG 
TTTGTACTGTTTCTCACATTTCACTGGGTTCAGTTTTTTGAGCCT 
CCTGGGTAAAATTAAGGTAGATAGATGAACAGGCTAAAAGAAACTATT^ 

CTGTTCAAGGGTCTTGGCTGGTTAGGTGCTTTCTTATTAGATTC CACATAGGTCAGG CAGGTAAGG 

TAAAGCTACTGTAAGG ATCTGGAAGAAATGAAACAAGACTCGGGTAG AGCT CAGGGAAGAG CT CCAATT C 

1 0 TTTTCCTTTGTT CT T CTTTTCAT TTT CT TT CCCTTTG AAACT ATTCTGT AG AG CAG TGATTCTC AACT AGGG AT AAGGG AAG CAAA 

TTTGCCTCITATGGGGCATTTGATGATGTTTGGATAAATT^ 

GGGCAGGG AGG CTACAAAAT ACCT ATAATG CACAGG ACAG CCCCCCACAACAAAGAATTAT ATG AC CTCAAATGT CAAT AGTG CCA 

AGGTTGGGAAACTT ACTT ATAAGAAGAAAATC CCAG AAGT AGJtf^CTTAAAT ACAAACTGG AGCTTAAG CTGTGCTTTTG ATTT AG 

TTCCTATTGTATATTTTCATTTTTACAATTTGAAAATC 
1 5 CTTTTAGCCAGAAAGAACTCGCTGGGAAAATCAGAGAAGGCAAATTCAGGCGAAT^ 

ATTATTACGAGGATGTTAAACTTAAAGGTAAAGATCATAAGATGATATATTGTCn^ 

ACATTCTCTCTTGTAGATCAATGGAGACTGAAGAAACGTTAAATA^ 

CATATGTAAATTTC^CAAATTGCACATCCTTGCTC^CCCACCTTCTG 

GATGGTCGGTAGCAAGGAAAGATTGCAAAAAGAATTCAAATTCTCGTTCCCCTTGT 
2 0 GGGAAGGGGCCAGAAATAAGAGTTTATTACTCrcCCTCTAGCTCTAATTGAGAGA 

AGGGCX3CACGGAGGACAGCTAAAAACAGAGGGAAAATCAATAAA 

ATGTTCTGGGGACCAGGCTGACATAAGAGAAGAAGGAAAA 

GAGTCAGTGAGAGAGTATCTTGCTAAATGTAGTTTAGAAATGCAAGGGCAG 

AGCCCCAGCACAAGATGTGCCAATCTTCAGGGATTGAACCTTCTT 

2 5 ATCTGCATTACCTTGTGGACCCAGAACCTCATGAGTTAAAACCACCAGT<1AAC^ 

AGGAGTCCTTTCTTTTXSTCTITGGAAACACATTTTTGTCCT^ 
GACTATAATACTGAGTATTTTTTCCCCAGGATTACCATCX3ACCTTCTGTTGAAGA 
GTTGCAGACGAGCAAAGAAGAAATCTTGAGAGAAGAGGGCGACAATTAGGAG^ 
GAGTGAGCTGAAACTGAAC-GAAATTCAGCTACAGGAGaiAGAGCGAGCT 

3 0 AATGGGGGTCAAAGCAAAGGAGAACTACCAAGTGGGrTATTTTGATGTGTCT 

AGATCTTATTAGAAGCITTTTGAAAAATACTTGAGAAACCAAAAT 
CATX3TTGGACATCTTTCTCTGAGATCAGTTGCTAGTTATATATTATTGGGCC^ 
TTTTACrcrAGAGTTACTTTTTTCTTTTTCGTCT 
CTGCTTTCCATAATAAAAAACCATGACTTTTGTCTTTTGAAA 

3 5 AGATt^GATATGTTCAGGCAATTClTTTCTTTTTTTTTAA C^ ^ 

CTGGTCTTGAACTCCTGAGCTCAAGTGATCCTCCCATCTTACCTCCCAA^ 
CTX3TTCAGGCAGTTCAATAGAGAATCTAGATACAAGATATTTTAAGTCT 
GAGTATTAACTGATTAGATGCTAATTTAGTTTTGATATTGAAAGTGTGTCTACT 
GGGTTATGCAAGGTAGCTACAGAGTGAGGTTCTTCATTTTAACATCAGGAG 

4 0 TCATGACTCACCTGATTTTACrTCTGCAGCTTTATGCCCAATCCA 

CTTACTG T AT AGGTACAAC CT AT ACTT ACTTCXIAAAC CTCTGT AAAAAAT AAGAAAGCTTTTT CTT AT AA TGTTTC 

ATAAriTC " rrri'C w l TT TTTTTTT TT GAGATGGAATTTCAC^ 

CAACCTCTACCTCCCGGGTTCAAGCAATTCTCC!TGCCTCAGCCT 

T AATTTCTGACTTTTT AGT AGAGATGCGGTTTTG CC ATGTTGAC CTGG CTGGTCTTGAACT C CTG ACAT CAGGTG AT CCACCTG C C 

4 5 TCC^CCTCCCAAAGTGTTC^X^TTACAGGCGTGAGCCACTGTGACCAGCCATAA 

TGAATTTGGGG CCATGAAT AAAT AG T AAC ATTTGTT ATT AAT ACT AAT CCT ATG AACT AC CAC CT AACTG AATTGTGTTTTTTTT C 
1'l Tl VriTGGGTGTGGTCATTCAACCAGAAGAAGGAGATTTTTGTACTGTGAGGCC^ 

ATCTTCAGTGAAG AG CAGAGATAGCTTTAGTTTTTTGTTTTCCCAAATGAGTTACT CCCACAAAAGATGACCAAAGGTTGAAGCT A 
ACTTTGTATTTGTTTAAACIAAGTCCTTTCAAGAAAATCTA^ 

5 0 TCAGGTTGACTGAGGCATAGTGAGAATCCTTTCCCCAAACATT 

TGTAAATTGTTTTTTAGTCAAGGTCAGTTGTTAATTTTTTTT^ 

GTTCGTGAGAGACTAGCAGAGGACAAACTGGCTAGAGCAGAAAATCTGTTGAAGAACTACAGCT 
GTCTCTGKjCAAGTAATCCAGGTATGAGAATCAACTTGGTCAACAGAAGCTGGTGC^ 

CCCAAAGTG G CTT AGGTAG CC CTTTTCATTT ACAAAT CTCAAATTTT AAGATGG ATTT CATTGAATATA CATTTCAATGGAAAC 

5 5 AAAATTCTGTTATAGCAGTGATTTATTTCCTGGGTTGAT 

CTTATCAAACTACATGTTTAGTAATTGATACTTATTTTTATTATCATTATTTAriT 

TGTAGCCCAGGCTGG AGTGCAGTGG CACyUlTCTCGGCTCACTGOtAGCTCTGCCTCCCGGGTTCACG CCATTCTTCTGCCTCAGCC 
TCCCAAGTAGCTGGGACTACAGACACCCGCCACCAAGCCCC^CTAATTTTTGTATTTTTAGTAGAGA 
CCAGGATCGTCTCGATTTCCTGACCTCCTG ATCCGCCTX3 CCTCGGCCT 
60 CAG CCGAT ACTT ATTTTTAAAAATT AGGG CATTTACAAAACTATTG T ATGTTTCAG ATTT CAG AG AG ATAAACTG CATTT CAAGT A 

AAAATAAACATTTTAATATTGCCATGCCTCATTTTTC^ 

GTCACGTTTGGAT CTTTG CT AACAC CCTTTCAAG TTG AAAG AC AT ATTTG TTG AAATTTGATTGTTT AAT ATCT ACAT ATGTT CTC 
TGCATTTCCTTCTTTATTTAAAGTCTGGTATATAGATATTCAGCTTTCTCA 

TGC CGCACTG AG ATGTGGTT ATCG T AT AT ACCAGTGAAC C ATTGTCT AACATT CT CCATTTT AT AGAAACCCTCAAT CT AATTT AA 

6 5 TCGACTCTTTAAGATTATGTAAAATGCAAACTGTGAAACATCTTITTC 

GTTTCAAAAATCGAGCATCTGAATTTTCIAATAAAATTCAGAGTT^ 

CTATTATTCCCATCTTTAATAAATTGAGATGTAGT^ 

AAC CCT ACTT* AAGTT AG ATTGATGATCACAATAACTTITGT AGTG AAT A^ 

G AAACAAGGATT AT ATG AAACTTTGG CTTCAT CT CTG TT AAAATACCTTCTTTTT TCCC AAT ATTX5CAGTCTACCATTTCAAT ATT 

7 0 ATAAAAATATTTATTCTGGCTTTTCATAGATTCCACACT^TCATCCATAAAGACAA 

AATGAGGAGAAAATTTTACCTGGATCAATTTCATG(^TCAAACCCA 
CGGTTAAAGAGTTTATTTACATTTGCCCTGATAGTCTTTGAGCAAAAG^ 

TAGGATACACAGATAAGTATGGGAATCTACTTATATGTGAATCATGGCAAATATTTAAAATGAGCCAT 

GGGGCAAGCAGACATTTTAGGGTATTGAGTG CTG AAAGT AGTG AACACATC AG AG CT AT AAG AT AAG CATCAT AGCAGTGGGGAG C 
7 5 GGGTACCATTCTGAACTCAGGACAGTTGGAAATATTCTCTCCCCATCCTTGC^ 
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GTATATTGAAATCATCACAGCAAGCAGCA^ 
AATGGAAGAATTTTAAAAATGTTTTTAAAACACTTT Crrrcri 'A 
AGTCCCAGCTACTCTGGGGCTCAGGCAGGAAGATCACTTAAG^ 
TAGCCACTGCACTCCAGCCTGGAAAACATGGCAAGACCCCATCTCAGACAAAA^ 
5 ACAATGTTGACTTACAAATGTAAAGGTATTTGTGGTAATAAGAAG^ 
TCCAGTTTGCTTTGAAGCAGAAGCGACGTATGAAGCGGGAA^ 
CTATI T C I TCTGCCTGGAGCACTCTTCATCCA^ 

CACCCAGCACCCAGCACCCAGCATGTCCCTGTTTCCAGAGCTATTTTTCCT 

TACTTGTTTGTTTGTGTTTGGTTTATCTCCTCCTACTAGAATAGGGCCT^ 
1 0 AAAT ATTCGAACAT ATTT C CTTT AACACRAAAAAT AAAAGGATTCT AG AATCTGT AAG C CTCCAGCCC CCAG T AATTCTGTCAG CA 

GTTAATAGTGTTGCCACTTTAAGTAAATATGATTATTATTTATTTTTTCCATGTA 

TTGATTTACCTTTTCCACACTGAATTTGTTTAGATT^ 

TGAACATTTAACCAAAACTGTAGGTGTTCTCTATTTAAAAAAAAAAAA 

TCTGTAGGTAAGGTGTTCTGCATAGAATGGCACTTATTTTCAAGGACA 
1 5 CCATAATACTCAGTGTGTTTAAGAGTAAAATTGATTGTTGGTTGTTAAGA^ 

CAATATTATTTTTTCAATGCTGTGGATTAAAAGGCTGATTTCT 

CAGGCACCTAGATTTCAAATGTGAGCCATAAGAGCAAATACTTTTTGATAA^ 

ACTCAGAGCCCTTTTTCTTTTTCCTTCTAAAGAACTT 

AAAGTAAAGAGAACATCATGAGGAGTGAGAATTCTGAGAGTCAGCTCA 
2 0 CACGCTGCCCAGCTGCGGGCTCAAGCCCTGTCAGATATTGAGAAAAATTACCAACT 

GCCAGGTAGAGAGACACAGAGCTGTGTACAGGATGTAATATTACCAACCTTTAA^ 

CTTGGTTCCATGAGCCATGCCTTTCTGTATAGTACACATGATATTTCGGAATTG^ 

ATGTTCACATTTAATTTTTCTTTCTTCTTTTA^ 

ATT ACT AG T AGGAACATG AG ATGTGACATTCTAAATCTTGGG AGAAAAAAT AATG TT AGG AAAAAAAT ATTT ATGCAGGAAG AGT A 

2 5 GCACTCACTGAATAGTTTTAAATGACTGAGTGGTATGCTITACA^ 

GTTTTTGAGCTTAGAAAACCCAGTTAGATGCAATTTGGTCATTAATA 

TTCAAATCTGTTAGCTTTGTGAAAATTCATCACTGTGATGTTTGTATTCTTT^ 

TCATTTACCTACTTCTTTCCCACTAAATAAAAGAATTCTTCA^ 

GATGGGAGTTGTGTTTATGATTTGT AGGTCTTCATTGTAGG G TACAGAAGGAAACATGG ATCTG ATG CTCAAGGGAG TGGCATTCC 

3 0 AGTCCTAAACTGTATGGCGCTTTACCTCCCATCTCATTAATCCATGTATGTA 

CAAGAGACGTTTGGCAATGTCTGGGGGCATTTTTGATTGT 

GGG ACACTGT AAACATTTT ACAATG CACAGG AC AG CTGCTGACAAC^ C CTG AATGAT ACA 

ATCTCAAATGCGAAAATCCTCAAAGACCAAAATCCCTAATGTCTTAAATCCCIAAAAGA 

AATCACAATCCTG AAAGATTAAAATCCCAGATGTTAAAATCCTGAAAG CCGAATTTGAGGTTAGGGATT AGTGC CGTTTCAGCTGT 

3 5 ATACAGGATAATTGCATCATGTTAGGCAGAA<2TATTACCTTGTTA 

ATGGATATCGAGTTGATAAGGGGTGAACTTGTTGATTTATGTGTCAACrrTGACT 

ATT ATTTTAG ATGTGT CTGTGAGGGTGTTTCCAGAGG AGGTT AATATGTG AGTCT CAGT AGACT AGGTGGGGAATATCTTTCCTCA 

ATGTTTAGTGTTTTGAAGATCATAAAAGCGACTTTTCAGGTGAAAAGTACAAGAAATCT 

AAATTCTGCrCCTTCATGAATAGTGCCACGCTTGCCTTAAAAAAATGCCC^ 

4 0 TTGTGAACCAAAGACACTTGCTGATACTCAGGTTCCT 

CTGTCGAAGAAGATAGACTGCrTATATTTACCACrAAATCTAACAGAAAAACTATCGCATC 

CTTT CAAAACT ATCCCT ACTG GTTTTT AT CAACT AT ACATG ATTCATG CC CCTGTTGGTTCTGAAG ATT CTAG AACTT AT CT AGTC 
ATTT AAACZAATTATTTG AAG ATTTGCTG(^CTTTGCAGAAGAAAAT AG CTTTCG ATTGAATCCCCAAAC CAT AACAACAGATTTGG 
AATT AGGTG TG AT CAAAG CTT CT AAAAGTG AATTTCAAAGTGTT AC CAAT AAAGTTTGTTT 1 ITT T C CATTC AGC C C AATG CATTT 

4 5 GATAGAAAATTAGATGAGTAGATAGGCCAGGAGATACAGCAATGACAAAAAC^ 

AGAGCCCTTTGCTCCTCCCTCAGAATGAATGGAGATCAAAAATCAGATGCTT^^ 

CTCCCGGATTGTTAGGGTGCTGACTGAGGATGAGATGGGGCACCOVGAGACAGGAGATGCTATCGCCCAGCT 
AGTACAATGCCATCGGAGGCAAGTTATCACCGGGGTTTGACGGTGCTC 

CTGATAGTTTCCAGCGGGCCCTG ATGGTGGGTTGGTGTGTGGAACTG CTGCAAG CTTTCTTCCTCGTGGTAG ATG ACATCACGGAT 

5 0 TCATCCCTCACCCACTGGGGACAGATCTGCTGGTATCAGAAGCTGGGCGTGGG 

AGCATGTATCTACTGCCTGCTGAAGCTCTGTTGTTGGGAGCAGCCCTATTACCTGAACCT 
ATCAGACTGAGATTGGGCAGACCCTGGACCCCATCACAGCACCCCC 
CAAATTGTCAAGTACIAAGACAGTTTTCTACTCTTTCTACCTTCCTAT 
GGCACACACCAATGCCAAGAAGATCCTGCTGGAGATGGGAGAGTTCTTTAAC^ 

5 5 CCACTGTGACTGACAAAGTTGGCACTGCCATCTAGGACAACAAATGCA 

AAACAGTACCAGATCCTGAAGGAGAATTAGGGGCAGAAGGAGGCCAAGAAGGTGGTCCAGG 

TCTGCTGG CCGTGTTCTTGCAGTATGAGGATG ACAGTTACAG CCACAGC CACATTATGGGTCTCATTGAACAGT ATGCAATGC CCC 

TGCTCCCAGCCATCTTTCTGGGGCTGGCGTGCAAAATCTACAAGCGGAAAAAGTGACCTAGAGACTGCAAGG 

CTCTCAATAAATTACTGTGTAACCTTGAAAACAAAACAAAAAACTTCAGTTTAAA 

6 0 GGAGGGAGGATCTCTTGAACTTAGAAATTTCATAACAGCCrGGGCAATG 



G CAGTG AG CTGTGATG AAG CCACTGT ACTCCAG CCTGGG AG ACAG AG AG AG AC C CTGTCTCTTAAAGG GCAGGTGGGTGG GGGGTG 

GGGGAAGAGTGTTTCCTTGTGTACAGTATGTATTAAGAGATGGGGAAAAAATGTATCATTCGCCTGCCTTG 

GATGACATTCTAGAAACTTTTAATGAATTAAAGCCACATTTGCCTGAA 

6 5 TGTACATGGTAGGATAAGAAAATACACAATGGTGTTGCTGTTTGATCACCAGTACTGT^ 

CATGCAGAACAGATTTCTGAGTACCCAAAACAACACAGAAGCATGACACAGAAAAT 

GTGT AT ATGG AAT CAT AG AAG AATG CCAGGAAG AACAG CAC CATGT AG AAAATGAATG TG AAC AT ATT CT C CAAGT AG AGCCATG C 
CCT AAAAG AGAAAAAGC AG CT ATT C CT C ATG ATGCAAGACTTT AAAAT AT AGTT AATG ATTG TGGAGG TTGG C C AG CT CTT ATGG A 
CTATCTCCATGCAATTG CCCAT AATCCAGTCCTGTAATACACTTITTTGT ATGTCAAATT AT CTTTTTTTITTCTGTTTT ATGTTT 

7 0 TTTTTTCACT ATTT AAAATTGTTGGCATTC M rri"I lT AC AATTTG CT ATG CT AT AT AATTC AT CTTCAAAT CATTTCTG ATACTGGA 

GGCATAATTTGTGTAGAGACTTAGAAAGTTCTGTTTTT^ 

GTAAGCATTGTTCCTGTACATAAAAACATTGAAGCATCCTCAGTAAGTGAAGAGGTGTCC 

GT AAAATG TTTG AACAT CTTGG CT CTTTAGGTGG CTG CAT ATGCAG T AG TG AC C CATCCTGTTTTTGAT AGATCTC ATCAAAAGAA 
TCTTAC^TTGTTATGGTATTTCAGATGACCAAAGGCATAAAACTGGGTGCACAC^ 
7 5 TCCCTTCTTGACCTA TTTCTTTI 1111 111 1 IC dl 1111 ITlTTrTTTTTTTTTTTTTTTTTGAGATAGAGTCTCACTTTGTTGC 
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CCAGGCTGGAATGCA&TGGTGTGATCACAGCT 

GATGGAGTCTCACTGTTTTGCCCAGGATGGT CTTG AACT CATGGGCT CAAG CAATCCTCCCACCTTGGCTGGGATT ACAGGT ATG A 
GCCATTGCGCCCAGCCCCTTTATGACCTA' X " rr C' l ' I rATGAAtATGGCTTGTCTGTTCATTACTGATATAATTATGCAACTTTCATT 
AGT AT AT ATGAATGTTT ATGCTTG CACAAAAATGTATGTT ATT ATTG CCT CAT AAAGTGACCT ATG AAGTGTT CTGT 
5 GTTTTCATATGTTTCrCAAATAAATTCCCTTTAAAAATGTA 
G AATTTTT AAATT AT ATTTTCCGG AATT ACATTTTTGG^ 

TTTTGACCTTTTGGGACTTCAGCATTATGACATT CGGGATT ATGGCATT AAAAATTG T ATCTC CTGGGACTATGATCGGCTCCCAA 

TGTCAAT AGTG CCACTTTTGAGAAACCTGATCT AAG CT CAAACTTG TTGGGACATAT CTCAG C CTTG AAGGAAAGTT ATTTTGGAT 

TTTGATGGATATTTAAATATGTATATATCTGACTTTTA 
1 0 TGGCAATGCTAGTCCTTTTAGCTTAATGAAATAAGACCAAGTTTT 

ATGT AGGATT ATT ACACIAG CK^CAG CAT AGTTC CATCi^^ CAGAAACTGATT ACCAGAT 

CTTT AATCCT AA(IAGAATCACCTTG ACATGAATGTTG AGT AT AACT CAG CT AT AAAATTTG CAAGACCT AGTGCAAAATG 

GTAATGCGGGACTCTTGTTCGAAAAGCAGGAAACAAGTACCCTG 

TCTCTCTCTTTCTCTGCTTATTGTGATGTTATTTGCTATT^ 
1 5 AATTTT ACCATTT ATGTCGTGCAATG CAGATTTT AAATGCAAAT AG CAG AGCATTCAG CTGAGAT AGG^ 

ATATTTTGTGACTCTTCTCTGAGGGTGCCTGAAGTTGACCAG 

CTCTGACACCTGCTCAGTTCCCCTTTCCTCCAGT 

TTTGCACAGGCCAACGGCCCCAGCCCIATGGCAGT^^ 

CCAGGGTACCTCGATGGCCCTTGTCAAGACATC^CCACCAG^ 
2 0 ACAGCTGCTGCCATGACCnXSCCTCGGGATGCATTGGAGCCTAGAT 

CTGGTCATGGGAAGGGGAGGTTGGGTGGGGCCCAGTGCACTAGAGG^ 

CTGATGGGAGGTGGGATCATGCATGAGGGGAGGCCCTGAGCC^ 

GCIAAATTCAAAGACACAAGTAGTAGCCAGGCGTGGTGGCTCGTGCT^ 

ACCTCAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCT 

2 5 GGTGGTGGACGCCTGTAATCTCAGCrACTCAGGAGGCTGAGGAAGGAGAATCT 

AAGATCACACC^CTGCACCCCAGCCTGGTTGACAGAGTGAGACrrCCGTCT 
CAAGTATTAAGAATTTCAAGACITCAGCCTCAGAGCACTAGACTC 

GGGGTCCTGT ATGGCTGTGCAGGTT ACTCGG CTGTG AAACCACCACTTT CTCT AGG AAACT C CTTT CT ACCTTAG AAAAT AAT ACG 
TTTTCTTCTCACTTTGAAACCTGAGGCATGCTGGACAACrr 

3 0 GCTTAATTCAACCAACAAGCAGTTATTGAGCACCCACT 

ATTGAAATCCCTGCCCTCACTGCCCCCTTOU^TGGGGCGTGACAGTCAGCAACTAATCA^ 

TT AAGGAG CAGAG CAAAGTGGG AAAG AGG AG CAG G AGTC^GGG CTTG CAATTT CAGAAG AGGTGGTGATG AGGGGTG CAGCAGGAA 

GAGGAGGAGACTAACCATAGCAGGTTACAGTGGAGGTGGGGGTGGAGGGGAAAAGCAAAGGGA^ 

AGAGAGGAAAGACAGCAGACCAGTGGTGGGCTAGGGGGCTGGGGAGGGGAGCGGGG^ 

3 5 CAGG ACTTGG AAACTGAGG AGACZATTTT AAAT CTGAT CATGAAT CC C CT CTC CTTGCT CTT AAGCT AACAAAAC CTTGTTTCCAGT 

TAGTCGCTGAAGCTGGGAACGTCTGGGGTATTTGAGCAGTCATGGT^ 
GGTTCCTTCCAAACGATGTTTCTCCAGCATCGCACT 

TAGACAG TTT ACATGTTT AAAT AAT AATG AG TT C AACC CAG CACAGG CCTACTCACAGG CG CTTG AAAAAGGGTCACTGG ATTTGA 
ATCTGATTAACCGACTGTCCTGGCCGACAGAACCGCAAGGTGGCGCCATGGCTCTGCTAGAGCCGCAG 

4 0 GCCTGAGCAACTGGGAGTTACTGCGAGAGTCTATGCCX5GGCATTGGATAAATGCCTTCT 

TACCCCTGTGAAGTAGGTATCATTCGAATCACATTTTACACGGGGAAACTGAGAC^ 
TGGCAATAACCCTCCACTGCCTGCCTCCX3CAAGAGAGC 

GTTAAGCGTCATAACCCATGGCTTTATCCAAGCCATTTGAATTGAAGAGGAAGGTGGACGC^ 
GCCTTTCCAGACTTGCGTGCCAGGAGAGATGGGTTAGCGTGGGGGAGGCCGGAGA^ 

4 5 CACGTCTCTCTCCCGCCTTTGCTCAGCATCATCTATAGCATC<5CTTTGCATTTC 

O^CCXSTCCAGTGTCCCCCACCCCCTAGAGCCATCTCATTCAGGACTGCCCTCCCCT 
TCTCCTTCCAGGGCAAAGCCCTTTCAGTTAGGCAAGCTAGTTCCCTCAGT 
GGCTGAATGCAAAAGGCGACAOUVGGCAGCTGTTTAAAAAAGAAT 
ATTGTGACCTGGTGGCTGGGAAAATAAGGATCCCTATTAGA^ 

5 0 GAGGG CTGTG CTTGCTGCAC^CT AG CTGATGAGACACGTGGC^ 

TACACAGGGAAGACAGAGGTCAGCTGCTGAGGGCCTCCTCCCATGCGAAAGGTTGCATTCCACC 
GCTAACCTCCATGGAGGCACCTTGCTTACATTTCTGACACCACAGCCACAGCGACAGCTCTGGGGC 
GCTGAAATCCCAGACAACCAAAGCTGCTCATCCCTTGATGC 
CCTGGACTCTGAGACAATCTCTGGCCATCITGATGCAACAGAGTGTC 

5 5 ATGATTCACTCTATTAAAAAACAAAACAAAACAAAACAG 

GAAAGTATATATAGTGAAAATTAAAAGGTAAACATTCCTCACCCATTAGCCTAG 

CTCTGTGTATGGGGGTCTGTGTGTGTGTCTGAGTGGATCCATCTGAGTATGTGTATTTAAAACACACGT 
TCACACCTGTAATCCCAGCACTATGGGAGGCTGAGGTGGGCAGATCACTTG^ 

CCTGAGCCTGGGAGGTGGAGCCTGOVGTGAGCrATGATTGCGCCACTGC^ 
AACAACAACAACAACAACAAAAACACACTGGGAAAAATA 

GAATCCTTTCTTGTTGGTTT AG AT AAATAAACCTTTCTTTTT CAG ACAG CTG C C CTGTGTCATCATTT AC CTT CATCCCAAGTT CA 
TTCTAGAAGGCAGCITACCTTGAGTCrTCAGGGAACTAGCCT 

6 5 AACTCAGGGCCTGGCAGCCGAGATGGGCACACATCTTTGAA 

AATC^CCGGTCCCAGTGCTATCCTTAAGAATCTCGTGCTTAAGGCCAGGCTCC^ 
GGCCGAGGCGGGTGGATCACTTGAGGCCAC^AGTTCAAGACCAGCCTGGC 

AAATTAGCCAGGCGTGGTGGTGCATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAGTCACT^ 
AGGTTGCAGTGAGCCAAGATTGTGCCACTGCACTCCAGCnTGACCAAC^ 

70 

HUMAN SEQUENCE - mRNA 

GGCACGAGTAGGGGTGGCGGGTCAGTGCTGCTCGGGGGCT^ 

TGGCGCGCAACCTGCGTGAGGCAGCGCGACTCTGGCGACTCreC 

CATTGGC^CAGGCTCCTACGGCCGCTGCCAGAAGATC^ 

7 5 G CT CCATGACAG AAG CTG AG AAA(^G ATGCTTGTTT CTG AAGTG AATTTG CTTCG TGAACTG AAACATC CAAACATCGTTCGTT AC 
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TATGATCGGATTATTGACCGGACCAATAC!AACACTGTACATTGTAAT^ 
AAAGGGAACCAAGGAAAGGCAATACTTAGATGAAGAGTTTGTTCTTCG 
ACAGACGAAGTGATC^TGGTCATACCGTATTGCATC 
CTTGGAGACrTTGGGCTAGCTAGAATATTAAACCATGACACGAGTTTTG 
5 TGAACAAATGAATCGCATGT CCTACAATGAGAAATCAGAT AT CTGGT CATTGGG CTG CTTGCTGTATGAG TTATGTGCATT AATG C 

GAATTGAATGAAATTATTACGAGGATGTTAAACTTAAAGGAT^ 

AGCAGATTKKnTGOUSACGAGCAAAGAAGAAAT 

TCCCTGTATTGAGTGAGCTGAAACTGAAGGAAA 
1 0 CAGAAAGAACAGGAGCTTTGTGTTCGTGAGAGACTAGCAGAGGACAAAC^ 

GCTAAAGGAACGGAAGTTCCTGTCTCTGGCAAGTAATCCAGAAC^ 

TCAGTGGGGAAAGTAAAGAGAACATCATGAGGAGTGAGAATTCTGAGAGTCAGCT 

AAAAGGCTTCACGCTGCCCAGCTGCGGGCTCAAGCCCTGTCAGATATT^ 

CATGCGCTAGCCAGGTAGAGAGACACAGAGCTGTGTACAGGATGT^ 
1 5 TGTTGAAT ACTTGGCCCCATG AGCCATG CCTTT CTGTATAGTACACATGAT ATTT CGGAATTGGTilT ACTGTT CTTCAGCAACT A 

TTGTACAAAATGTTCACATTTAA , ri"rrrcriTCT'i'L X IT T AAG AACATATTATAAAAAGAATACriU'ClTGGTTGGGCTTTTAATC 

CTGTGTGTGATTACTAGTAGGAACATGAGATGTGACATTCTAAATCTTG 

GGAAGAGT AG CACTCACTG7AT AGTTTT AAATGACTG AGTGGT ATG CTT ACAATTG T CATG T CTAG ATTT AAATTTT AAGT CTG AG 
ATTTTAAATGTTTTTGAGCTTAGAAAACCCAGTTAGATGCA^ 
2 0 GCTCTGTAGTTCAAATCTGTTAGCTTTGTGAAAATTCATCACTGTGATGT^ 
AGCTGTCTGTCATTTACCTACTTCTTTCCCACTAAATAAAAGAATC 

HUMAN SEQUENCE - CODING 

ATGCCTTCCCGGGCTGAGGACTATGAAGTGTTGTACACCATTGGCACAGGCTCCTACGGCCGCTC 

2 5 TGATGGCAAGATATTAGTTTGGAAAGAACTTGACTATGGCTCCATGACAGAA^ 

TGCTTCGTGAACTGAAACATCCAAACATCGTTCGTTACTATGATCGGAT^ 

GAAT ATTGTG AAGGAGGGGATCTGG CT AGTGTAATT ACAAAGGG AACCAAGG AAAGG CAAT ACTT AGATG AAGAGTTTG TT CTT CG 
AGTGATG ACTCAGTTG ACTCTGG CCCTGAAGGAATGCCACAGACGAAGTG ATGGTGGTCATACCGTATTG CATCGGGATCTTAAAC 
CAGCCAATGTTTTCCTGGATGGCAAGCAAAACGTCAAGCTTGGAGACTTTG^ 

3 0 GCAAAAACATTTGTTC^CACACCTTATTACATGTCT 

ATTGGGCTGCTTGCTGTATGAGTTATGTGCATTAATGCCTCCATTTACAGCTT^ 
AAGGCAAATTC^GGCGAATTCCATACCGTTACTCTGATGAATTC 

CGACCTTCTGTTGAAGAAATTCTTGAGAACCCTTTAATAGCAGATTTC>GTTGCAGACGAG 
GCGACAATTAGGAGAGCCAGAAAAATCGCAGGATTCCAGCCCTGTATTGAGTGAGCT 
3 5 GAGAGCGAGCTCTCAAAGCAAGAGAAGAAAGLATTGGAGCAGAAAGAAC^ 
CTGGCTAGAGCAGAAAATCTGTTGAAGAACTACAGCTTGCTAAA 
TAATCTTCCATCCTCAGTAATTAAGAAGAAAGTTCATTTCAGTGGGGAAAGTAAAG 

GTCAGCTCACATCTAAGTCCAAGTGC AAGGACCTGAAGAAAAGG CTTCACGCTGCCCAGCTG CGGGCTCAAGCCCTGTCAGATATT 
GAGAAAAATT A CCAACTGAAAAG CAG ACAGATC CTGGG C ATG CG CTAG 

40 
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Table 89 

MOUSE NOMENCLATURE 
ICSGNM Nr3cl 
Celera mCGl3634 

5 

HUMAN NOMENCLATURE 
HGNC NR3C1 
Celera hCG37601 



10 MOUSE SEQUENCE - GENOMIC 

TGGTACTTTK5TTGCAACTTACTGATGATGCA 
CrTTTCAACCCrACTTCCTGTGAGATGCA 

TTCAAGTGTATAAACTGTGTTGGGTTTCAAATGCMACCTrAGGTTGAA 

GCT ATTTTATGAAGTG T AG CAAAAT ATCCTTTG CAC CACTTTAGAATGTG ATGT GAAGAACTT ACAG ATTCGG CTG CCC CTTT AGA 
1 5 AGCCTGCATCTGCCTGTCCATGTGTTAAAGTGCCCCCTCACATGTGTCT^ 

TTTAGTAAAGCTGAGCTTTGCTTTTACAAGAAACAAA^ 

ACATTAACTAAGTAAAGGAGGTGTAAGATCrAAAACGGAAGCAGTC 

GCAGACTGAGACACCATCTACCAGAGGAGCATGGGAGAGAAGAGATTAAATCAGAA 

CCACAAATGCCACAAATGACAAACAGTGTACGTAATAATCGCTGCAAT^ 
2 0 AACATTAGAT CTCTGCTT ATGGCT ACATGT AT CTAAT ATGTCAG CTGGTGTTTT AC CTC CTCTTTAAT C CCTTATTGCTTCAAG CA 

AGTTCTTCTAATAGAGAGGATTTAACCAGGACAAACACCAATGGTTAATAGACAGAAAGAGAAGCTGGTAl 1 1 1 AAGTTGTTTATG 

ACGAAAGAAAAGAAAAGGGCAAAACAATAAAAAAAAAAAAATA 

TTC CATGTT AACTCT AGAAAAAG AAACAT AT ACC AT AGC AG AG CTGATT CT AACAG (HTGAAAT ATTT GTGTG AATTTTTT AT AAC 

2 5 GTTTCTCAAAGTCACAATGTTCAAGAGGCATTTACT^ 

GGATTACCATATAGCTAATATTTTCAAAACTATTCATTAACAAGAAAGCATATTTATATT^ 

TACAAAACAAAAAAAAATGCAAGCTGAGCCATGAAAAATGATCATAGCTGGAGGA 

GAGAACTCACATGATAACCAGTAGCACAGGTGGGCCTGGAATT 

GTCACATCAATTCATCCCAATAAGTCCAAGTTCTGTAGTCTTGGTGAAAGACT AAG CACTGGTGGCCAG CAGTGGTG AACACAGAG 

3 0 CCAAGGTTCCTCCAGACATTTCTCTGGCTACCAGCTGAGCT 

ATGAGACATGAGTATGAAAAGAAGCCATTCTTCTATACAACCTAAATGGAAAGATG 
TGGAAGATGCTTCAAAAGATGAGAAAGAATCACAAACAAGAAGAAATCTAGTTTG 
TTTTTTGAAGAACCAGAGAAATTTGACATTAAAATGACTAATAATTATTAGGG 
GATAATTCAGTATATAATATGCCTCATtlAATTAATATACTTCATATA^ 

3 5 ATTTTATT CTCACTTTACAAGTGGTCATAACTAACGTATAGTGAGTTACAG AG CTTGTTCAACATTACCCTATATAGATACTGGCA 

TTAGTCCCAATACACTCTGAGAATATACCACCCAGGAAAAATGAAATAGCACTCATAAAA^ 

GCAACTAAACGTTCACTGATGGATTTTAGGTTTTAGAAAACAAA^ 

TTAAATAAATAACCTTATTAAGTTTAACAAAAGACTAATCACCACTTCT 

G ATGG ACTT AGCAGCAAAATTGGACTTG TTTCACTGCAAT AGAG AG C CCCTGT ATATG CTGGGT C AGG ATC CT CAT ATGATTTG AT 

4 0 TGATGGTTTTGAGGGGGATCACCTGACCCTTTTGTTATTGGGTTATT^ 

GAAAAT ATCAAAACATT AAG CGAT ATAT AAATG CTTTCCTTATT CTT CACT CTT GT CT TTC TTCCTTT ATTG CTGCTG CTTCTG CT 

CTCrGTCTCTCTCTCrCTCTCrrCTCTCTCTCT 

GGGGTGGGGGGTG CATG TAT CTGTGGGATCACATGG AG AC CAAAGG AGGATTTTGGGTGTCCTG CTCT ATCACT AT CIT ATTCCCT 

4 5 TAACAGGGTCTCTCATTGTATCTTGAGCTTAGGTAGAGACCAG 

TGGGGTGGGGAAAATTTACAAGCACCTGATGCCATAACAG 

AGCAATCTTTG CAGT CCAAATGCT ACAG AAATG TAAATTGTGATTT GGTTG CAG CT CAGGCTT ATT AATATTTTGTTTGTCTT AGG 
ACATAATAGTCAATACATATTCATCAATTCACTCAATTAACACCCACTGAGC^ 
GAAGTCTCTGTCTTTCAGTCTTATATTCTAGTAGGGAGAAAGAATATAACAA 
50 TGTTACCAAATTGTGCAGGAT ATTG TAAAATT AT AATGTGTTCT AG TT GTTT ACGGGT AAAG TTTGG AAATT AAAAT CCAC ACGG A 
TAAG CAT AAAAAATCTTG AGT AG CATTCAT AAACTTTGTGTT AAG T CATTT AG C CAATT CAACCTCAAGT AAT ATT ATGCC CAG AT 
GATTAATTGACTGAGCATTATTATCATAACTCTGTAAAATATTA 

CATAATGAAAAGGATAGATCAGGTTTTTGCTAGATAGAGTCTGAATATATCAGTTTTATT 
ATACATTCrCTCTGAGGTAGGGATTTATACTGTCTCTTACCTTGTAATAA 

5 5 TGTTAATGAGTTAAGAGTAATGTATAAAAATATTGAATAGTGTCT 

CTGTTATCAAAATCTTAAACTTCATCrcTGTACAATGGACATTTCTCTG 

TGCTTAAATTCCATGAGTGCITAAGTCTTTCAAGAATTCATAATGCCCCTT 

TTAACATTACTGGATTCCATCCTTAACCCTGAGGAGCTA 

AATCAAGTCGGACTAACATGTCGGACAGTTTTATGTCAACTTGATACAAGTTAAGGTAAGT^ 

6 0 AAAATGCCTCCAAGCCCATTGTGGATGGTGCTTTCCCTGAGCTGATGATCCT 

AAGCAAT ATTC CTTCATGG CTT CAGG ATT AGCT C CTG C CTC CAAATTCCTG CTGTTTGAAG T CGTGTCCTT ACT ACTTTT AAT AAT 
GAACTGTTTATATGGAACTGTGAATGAAATAAACCCTTTCCTCCCCATGTTC 
ACCTTGACTAAGACAACTAGCGACATAGTTCATTGGCCTAGTGAGTACTITTCCTAGAATATAGGA 
TATAATAAAATATCTATTTTATTCAACACrTCCAGCAAACTGAGCAGAAGAG 

6 5 T TT TT CT T ACTGTGT AGATATGATT AGTTGTTG AGAGTTT AGG CTAATTTTTAAAGCAGTATGGATTAAAGCATG AGATGTTTTTC 

TGTTGAACT AAT AG ACT AAGG CAT AGTTAGGC CACT AAAG AGAAAAAACTOTAAAAATTCCT 

GGAACTGAAATTTAAAACTTAAAATGAAGTATTCTTCTACATTAAGAATTACTAGGCATG 

AAGTAGGAATTCTTGTAAAATCATAAATTATTCAAAAATAAA^ 

7 0 TCCTTTTAAAAAGTGATCATCTTCAAGGGTAAGTATGAGTTTTATT 

TCTCTTTGAAGTCTGGGGACCAAATTCAGAAGATATCTCATCCCAGTAGTTGAGA 



rCTTTTTTGTTTTCAACACTAAGTGGTTCAAGGGCTCTTCA^ 

TCCrTTGCCAAGATGGCGGCCCTGGCTCCCATGGAGGTAGCGATTGTGCAGC 
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ACCTCTGGCCCAGGGGCCGTCTTACAGCCACGGCCTACCCGCGCAAA^ 
AGAAAGG<3CGCCTCG<yriTTGAAGCTAAAGAGCACCrCTG^ 
GCGAGAGACTTCC^CACGCCCTTCCCCAATCAGAGATCTCCAA^ 
GCGATCGGCTGGGGGAGAGAATGGGGTGGAGCCACGG<^GCCTCAGCTCCG 
5 CAAAAGTTACTTCCTCGCACCCCAAAGCAACACCATAACACCTTACT 

AGGTAGGGGGCACGGTCCCAGAGTCGCCCCAGTCTGCCCAAAGGGGOGGGCAGCTTA^^ 

GGAGCTAAACGCTGACGTTGTAAAGATGCGTCTT T 1 GI I TTATTTGGAGGGCCAGAGGGGTCCCTGGAACTCAGAAAGAAGGCAGA 

GCGAGGCACTGAGCCTGGAGCAGCAAATGTCAAGATTTGGG 
1 0 GGAAGGAGAGGTCAGGAGTTTGGGGTAAjGAGGAGGGCGGACTTCXSCC^ 

GCACACACCCCCTCCCGCCCCGCAAGGCTTCCTTAATCACAN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNKNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNN^^ 
15 NNNNNNNNNNNNNNNNNTWNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNI^^ 
NNNNTOJNNNNNNNNNNNNNNNNNN^ 
OTJNNNNNNNNIWNNNNNNNNIWIWN^ 
N>nWNNNNNNNNNNNNNNNNNNNNNNNNNNNm 

GTGTCCGCGCrCCCCCCCCTrCCCCTCCGCCTCCTCCATTTTGCGAGCTCGC 
2 5 GTCGG<3GACGGATTCTAAGTGGGTG*3AACAAGA 

GGAGCGGCGCGGGGGCCCGGCTTGTCAGCCGGGAACGGGTGACTTTCCGCGATAGGGGCTCT 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 nnnNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNN^ 

NWNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
40 NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN1W 

NNNNNNNNNNNNNNNNNNNN^ 

TTTGG AAAG AG AAAT CTT CAACTT AT AGATAAAATATTTTGT ATTG AG CAT ATT AAAC AAATTTTTAAAAAAGGAAT AT AACTCAG 
TAAATCITCGTGTTAGGAATTTTGATAAGCTCTGTTCTATTAAACTATTTAAATACGT^ 

4 5 ACTCTACTCTGCTTTTAAGGATGCACATTTTAGTCTC 

CTAGACAGG CAT ATTT AG TTG AAAATGG AG TTTCAGAGT AAGAG CTTTTC C CT AG C CCTG CAAG ACAAG CAG CC CAG CACTGCTTT 
TTTTrrrTTTCCAATAAAATGTGCATAATTCTTCAAGG 

GTGAATCATGAGGTGTGTAAACAATTATAGTATAGCATTTATCAAACGGTTTATGTAGTGGTT^ 
AAAGGGGTTGGAAATTTAAGGATCACGCCTTTAAAAAAAAAAAAAAAAAGT^ 

5 0 ATCACAAATAACAAATTACAACCTCCATGCTTA 

CGACCGGGCAGATGAGCTGCTTTGTGGATGGGAGCAGGGAGGATATAGCTTTATTTTA 
AAGACTTTGGTTTGGGAGTTACCTAAAGGGTTTATTTAATAGGCTGTCTCT^ 
TTATTATGTC C T ri TrrTTTTTTTTGTAGT^AATATTTGCCAATGGACTCCAAAGAATCCTTAGCT 
CCCCAGCAGTTTGCITGGCCGGGGGAGGGGAAGCGTGATGGACriTG 

5 5 CGTCTTCACCCTCAGTG<3CrrcCTGCITCTCAGGCAGAT^ 

AATGCACAGCAGCAGCAGCAGCAGCAGCAGCCGCAGCCAGATTTATCCTVAAGCCGTT^ 
GACCGAAACAAAAGTGATGGGGAATGACTTGGGCTACC^ 
TTCTGGAAGAAAGCATTGC!AAACCTCAATAGGTCGACCAGCCGTCCAG 
ACCCCGACAGAGAAGGAGTTTCCCCAGACTCACTCTTGATCCATC^ 

6 0 TGGCAGTGTGAAATTGTATACCACAGACCAAAGGACCT^ 

AGACAAACGAGAGTCCTTGGAGGTCAGACCTGTTGATAGATGAAAACTTGCTTTCTCCT 
CTGGAAGGGGACGTGAATGAGGATTGCAAGCCTCTTATTTTACCGGACACTAA^ 
ATCAAGCCCCAGCAGTGTGGCACTGCCCCAAGTGAAAACAGAGAAAGATGATTT 
AAGAGAAACTGGGCCCGGTTTATTGCCAGGCAAGCrc 

6 5 GGCGTGAGTACCTCTGGAGGACAGATGTACCACrrATGACATGAATACAGCATCCCTT^ 

TAATGTC^TTCCACCAATTCCTGTTGGTTCTGAAAACTCGAATAGGT 
CTATGAACTTCGCAGGCCGCTCAGTGTTTTCTAATGGATATTCAAGGTAAGTC^GCGC 

AAGAT AGTTTGAT AAACTT C7VTTCTTTTATCAGTGCGT ACTTTTTT CT AT ACT ATT TGT AT ACTTTTG CAAAT AACTCAGT ACAAA 
TGGTCITTTGAATGTAT AG^CATCTTAGAAACTGCAAAGGACTCGT CACTG CAAAAGCCCTTTGG^TGTTTGGTT AAATAGTAATC 

7 0 GTTT CTCGTTGG TCAGTGTTAAAGATTTCATTTGACAACTAAAT CAGAATTGCT CACT CACAAGTAT CTG TT AAAGT AACATTT AT 

CAT CT ACAAGT ATCCAAAG TCAAACT AGT ATGAAAT AAAAT AAGATCTCT AGTCAT AG AATACATTT ATTG CACAT ATT AAT 
TTAGCTGTTTTAGAATATATTATTGCAAGAAAATGCTTGAAGTGTTACCTTTC^ 
TGAGTATTCAATCTGAACGGTCTGTGTTTTGAAACTGTTAGAGCATTTC^ 
TACAGCAGACAGTTGGTGACTGCTCCrTTGCCTCTGTAAAGGAGCT 
7 5 AGGTAATTCATGATTAGAATTTTCAGGATTCAGTTAGCAAGTACCATATATGTCTCTCT^ 
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TACTGATATTTCTGCCTGACCTCACAAAG<nrCGTT 

AAACAAAAATGTGTAAGTTAAAATAAATGGGCTGGTCGGCTTAGGTTCTTCCT 
TTATATATCTATTTTAGGTGTGATAAAAAAGAAAACGTGGATTGTAGTGATATAAG 
TTTTGTAGAAAGCAGTCCTTTAAAGCCTTGTATGTAATCCCATCAACTGTTT 
5 TCACTTGAGAAGACTGTACTTAAAATTCATAAGAAATGATTTCA^ 

TTCCCTITTrCTGAAGGCTTCCCAGACACTAACAAACAAAAAGGTATCT 
TTAACTTTAGACTATGATCTATTTTTCTAAATAGTTTAATTGTATCTTATCA 
CCTAAGAGACGGGAAGTTAATATTTATATTGCCTATTAGTAGTGACTACTAATTTATT^ 
TTGGGAGAAAACGAffTGCAACACTTTAACTTTCTGAATTAGGGATTTA 
1 0 GCATAAACTGTGTTTTAAAAATGTTAAGAAACAGGCTGTGCTTTAAAATTTC 
TTGACTTTCATAAAGGAATAGTTAAAATTGACTTGATTATCTATAGGGAGCT 

GACAAAT AAT ACATTTAG AAAATTTCTTGCC CTGAAAACCATCAT AAAACTTCACTTTTTATT AT AAGAAAAA C 

AAACTTAACTTGCATGCACTATGAATTTTCATTAAGACT^ 

ATAATGATAATGCITTAACTTTTACACTTAATGACATATAAA^ 
1 5 TGGCCATGTCAGTGTGGCACTGTCATGTCCAGGAAA 

TTCACCAGGCAGAATACCTTAACGGGACTGTGGGCGGTGCTTGCAA 

AAGCTTTACCCAAATAAATTGAGACAATGTTTCCTTTCTGAGCATTA 

AACTGAAATTGGTTGCTCTCTCTATATGATGCACXSGTAGTTGCCAAACAT^ 

AGTCAACATG CTGCATGJUX5CCACAGCTTGLACTGAATGOT CTCTX3 
2 0 TGGAAGTTTTAAAGAAG<3CATACATTTTAATTTGAATTCCTAAATATT^ 

ATTCGAGGGAGAAGGCAATTTCTACAGATATGTCAGTATACCATGAC^TTAGGTT^ 

GAATTTTGTTTTGTCTTGTTTTTAAGGTTTAAGTTCATT^ 

TAAAATGAACTTCCAAGTCTAATGGTGCAAATTAAAATTCAGTGAGCTT 

ATTTGATTTCCTTCCACCGTGTTTCGCCTCAGCATTAGTAGTGTACAT^ 

2 5 GGGATTAGAGCTTTTGGCTATGCTGTTGACAT 

T ACTG CAAGATGGCC CTCT AAAC CCAGTGT C C ATGG AGGT AGGGCTTCGG TG AGTGGT ATTT AATG AGGACACT AACACTTGAGTC 
ACTTTGGACATCCCTCATTATTGGTCAGATCTGCATCAAGTTITATAAAACT 
CCCCTCTCTCATTTTGCTTATCAATCACGGGCTAGATGTTTTCTGAGAC^ 
TGAGAGGTACTTGTTTAATGAGTGACCCTGTTCCTTGCCCACTAAAAATTCCG 

3 0 GTCTCirGATATCCATTTAATGATTTCTGGATGTGATTAGCT 

AGAATTGCTTCCTCTTTTAAAAGATCAGTATGATTGATTTTAACAG 

TTGG AATG ACTAAAACAGAG ATAGGG AGT AAG CCTTTAGT CAG CG CTT AATG AAATTCTTAT AATGTTTCTTTGGCTAAT C CTTGT 

ATTTTTTTTGATTAACAATTTAACTTGTGTTCATAACGTTACT 

TCCATGTATTACTGAATGTTTATGAATTTTC^ 

3 5 GATGGACCAACAGGAAGAGAAAGCTCACATTAGAAGCCTCTAAGATTTGCTGAGGAG 

TGGGTGTCAGTGTGAAAGGCTAAGTGTTAAATTCTTGGTGAGTTATGTAAGTCTCAGTGCACT 
CTTCATTTAGATTTCGGTTAGGTCTGCAGCATAATTCT 

GGTTTTGGG CTCT ATGGTG TG ACG ACT CAGGT CATGTTG AGT AGTG TTT ACTGTTTTG AAG C CATCTTGG T AATATTTTGG T CAAA 
AGTAAGTTAATATAATAAGTATTTGTTTCTGTAGTTCTGAGGTGCGCAGAATATAACTTGGCrr 

4 0 TAAGGAATGTGTTGTTGGAGTAAGCTACAACCAGAAAAAAAAAATCACTCCTTC 

TAGCTTCATTTAGCCAGCTGCTCTCATTTCTCCTACCTTGGTTTTATA 
CAAATCTCTAAAGTGTTTCCGTACTCTCTTTTGAGTTATGTC 
TATACATTACAAGGTTAAATGAAGCCAGTGGAGTCAACCCCTCTCCAGTAGGGGAG 
ATAGAGGGAGAAAGAAAATTTCAAGGACAATTACAGAAACCCATTTGGATCAACTGTTC 

4 5 CT AT CAGG AATG ACAG CAACGGTG CTTTCTTGTGAGGG CT AAG AGTG ACGGAAAAACGAACG CTTCT AGTT AT CTT CTTT CTTGTG 

ATGTTATCACCAAAATAAGGGCAAAATGCACAGTATACTTTGATGCCAAGATTAATT^ 
AAAGAGAATGTGATGACTTGAACTTAACATAAT(^CTG 

TAC^TATAAACGTGCCTTAGGCTGTTA(^GCCTCTTCCTATTTCATAGTGTACTTTTALl ntlAAATATCTt^rTTATAAACTATCA 
ACTGACTGTGGAGTATAACTCriTTCCTTGTGTTATAGGTTTTTCnTTAG 

5 0 ATTGG ATT CTG T AAGT ACT AGAT AG C CACTT AAACTATTTGT ATTG CAGT AG AATC CTT AAAAACAGTT ATTTTAAATG AAAG GGT 

GTC^TCCTTT AACTT ACTCTCTGAG CCAATTT AAAT ATT AAT AG C CTATTTTGG TTG AG CT ATG TG CTT AAACAT ACTT ATTT AG A 

GTAACCCAGCTATTAATTTTAAGTACTTTAGAATACITTTT^ 

TGTGTTTAATAAGGAACTTCTTACTTTAGATGTTTTTTCCATT 

GTGTGTGTGCGTGTGTGTGTGTGTGTGAGTGTGTGTAAGGAGAGGAGAAAGGTTGAATTACATGACTTACATAGCT^ 

5 5 ATTATTTATATATTATTTTGCCTTGAATTTTTATTTTTTGTTAG 

CTTAAAGCTTATTATGTTTACTTTTAAAATAATAAATTAGCCTTTATTCTAGTTTATTT 
ATGGCTATTATGAAAGTATATTTGATAGCCAAATTTTGAAAGCTATTGTGAAATGATATCCAGT 
ATGTTGGGGCAATGCAGCGACTTATGACCTTATGATTGTCACAT^ 
TCTCATATTTTTATGTTTTGTTTTATAATTGGGATTTCAACCT^ 

6 0 TCTCTTCCAAATAATCTGATCTAGATAATCCTAACAGTAATTGTAGGAAAAACAAA 

CAATTTATGATTCTGGGTAACAGTGTGCTTTGAAATCAAAG 

GTCTGTATAAAAATTTGCAATTCTCTTATTAAACTGATAGGCCATGGTATATTAAAGACCATT^ 

TCAAGTACAAAACAATGAACTTAGAGTTGAACTTTGATTAC^TAC^GCTTAA 

TTATAGTTGGACCCAAGAAGTGGGCCCTTGAAATCTCAAAGCrTGTTTTATGTGTTAGGTAAGA 

6 5 ACTTTCCAATGAGAAATTGAGACCATGGCAGTACTTAGGAAAGAACACATACTTC^ 

TTGGCAGCT ATTTAAACCTT AG CATCT AGGGCTCTTCCAGGGTCTAAAAACAAT CAGTAATTG ACATAATACTGAATAATAATAAT 

AATAATAAGCAGGTAGTTCATAAAAGTCACTGATCCTTTCTTCCTAGTGTTAGTAGGTTC^ 

GCAGCACTATGTTACATATGCrrATGCTACATACACrATATAACATACACTATATTATATACA 

TG CACT ATG CT ACAT ACACT ATGCT AC AGTTCATGTT ATACACATCTCG TACTTTCATGT CATG TG ATGTCTTAATTCTTTGACAC 

7 0 GCTTTATTCrGCTTATCACATTAAGTGTCTGAGTCrrcTCAGCT 

GAAGTTTAAGACATTTAAAGAAAAAAAAC^GGAAACCGTAATGCTGATCTGAAAGGAATTAATTACT 
CTCAACAATCTTTAACACTAATTGTTCAACAGTGAGTGTGTGTGTGTGTGTGTGTGTC 
CCAGGCCrCTGAGATATATAGATATAGATGTATAGATATATATATTCACTTAOTGATTGCTCTAAGTTT 
ATTCCTACTGGATAGAATATGTAGGCGCCAC<>CGCCAGGCTGTCAATGA 
7 5 CTAGGTTTGGTGGCAGAAGCC^CGTAACCGTGGAAAATGAGAATCTATTCCftTAAA^ 
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GGCAAACCCCATAACATGTACATATACATACACAAT 

CTGAAAATTTAAAAAATAAATTACTAAGATGCAAGGTGGTAGTTTCAAAGCTC^ 

CTCATTTGCTAATAATAGTTCAAGAGTCTGCATCTCTGGTGTA^ 

CTACTATGTCCAGTTTTCCCAAATTGGGTCATGACTC 
5 ATTAATTTTTTTCAAGATATTTTTTCTCTTTTTGTAGGGG^ 

GATAT ACT ATCT AAATTT ACT AGCT AT ACTG AT AAT C CACATTGCTG ATAAACATG AGACT ATC CCCTTTCTATTTI^ T CTG 

ATACTCTTAGGTAATCATGCATCCAAGGATAGTTTGTTAATTAAGAAGCT^ 

CTAGGTGCATAAGTTTGGAAGGATTATGTAACTTGT 

GGCTGGAGAGAAGGCTTATTAGTTAAGGGTAGTTACTGCTCCT^ 
10 TCACTACTGCCTGTAACTCCAGTTTCAGTGC^ATCTGACACCTTT^ 

ATATGTATGTATGTATGTATGTATGTATGTACATATACATAAACACAAATGTACATATATTCATGTATATGTC 

TGGTCAAGTATGGAAAATAATTAAATGGCAGTTTAAATAGATACATACGTTATTATAAACCTC 

AATGTTTAATTAGAGCATTATATCAAACATGTCATCTGAATCCCAGC^ 

TTTTAAAAATATTTCCCCCAAATAGAAAAACCC^VGAATAA^ 
1 5 CTCTTATTTTGAAGCCATCTTGAGACACCTTGTAAACCATAAGAA^ 

TAAC CAGAAAATGTTATTT CT CTG TTTGATT ACTT ATGTGCAAACITTCAG AAATTT AAC CAG CCTCAGGTT AT AAT ATTTGTTTC 

TGTTCAGGATTATTTATGAGATCAGCTAGGGTGTTCCCTTCC 

CTATTGTGCAGCACAGGCTGCCCCCAAACCTTAACCT^ 

CTTCCCTTCGAC C CTCAAG CAG ATGGTGCT CTGTGGTTAAG T CCCAG TGTGTG CTTT CITIT AAGTT AT AAAGGAATTTAAGT AAA 
2 0 CAG CTT AT AT AC T T C I' 1'1"1"1'T T AG T ATCGTG CAG AAAAAGT AAAG ATTACTT ACTTCTAAGCCTGA1T1T lTI' CT CTCT CTCT CTC 

TCATGTTT ATTTTATGTTCAAG AAAGAAGG C CT ATCTTTGG CTTCAACTTG AAAGGAATTTGG TGACTGCAGAT ATGGAGTAAAT A 
CAGTGTGAATTCCTACAGTAATTTTCAAAGTATTCGAGGTCCrr^ 
TACAGCCAACAGTTCCAGTTCACTGAAGTTATTTATACTAGCAGTCT 
TTTTAGTCCTCAAAATGTAAAAAGGAAACTTAGCTTCTACT 

2 5 GCATTTATTCTCCTGTCTGTTGTTGG CAGCAGGTT ACGATGGGT AG AAG CTTGGCCTTTGAGACCIATGGC^CATGCTGCAGCAAACA 

GGTAGCGTGTCTTTTGTGGGATCATTGTGTTAGGACTOAAATTCT^ 
TTTATATCTATATTTAAAACCATCTAGTTTCTTAAATTCTTATTATTCTTC 
GACAAAGAGCCACCATCATGTTAGGGCAGTATTAAAGTCTCTGGGCACTAGAGCCTCA 
AAGAGGGGAAAAATGCATAGTAATTCTTAGGATTAACTTACTGATGACAAATGAC 

3 0 CCATGGTACTCTAATAAGGAAGCCAAAAGGAAAAATTAGAAAGCCACGGG^ 

TTCTGTCTGGTGACTGGCTGTGTTTCTTTTCTGTACGCTTCAG 
TCATTATGTTTGTAAAAGGAAGAGCTACAAAGACATTCTAGGAAGCAGTTCTGGA^ 
CAGAGGTCTTCCAGCTGCAGCTGTAGTGTGGAAGGCCTGCTGAGCTGGGT 
GAGAGCAGAGAGGGCAACTTTCATACCTCTTCCTCCTTCCTTCC^ 

3 5 TGCTGGAGATGGAGCATGTGGTGAAGGGTTTGTGGAGATGAATCAGGAATGA 

CAGAAGGCXTGCTAGAAAGAGGATGAAAACTCAAAACTGACCCATTTA 
ATAAAGGAGTATGACTGCACTTGCTACCCTAACTCTGTGCATGCAAGACC^ 
TACCTATGTGGGGGTGTCTACGTATTTTCTGAGAGGAATATGGTTGTCAGTGTAGGTCCCCTGG 
CTTTTATCTGAAATACTTTCCCCAAATATCCACGTATTTCCCTGTCCTTCTTTATTTAGGT 

4 0 TGGGACCTCTGTCTCTATCTCAAATTGTACATCTTCCACACCTGATTT^ 

CATTACAGGTTTGCTCCACTCCTAAATCCCACCATCGAAAGAGTAGGACAGAGATC 

GAAATCAGGAAG AAGGCT C AG ACTTT AGTCTAAGAAAACCAG TCTTAAG ACAGT AGTTG ATG CAAAT AGACAGAACAT ACAT ATGG 
TTTTAATTGTTGTCTGCTTCTAAATTTACTCTAAAGCTTT 

AGCTTGT AGATGATGTTTTTGAGGG CGTTTGGTTGTTTTGTTTGTTTGTTTCCTCCACTATATAATCTTCTGCTCTAG 

4 5 ATTTATTATTCTTTTAAAATTTCATAAATGTATGCAATACATCTTAATC^ 

CTCCITTCCCCAATGCATCCCCCAGTTCTCTGCTTTATAAAAGC^ 
GGATTTATTATTCTAGC^GATAAGCCAAAAAAAAGTACTCATTTATATCAACTTA 
TTTTGTTGTTTTAGATAGTTTATAACACTGAATATTCTAGAAGAGGAGGAGTC^ 
TTCAACTTATGGGTGTTTTGGGTTTGATTTATTATACAATAGTATCATCAAC^ 

5 0 ATTTCAGATAACTGTCTAATTTGGCAAAGACAGACAATGATGATGAGAATAATAATAATAATAA 

CTATTGAAAGAGAATTAGATCAGGTCTTTTAAGAGTGAACTTCT 

TTTAGAGGGCATCCAGAATGAAGCTACAAAAAACACTCAGTGAATATCTGGAGAATTAAGAG 
GAGAAAGTATTTCATGATATTGTATCAACTTGGGCTAAAGACATAACAGAGTT^ 

AT ACAAATGTCCTCAAAAATT AAGTCCTAAATGC ATCAGCAAAG C AAATGGTT AGTTTATTGTCTACAAT AC CTTTT AATTTGAAA 

5 5 TATAGGTTAAAACATCTCATAGAAAATACCATTTGTCTG^ 

CAAAAGCTAGGATCCCGAAGACAATAGATGGTGATCTGCATGTT^ 

ATCACTAAAGAGTGTTCC^ATCCCTATTGTGTAGAACATGGGGTAGAAACATGGGATTCCAGCTGGAGG 

ATTAGTATCATATTGCACATCAGACTTCCTTGGTTTGAATCTTTTAAGACCTATGTGACCTTGAT 

TCAGTTCCTTCACTTTTGATATGGACACAAGGAGACACT 

6 0 TCTTAGTGATGTTGCTT(IAGACTGGAGAGCTTCT^ CTACATGGAGGTCTGTTTGGCCACAGCACC AAG CG AGTC 

TCAGTCTTAAGATGGGTTCATTATGATTTTCAGAGTAATTATTATACTTAAG^ 
ATGTATGTTTTGACTTACATATGAAGGGGAACTAAAAAAAATGTATGTGTGATAGATGTTA 
GTTTAACCCCTTCCATCTTCCC^CTCAGCCACATGAGCTGTAGAGAGTCACT 
TACAATTTAAATAGCACTTTTACAAGTTTACAAATAAGGTTTGGCCATTCCCTCCCTCT 

6 5 GAAGAGATTGAGATTAGATTTTAGGATAAACATTGTTGCGAAGTGCCrACTTGGTG 

TGGGAGGCGGGGAGGAAGGAAGGAGACGGATTCTGGAACTTTTGTCTCGCTGAATAAA^ 
ACAGAATGAGAGCAGGCAGCTTTGGGATGGTAGCACACAACCACTTGTC 
TTATGAGAAAGGTTAGGGGTTTGTX3TCAGATAAGCACACCAGTGA 
ACTAGTCCTAAATGAAAATAAAGGAACAGAAGTGGGGAAAGAGATGTACT 

7 0 CAGCAATGGCAAAGGTAAATGCTAAGGTTTAAGTTCCAAGGACTTCCTGCCGTCT 

TTCT ACTTTT AGGGT AACTTT TTG AAGTG T C CTT AG ATGATG AAAAAACACACTCG ACTTTGTCTGT CTGTCAGT CTG CTT ATTTT 
GATTAACCTCTCTTCTCTAGTTCTATCAGCTTTTCTC^AAATGACATGAT^ 
GAGATACACCTTATGCTATATCCATTCATCTGTTGATGGACACCTAAGCCAATCCCACA^ 
7 5 TACACTGAAGTGGGAAAGCATCTCTTCTTCCATCCACAGTCGTCATGTT 
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GTCATTTTACCTTGGGGAGATATGCATACT C1TT1A CTGGT 

CTTTTTGAGTT AACAGCCATTCTGAATTTCAGT AT CGTTTT AATTTTG ATTTTC CTTG CTGGTTAAAT AC^TTGAGCA'l M ri"rri"l"r A 
AATGTTGAAACCATTCGTACTGTGGAGTGGTGACTCTTC 
TCTGAGATCTTGATATAGCCTGAATTTTAATCCT^ 
5 T ACTT AG TTG ATTGTTT CCT AGG CTGGG CAT AG CCTTrTGAAGTTTTGTAAT CGAATTCTTCAATTCCTTTGTTTC CTGTG CT ACT 
GAGTCCTATTCAGGAATAGCCTCAGTTGAACT 
CATTGAGGTATTTAATCTATTrrGAGTTGATTTTTTT^ 
CCTATCCIAGTGTCCCTAGCACCACrTGTTAGAGAATCTTITCT 
TCCATTGGCTGGAGCAAGTT<X3ATTTATTCCrrGG^ 
1 0 TGTTGTTTTGTTTTGTTTTGTTTTGTAGCAGTGCT 

AAATTCCAACACTCAGAAGAAGCCCATT AAAACTITGA CTTG ATTGT AAAGGT ACTATTTXT ACCITCTT AG TTGGGTTT 
ATTCrGAATTTCATTCACTTATATTCTCTCT 

1 5 TGATGAATTTGTGTATTGGCrrGCAA^ 

TATGTATACCCATGTCCrrnTCTAGCTCAATGGGTATAGCTAAAATTCCAAGTA 

CCCC CGT CTTGGTCCT AATTTT AGGGGAAAGTITTT CT ATATTTCTC CATTCAG CAAGATGTTGG CT ATGGATTTGTCATGCAT AG 
TCATTATTTCTGTCCTATGACTTTCCrreTGTCA 
TTGTTAATGACTATATGCATTTTATTCTTGATTATATTTATC 
2 0 CCTGGAATAAAACCAAGTTGACATGGTATTTGATCAT^^ 

GTAGGTGGTTCAGAAAGAGTAGTTTATAAAGTTTTGTTCTGTCT 

CAGAAACTCCAAGTAGGCGGGAGCTTTTC^CTCT^ 
ATATCTTCTTGGCTCTATGTTATTAGGTCATATGTACCT^ 
2 5 TTTTCCAAAACAITTTCTAATAATGCTGGTATTTC 

TTGCCTCAAGGATTGTTAAGGGTTGTCITTTGGGGGAGGTGGGGGC^ CGTGGAG CT CT AAATCTTTTCTCTAGATTCATTTTATTG 



G AGGT AACG GCGG CTTCG AG CTG CC CAG CACGGG CGCTGGG CAT CACACT CAGGT CTT CTG CAAG AACGCTG T ACG T C CTT AATC A 
CCCACTGTGCCGCCAGCCCTGAGACTCAACCTTGTTTCAGG 
3 0 ACTCATCTTATAAUtjrCTTTCCTTTTTATTGA 

TTTTCATATCTGTATCCTTAATGTATGUICATACGTAGCTATAAACTTTCCT 

GTCATTTGCCCTTCTCCTTT AGTGTCTGAG ACAGGGTGTCTCAG CAAAG CTG AAGG AACTCACTGTTTCACCTGAGCTGACTGGCC 
AGTGAGCTCACTCCTCTTCACCAGGGTTGTGAGGTTACAGTTACCCATGTGCTACCTGCCAT^ 

3 5 GGGGTCGGACTCXSGGTGTTCATGGTTCTATACCAGGCCTCTTACCAACTGA^ 

TG AAATAAT AATT CT CTTCT CTCATTTCAAATCG AAGGG AT CT ATTT AGT ACTTGTTGGAGGG CAG AT CT AT AAAT AATAT ATCTT 

GTAGTACCTGAATTAGAATATTTGAATCCCCITTAGTTTTCAAGTGTAGC^ 

TCCATACTTTATTTTTTTTCrcTTACATTACTCT^ 

CTCTGCTGCTCAGAAATCATGTATGATGAATTAACTTCTCTTGCCITCAGGATTCT 

4 0 TGTGCCTAAGTGCAGATCTTTTAAAATCTGACATATTTGGAGTC 

TGCTGATGTTGCATGTTAAGATATTTACATGTTCCCTCT 

CCACAGACCTCTCAAACTGTTGAGTGCTTTCC^GACCCTCT 

TCCTCTCCTX3TTCTAGCTTTAACATTTCTATCCCATTCTATTTCATCT 

ATC AATTTT AT ATTTCAG TTCTTT AG ACATGG TTTTCTTT ATGCCAG TGTG TAT ATTTGAAAT ACTT AGCGCATAAG CTTTCCAAA 

4 5 GCAGTATTTCTCTGTTGATTTCTTGATGTCCTGTTCATTGTCTA 

GAGACATTTTAATATGGATTCATCCCTAAACTGAAATCCAAAATCT^ 
TCTGGAGTATTCAGATTTCATGTTrCTGTTTTAGAGGTTT^ 

CTCAGTAAGGGTCCCTCTGCTTGTAGTGTAGTAATGGCAGTCGGCAGTTGACATGC^ 
AAGCATTACAGACAGTGTTTAGTGTCTGTTCTCAGTT^ 

CATACACATATACATACACATATACATATAAATACACACATATGCATGGATACATATATATATA 
AAACTTTGTGCTCATCTAATAGTTGAAGAGAAATT^ 
CTGTGTTGCAGAAAGNNNNNNNNNNNNNNNNNNNNNN^ 
NNNTTTTCCCGAGGGCATTATCTAGTTTATTTACTATTCCT^ 

5 5 AATAAATGTCCTCAGTAGCTCTCCCTGTCCAAGTATAGCAAGCTACATCAAGCCAGTTT^ 

TTTGAGGGGATCTAAACAAAGTATAGGAGGAATTCTGTGCCTCTGTGT^ 
GCCAGTCACAAGCATTGTAAGCAGGTGCCATGTTTATAGGAGCAGGAAATTTCTTTGGGATT 
AAAATAACTTGGAGAAAGCTTCCAGAAATACATCTTTT 
GTGGTGTTGTCAGACTGTGGGTTTAACTGAAGCACATCT 

6 0 CAGATTTGCCTTTCAACCTGGTCGTCGTACTTTACCCCTGT^ 

TTTCTGGTTCACTAGAGTTCAGGATGAACGTTTTCAGGTCT 

GGCCATGTCTTTCTGTTATCTCACTTGCAAATAGCTGTGAAGAGTCCCrGGGA 

ATTGCTCAGCITTCACTAAGTATTCTAGGGAGCGTTCGGACATCTA 

ATTCTATGTGGCAGCAAACATTTTGGGTTAAGCTTCATTTTAGTGAACTG^ T CAAGAAAGGAAG CTG AAAGTTTGAG ACTT AAGT C 

6 5 TGG CCTTGTGGCTTCAAACAG AAAG CT CACAAATGT AAAA C CT AC CTGGG CT AT AG AG AAAATTCAGGGACAG CCTGGG CT ACTG C 

CT CTCAAAAAATTAAAG CATT AAGGG CTT AAGCAGTAGTAGAATGCTTACT^ 
CCAAAGAAGAAATTGGAGACTATC!ATGTTTGTTCAAAAGAGGGTTTA 
CATAGAAAATAAAATGGTTACTATCTTCTATCATATATAACATTGTGTGTCACAATGG 
AATAAATAAAATCTTGATATAGACTTCATGTACACATTTCTTAACTTG^ 

7 0 CCTTACTCCCTTGTATATCTTTGAAACCTCTTTCTTTTCATC^ 

TTAGCAT ACTGTCTTCTTGGCCCACCTGTG ATGTAT CCATT AACATCCTCTTTTGTGTGG CTAAATGGAGGTCTGGGAGTGTATAG 
T AG ATTTTCTTTATCCATTT ATCATTGATGGG CATTGTGATGCTGQVGGGAC^TCCTTAC AAAGGGTT AATTC CATCACCCAGAAA 
TTATTGCTGGGTTTTACAGAAGCTCTGTGTGTAACATTAGAAACCT 
ACCTGCTAAGAGGTCCTTTTCTATACATGGTCACCAATATTTG 
7 5 TCTCTTCTCTTCTCCATTCCACCCCAATTTTGAGATTAGATCT 
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TTCTCAAGAGAATTATGTCCTTTGTGGGAGA 

GAGTTGTCAAG ATTTCTTTCTAAGATGACACATTTCTCCCTTC CCTTTCTTG CCTCC CAACCCCTCCCCACTGTCCTTCGAATCCA 

TGCCTATTTTTCCTTTGTTATCGCATGTATACGTGCATATACATATTTTTTCCTAAATATCT 

XCTCCAGGAAG6CTATCTCTATCTACCATACCCAGATTTACTCAGTTC 
5 TTCTCTGTCCACTTTGGCATGTCCATTGGTACAGTTTTGGCTCA 

AGTTTCTGACATTACTAGGAGACAGCCCCATAGCAAACCCTGTGACCCTCTGGCTC^ 

GCGTTCCCTGAGCCTTAGGTGTGGGAGTGTTTTGTAGATGAATCC^ 

GGTTTTCTCCAGTGGTTTCTGTCAGCTACAGAGAGAAGTGTCTTTC 

CTTAGTAGGGACTATGTTGATTTGGATTTTATAATTGGTATC 
1 0 GGGCTGCCTCTTGTTTTCCCCTGTATTTTATAAACTTAAGTTTAGTGGA^ 

TTrGATGATATATTCAAGGAATAACTACAAAGACCAATGTCAAAGACTTT^ 

CATAGGACTTTTGTCCTATTTTAACTTTACTTTTATCTATCATGTAAAGGTACATCT 

CAGGTCTTGTTTATATCTCrTTATTGCCACrGTCAAAAGTCC^ 

TTTGTCTCAGTGCTATATTGTTTTAACTGTGTCTGTGCAGTGTAGTAT^ 
1 5 TTTAGGTGTGCATTGTTTCTCCGTATGAAATTTGGGATTATTGTCTTCTATATGTCT^ 

CTGCCCTAAATCTTTATTGATTGCACCCCATCCATGAACATACTCAGTGTTCGCA 

AGTTTCCAGTGTrGTACATTTTAATTTCATTGGTTAAATGTATTCCTA 

TGGGCTGAT ACTTT AGTTT AAAAAGTGGT AT AGATTTTTTT A^ CAAG TTTCAGAACTTG CTTT AT AT AGTG 

TTATAGCACTCACCTCTATGATGTCTGTCTT(nTCTt3ATCTC 
2 0 GGGTGGGGGCAGG AATCTCTGGTAAAGAGAATTTGTAT AG CACAAGTGTTTTTAAATGTCT ATT AAAAGAGTCAGT AAGAG CTGGA 

GAGGTGGCTCTGTGGTTAAAAGCACTGCATGCTCrTCAAGAAGGTT 
ATTCTAGTTCCATCGGTTCTGACGTACTCTTCTGGCCTTTC 
AGAGATAAACAAAATTTTTAAAAAGTTTTTAAATA 

2 5 GC^CCCTCAACAGGAGATGGCTCCGCTGTGTGCAGAGGACAGC 

tgctggtttctgttacaggtcittsaggttgtagtcagccattaacca 
tcaacagaagctggggacctcgagaaggtgagagagagggcctaagaatgccttagag 
gggttgcattatgatggaactgtgaaagtgtatattcctccagtga 
agtgttaggaatgctatttccgaggtggtattctgcaaaactataaaaggaaaa^ 

3 0 catccccgtttgtagtttaggagatggggttaagttgcctttcaataattcagtt 

tgttt ag aattttgt actt attaaataaaat acttat cttt c aaatttatcc aattttag ctttt catgctgacattctcaatttt 
atagacagtatattcttttattcaactaccaaagtgattttttttattgtgtaaaactg 

ATAAATGGAATTACTGCTATAATTATTATATTTACTGAGTCAAATTTCAGTIT 
GACAAGCCACTTTGCITrCCTGTCGCAAATACACTAATAACACATTA^ 

3 5 G CTT CTGT AGCCCTTGGCT CTG AGG ACGCAC CTCTG CTGTTGT AGG AGG AT^ 

CCTGCATGTGTGCCTATGCACCACTTGCATGCTTAGTACCCAAGGAGACCAGA 
AAACAGTTGTGAGCTCTTATATGGGTGCCGGGGCCCAAACCCAAGTCCT 
TCTCCAGTCCTGCCGTGGAAGTA TTTTTTT CCCATCT^GTTTA 
AATTACTAACTCTTGGAAACTGATCAAAGTCTAATTTAAAT 

4 0 TACAATTAATCTCAGCAATTTGGGTGTGTATTTATTTTTAAGCTCTTAAAAGGAATGGGCTA 

CAGGCTTGCCTAGTATGTACAGCATTGGAGTCCATCCTGGCTX3GGGT 
GGGGGAAAGGAAAGATGAAAGTTTAACCTAAATGTGATAATTAACTATAAGCTTTTT^ 
CTATAATTT'AGTAATAAACAGACCACATTCGCATGTAATCTCTATTTGAAAGAAACAA^ 
GCAATCCAAGTACCGCAGAGCCTGAGCCATAAGACCACTG AAGAG AATTCACGAAGAAACATGCT 

4 5 CCCACTAG^TCTGATGGGCTCAAAAACTAAAACTT AGCAG CCTCGGGTTG CTGATAGAGT AATTTGCTTTGCCAGTTG AACTTGGT 

T ATT AAAT ATAACTG CTT ATGT ACGACAACAG AAATAT CATGG TTTGTTT AT AGTACAGT AAG AGGTG T ATAAGTT C CT AAAAT CT 
ACTGGAAATTTGACTAGGAAAAAATAAATAAGGAAGAAGCTGGAAAAGTT 
TAGGAGGCTGAGAGGGCTTCAGGTTTTAGAAGGATCTTC^ 
TCACCTTTTCACTGCATACATGTGGCTTGAAATCATTTTCA 
50 TATAGTTTAAAGTTGTCCTATAGTATAATAGATAAAGAGTTTTTATTACCTGTTACCCCTGAATTAAAC 
AAAAGGCTCTAAAGGGCTTAACACCAATCACCATAGCTCITGGGCATAGACT 
TTTGATAATCAGAGCTTTTCTGTTTTCTACATACTCT 
ATTTCAATTAAATTATTTAAAGTTAAAATCTATTTTAAGGTAGATGTGTG^ 
CAGAAAAATAACCATTCTACTGTTGAGCACATCTTAGGATTGTTTATTAAAAATCACTTTAA 

5 5 TCAAAGATTTTGGAGTTTTCAAAATAATACCTACAAAGTTTAATAA 

CTGTGCCTGTAAGTACTGCTTCTCTGTGATGGTTGGAGGC7VGGGAGTAAACA 
CAGGCAGCCTCAAGCTTGTGACTTACAGTCAGGGTTAGGAAAATGCTTT^ 
GGAAGGCACATCATTTATTAAATACCATGCAAGATCTTGTAAAGCTTACATGTCTTAAAAGT^ 
AACCCTTTAACAGGAAAATTTGGAGTTTTTAGAATCTATTTAACCTTCTC 

6 0 TTTGACCTGTTGTCAATATATTCAGTTTACTGCCTAGTAGTATTTTO 

TCATTTTGTTTGTGTGATAAAAAGCTAATTTAAAATAATTATTTTCTTATTTAAAAGAG 
ACATCACACCTACATCTTAGTTTAGAGCTGATAAACCACTTTAACTACCTAACT 
AAG CXSAAG ATT CTGGAT AGGTTTTTGTTCTTGTTGAAAAG CT AGT CAAA 
GCAACTTGGGAAACATTTAAGTATCATGTAACCAAATTATTGACT 

6 5 GGAAGTAAGGGTTTGTGTGTGTGTGTGTGTGTGTGTGTGTTTACTTGAAAAATGG AG AGTG AAAT 

AATATATGAAACTTGAAATCTAAGAAAGCACTTTGTTTTAACAATA^ 

CTCTTTTACTTTTAAATACTTTATTCCATGCAGTAAAACTCT 

TTAGTGTACTITTCAGTGACTCTAAACTAACACAGTTTCTAC^TGTGC^ 

CTTGGTTT AAAACT ATGG AAAC^UITTTTT AG AGGACATGGT AT CTT CTAGT AAG CTGTTTATTTCTG CTTT CTG AACTAAT ATTTG 

7 0 CAAGTTTTCTTGGGTGGTATAACTCCTATTTCCCCTTGGGAGGGCAGCA 

GTTGTCATTTCATATGCrAATGTATTATTTATTAAAATTAGAATOSG 
AAGGACATAAATGAAGGTATTGAGCATGAGTGACTTAAACTAATTAATTCT 
TGTGCCCAAGTACTTAGCATATAAAGTACATGTTTGTCACCCCTAAGTGTGG^ 
GCTAGTCACCAATTCAGTCTTCCTCACTAGCTACAGGACTTCCCAGGGC^ 
7 5 TCTTCTCAGACCAAGATCTGAAAAGAAGATCTATGACAACAGTGGTCT 
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AAAAACAAACATGAGAAGTTAAGATTTCTAAAATA 
AAAGAAAAAGTCTCTTGGCATTGTTTTATAAACCAATAAAATC 
TCACTAGGQTAAAATTTCAAAGTATCAGTCTTAGCAAGATAGTTTGTTTCT^ 
CAACTGTCTGCCATAGAGAGTAATTGAGAACCAAGAAACA^ 

GCTTGATAACAAGTTATGTAACATACACATGATTATGCAAAATACTGGAAAGCAGTACT 
ATGTTATATGACTACATAACATATCTTTCTTGGCCTGTGACCTTTATCACCT 
GTTAACCTG CCTTGAGTATGACAGTTGTGAAACTGGG AAGTCAAGGGACTGX5 
TAACTAGGAAGCACCACACACTTCCAAGATGGCTATTGCTTTAAAAGAAA 
ACTTTGCTGAGAATTGAGAACCAGTGGTAATAGGGTGGAAATGTTTTCTCT 

TTAAAACAC CTCC CCCTTTGTT AGC CACATGTTTCACAG CAC AAGAAGG AAT ATATTTTT ATGGCAGATTTTT CT CTT AG ATGTTG 
TTGAT AT AGT ACT AGACTAAAT AAAG ACCCAGATCGT AAGAAAATGGTT CTCATCTGA^ 
AAATGTGGTTAATTTTTTAAAGCCATTATTTCGAGGCAGACTTTTC 

AATGAGG CAGGGGTTAAAT ATCAGGTAG AGGGGTTGGATTTT AC C AACACTGTTAGACCTT AGGAGCCAT AT AAAAGCATCT AACA 
GAACTCAGGGCACTGCrGTTAGTGTTACAAACAGCAGTGCrrCT 

C CTCACTGAAGTTG AGAGGG ATGGT AT ACAT ATTAT AAAGG C AG CAAAG AATGATT ATG CTTCTGAGCTAGGCTGTCTTGCTCCAT 
ACTGTACCTTATAAATACTACTCTGGCAGTTAGATATAGGATAAACTATGTTTGGAG 
AGAGTGACTGCCAGACTGCCTAAGATCTGATAAGAAACA 
CAGAAAGCTTTAGGAGGCAGTTGGTTAGAGCJ\CCGTTGGTTTCT 
TGCTTGTCTCTAAATTCAGTGACACTCrCTGCT 
TTCTTTCTAAGGTTTTTGTGGTTCAAATCTCrCGGGCTC 
AGACCCTCTTCrCATTCCATTCCTCCTCCTGATAAATACCATCACT 
ACCrATATGTATAACTACCTACTGGTATCTTATCTTACTGACAGGC 
GGTATATTCTAGTTCCTTTTGCTCCTTTATCCAGTACCT^ 
TAGACAATACTCAAGTTGCCTTTTCTCCTCCCTGCCTATAG 

AGG AAAACITTCTACAACTGAGAT ACACTCCTTCATTCCCA.CT CAATTGGT CCTACGTTAAGGCAGG CTCCAATG ATT 

TTTAAGTGGATTACTGAGACATCTTC^CTC^CATCCTTCCCATCT^ 

AGATGAGATCAGATCACTACTGCTCGTAAGGAACAATTCAACTGCCCTTACTGCCCCT 

G T AAG AT ATTG CCTT AATT AGGTTTAT CTTGCTG AG ATAAAG ACCTTGATCAAAAAGAACATGGGGAGG AAAGGGTTT ATC C 

TTAAGGTTGTGCTCTATAAAGAAGGGAAGTCATGGCACX3AACT 

TGGGTTGTTCCCTCTGGCTTGCTCACATTGCCTTTTC 

GGTAGGCCCTTTCATATCAAATCCTACAGGCAGTCTC^TGGAGGCATTC 

TTGGGGAGGGGGAGACTATCCAAAAATCAAAACATGACAAGTGTCATGATT^ 

CCTCCrC CfTr TCrTCTTACTCCrCCTCACTClTCTTTTrGTATTTGGT^ 

CATCAGGTAAAGGTCGAATCAGAGATAGGATAGTTACTGCCACAAACT 

T AAALTTCT TT T AATTCTTT CTTT CCTGCTG CA.GTG CTTTTT CAT AAT AC CAAT CTTT AG ACTT AGT ACTTGG CCTCCAACCAT CT 

TTCCTTTCCATTCCTTCTTCATTCTCCAAAACCTAACTAAACTAT^ 

CCATCACAAGACITGGCAGTCACCTGTCCTAGCCTTTATCT 

ACCTCACAGCTCCAATTGCACAGTGACTGGGAATTTGCATAAAAT^ 

TTCTGTTTTCTATCAACCATCCAGTGAGCAGTCAGGGAAAATA 

AATTGTCAGCAGAAGAAATTCGGTTTTTTCTTGTGTATTTGT 

AAG T ACT CT AACCACT C AG CT ACAGCCTGAG CCCGT CAGAGG ATTTC AT ATTTT ATTT AAG CT ATCGCT AITTT AACAAAAAGG AC 

ATGTTTACAAAATCTTTGTTGAGTGAAGAGCCAGTTAAATAAATAAATGAACACA 

GTCTAGCAAAG' lTrrcnTT AGAGTTAAATGTACTACTT^ 

GCATGTGCATGTAACCGCAGTGCTGTGGTGGGCAGAGGTGGGAGATCACTAGGGTTTAAT 
GAGAGACCCCGACTCAAGGAAATAAGGTGGAACATGATAGAACAGGACATTTGATATCT^ 
CTGCATGCCCCCTCCCCATATATACCATGCATACACACATTAAAAATAATA^ 
GAAGGAAGGTGTTTATTCCACCTTATACTTGGAGCCCACACTGAGG 

ACCAACCACAGTGAGCAGGGCCCTCCCATAGCAGTCATCAGGGAAGCAGTGCTC 

CCCAAAGGCAGAACTTGATTTGGGTTGATATAAAACTGGCCTCTCATTTCACT 

TATTGTGTGTCrGCACTCGGACATTTATAAGATAGCTACCT 

CTCTAACTTC CTGCCTATGTAGTAT*TTTTGTGGTGATTCCTCATGGAACTGCAACTAAG CTATATCCAGAGTTGGCAGGCTATAGTC 
TCAGAT C CATCTGGCCCTT AAATT AC CATG AACT AAT G ATTTTTTT AAGGTTT AT AAAAT AC AACAAGGAAGGAATGG CAT AAGG C 
TCCAGAACCTCCTGAGAAGTCTTTTGCTGACAGATGTGGATTGATTGAGCT 



TATGAGAGTCTGAATATAGGACGGCAGTACCTCTCTTTTCTCT^ 
GCTTATCTGCAGATAGTCAGGATGACACTGCATT^ 
6 0 CAAGGTTTAATTAAAATAAGTTGTAATC^CTGTGTTAGTATCTAAAAGTAAAATTGCCACA 

TTCAAACTCACTATGTAGCTGAGGTTGCACTTGTAT^ 
TGCTTTTTAATAATTGAGTGGTATAGTTrATACTTT 

GAGGTTAGCTGATGCATGAGAAAAAGTCAACAGGTAATCTGGCAACCAGGCACTGTAATCCCAGG 

6 5 TTCAACGCTTGCCTGGGCTAGTGAGAGACTCTTGAAAAC 

AAGCCCAGACATGAAGTGAATAATAAAGAACCACATGTTTAATAACGCTCG 
CAGTCAGCAGTGGGGAAGTGTTTAAAAGGCCTCACCTGGGTCTGTGCTTA 
TACTGCTAACTGAAGGACTTTATTTTCTAAAACAAAAAAAGTTTGA 
CAGTTACCTACCTGAATTTATTAATGTCTTTTCACAGTAATGATTTGTCTT^ 

7 0 TTAGGGAGAAT AG CTTTTTGAAG CTGTT C C CTCAGT A CCAAGT ATGG AATG CCAGCAG CTGTTGGCTCTTGTTTGTTTG TATAAT A 

AAATGAATTTCCAGAAGAGTCACrGTGTATCCTCTAATAGATATTAGAAATGACACrTAGCATAAT 
ACTATTAGCCCTTTGTCACTGTXSATACTCrTTAAAATAATTATC^ 
GTTTGCAGGTGCAAATGTTAAGTGTTTTT AAAAT AGAAGTAAGGGGGAGAT 
TCAGGGTGAGACTGTAGAGAGGGAGTCCTCTGCTCTTAGCTTCCCTTGC 
7 5 TTGATTGCCIT AGG ATCAAGGG ATG CT AT AGGGCAT C CTG CTTTG CTGAC CAAAACT CATG TT AACTGTGGAGG AAAAAAGGGAAT 
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GCCATACAGTTGGGCAATTAATCACCAAATGCAAGG^ 

GTGAAATAGCAGGTGGATTACTGAAAGOTGTCTGTGGACCTGATGCAGTCAATCCC^ 
GGACAGAAGCAATTGCAGTCAACATGAAAGTACCACAGTCCCCCAA 
AGGTACAACAAAGTACCTCACTCCAAATTATATACAGGGCC^ 
5 GACCAGTGAACCGGAGTATCAAGAAATAAACTGATAGTAGCATAGCTGTCTATAGTCTTCTGATC 
CTGAAGAAAATAGCCTAACAAGTrAAAACTTTACATCTGCATGAAAAAA 

T CAAAGTGGATCCAAAACTGG ACAG AAACTTTGGACCTG CTAT CAGTGGCA 

AGGACTCGTTCAGCCTGCTTACCCAGGACCACITGCCAATGAGGGACACTGCC 

AATCAAAAAATACCCCATCGGCTTGCCCACAGGCCAATTTGGTGGAAGTATTT^^ 
1 0 TG AGG CTTGTGT CAAATTGAT ATAAAACT AGCCGG CACATCCAAGAAGT AAT AG C^AAAACT AG C CATTG CATG AAATT AGAATGC 

AGAATAGGAAAGAAATCAGTCTTCAGCATGAAGAGACAGCCTACAGAGTAGGAGGAGGTC^ 

AATCCAGAATATAGAAAGAATTAACCATCAGGAAACAATATCTGAAAAGTGA^ 

GGAAATGTCCAACATCTGAAGACATACGAGGAAAAGACAACATACTTCAGCCAGGAGGACTACCA 

TATTTTTGTGGTGCCTGCTAGGGTTGGCTGTGACAAGATGAAGCT 
1 5 AATGGATGATGAACGCAACAAGCTTCATACTTCCTCTGAGAAGCAAGCACATGGCCACAGAAGTAG 

GGCTCTGAGGTCTGAATGGGTGGGATTGACAAAAAAGATTTTCCT 

TAACTAATTACTTACCCCATCCCTATCTCTCTCACAAACTTCCTCCTCCATCTCC 

CTCTGTGTGTCCCCCCACCCTGTCAATATCAAGTTTCTC 

T AGGGT AGGAGATT CCACAGACAGGG ACAG CC CCTGCTC CAGTTGTTGGGGG ACCCACATGG AG AC CAAG CT ACACATCTG CT ACA 
2 0 TATGTGTGGGGAATCTAGGTCCAGCCCGTGCATGCTGAGTATGGAGCACTCTTGTTATAGGCT 
ACAAGTCTACTTACGAATCCACTCAGTGGCCCCAAAAGAACTAGCAGAACCTGAAAGAT^ 
TCCCTAGTTTGCAGTATTAGACCTGGAACrCCCAAGAGAAGCCCATAAGCAAAGAGT 
GTGTCCTGTCACTCCAGATGTCCTGTAACAC^ 

CTAT AGAATATG TTAAACTTTT AG CCAAAAGAATGAAGAAAG CCAAAGAAT ACAG G AACAT AT AT C CAAGAGG CAT AG ACTGT CCT 

2 5 CACAGCTTCTACTCCTTAGTCTGAGTCTAGCCAAAAATTAGTOT 

TOVGGGCTAC^GAGGTAGATACAGGACAGCTCTGGGACTGGCTGGTCATACA 

AGACCTGTCrrCAAAAATAAACTGAGAAGTGATTGAGGAAGACGCCCAACrCAGTC 

CAGCAAATAGTGGCAAGGATGGCAGGTAGTTATATTTTGGTGATTATTCTG 

3 0 GCTAATTTTTTTAGAATTCCATATAGTGTGTTTTGATCACCCCACCCA 

TTAGTTAACCACTAGAACAGCCCTCAGTGCACACTCACAGGAGTCAGTCAG 
TTGGCACTCCTCAAAATAGTCAACTCATGAAATCACT 

CACAG CCACT AAAAAAT AAAATTG TTAAG CAAAATAAACCAT ACTTGGAAAAG CAAATGT CATG ATG AAACCCATT ATTTTGTGT C 
ATTAATACATATGAGTTACAATTTATATAAAAATATTTTAAAGTCT^ 

3 5 CATGAGATTTTGAACTTTATACTATTTATCrTGGTAAAACAATT 

TAAATATTTAAGTAATATCATCTGTTGAGAACGGAAACT CT CTAGAAAGTTAT AAATCATAGCC 

T AT ATT AT AAATCAT AG AAAC CAAT CACATTCAAAGGCAC C CTCTT AATG CTC CT GT AG AAAGACT AG AAATC CACTTT CAAC AG C 
CAGGTAACATGTCCCTGGAAAGCAGTCTTACTTGATTATCTGAATTTAAATG 
ATTACTCTGAACACTGAATGACTGCAGGATTTATTTTCACAATAGTTTTCCTGTGAGTGTTG 

4 0 ATTTTGGT AGT AAACTCTG TTT ATT ATTT ATT ATT ATTTTT ACTT AT T CACTTTG CAT CTTGCTCACTGCT CC CTC CTGG T CACCA 

CCTCCCACAACCCTTCMCCAACACCrrCCTCrCCTCTGTC 
CACATTCTCrCCCACTGGGGCCAGACAACAAATCCCACA^ 

CATGAAGACCAAGCTGCACATCTACTAAATATGTGAGGGGACCTAGGTCCAGCTCATA 
TAAGAGCCCCAAGGGTCCAGGTTAGTTGGCTCTGTTGGTCTTCCTGTGAAGTTCCT 

4 5 TATTCTTTCATAAGAGTCCCTGAGCTCCATCCAGTGTTTGGCTGTGGCT 

CTCAGGACAACGTGCTTCTATCTGCAAGCATAACAGAGCATCATTAATAC^ 
AAGTTGGGCTAATTATTGGTTAGCCATTCCCTCAGTCTCTGCTCCATCCCCTGTGCCT 
GGTCAAAAGTTITGTGATTGGATTGGTGTCTCTATCATTCCACTG^ 
CrATCCCCAATACTGTGAGTCACAGCTAAGGTGACCCrCACTGATTCTTC^GTAC 
50 GATGTGTTCTCTACCTCCTCACCCCTACCAGTTGCAGATTTCCATTCATTCTGATGGCCATCTCTCCTC 
TTCTGACCCCATCCCCTCCCCACCCAGCCCCCTCACTCCATCTATCTCTTAT^ 

GCATCCTTGTGTCTGTAGAGTGCCX5TATGGTTATCCTGTATTTTATGGCTGATATCCACTTATAAGTGAATACA 
CCTrTTGAGACTGGCTTACCTCACTCAGAATTATGTTCTCAAG 

GGCTAAGATTAAAAACCCAAGAGACAGTACATGCTGGCAAGGATGTGGAAAAGAGGAACATTCCT 

5 5 ACT CTTACAG CCATT CTGG AAATCAATTTGACACTTT CTCAG AAAACTGGG A^ 

TG^GCATATACCCAAAAGGCATTCTACCGTCCCATAAAGACACCTGCTCAACTCTGTTCAT 
AACTGGGAACCACCTAGATATCCTTCAACTGAAGAATGGATAAAGAAAATGTGGTA 
GACATGGATTTTGTAGGCAAATGGATGGAACTTGAAAAATACCATCCTGTTGACATATTTTAAG 
TGGTATATGCCTTTAATCCCAGCACTTAGGAGGCAAAGGTAGTCAAATCTCTGTGAGTTA 

6 0 AGGAGAGTCTGTCACCCTNNNNNNNNNNNNNNNNNN^ 

CAGTATTGGGGATAGGAAACCTCAAGAGGCCACCCTCTGTAGCTAGACAGGAACCCCA 

ACAAAACTTTTGACCCAAAATGTATCCTGTGTGCIAAGAAATGTAGGCCTCCCCCCCT 

TCTCTCTCTCTCTCTCTCTCTCTCTCACACACACACACACACACACACACAACT 

TCCAAAGAATGTCCATATTTAGAAAAATTTTGATTGTCTACATTTATAGATTGTGCTAGTGTCTGATTATTAGT 

6 5 AT AGTGT AT AT AACTT AC CATTT CCTT AT ATCTTCT AT AATGAACTG CAG GGG CTT CAT AAAACCAG ACTCTTGG T CTTCCT ACTT 

AT ACTAC^AGTCCTAAGTCACAGCATGTTTCTTGGCTTCTGATTCAGAG CTTGATCTGTTTTCCTTGTATCTTTG CTCAGGATTTCT 
GATTCCATCCTAAACAGTTTGAGGGATCrrTTCGTTGTTACTC 
ATTTCTCTTCGAATGTGATTATCAGATATACGTCCATGGTAACTTTT 
CAACAAACCCCTGGGCCGTAGGTCTCTTGTCAGGAGCTACCAAGTT^ 

7 0 GGCCCCAAAGTGGCtSATATTAATATAACCCATTCTCAAAGATAATATCAGGGTACG^ 

ATACGCTGCTAGCACACCAGGTTAGAGCTGGCATCAATTAATTTTCAGGTCA 
CTTAAGGCATAGTATATTTCTGCCTACCTC^TCTCTTTGTTGTCCCT 
GTGAGAGCACTCATGACTCAACCATTGCACCATCCTAG^ 
CCCTGCTTCTTCATTTTCTTTGCTATGAAA 
7 5 GAAGACATTTGCCCTATTTATTGTAAACTAAATAATTCrATATTCTTCAACA^ 
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TGAAT ATTCAAGAAT ATTTAAT AG CATGTT AGT ATTGAATAATG C CATCATAATTTC AACTCTTTGGTT AG AAAT AT ACAGTACAG 
GGTACATACTCTATTAGAAO^TCTGCAGTTGGAAiGACTTACT 

ATAT AGATGT CTCTTAAAAAGACT AGTGGTGAAT AG TT ATTTTT AT ATTATT ATTGACATTCATG TT AAAAAGT ACT CAGTGT AAA 

GCTTTGCAAGTTAAAGAACATTTCTAATTATTGAGTTAAC^ 
5 TTACTTCCTTTAGCATTGCAGCATCCCCTTGCTCATT^ 

GTACAACrTGAACTTCCT ATGT CATGGGCCTC CACTGTTG CTG CITGT ATTTT CAT AGAAAAAGCTCTTGAGTTTCCTGGCTG CAC 

AGGGGGGTATGGATGGGTGAGTGACAAAAGGAGAACTT C CGGTTTTG CAACAG AG ACTGAAGAAAAAGAAACACTCATC^^ 

ATAC^AAACATGTAGGGAAAAGGAAACTTCCCTCAAACTATCACTTCTAT^ 

AGTGTGTTGCTTTTTAGAGAAACCrTCAGAGACTGCAACT 
1 0 AGAACAAGTGAAACAGAAAAATAAGATGACCCAGAAATAAAACA^ 

AAAAAAGAAAATGAAAAAAAAAjGACTTAAAACAATAGAAGGAA 

TCTGCTTATTTATTTTGAATTTTTAAAATTAAGTGAA^ 

CAGGTGCAACATAATGGTACACTGTCTCAGATAAATAAATACCACA 

CCAAAACATTTTTAC!ATAAGAATATTTAAGAAGCTTGGAG 
1 5 CTGG AGGGTT CT AGAGTGACT ATAAGACAT CATCACACT CTGCT CTTT CT CT AT AAAAAT AACCACT ACC CTG ACAAAT ATG ACAA 

ATATACCCACACTATTTACATACTTAAACCTTTC 

TTTTITGTTTTCTrrGTCTTAACACTGCATACT 

GATG CTT AG AT ATCT ATTC CACAATTGTGAG CTTTGCCT AGTTTC^ CT ACAGTGAAAAATG CTG CTGCCAGCATC 
CTTAAATGTTTTGTGATACATGGCTACTTGTCCATTGATGGAGGTGCTGAA 
2 0 CCAAACTG AATGG ACTTCTTAACTTGCTT C CT CCrCATTGT.LT CT-ITT CTACCAAGG TTCT AC^T AT ACAAGAGACATTCAGTG TG 

GATTAAGTTTGAGACTTCATAATTACAAGTGAAGTTGAATAACT^ 
AAGTTTATATATGTGTTGTGTTGTTTGCTTC^ 
TTTATTTGAAATCCACATTTC^CTTAACCTAAAGATATT^ 
TTTTGTCTGCTCACAC<3TCCCTTTTAGGGTAAAT^ 

2 5 AAAGAGTTGTAACCCAGCCTACCATTTCrcrAAATAGAAGCCTGTAAATGTTCTAG 

ATAATT AGGTTTCCTCCT AGATATT CCTGCTATTTAAAAACATTC C AG AAGT CTGTTTTGT AGACTATTG CTTAT AT CTT CTCATC 
TTTGAAATCGT AAAC CT CAAAAAATTAAAAAT AAATGG CAT AG CTTTT C CAAG AAC CT AGGAAAATG AG AAT AAAGTT AACACAGG 
GTGT CAATTTT AGTT AGAG CAAT ATCT AGCTGGT CATTTT ATTTT AAAG AAT CGTG CACTTCAATAAT AAT AT AG CT AGCAAAT AC 
T AGT AT CATC^IG CTCAAT ATTGGT ACTGAGAG CT CT AAG CATT AG AT ATT CATCATT AGATTT AAAAT ATTTT AAAATTT AAGATG 

3 0 AAACTTT AT AGACAT AT C CITAAAGG ATT AAATT CT AAAT AAT CTAATTCTTTACAAGAGAA^ CAAG CACCACAAAAT AAAAACA 

AGGCAATTTTTATCAACTAAACATTCAACAGACrrCTTGTGAGA^ 
CTTTACTGGGATCACGTGATAGGCAGCITAATAATAC^TG^ 

ATATACACACACACACACATACACATATATAGCAGAATGATATTGAATAAGTTCAAGGT 
CCATGTATAAAAGAAACAAGAAAAAACACCATTATC^TACATATAGAAAATACCATTATG^ 

3 5 T C ATTGGAT ATC CTT ACCTT CCTAG AAGTTT AT CTT AAG CAAATT AAAATT AAAGT ATG ACCAGT AGCTGGAG AATG ATT AAACTG 

TGTCTGAATCCCTTAACAT CATGTGACCTG AGAGTGGGATGCTTGTCITC AGCATTG ATGAGGCCCTG AG CAAC 
ACAGGAAC CACAGATGGTTACTCTGTAACCCTAG CATGTGGG AGGTAGAGGC TGAACG ACCAAAAGTTTG ATCCTTGGCTCCCT AG 
GGAATTCAATGCCAGCTTGGGTTAGACACCCTGAAAAAAATAAGATTATCTATAGATA 
AATAAACAGTGCCCCCCCAAAAAAAAGTGAGGTACACAGCAATTGCCT 

4 0 AAGAAGGAAATGATGTGTATAAGGAGGTCTGCTCGTGAACAGA 

GCTTAGGTCATTTCATTATTTCTCACTGTATATTTCTAAGTG^ 
AGAAAQUVGGAAAAAGGTTTCAGGCCTTCTGTATTCAGTAATGAG 

ACTGGATG AAGAAATGCAATTGCTTTAGCATGTACAGTTCTATAG TATATGATACTGTTCATTATTAGCAAGAAGAAAAGCAAATA 
GTCTCCTATTCAAGATGTAATCCTGTAGCCCACAAAGACAGTTACCCAGCTGA 

4 5 CTGCTTCCTCGCCTGAGJVGCTTCCTC!AAGGTACTGCTC 

AGTGACITTTTCTTCC^CrGGGTTCCATTCTTGAGCC 
CTGCCCTCTTCCCTGTTCATAGAACTGGCAGGTTCCITCCCAATG 
ATAAAATAGTTTCATTTCATGACTCTCAGWVCTCTCAGTTGTA 
ACCAAGATTGGCCTCAAGTTCAGTTCAATCACCATCTCAGCCTTCCAAATA 

5 0 TGTCTGTGATTCTTATTGTAACAGACTGTrGCAAAAATGGTCTTGGTAAAGCC 

CAGTCCACTTGGTAATGTTTTCATGTTCTCAACAGTTGCCAGTTTT 
AGACATTTAGTCTCAGGCTACTAAGGATTTAAGCTCTA 
AAGTGGTTTTGTATCCTGTCTTAAACTACGGATGACATTTT^ 
CCAAGAAAACCTCTGAAATGTTTTCTTTTCACCTTATTATAGCATAA 

5 5 CAGAGAAACAACTACGTAGTTTGACAAGAAAAAAAAAAAC^CTGTAATTCA 

ACTTTCTCTGTAGAAAACAGTAAAAATGATAAAAATTTCCCAGTAATTTA^ • 
AACTTAAGAATCCCAAAGAAACCTGACCTTCATTGTTCAAAGAGCCATGTTTTACTT 
TTTCATTAGCTTGCTACAGCATTAATGCCCAAGTAGTGAATTGGTTrTTGTTA 
CATT AAGTGATGATTTAATTT AAAG TTTCAGTTTTAAAC^ 

6 0 CCAGGTTTGGGAGCTTCTGTTGACAAAAATGTTCTCTCTGCAGTGTAGTGTC 

CTCACATG^GAACACAATCTGAGAAGGCATTCTCATGGCTAATATTTACATCCTTG 
TTTAATACTTCGATGAAGTAAAACTTTGAGCTTTATTTGTC 
TGCCTGCCTGTATGTATGTAAGTATACTXjTGTGCATACTTGGTGCCATGCA 
AGTTACAGTTGGTTGTAAGCCACCATGTGTGTGAAAGGAACTGAAC 

6 5 AGTCAACTCTCCAGTCCCTC^GGTCTCCTCTTAAATATTTAAAATAGCTTAGAGA 

CTGTTACTTG^CTGTAAATGCTCATCACAGTTAAAGCTGCCAGACAGCA 
AAAAGCTTCTGTGCCCCTGGAACTrTTACATTGAGAAAAACGTCTCTAAAACAA 
TCCTATGTTGTTTCTAAAATC^TAGTAAATTCCTTTTTAGAAGAAACACAAGTTACT 
TACTCTGAAACAAAAACAAAACTTGTTTTCTATTTTGAAA 

7 0 AATTGAGAATATATGTGGGTAAATCACrACTTCATGTGTATAGCACACAACATC 



AG ATTTCT ACCCT ACAGTT ATT ACTG CATT AT ACAT ATTCT ACAACAC CAT AG CT AG CCAAAAGTGCAT ACAT ATTCCAT ATT ACT 
CTGCTATAGTTAGCCAAAGGTTTGTCATTGGTATGTGCTAAAAGCGA 
ATTCCCTCCTACTAGTGATTGATGTGTAGAACATTCTGGGACTGGTGATCTTT AAAAACA 
7 5 AAATGTGATATCTTCTCTCAGGTTGCTTTCATGGATCA 
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TCCGTTTCCTTCTAATTGAGTTGACTATATAAATTATAC^ 

AGGTAACAAGTTAACATGCTCATACTGTTTCTCTAATAGTAGGTGTGTCTTCTA 

CCCCTCCAGCTCAGGCTGGATAATATATACTACCTTCATTTTACTTAAATTA^ 

TTCTCTATAGCTGTTTATTTTATTTACGAAT^^ 
5 ATACTTCAAGGCTCCCCTGAGATCAAGAATGGTGTTGAGCGCATAAGACATGTCTCT 

TGATT AT AG TGTTTT AATGACTCACATGAGTTTT AAAAT AAAA^ CAAGTGT ACCAGATT AATT CAATATAAT AAGGAT 

CTAAAGAAAATGTAAATACCATGAGATGCTTTTGTATCATTAAGT^ 

TCCTTGTGTTCATGGCTGGCCTCTACTTCTCTATTCTCTCrrCTC 

CCTCAATAAACTCTATTCTATACTATTTTTAAAAAAGTAATTTCTT 
1 0 TTCATATTTACTACAGTATATCTCTCTGGTGCCC^ 

TACTAGGCAAATGTCAGGCATAGCTGAATTCAGATCCAGGTAGTGTTTT 

CTTTATTCTTGATCTATAAATTAAACTTAGAATTATTTATGTAAGTTC 

ACTTGTAACAT^XSTTATTACTGTGATGGGACATTTTGTT^ 

CCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGACGGATTTCrrGAG 
1 5 CCAGGGATACACAGAGAAACCCTGTCTTGGAAACAAAAACAAAACAAAACA^ 

TCrAGAGGCTCCri'CTTTACTTCTCGATCn'AGTAGTGGGAATAATGTGTCTCTCCTCTT^ 

AT AT CCAAGGG AGGAGTCAGAAGACT CATTGGCACAACAGAAACTAG CAG CT CGTG CT AACATGAGAATT AAACTTT AGCCTTTGA 
TGAGGTAAAGAAAATGGTCTCAAACAAAACTGAAGCCTCTTCTGAGGGATTT^ 
TGTTGCnTTCrGTTCTTAGAGATACCTCTGGAAATTTGA^ 
2 0 TTTGAAAATTT AACACAGTT ATAAAAT AATTCTTTG C CTTT AAAATG AT AG AACT AAG CT CAAGG AGTT CCCTACTTTT ACTTTGT 

GTTACTGAGACGTAAAGTAAAAAGCATTGCTTCTC^ 

GTTAATCTAATATAAGAAAGACCACACTGGTGTGTAGGAAGCAAGAAGGCATGATTGAACTTTACTA^ 

CAAAGCCAGTGTCTACACCTGTAATGTTAGTCCTTTTAATAAAGA^ 

TCTGTATTGGATATAATCTTATAAACACCTATCATATTAGGTGCTTCAAATTCT 

2 5 T ATTT ATG CATTTTT ATTTTT ATT AGAACCACAT AT AAG AGG C C CATC ATGTG AAGAGGGG CG CAAGC CATGGCCCTGGGCCTCTT 

CAAAGGAGTT ACTAGTAGG AAGAATACACACAT ATCTAAAGG CACCAAGG CCACTGTGTAAAATATTACACAGTG ATTGAGTCCCA 
AAATCTTATGGTTCAGAAATGCCACAGTAGATGC^TCAT^^ 
ATACTAGGCACTTCGTAAATATTTATACAACACAGGTTCTCTGAAATGACA 
TTGGAAAAACGAAAGGGGCCAGACTGTGAGCCATATTGTGCTCATACT 

3 0 AGGACTGACTGTCXIACAGTGATGGGATTGGTTGGAATGGAAAGAGAAAAAGCTAA^ 

TGGTGAAAGATGAACATATTTAAGGTGGTCACAGAGAGGACAAGAAAGAGAAAATT 
TTGGGCTATGTCTCCAAAACAAACAAACAAAATAGTAATAGGTAAGATATGA 

T AGGCTAC CAAG CAC AGT AAATGATT CAG AT CTCT CTTT AAGGGTGTGATGG TGT AT AC CAATGTG ATGG TGT AT AC CTCT ATTCC 
TATCATCCAGGAGGATTCTGGAGTAGGATTCAAAGTTCCACTGTCGCAGCTAACAAGTGTTAGT^ 
3 5 TC^TATGATTTTGCAGGTTCATGTGGACTGAACCAAGTC^ 

GTAACTGACCATTGCCTCCCTCATAGCCTTGCCTTCTGTGAAGATGCCACGCTTATGATTCT 
TACTAGCAAATCAGTTATCCTGTGTGTTCTTAATAGTGAAGAAGCATCAAATGTAGTA 

ATGTAATAAAAACTTTG ATTCATTGTC CTT AC CAGTCTT C CTTCATCTT AT AC CATTG CTT CTTCCTT AAAACTTGTG CTTCTT AAC 
AAACACTCAAGCTAACAGAACCAAGAACACAGCATGAGAAAGAGAGTGCATGGAATTTACAA 

4 0 AAAAGATG ACACAGTTCTTAAGAGGTAAATTTCTTAG C!AT AAGATTCAGG CTC AGG ATTGACTGTTCATG AGAAAGAAATACATTG 

TTTCTTCCTGTGAAGTAGATTAGACTTTAGGTGACCAATTTATTCTCAA^ 

G TCAAG C AAATG ATGC C CAG AG AAC CAG CAGG ATT CTTT ATGT ACAC CTAAAAGTG AAACCTTTCT CCAT AAATGCAAAG CAG AG A 
GAATAATAACCAGACTGCTTTGTATTGCTAAGATCAGGTCA&ATGTGTCCTACTGA 

4 5 TCCCCTTGAGTACITCTGTGTTGGTCTGAAAGCTAATACATAGCGTATTAT^ 

TAAACAGAAAGTCAGAAAATATAGTCACCTGGTGTGTGCAGCTGAGTGTTTACATT^ 
GTTGATTTTAATGTGTAAAATTCAGCTTCTACTGCTAGCCCTAGAGG^^ 
AT AACCAGGC CTGTAAAGT CTCTGGCTTCTACrCCATTTTG 
TCAATAGAACAATCATTTCTTrrAAAAAATTTTATTAGTTTGTGTGTGTGTG 

5 0 AT ATG ACTGCT ACATAATATTAT CCTCCTCTCCAACT 

TTTCTTCAGTTATTAGT0X3TGTGTGTGTGTATAAGATATAGCCTC 

TGACTACTTGGTATTTGATAACCAATTAGAGGGCATGTGCC^TCACTGACAAAGACT 

TGTTTAATCTAGAGATTAAACCCTATGAGATATCCnTCTTCCACATTGGCATGTCAGCTGCT 

AGGCAACCAT ACCATTG AGATTTCATGAGTAT AACTGTCCT ATC CAT AT AGAAGATC CTATCTCAGAGCTGCTCCTGTCCT CTGTC 

5 5 TCTTATAAACTTCCTGCTCTCTCTGAATCTTAGCTGTAGGGATTGTGATCTAG 

AAAAACX3TTTCTAGCTCITGTCTGATTTTATTCATTCATCCAT 

CAC CACTG CT G ACACCT AAACT AGG C CATT CT AGT AGGCGGCTG TCTTGTGC ACCAGAGGGTGTTTACCACGT CTTGGGCCT ATT A 

TCCCTTGCAAAACTAGCTTTTCCCCCATTTGTGAGGAATTAAA 

CCCCTATTGAGAATTACAGTCCGATAGCATTTGCACAGAATGAGGAGCC^ 

6 0 TTAACCCAGTGATGGCAGCTGCATTTCCAGGGTTTCCCCATG 

AAAATTACTGATTTAAATTCACCTAAACTGAAGGCTTAACTCAATCACAACACT^ 

GAACAAAAAAAAGCTTCAAGTTTAGTATACAATAACA'lTl CI 1 " ! i I ACTCAGTTCATTACTATCCCATGT AGAAATAGGACAGT AG 
AGGATTTATAGATACTAAATGATTACCACACTCTTATATT 

AATG ACCC CTG AGAAT CT CCAAG AC CAT ATCAG AAT ATTTTTGTG ATG ACACTTG CACTTGTG AACTT AACATTTTTTTTGT ATT^ 

6 5 TTTTATTTACTATCACCTCATTTAAAGGGTGTCAAGTCCACTGAGCATCCTTGAGAA^ 

CACGTATCTTTAATATGTGACAAGATAGTAAGAGACAG<IAGTACT^ 
GAAATACGCAGTTACTATAATATAACTTTAATTTCTGCCATA^ 

TTTGTTGGACCCTACCCCTCCTTAGTCAAATTT AATT AATTT ATTT ATTT ATTTTT AAT 
TTCTAGTGCTATCCCAAAAGTCCCCCATC^CCTCTCCCCCACTCTCCTCCCACCCCCACTCAC^ 

7 0 TGTACrTAGGCATATAAAGTTTGCAAGACCAATGGACCTCTCTT^ 

CTAGAGACACGAGCrCTGGGGGTACTGGTTAGTTCATATTATTGTTCCACCrrATAGGAGTGCAG 
TTTCTCTAGCTCCTCCATTGGGGGCCCTGTGATCCATCCACTA 

TAG CCT CACAAGAGACAG CTAT ATCAGGGTTCTTTC AGCAAAAT CTTG CT AG TGTATG CAATGGTGTCAG CGTTT AG AGGCTGATT 
ATGGGATGGATCCCTGGGTATGGCAGTCTCTAGATGGTCCATCCTTTC 
7 5 GGGTGTTTTGTTCCCAGTTCTANNNNNNNNNNNNNN^ 
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CTTACTCAGGATGATGCCCTCCAGGTCCATCCATTTGCCT 
TTGTGTAAAGTACCACATTITCTGTATCCATTCCTCrGTTG^ 
CTGCTATGAACATAGTGGAGCATGTGTCCITCTTACCGGTTG^ 
T CCGGT AGTACT ATGTC CAATTCTCTGCAG AACTGCC A 
5 GAGGAGTGTTCCTCTTTCCCCACATCCTCACCAGCATCTGCTGGGACACT^ 

TCTCAATTCTGAATATCTATGCTCCAAATGCAAGGGCAGCCACATTCATTAAA^ 
GCTCACACAATAATAGTGGGAGACTTCAACACACCACITTCATC^ 

GAAACTAACAAGTT ATG AAAAAAATGGATCT AACAG ATAT CT C CT AAAACAAAAGG ATAT AC CTTCTTCT CA 

GCACCTCATGGTACCTTCTCCAAAATTGACCATATAATTGGTCACAAAACAGGCCT 
1 0 ATGCATCCTATCAGATCACCATGGACTAAGGCTGATCITCAATAACAAAATAAATAATA 

ACAACACTCTTCTCAATGATACCTTGGTC^GGAAGGAATAAAGAA 

ACAACATACCCAAACTTATGGGACACAATGAAAGCATTTCTAAGAGGAAAACT 

GAGAGGACACATTAGGAGCTTGACAACACACCTAAAAGCTCTAGA^ 

AAATAATCAAACTCAGGGGCGAAATCAACCAAGTGGAAACAAGAA 
1 5 GAGAAAATCAACAAGATAGATAAACCCTTAGCCAGACT 

AAAGGGAGACATAACAACAGATCCTGAAGAAATCCAAAACACTATCAGATCCTTCT 

AC CTGG ATGAAATGGACAAGTTTCTAG ACAGAT ACCAGGT ACCAAAGTTAA^ CAGGTT AATG ATCTAAACAGTC CT AT A 
TCCCCTAAAGAAATAGAAGCAGTCATTAATAGTCTCCCAACCAAAAAGAGCCC^ 
GACCTTCAAAGAAGACGTACCCCCTCCTTAGAAATATCATAGCAACAATATTTGCA 
2 0 T CAGT AT AG CCC AAT CAGTTCCATTCTCTGTG ACIAAAAG CTGG TTTTT CC CACTG ATGTCTATGG CACACAGCCTT C CGTG CAGCC 

TCCCACATACTGAGTTCITATCCGCTACATCrAGATGGCTTCCTGAAG 
AACTCTGTTGCCATCTACATAGATCACACCTCAGTGACCTGAACATA^ 
TCTACTGTCCTAGATTTAAGACTTAAGATTTGTTTCCCTTAA^ 
ACTTTGAAC^CAAAC^CAGGCGGATTTCTGAGTTTGAGCCCAGCCT 

2 5 AAAAAAAAAAGAAAAGAAAAAAAG AAAGAAAATAGTG AACTTGTATGTTT CAATTCTTTGCTGCAAGTACT AG ACATCT AAACATG 

TTCACAGGTGACACTTCTCAGCrTCCTGATAAAAACrAGAGCTCATC^ 

GGACTGCCCAGACCAAAACAAATCAC^ATTCTCCCTGGTCAGCCACGTGCTACTGTTTACATGAC 
CTTATCACAAGGTTGTTCAGAGGATAAATGAGATTCCCTTATGG^ 

TGAAAT AT ATACAT AT ATTTT AT AT ATGTG AATATT ATGTTT AAAACAGG CAAACGAAACAAT AT AG AATTG AC CTT AAAAATCTT 

3 0 TAAACAATG TGCTT ATTTT C C AT AAACT AAATATACAAC<LAACTTGCTGATCCCTAAAAAT AGATGT CAGATG AGAAG AG ATTGGC 

TGACCAGGAATCGGCGCGTCCTCAGAACAGAAGCTCrGGAGGCTTTGCTCC 

AGCAGTCATGTCTGGACAGTCTCTTCCATTAGAGGTTCACACTTTGCACTCAGTCCTCTT 
CTTGTGATATGGAAACAAGCAGACACTCTATTGACAGTGCTCAAAGAAC^ 

3 5 GCTGCACTATTGACTAAGCTACTGGATAAACAGACAGTTGGCAGCATGCAG^ 

TTGC^TTTAATGACTTACTTAATAGCATTCTAACACATTTTTGTTTGTTTCT 
AGAACTGGCAGATCCCAGACCAGGCTCAATTTTATAGCAATCITTCT 
ACAACTCCCAGCTACAGTTACTATTTTTTAATTTTTGTTCCTA 
ACAATTTTTAGTTGTGAATGTATTAATCTTTTTTTTTTTTTTTGCATTT 

4 0 CATAAAGGATGAAGAGACAAGCAACATCCAGTCTAGCTTATTCCT 

ATGTGTGTGAACTTTCCTATATAACTGATAAACAACATGATTTACCATC 

TTTTTAAAGTCTGTTTGAGAGGCTAGAACAAGACGAGGAATTGGCAATATAATCAGAAGCAAAAGGCT 
CATATGCTTTTGTTCCAGAGTCAGCATGTATGGCAAAAGTT^ 

ATGT AAT AG CT AATTT ATC AATT TTT CAT AAACCTT CGTTG ATTCAGG T AAAG^ AT AAAT ATCCATT GTT CTTC CAAATTCTCTG A 

4 5 AAAATT AAAGACTTTGAAT AAG AACTTT AAACT AAT AAT AT AT CT^ TGCTGGT AAG AAG TTTG C AG AATG AAAG T 

GAGCCCTC^TCAGAGCAGCAGACTCGGTTTTTGTTGTTTTCTCAGG 
. TAGAAAACTAGAGGGTTTCCAATATACATC^TAATGTTTCTAAACTAATCT^ 
ATCAAACTCTAAGGTGCTATGTGGTTTTTTTTCCCTATAGTAGAGAGC^ 
AAGGAAAAGAAGATTGATGTTTA71GGGTTAGAAGG AATTT AGGGGTGTGGTAGAGA 

5 0 ATGGGT CT AAT C C ATTTTTTTAATCITGC CAAT CACAT ACTT AG AAAAT AATG CATTTTT AAAAAAT ACT AATT AT ATGTT AAATG 

AACCACAACATCATTCTGTTTAGAAGCCAGAGAACTAATTGGCTCTTGCACT 

ATGCATGGGAATATGCAAATATGTAGAGACCAGAGCAGCAC^TAGGGCAT 

CIATGAATGGTAACCTCACTGTCTGTTTGTGAGCTCTTCATATCCACCTGACTCTCTTCCTCAATG 

ATTACGGCCTTCTT/TTT AAGAAAGTGGTAAAG ATTTGAACJTTATATCCTCATGCC^ CT AGG CAAATGATCTT AACCACTGT ACCA 

5 5 TCTTGT CCATTT AATGCATTTCATATTTCATACAGAAAAAT AG T ATATTC CAAT AT ATTG AAATC AGGT ATTG GTGCTTG TT AG CA 

CTTAAAAGACAGAATTCAGTGTGAACATGCTGACATGGAATGAGAAGGGGGAAGAAATA 

CCAAGCATAACCCTGTTT ATGCTTCCTATAG CCCTGGAATGAGACCAG ATGTGAGTTCT CCTCCGTCCAGCTCCTCCACAGCAACG 

GGACCACCTCCCAAACTCTGCCTGGTGTGCTCCGATGAAGCTTCGGGATGCCATTATG^ 

CTTCTTTAAAAGAGCAGTGGAAGGTAGTGTGTGTTTTGAAGAGTTTAAITITTGI'1'CCTATTTC^ 

6 0 CATTGTATACAAAGTCCACTAATGATAGACATTAGTTCAGGTGCAAGCATGGTGGCT 

GTAAGGAGAAGAACTAGAATAAGTAATTAAGTTCAGCCTGAGAAAGAACTAAAGCAAGCATTGTAAGGA^ 
GGCAGTACTCTGAGTCTGGGC^GTCCGAGAATGGTGGCTTTTCATTCCCTGTAAGCAAG 

GACTG ACAAAATCAGTG ACC CTG GGG AGG AACAAT AAGGG AGG GACAGCATTCTTTG CTTT ACTTTTTTCCTGTT AAAT AAAT AA^ 
GAGGTCATGTTAAGTACCACAATGGCTGAGGCCAAATTAACAAAGTACAGAG^ 

6 5 TTGAAAGAAGATGACTGCAAGGTGATCATGTGTTGACGGG 

CCTACCCAAAAGTATCTTAATGCAATTAAACCAGAAAATAAAGCTACATTGATC^ 
CTTATTTAAAATGAACTCATTCCCTTATTTTTATGTAAAAATAAGTATTAGCTA 
TTCAGCAAAGCGTGGTGGCTAGATTACATTAGTAAGGAAGCCTTTAGCTC 
TCCTCCCTAACTTGAGTTTCAGTGTTCTCAAAGGCACITTC 

7 0 AAAATTTATCTAGACTTGTAAAATCCATAACATAGTTTGTAGCATT^ 

ATTACATATATAACATAACATTTTAACACATACTAC-AAATTATACTTAAACCCT^ 

AGAGTTAAGTTTTACACATTTGTTATTTATATAAGACAAATCTATT1T1T 

CCAGCTGGTCCTTTTTACTTGTATTCATTTTACCT 

CTCTTCACTTGCCGTGTGTGAAGTACATGCATCCAAAATCAATTAGTGGCCCTC 
7 5 GCCAGACTAGCAAACAGGTTCAGAAAGTTGAAGTAGCCAAGCT 
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CAAAGATC CT CAAAG ATG CAT CTG ACAAG CCCTTGATAC CTG CT AGATT AT AATTCAGGT AGATG CTGAAAT AAAAGT AAT CACAA 

GAGATTT AACT ATT AGCT ACT AT AT ACTGAGTAGTT ATGGTGTGTT^ T ATCGTCTG CAT AATT ACTT CCAAG CTTGGCA 

AATTTTCTAACAGTTCATAAGAGAACAAACTCCAAAAAGTTAATGT^ 

ACTGAGAACTGCTATAGAAGCGTAAGCTGATAGTATTAGTTTA 
5 CTTTTACATGTGTTGTTTCCCATTTCCTATACCATC 

CTAGAAAGGATTACACTACACCTTATTAGGAGACAGATATCACTGGCT 

AGGC^CTGTCAAGTGCTAAAACTAGATTTAGCCAAAAGCACCAACA^ 

TTCCCTTGACATTAAGAAGTTAGAAAGGAAATTTATCAGGTGGCAG 

TAATTGGTAGTACATGGCC^TTCTCATTCTTTGTGA 
1 0 ACT AGTGGAT CATTCT ATT AT ACT AACCTG AGC CT ATAGCT AAG ATT CTCAGTT AT ACT ACACACTGGATAAAG AAATAAAT ATCA 

GCATAATACACAGTCAATCTTTTAGTTAGACATAAATAAACACT 

AGCAACACTG ATAT AAGG ATT CTCTGAAAGACAATGG ATGT AGGGGTGTT^ 

ACAATCTGTTGTTTCTCCTTTTGCTGCATGATGGAAGTT^ 

TGTATAAACACTGTGTGCCTXX5TACCTGCAGAGCTAAAGGG 
1 5 GCCATGTGGGTGCTAGGAACTGAACCAAAGTCCTTTGCAA^ 

TAATAGGATTTTATTGAGTTAGTTCTTCTATTTTGTGTTTAAGAA 

AGCATAGATATTAGGCATGGGAAAGTAACCCTGAAATAA 

TATATATATATATATATATATATATACACACACATACATAGATATGACTATTAGTCAACT^ 
AGACTAATACAGTTAACTATGTACTCCAAACTCATATTTAAACAACATC^ 
2 0 TTTAATCCAAGTCTTAGATTGCCCTAATAAAGAAAAAAGTCTCT 

CIGTTACAATl'ClTTCrCATrrC'lTTGAATATGATTTATTCTAATA^ 
ACAGTTAAATCTTGTGTAGGAAGCTTCTATACCCCACCCCAGTCAA^ 

ATTAAAGAAG ACAGTG CATGTGGGAAACAGAAACACTTTT AGC CTTTTC CTT^ CAGGAG AG AA 

CTGAAGTTCATAATATGTTCITCAGTAGTGGATACCAGAAAATGGAGCCACTAGCT 

2 5 GACTCCATG CAT CTTTGTATTCT AAAAGT AT ATT AAATAATGATCTTTGTTAGTTTG CT A 

ACCAGTGGCTAGAAGTTGGTAACATTTTACTTCATTTAAAATTG^ 
AATTAAAGTCATCITTTTCCTTAGGGATTCCAAAAGAAAATTT 
ATAATAAAAGTAATAATTCCAAACCCCJVTTTATAACTGTTAAATAACTATAGAAA 
TTATGATTTGAATACATATATATATGAAATACTGTAGTTGGGTTC 

3 0 caacattgtaattcatgcagataaacagacttaacatccttagtcagtgact 

cagaaagggg at agg ctit 'i t c 1 1 t 1 aaaat atc ctt acag at ctgtggaatttt aacaatg cccttttaatctcttataggacag 
cacaattacctttgtgctggaagaaatgattgcatcattgataaaattcgaagaaaa 
tcttcaagctggaatgaacctggaaggtaatggaagcttaaggaactgtctgc 
aagctagcaagtgaaccatgtaatgaactctatatatgcttaattagc^^ 

3 5 CATTTTG TG TG TT AATTCAACTACTTTTGTT AT AT ACATTG A^ C CATGGT GT AT ATT AAGT AAAG AG ATT ACAT AA 

TAITGAAAATTACATAACTTGGAGCIAATTTTGTTTTCAT 
AAAGTATTGTACATGAGATCTTAGTTCATATTCCTCTCTTA 

TTTATTACAACTCTGCCAACAACATTGGATGGGTTGTTTTATGGTTTAAGTATAAAGGA 
TTTGATATTCCTGCTTATATAAAAAGTTATATAATTAACACTTATTTAAAAACTTCCCT 

4 0 GGAAATAGCAAGCTCTGTGAAAGTAGCATTCTTCCTAACTCATGTTGTTAGTATTGACTGTTACTG 

TCAGCAAATATAAAGAGAATACAAATTTAGTACCTGAGTCTGAGCAGCTTACTT^ 
GTATGGCACTCTACAAAGGCAATACTATTACAGGACCAAGACGGAAGGCCATCTGAGCCTGTC 
AGAAGTAGCACTGAGCTGCAAATACAGACCTTGAGGAGGAATTAGATAGCCATGACAGAGACGTGCT 
GAGCAGTCTTTGGGATAGACAAATGGATCGGTGGCAGGGGATCCAACTTCACTTCTC^ 

4 5 TGCCTCTAACTGGGAATTGAACTCAGCATCTCTAGAAGAGCGGTCAGTGCTCTTA^ 

AAAAGTTTAAGGTGAACAGTAAACTCTTGGTTCCCTGTTAGAGGAAACAAGTCCT 

TG<X5AGTTCCATGGGTAAAATCACTTGCTAGGCACGCATGAAAGACC^ 

AGCTATGCTCTTGCCnATTAACCACAGAATTTTGAAGGATAAGGTGG 

GTCCAACGAGAGAGCCTGCCTCAAAAGAGTGATAGACTTGGATACCCAACTTCTTCCCCTG^ 

5 0 CATATGCACGTGTACATACACCACACTCIACACATGTACACACAGATTC^ 

TCAATAATAGTGTCAGAAATTAAAGACCTCTTATAACTTGAAAGTATAGGCTACAAAGTTGAT^ 
AATATTCACAGCTCGAAAAACGAAGAAAAAAATTAAAGGAATTCAGCA^ 

CTAACAAAACAATAGTTCCTGCCGCGCTGCCACAGCTTACCCCTACCCTGGTGTCACTGCTGGAGGTGATCGAGCCTGA 
TATGCAGGATATGACAGCTCTGTTCCAGACTCAGCATGGAGAATTATGACC^ 

5 5 CGCAGTGAAATGGGCAAAGGCGATACCAGGTAAGGCATGAAGGATGAGCCACGGCATAAATCTTTGTGTT^ 

CTCTGAGTTGGGCTTGGTGAATAACTTCGTGACATGCCATCTGCTTATCATACACAATCCTTG 
AATAGAGCTGCTCTGAGACTCTAG^CGGTCTGTCCTCCTGACGGAGCTCACGGCAGAGCTG^ 
GAGAGATCCAAAGTATACTGGATACTGTTCCCTGACTTTGRGGTTACACA 
AACTAAAATTCCATTCACAGCGACACCTGGAAGACAGCATTTAGAAATTC 

6 0 AACAGCAGG1TTTGTTGACTTTGTCAAAATTTACTAACACAAAGTTA 

TAAACTTTATTCTAGCTTTAAAGGTATTTTTAAACA^ 
CCCTTACTTTACAGATAGCAAAGTGGTTGGCACGTTTATGGCTGCATAC^ 

AGATGGATGTC^GGAAATTTTAAAAATCTAAATCrGTTAAGAAAT AGGAATCCTAAG CAAAGACTCTCGGCTTTCTCTTTTACTAT 
ATATTCTACTTTGTGTGCACTTAAATGGACAAAATTGGCCAAAAA^ 

6 5 AGTGCCCAGGCGTTGATCrGGTCAATGGCTGTCTACTAAAGATCACATCCAAACAAGGCCAACT^ 

ACCTTTATTTCCCATCTAGGATTCAGAAACTTACACCTGGATGA 
ATTTGCCCTGGGTTGGAGATCATACAGACAAGGAAGTGGAAACCrc 
CATGGCCTTAACCCTCCACAAAGAACTATAGCAACTAAGCAGTGCTC 
AGTTGGTTATCCAGTACCATTGGTCATCCCTAGAAGCATGCATACAAGTAAC^ 

7 0 TAGGAATATATATATGAATATACATGCATGAATGTAACAGCAATTCATGCACAAAGTGGCC^ 

GG AATGTGAAAAGGTTTAGAGAGAGGAAAGGGAAGGGAAGGGGCAAATGACTTTGTAATCTGAG 
AAATAAGCAGACTAAAAGACCTGTAAAAAATAACTAATCCCCTAATGTTTCCATAAAAGTAT 
AGCTTATTGAGTTCTGAGTATATAAAGGATTCACAATTTTGTTGTTTCAA 
TATTGAATAGTCTCTCCATGTTCTCCAACATTCTAGTTCTATAATAGGTTATTGGGAATAG 
7 5 TTCATAAATTTAAAAACCAACCCAAAATTCIAGCTAATATTTTAGACTAGCA 
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CCCAGGOtfaAGAATGACTCTACCCTGCATGTAT^ 
ATCCTATGAAGAGTATCTCTGTATGAAAACCTTACTGCTTCT 

TAAATATGGCAAATGCATGTGAAAGTTCTATAACTATCGTGCATATATGTGTATATA 
ATATACATATATATATATATGCCAATTTCATTGTGTTTTCCTAGTAAGATG^ 
5 TATATATTAAGTGTTATCAATATTAGTAAAATAAATAGGTATC 

GG CTCCAT AT AGAGT CTTCCTTC CTG CT ACCCATTCATCTAT ATCTAT AT AAGATGTGGAGAAAGAATG CCAACAGT AATG AGTCT 
GTCTCTCT ACTG TACG AGTGTTT CT AG AT AAACAG TG ACAGCT AC CATGAATGGGG AAT AGCTGAGAAAAT AAT CANNNNNNNNNN 
NNNNNNNNNNAATGTTTAAACTTrAGACTT^ 

TCTATATTAGTTAAGTAATAAAAGGAAAGTC^TACCTGATATTCTC^ 
1 0 AACCCAGTTTTTAAGGACTCGGAATATGCAAAGATGACTTATTAA^ 

GGACATGTCT AGAATGAG CCAATCCAGAGT AG Tt3 AGT ACACCAAATACTGGTTGAGGACACAGAGT ACAGGAATT CTTTTTCACTG 

CTGAAAGAATAGCAAAGTGGAATGACCATTTCAGAAGACAGTTTO 

TAGCAATTGCTTGATCAGTATACAAAGGAAATGAAAACTTATGCAGC^ 

TTAGGCGCAATGAGATATATCATCCAGTAGACAAATCAATAAACTGACATATG 
1 5 TGGGCTGTTGTGCCAATAAAAAAATATTATTGAATCITAAATATAT^ 

CT CT AAGAAACAGATGGAGG CAGATGAG ATGACTTGTGT AAAATGGAAGCTCAAC ATGCC CAG CAAG CC AACTTTCCTCAGAAATT 

TTACAGGTGCTTGTGGTGAAGATAAGAGGAAAACTCTCCCATTTTTATGTC^ 

TAGTATTTCCAAAGCAAAAGCAAGGCCTGCCCTATAAGAGAA 

ATTCTAGACCCTTATAATAGCCCTCCTOACrrTCACAGGGAAGAAAAGGGGAAAATGC^ 
2 0 AGTGAAATGTAATCITAAGAGAAAACATTTCATTGGAGCCT^ 
AGGCAAGCATGATTCTGGAAAAGTAGCTGAGTGCTACACT^ 
TGGACTTTTGAGACCTCAAAGCrTATCCTACAGTGTCACACA^ 

ATTAAACACZAGTCCAGAGCCTAGCCAGATTACCATACTATCnTCCACCAAATC 

2 5 AGCTTT CAAT AAAAAACAAAGC CAG CATTAAATCTG CCCC AAACCACTGG CATTATCAC ATTGGAATTT AAAAT AACTGCAAT ATT 

TTTGCT AAGGACIACTACIAGGAAAAATAAAG AC CACATGAG AATT ATG AATGAAGAGT CATAG AAG CT AAGGAGGTTCCT CTAAGGG 
AGCACTAGAGATAACAGATGCTGAAATGATAATGCACAGTGCT^ 

G AGCAT AGGG CAACAAAAA CCTCCT AGACTCAAAT AAAAAGAGGG AAA TGT AGAGAGATCAT AAAAAAT AT 

TTTAAGTAATAAAAGTTCAAAAATTTCTAAAATTAGTTATATGTACCAAAATGTC^ATGC^ 

3 0 GAAAAACAATAACAACAAAAAAATCTATATTTTGGCATATCCTCTTCAAA 

GTTTTAAAACAGGAATGACAACAACCAGACITCTTATCAGAAATC^ 
AAACAACCCACCAGTCCTGAGACAATAGATCCAACAGAATTACCT 
CCTAGAATTCTAGCACTCCAGAGACTAAGGACTGAAATTCAAGCCAGCCATGGCTAC 
ATGGGTAAAATAAAAACATTCTCAAACAAATGIX3TCTTTTTAAAGG 

3 5 TTTAAAAATTGTT CAG AAG AGAGG AAAAT ATG TOU3 AATT ATGTGCTG CAT AA^ 

GCCTTTAATCCCGGCIACTTGGGAGGCAGAGGCAGGTGGATTTCTGAG 

GGGCTACACAGAGAAACCCTGTCTCGGAAAAAGAAAGAAAGAAAAGGAAGGAAAGAAGGAAGGG 

ATAGAAAAGGGATGCCTGAGAGTAAAATGTCAATTTTCTTCTTAATGTTA(!AT 

GATCTGGAGAGGTGGCTCAGCAGTTCCCAGCACCCACATAACCACTCAGAACTGTCT 

4 0 TTCTAACTTCTGGGGT ACAT ACTTG TGG TGT ACAAACAT ACATCAGG CAAAACACTCATG C AC ACAG AT ACACAC AC ACACACACA 

cacacacacacacacttttatttatcttacaaattaacaatcgcagacta 
agaggtattgtagttaagttaggaggg«aaatttcagatacactctggcato 
gaagg<;agaacaggttagttgtaaatgtgtactg 
catgagcatgtctggcacactgaaggctgatgggagtctccagagtcta 

4 5 tt'ccccaaacctgatgtgcttggtcagaacagaagcacctctcggggccaagctaa 

agttggaaaaactgctaagtgtgagcaaaggcaatccccagaaaciaa^ 
agcacaggtctgcttcagtgagctccttgg^gctgtttctaa 

agacaacaagaaagagg aatggaaaccag cattt cctcctac^^agattaagttcagtataatggagaaagaag^aaggagaagaa 
agcctggaag<3agaatgacagacttaattccctggtgacactgcagacgctcgg 

5 0 aaagcctgctgaacggaaggagaaggacagcgagagttgtgcccacagcgataagggagac^ggttca 

aagctttc^aatcagaggaagatgattaataatgaagtggatctccttcagt 
ttcaaagaaaaagactgaaatovaaggcrcratagaagtgcttctaaa 
acagacaccctttcctggacotacrrgacctcacctggtacagtatgaaggttgtaaaatggaag 
tggtacacatcacaaattagctatacacaggatgacacc-gtttgca^ 

5 5 catttggatctcccatggttatgaaccccaattctcagttagcctatactttcacttagctat 

catgaattttagagtgtattgaatgtacattgtaacgatgaaatacgcatgaaatgtgct 
ggaaaatttaactgacttcatcttaccttttagttcctaaggaaggtctc 
tatatcaaagagctaggaaaagccattgtcaaaagggaaggaaact 
tttggactccatgcatgatgtaagtaccaaacataaatccaaaacaa 

6 0 ccittaatcacagcagacacagaggtagagcaagtagatgtctatgagt 

gagatacgtaagagatcctatctcaaaagcccaaagacagagacagagagagac!agagaca^ 

TGGTTTAATTAGTGTATGTGTATAGTTATATACCAATCAGTCATTATAGTGATTATTCAAGATAAATAAi i r i 
TTAAGTCITACTAGTCTGGTCATTTGCGTAAAGATGACTCT 

TACATGAATGAGCTATCAGATTACACTAGTTTCACCTCCCTGTGTATCAGTACTGGTGC^GTCTCCTTTGAGCC 

6 5 TCCTGACTGCTC^GATC^CAGGCATGCCAGGCTTCATTCTGCACACAGTTTAA 

AAATTATTTTCTTATCTGAGGAATTCCAACAAAGATTGGTATATTTGTC 
GGATCTATATACCAGGTCITCTGTGTGTATATTACGACTTTCAGTTTAGTGTGTCTC 
CCrGACTCTTGTTCCTGTTCTTGGGTTCTCCTTTTTCTCTTTGTm 
TTTATTTTATTATCACCTCTTAGAAGCACAATCTTTT^ 

7 0 ATACATTACATGAGAAAAGAATTTATTTTCAGTAAACAAAAAAATATCT 

TCGCAGTTATGCTTTGAAGCCTCACCATTACCGTATCTTCT 

TTTGGATAAGTCCATGAGTATTGAATTCCC^GAGATGTTAGCTGAAATCATCACTAATCAGA 
TCAAAAAGCTTCTGTTTCATCAGAAATGACTGCCrTACT 

AAACTT ATCAACITtn'CTTGT AG ATGTTTTG TCGTT CTTTTTG TTTG TCTTGTTTGTTTTCT ATACGCACT ACATGTGGTCT CT AG 
7 5 AGGGCCAAGACTTGGCAACAGAAG CAGATGAGCCATCACTTTTCAGTGACAGG AAAG CAGACAGTG ATGTGCATTGGCTGGTGTAT 
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CACAGAAACTAGAACAGTTAGTGGAGACATGTCCACTATCAGAGAAGGA 

CAACTTTCTGCTCAACTTTCAGTTGGCTGGATAACACTTTCTAGAC^^ 

CATTTTGATTTATGCATGGAAACCTGAAAAAAGTTTACACGTGTATATCAGA 

CTGGTTTT AACAATTTCCTTT AT ATT CAGTGAACT ATG CTTG CTCGTTTTT CTT AAAT AATTTTTGT ATTC CAGTT ATTGTAT AG C 
5 TGTrrAAGATGGGCAGCTGCCTCACAGCTCTCCTAGACGCT 

GGCACTCAGCTATCAGAAGACC^CAGAAATTGACTCAGATCTCCA^ 

CATGGGGAATTATATAGGTTGTGCAGATTAACCGTCCTAACTGGTATAGAGCACCTAG 

TGATGGTTACAAAAGACTAATTGTAAAACAGTGCCCACCAACAGGCCCCG 

AGCAACAAAGTTTGTAACTCAGCTCITTCAGGACCTTCGGGAGTA^ 
1 0 CTGTGATAAAGCCGAGAGATTGTTTTAATCAGACCAAGTAACTTCTCT 

AGAATGGCrAGACACCCATTTTCACATCCCACCT 

TTTTGTTATTTAGAACTGAATGTCAGGCATCCATGTTTGTCCAA 

GTGAACTTCTTCACACTGAGAGTTATCATTTTAAACAAAACAGAAGCT 

TGACTCAAAGCTTAACACACTTACTAAGCCACAAGAAATC^ 
1 5 AAAATCTAATATTAAAAATATGAAACTTCTAATATATTTTTATATTTAGTTC 

ATCTGGGAAGGGAAGGGCTACTGCAGCTGTACATGCAATTTAT^ 

TrrTTAGATGAGAl' TO TTTT A TCATGACATGTTATATA rn^ 

TAGTATATAAGAGCATCCATACAGGCCTCAGTGGTCTTGGAAATTAAAACAGGTTTGCT 
GGCCCTGTGTGCAGTGCAGGTCCTGAGGGTTTGACCCG^ 
2 0 GACCATGGCCCTGTCAGTGCAGGCTGGCTTTATTAAATCCAGGACAGAAA^ 
CAGGGTTGTTCCCACATGCAGCCTCCGCACGGGTTAACAGAAAt^^ 

G CAAT ATGT AAATACT AAAGAATCGCAT AGGAAGC CAAT AAT ACACT AATC CTCTCCAT C CT ACAAG AG T C CATTT CCAAG T AAG A 
TG AGGACATG TTTATGTT T'l'L T T T G AATG CITTTTG AATG TTG TT ATTTTCAGT ATTTTG CAG AAATT ATTT AAT AAAAAAAAG T A 
TAAT CATTTGCr rriTG AATTCTCTCT AAAAGGG AATGTT CAG T T ACTTT AAAAT AATT 

2 5 GTAACCCAGCTGGATGTGAAATTTATGGTGCCTAAGAAATACCACTTGAAGATTATCAATGACAGTGT^ 

TCTCAAAAAT AGATT ATTGT ACATTT ATGGAATGTT AT ATGGTT AAAC CCAAAAAGCACAT CACACAT AAATCT G CTTT CAGTTCC 
AACCAGCTTGGCTTTCAAAAATAGAGCTCCAAAAAAAAAAAAAGGAAAAA 

AGACATTCATAAAACTACTATCGGAAGTTTTCCATTAGATGTATAAAGAGCTATCCTTTGGTATG 
TAATTCTGATTGAGTATAAGTGAGAGAGATACGGTACTGTTTGAGAGCAGCTCCTTTTCTGC^ 

3 0 TATGTAGATTTTATAATAGCTTCAGTGAGAATTGGTAACATGCCTGTATGACT 

GGTAGGACAAAAAGGGACATTCTGGTTATTTTAGGCACTGGCTTGGAACACT 
GAAAGGAAGGGGACCTGGGAAGCACTGCCTTCTTTAAGGAAAGACACACCAATAGATGAGATC^ 
GAGTGTGAGTCCTTAGTGACGAGTCAGGTGAGCTCTGGTGAGCTTG^ 
GGGACAAGCAGGGACGACAATTCCAGCTGGACACTGGTCCCAGTATTTTGCT 

3 5 GATGAACCATGGTAGCACGTTTTGATCTGTCAGCACTCAAGGATC^ 

AACGATCAGCrGTAGTTGAATGTGTTTCTTATGTGCCTGGTTTCAGTGTTA 
CCACTTCGGATGGAAGTTTTCTCATT1TCCAGACTATTTTCGGTCAG 

GGTTGGCTTCTATCTAGGACATGCCTGAAAGGATTTTATTTTCTGATAAATGGCTGTATGAAAATACC 
AACTACATATAGATTTCAGTGTGTCAATATTCTATTTTGTATATTAAACAAATGCTAT 

4 0 TGTATGGCATTATTAAGAAGCTTTTTCATTATTTTT^ 

TTAAAAATAAAAGTTGTAG'1'TTTTTATTCATGCTGAATAACCTGTAGTTTAAAAACCTGTCTTTCTA 
ACTGTAAAGTTTTGTGTGGAAATGTTTAACTTTTATTTTT 

GAATTGGCAGTAAATGTTAGTCAGCCATTTACAGCAATGCCAAATATGGATAAACATC 
GTGACTCCCAACATGTTTTCATAGAACTTGTGCAGTTACAACCATTCTTGGGTTTT 
45 ACAGCAAAGTTGCCACTTGAAGGAACGTAAGATGCTGAGGTGGGC^^ 

ACACTTATTCAAAAGCGT ACC CTGATGCCGACAGCTCTAAAGGTCTTCTCACCT AT CTGCAGAGGTCACATGGTTATGACTCCCCA 
GGGGAATCTGGGGAACACACAGAAGCACAGGAGAAACTGATGCCTGTAACTGCTTCAAAG 
TCTCATAGCACAATTAATCTCTTAGTTTAAAATCCACCGTGCTCTTAGAAATACACATAAGT 
AAGATAAAGGCGTGGTGGAGTCACCATTTATCTTTACACTTTACTaSGTAGCT^ 

5 0 TATAGCACTATCGCITATGAAAAAAGCCATTCTAATTTAAAGAGTAAAATTC 

CCTACTATATAAATATAAGGAAATGACAACACTTAGATACTTAAGACCATGAGATTGGGATGTCCGGAAC 

TT AAAGTACAACAGATGCACCAGATACGGCTCAGTGGTCTACAGTTTG CATTTG CATATGCATCTACAGAGCCCAG AAGAGAATAG 
TGTCTTCCTCTTATTGCTCrCTGCCATATTCTCITGAGACAAGGTCT 

AGG ATCTTCCCGTCTG CCCTGC AGTG CTGGGCTT ACAGGCAGCTGTAG CCATGTCCAGTTTGTGGACTCTGGGT ATTCAAATTCAG 
AAACCTGGTTTTGTAAAGTTCCTITGCACACGCTATACTAGTTAGTCTTCACTCAATGAAT 

CTCAATTAGT AAAACTGCACTGGAAACCTGTGTGAGGAAG CCATTTC CAGCGTCTC 
AGACAACATATTTTCTGCTTTTCTCTACTGCTT CACTG AG CACTTACT ATGTCTCAGACACCATGTTTTTTTTTAACAT ATT AT^ 
AATCITTGCAGCCCTTTAAAGCMTATTCTTATTCT 

6 0 GACTTAACAAAGCTTGGGAATGGCTAAGTAAAACGTGACGA 

GGAGAGAGGGATGTGAAGCTTCTAAATGCAGCAGCTGAGCCTTGTGTCCCTGTCCTCCGAGAT^ 

CATCC ATlTlT TAA A l* r C Tri A TTlT GAGATTGTAGTT^ 

TCAAACTCATGGCCTTTTTCTCATTAGCATTGTTGTGTGTGTGTGTGT 

ATAAGCATAACCTACTCAATCATACGAATGACCTGTATGTATGTTTGGACCCCTAGAGGATGAGGGTGAAAAACTA^ 

6 5 GATTACAGAAAACTCTTTGGTTTTTCAAATATACAAGGAATCTGCGTGTCACTTA^ 

GG CAGAAGAAGAT AAAG AG CTTACAGTT ACACTG AGGAGGAGG AAAG ATG TTAG CAAG TGAAAG CTCTGATTCAAGCCAAATCTGA 

CCCCACTGTAACAACAACAACTTCTAGCTAAACCTTGGATATAAAAAACA^ 

TAGCTGAATAGTTTGTAGACTTTGTTCATCTTGAGAGTGGCTCCCAAC^ 

7 0 AGGGAACACTITACTCTGTTCGTTTCTAGAA 

ACCGGAATAAGGGGGATTCGCAGTCTGTTCATCCCATGTGCCCTCCTCAGTGAGCTCTAGGCAATG 

AAATGGCAGTGTOU3AAATGGAACCATGCACAACTAACTACAAA 

CTTCGTTTATTGGGAAATGGTTCTCTTTCTCACITTCT^ 

CATAAGTGAC^GGAACAGTGACAGAAGATGAAAGCACACAGCCACTAGTGCTACCACTATC 
7 5 TAGATATGATTCTTATATTACATCATGTTTGGGGAGAGGTGGCTGGGCA 
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TTGTGGGAGTTGATTCTCTGCTTCCACCATGTAGGTCCCAG^ 

GTTGAGCCATCAT ACTGGCCCT CATTTTTG CTTTTAATTTGTTTCTGTTG CTCAGTCTGT CCCAGACACCT CACCT AGATG T AAG 
TACAAAAGACCTACAG CTCGGGG CTTGAACAGTGTCCX^^CATACACATTCACACCAGCGTTGCTAACAAGAG CCAGGGTCCAT AA 
GGCTTTGCTTAGTAAGTACTCTCAAGATACCATGGGGTTXntSATG 
5 AGATTCCAAGGAATAAAGAAAGTTAGTGTGAAGGGGGAGGACAGTATTCATAAA^ 
AGCTCCTGATTAATTTAGTGGCCTCCCACTOVAAGGCCACCAAGGCATTACTAAA 
TTCAATATGAAGGCCTAATGACTTAG^ATGTCCACATCTGTTTGTCTAT^ 
TXSAATGTGCATAGTAGCAAAACAGCCAAAATGTGAAAGCAATGTAATGAC^ 
ATTATCTATGGCCACTAAAAGTAAAATACAGATGCATCCCATAACTATACGTCATGCTATA 
1 0 AT ACATG ATTTTTCTT AGATGCCCAGAAAAGGCAAAT CCATACAG CTCAT CAATGGTTG CCTATGGCT AG AG ATAG CA 

TCAC^CCCAATGTACTGAGTTAGTGAAACrTCTAGGTG^ 

AAGT CAAT AGAATCCCAGAACTCT ACTTCATCAGAAATTACACAAAACT CAAAAA 
GTCTAAACAAAC^CTTGGCAACGCACTCTAGTATTAAAGGAAG 
GTTAACACTAAAGTCTAGCGGGACTGGCGAGATGGCTCAGG 
1 5 CTGGGACATACAGAGGAGGAGAGAGTCAATTCCAGCAAGCTCTTCTCTGACCTCCAC^ 
TGTACACATATGCACACACATAAAGGAAATAAATGTGGAAAAAAGTAATT 
TGAAATACGTATCAAACATTTCCTGAAATGCCACATATCCCGATTTCTTTTAAA^ 

TGAAAGG CG C CT ACCAT AAGGT ATCTGTCAG CAT CATGCTCAATGGAAAAAG AAAACCTTT CCTCTT AAG AGCAGT AACAAAATGT 
TTACATCTACTTTTACTACTTCCATCCAAATATCATATGAGGGCTG 
2 0 CAAAGGAAAAAACAAAGTTATTTGTAGGTATAATCATGATATCTTAAATGGT^ 

AT CT AG CCT ACTATT CT ATTCTT AGGTAG AT AT CCAAAAAAAT CGAAAGCATCTG CCCAAG T AAG CATGGTTTTT CACC CTCAGTT 
TCCCCAGC^TCC^GGATCCTGCTTAGAGTCCAGGGTTTTACACT 
GTAGGTAACACTGGGTTACGGCACATTGTCTGGTTTTGAGCAGTTTCA 
TACTAGGAGTAATGGCAGTTGAGGGCTGCAAGATACAGTGGGCCAAC^^ 

2 5 TTTGTTCTGCAAATTAAAATTACCATCTGACTGACTCTTAGAACATCAGA 

ATTCAAGGTGGTACAGTACTGGCAATAACTAAGTAAAATTATT^ 
GAATTGAACCX3ACATAAATGCAGTCAAGCGATGTTTGACAAAGGAGCAA 
GCTAGGAAGAGTGGGOSTCACCTGCAAAAGGTATCTCTAGATGCAGACr^ 
AACAGAAGTTACAAGTTTCTAAAAGACATAGAACAGTCCACATGATATGCTACCAAAG^ 

3 0 GGACITGATTAACATTTa^TTTCTCCTATGCAAGAGAJC^ 

AC CTTTC CTCATGAG CACACAAAGG CTC CTGAGACC CAG AAGT ACGCT AG AT AAACAG TG T AAAAG CTG AC CAAGG AGCCTG CC CT 
GAGAAG AT AG AAAGG TG AAGGG CACATG AAAACTGTT C C CGTCACAG ATT AT CTGGG AAATT AAAAGTTGAAG CAAAATG AGAT AA 
CACTCCATATTTTTAGAATGGCTCAAATGCACAACAACT^^ 
TTGCTAATOSAAATTATGTCTGGCGTGATGGAAGA^ 

3 5 CTCTGTATTTAGACTTTAAGACATATACACACAAAAACATCT 

ATATGGCTTGAGTAAAAACCAAACATCAAACTGTGGAATATGATTCAACACTAAAAACCAGGCT 
GGAGCCTTAACTATACATTACAAAGGAGGGGAGCTGATCTGAAAGATG 
AGGGCAAAACTATACAGAAAGATGTTTAGTGCTTTTAGAAGCAGAAGTT^ 
GGGCAGCAAAAGTTTTCCACATCTCACXSGTGGACATGCrCTAACT 

4 0 TAAACTAAGAGCAATGGTGGTGTCAACATC^^GGTCCTCTACTGTAATACGTGAATTTG 

TGCTATTCACGAGGGGGCTAAGGGAGAATAGAGAAGAGCTCTGTG^ 

MOUSE SEQUENCE - mRNA 

GGAAGTTAATATTTGCCAATGGACTCCAAAGAATCCTTAGCTCCCCCTGGTAGAGACGAAGTCCCC^ 
45 GAGGGGAAGCGTGATGGACTTGTATAAAACCCTGAGGGGTGGAGCTACAGTCAAGGT^ 

CTTCTCAGGCAG ATT C CAAG CAG C AG AGGATTCT CCTTG ATTTTT CAAAAGG CTCAG CAAGCAATGCACAG CAGCAGCAGCAG CAG 

CAGCAGCCGCAGCCAGATTTATCCAAAGCCGTTTCACTGTCCATG^ 

TGACTTGGGCTACCCACAGCAGGGCCAGCTTGGCCTCT^ 

TCAATAGGTCGACCAGCCGTCCAGAGAATCCCAAGAGTTCAACACCTGCAGCTGGGTGTGCTAC 

5 0 CAGACTCACTCTGATCCATCTTCAGAACAGCAAAATAGAAAAAGCCAGCCTGGCACCA^ 

AGACCAAAGCACCTTTGACATCTTCCAGGATTTGGAGTTTTCTG C CCCCAGGT AAAGAGACAAACG AG AGTCCTTGG AGGT 

GAGACCTGTTGATAGATGAAAACTTGCITTCTCCTTTGGCGGGAGAAGATGA 
TGCAAGCCTCTTATTTTACCGGACACTAAACCTAAA^ 
GCCCCAAGTGAAAACAGAGAAAGATGATTTCATTGAGCTTTGCACCCCTG 

5 5 GCCAGXaCAAGCTTTTCTGGGACAAATATAATTGGGAATAAAATGTCTGCCATTT 

ATGTACCACTATGACATGAATACAGCATCCCTTTCTCAC<1A.GCAGC^TCAGAAGCCTGTTTTT 
TC^TTCTGAAAACTC^AATAGGTGCCAAGGGTCTGGAGAGGACAACCT^ 
TGTTTTCTAATGGATATTCAAGCCCTGGAATGAGACCAGATGTGAGTTCTCCT 
CCCAAACTCTGCCTGCTGTGCTCCGATGAAGCTTCGGTATGCC 

6 0 AAGAGCAGTGGAAGGACAGCACAATTACCTTTGTGCTGGAAGAAATGATTGCATCATTGATAA^ 

CAT G CCGCT AT CGAAAATGTCTTCAAG CTGGAATG AACCTGG AAG CT (XLAAAAACGAAGAAAAAAATT AAAGG AATT CAGC AAGCC 
ACTGCAC^AGTCTCACAAC^CACTTCTGAAAACGCTAACAAAACAATAGTT 
GTCACTGCTGGAGGTGATCGAGCCTGAGGTGTTATATGCAGGATATGACAGCTCn^TTCCAGACT 
CGCTCAACATGTTAGCTGGGCGCCAAGTGATTGCCGCAGTGAAATGGGCAAAGGCGAT^ 

6 5 G ACCAAATG AC CCTTCT ACAGT ACT CATGG ATGTTTCTCA.TGG CATTTG C C CTGGGTTGGAG ATCAT ACAGACAAG CAAGTGGAAA 

CCTGCTATGCTTTGCTCCTGATCTGATTATTAATGAGCAGAGAATGACTCTACCCTC 
TTATCTCCACTGAATTACAAAGATTGCAGGTATCCTATGAAGAGTATCT 
AAGGAAGGTCTGAAGAGCCAAGAGTTATTTGATCAGATTCGAATGA 
AGGAAACTCCAGTCAGAATTGGCAGCC^TTTTATCIAACTGACAAAACTT^ 

7 0 GCrACTGCTTCCAAACATTTTTGGATAAGTCCATGAGTATTGAATTCCC^ 

AAAT ACTCAAATGGAAATATCAAAAAG CTT CTGTTTCAT CAG AAATG ACTG CCTT ACT AAGAAAGG CTG CCTT AAAG AAAG TTGAA 

TTTATAGCTTTTACTGTACAAACTTATCAACTTGTCTTG 

CACTACATGTGGTCTCTAGAGGGCCAAGACTrGGCAACAGAAGCAGA 

75 MOUSE SEQUENCE - CODING 
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CTTGTATAAAACCCTGAGGGGTGGAGCTACAGTCAAC^TTTCTG 

AGCAG CAGAGG ATT CTCCITG ATTTTT CAAAAGGCTCAG CAAG C AATG CACAG CAG CAG CAGC AGCAG CAG CAG C CG CAGC CAG AT 
TTATCCAAAGCCGTTTCACTGTCCATGGGACTGTATATGGGAGAGACCGAAACAAAAG 
5 GCAGGGCCAGCTTCGCCTCTCCTCTG<3GGJUVACAGACTTTCGG CTTCTGGJU^ AAAGCATTCCAAACCTCAATAGGTCGACCAGCC 

GTCGAGAGAATCCCAAGAGTTCAACACCTGCAGCTGGGTGTGCTACCCC 
TCTTCAGAACAGCAAAATAGAAAAAGCCAGCCTGGCACC^ 
CATCTTGCAGGATTTGGAGTTTTCTGCCGGGTCCCCAGGTAAAGA^ 

AAAACTTG CTTTCT C CTTTGGCGGG AGAAGATGATCCATTCCTT CTGGAAGGGGACG TGAATG AGGATTGCAAGCCTCTT ATTTT A 
1 0 CCGGACACTAAACCTAAAATTCAGGATACTGGAGATACAATCTTATCAAGCCCCAGCW 
GAAAGATGATTTCATTGAGCTTTGCACCCCTGGGGTA^ 

GGACAAATAT AATTGGGAATAAAATGTCTGC CATTTCTGTT CATGGCGTGAGT ACCT CACT ATGACATG 

AATACAGCATCCCITTCTCAGCAGCAGGATCAGAAGCCTGTTTTTAATGTC 

TAGGTGCCAAGGGTCTGGAGAGGACAACCTGACTTCCTTGGG^ 
1 5 CAAGCCCTGGAATGAGACCAGATGTGAGTTCTCCTCCGTCCAGCTCCTCCACAGC^ 

TGCTCCGATGAAGCTTCGGTATGCCATTATGGGGTGCTGACGTGTGGAAGCTG 

GCACAATTACCTTTGTGCTGGAAGAAATGATTGCATCATTGATAAAA 

GTCTTCAAGCTGGAATGAACCTGGAAGCTCGAAAAAC^^ 

GACACTTCTGAAAACGCTAACAAAACAATAGTTCCTGCCGC^ 
2 0 CGAGCCTGAGGTGTTATATGCAGGATATGACAGCTCTGTTCCAGACTCA 

GG CGCCAAGTGATTG CCGOVGTGAAATGGGCAAAGGCG ATACCAGGATTCAGAAACTT ACACCTGGATG ACCAAATGACC 

CAGTACTCATGGATGTTTCTCATGGCATTTGCCCTGGGTTGGAGATCATACAGACAAG 

TGATCTGATTATTAATGAGCAGAGAATGACTCTACCCTGCATGTATGACCAATGTAAACACAT^ 

AAAGATTGCAGGTATCCTATGAAGAGTATCTCrGTATGAAAACCTTACTGCTTCT 

2 5 CAAGAGTTATTTGATGAGATTCGAATGACTTATATCA^ 

TTGGCAGCGGTTTTATCAACTGACAAAACTTTTGGACTCCATGC^ 

TTTTGGATAAGTCCATGAGTATTGAATTCCCAGAGATGTTAGCTGAAATCATCACTAATCAGATA 
ATCAAAAAGCTTCTGTTTCATCAGAAATGA 

30 HUMAN SEQUENCE - GENOMIC 

CTCCCCTGCCTAAAATTCCAGACTGCCCATTGCACTGATTCCCTG 
TGTGGTGAAGTGATAGTGCAAAAATTCCATAAGCTCCTCTAAGTTTTCCA^ 

ATCCCATGTAAGGCCTACATTACTTCCTGTGTGATAGGTGGGGTAGATATCATTATCCTTCCTTTACAGArTGAAA 

AG AAAAATCAAATG CCT AC C CCAGGCTCATTCAAGT AAGT AAATGGCAG AAT AGGG T CT AAAAACCAGGTCTTGT ATTG AAAATT C 

3 5 AACAAGAGGATAGAGAAAATTTAATTGACAAATACTCATCT 

TTTGGAGTTTGATATGAAGTATCITCTTATCTTGTTAAGGGATCCCT 
GTTAGCTTAAAAACAAAACAAAACCCTGTGTGCTCTAATAAGAAGA 
GAAAGATTCCACATAGCAGCGGGGCAGATTGAAACACTC^ 
AAGAAAGCTAATGACATATGAGGAGCCACAAATGCCACAAATGAA 

4 0 TTGCTCTAATGCCTCCAAACATTAAAAACATTGAGTTCCTGCrTATGGC^ 

CCTTGAATCAGGTATTGCTTAAACTAAGTTCTGCCAGC^TAAAGAATAAAGTCAACCAGGACAC^ 
GAGCACAAATTGTGATATTTTAGGTTGCTTATGATGAAATCATTGAAAGCATC^ 

ATAATTTAGCATTTCCCAGTCCAGGCGGGAATGTTAATTCTAATTACAACGAGGAATACCAAAGTGGAGT^ 
CTTGCTGTACTAGTGTGAATTGTTTATAATAGGCATGTCTGGCATAAAATACA^ 

4 5 GCT CAG CTGGG TT C CCTTGAAC CTCIT AC CTT ATAAAGTT AAAT AGG AAT AG AAG T ATTTTC CAAAGT CAAG AT ATT ATTTT AAAG 

AGACAAATAAT AG CTATGATAGCTG C^lGGAATAATTTTTTAAAGT AAGTTCTAACCACC CTGTTTGTGTGGG CTTAAC 

ATG TT CAAAAT ACACAAT ACJ^CATGT ACACAATACACGT AT ACATGGTT CCCATTT AT^ TG AG AG 

AGTACAGAAAAAAAGAATAAATGGAAACTGAAGCCATGACAAAGAATCACC^A 
ATGTAAGAGGAAATATATGTGATAAACAACAGGAGACATGGGAATGAAT^ 

5 0 ACAGACACT ACT CAAT ATCGATCTCAACTG AT CC AGTT CT C C AGTCTCCCTGG AAAACAATGGATTG ATGCCCAGGGTG CG CTGAA 

CACTC^AACCATGGTCACTTCTGCACTTCACACGATTCT 

AAACATGTTTATGCCATTTGTTGCTTATTACTGTACTTACTTAATTTGA 

ATTGTTCTATAAAATCTAAATGGAAAGATGCCCTATAGCCTTCCTTC 

TTAAAAGACTGGTGTGGGGG CAGG AGTGAGGAAT CAT AAAAATTT AG AAAT CTG CACGAAAATTATTCT AAAAGTACACATTGAAC 

5 5 ATCCCTAACTTGAAAATCTGAATTTTGAAGTGCTCCAAAATCCAAA^ 

TACATGGAAGTACTTACCACAAACTTTGCTTCATGCCCCAAATTATTTTAAATAT^ 
ATATGAAACAAAAAATGAATTTCCTATTTAGACTTGGGTCCCATGGCCC^GATATCT 
AAAAAAAAAATCCAAAATCTAAAACACITTCAGTATTAAGCATT^ 
TACTTTGTTTAAAAAAAAGGGGGAAAATAGTAATTCTACAGATGAATAATAATTGCT 

6 0 GGAATTCGTCTAAGCACTTCACATATATCAAACTGGTTTAATCTT^ 

GAAAAAGGCACAGAGAGTTAGCCTGCCCAAGGTTACATTCCTTTTAGTGGT 
ACTGCTAGACTATATCAATGCATTACAGGTATAGTTT^ 

GCTTTAGACCTT CAT CAAAAGATTAGAAAATAAATGTT CAAT CACAGGTTTTA^ 
AGTT AT CTTTCTT AAAATT ATTTCCTTCTTTTAACTAAT^ 

6 5 TTCTAC1AAAGGGGTAGGTAAAGGTGATGATCTTAAATACCATACTGGGAGTTGGGAT 

TATATACTTGAGAAGGAGCACAGCCTGATTTAGTTGATGTTTTGGAGGGGTAAT 

ATGGAGATGACAAAACATGAAGAGAATGGGAGAACTCACTTGGGTAATATA 

T ATTAGTCAGGCTTAAGGTTTTATTGGC CTT AGG ACATAGTCTCTAAGACCATAT AT^ 

GACTGAGC^CCAACTATGCACTAGTGATACAC^GGGTGAAC 

7 0 G AT ACAGAGAAATAACAAATGAGGGCACAAAT AATGAACT ATCATTG TAATGTGCT CAGAAT ACT AT AAG AACAT 

AGAATCAACJ^TCT AAT CTAGTTT ATTTT AAAAAAAAGTGGG 
CTTGAGGGTTATCTGTACACTTTTTATTAACCC^TTTGGCC^ 

ATGACAAGATAGATCAGGTCCTTGCCTTGACTGAGTCT^ 
7 5 C TTCCCACTCAGATAGGGATTCATCCTATTTCTTGTAGTAATC 
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ACTTAGGATACTGTCCATAAAACACTTTCAATAGTG 
TTATTATTATAATGCTAAATCTCACGTCCTTTCTGGCAC^ 
TAGTCTATC^GAATTGATATACTACTTCATGTTAATGATTGAGATCAGTGAGGA 
AATGCATAGGAGCCATGTGAACTTAGACAAGCATCAACCTCTG 
5 AT CTCATTT AATTGTTGTCAG CAAAAGTGGGCAG CAGAAGAGTGGGG ATCAAT AAATGTT AG CCACAAATCAT AACT ATTTGT ATT 

ACTCTGAAAAAGAGGGGTTAACTTATAGTTCATGCATGCTTCCAAATG^ 

AAATT AAAAAATT AAC^UyjCACATTT AG ATTG CTGAAGAGAGAAATT AGGAT AAG 1 1 1 1 rTTTTTTTTTTGCTCAATTAAAAACGT 
TTTCAATGGCATATTTAAAACTACGTATTCTTCCCCATTAAGA 

TGTGAAATCATAAATTATTTAAGCAGAGTACCTACCACTAAGGATTTGAGCATGTATGA^ 
1 0 ATTCAGCAGTCAGACAGGCTAATGGCATGGGGAGATGTGTTCT^ 

TATATAATATATATGTGTATACATATGTATACATACATGTATATATAAATAGAATTCTTAATCATTTTA 

ATTATTAGCATGGGCTATTTAACTC^CTTTTAAAACGTGTAAAACAT^ 

GTTGGTAGTTTAGGGGCCAAATTAACGAAAGACATCTCATT 

AGCAGTGTTCTCATAGCCTCACAGATGAG CATCCAAAG CAAGGGCCTCCTTCTAGTTG ACTTGTGCCAAGGGACAGGAGAAGCGGA 
1 5 GTCTGCCTTCrrcGGTCCAGAAGGGGGTTTTC 

C£ TTGGCAT ACGGTTGGTTCC CT CTT CTTTTC CGAGGTGGCGAGT AT CTCTTC CTTTG CCAAGATGG CGG CT CCAGAATCCTCTGG 
AGGCGGCCCCCGTAGATCGTCTCO^ACAAGAGGCTreCTGAAAGCCTACTTCT 

GTTTACC CTTGAGAAC CAAGG AAGGACGG CTTAGGCT AC CCG CGATCG CGAAC CTTTGCCAAGATGGTGG CCGCGGGG ACGGG CTG 
GCGACACTGTACCCTACCAAGATGGCGGCGGGCGGCTTCCXK3GACGCGCTTCCCCAATCGTCTT 
2 0 CCGCCGTCAGTCTGAGCGCGGCGGGAGGTGAGAGAGTGGCTGTGGCCGAGCGCCCGAGCAGGATTAGGTGGAG 
CGCCCGTGTCAGGAGCTGGC^GOSATGTCACCTGTGGGGGCGCAAAAGTTACCT 
TTCCCAGAGTCACAAAAATCATAATCTGTGCCGCACAAGGTAGGAGGCTCGGTCCCGGCATC 
GCTGGGGAAGGGAGCTGGGGCGGGGGCTTCCCGCACGGGCACCCCTCGCCCCACGGCCCTCT 
TCCCTTCCCCTTGGACTGAGGGGGAAGCTCCTAACAGGA^ 

2 5 TGTTTTATTTGTAGGGGCAGGGGTCCTATGAACGTGATAGGGTGAGCAA 

GGTGGGGCCTGCACCGGGAACTTGGACGCGGGCCCTGGCCGGGGTGGAAGAAGAGGTC^ 

GCCAGCAACTTACTATTTCGCCTGCAACTTGCTTTTAAGCCTC CCG CCCCCTG CTTT CCTT AATCAT AAT AATAAAAAAAAAGTGC 

AAAGAAATCCAGCTCGCTGGAGGTTTTGCATTTGGCGTGCAACTTCCTTCGAGTC 

AACTTGGCAGGCGGCGCCTCCTTCTGCCGCCGCCGCCGCCTrc 

3 0 GGGTGGGTTCTGCTTTGCAACTTCTCTCCCAGTGCGAGAGCGCG^ 

TGGGGGACCTGCCGGCACGCGACTCCCCCCGGGCCCAAAGTACGTATGCGCCGACCCCCGCTATCCCGTCCC^ 
CCCAGAGGGCGTGTCAGGCCXSCCCGGCCCCGAGCXSCGGCCGAGACGCTGCGGCACCGTT^ 
AGTGACACACTTCACGCAACTCGGCCCGGCGGCGGCGGCGCGGGCC^ 
GGCCCGCCCX3CTGTCACCCGCAGGGGCACTGGCGGCGCTTG 

3 5 AG CTGGGCGGGGGCGGGAAGGAGGTAG CGAGAAAAGAAACTGG AGAAACTCGGTGG CCTATTAACGCCGCCCCAG AGAGACCAGGT 

OXSCCCCCGrOINNNNNNNNNNNNNNNNNN^ 
NNNNNNMNNNNNNHNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNTO 
40 NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNWWNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNITON^ 

NNNNNNNNIWJNNNNNNNNNNNNNNNNN^ CGAGGCGGGAGGGGGCCG 

4 5 GGGCGCGCTCGCTCCCCCGAGGTGC CGCTGGG ACCGGAGACAACTCGGGGG CCGCCG CGGGAG C CTACAAACTTTTATTAGCCTCG 

GGGAGTGGGGGTGGGGGGCTGGCAAGGGCCGGGCGACGGTG ACGAAAGGGCAGCGCG CGGGTGACAGCGCTGG CCTCTTCCTCTCC 
CTCCGCCGGCGTCCCCTGGCCGGGC CGAGGGGGAGGAACCTGACCTCGGACGGCGAG CGG AGCCCCGCCG AACTGCCGGGGGCTTC 
GAGCCTCTCATTCCTCGCGGGAATCCTGGCCTCTTTrCTCCCCCTAGTGTCCCCT 
TTCGTGGTGAACGCTAAGCCGCGTATGAATTTTACTCG 

5 0 TCCGGGGAGG CCCCGGCGG CGCCTGGCTTGAGGAGGGCGTG CGGGGCGCGTGAGGGTGCACACGCGGGGGGCTGACAGCCCGCAAC 

TTGGAGACTCCGGCCGGGGCCGGCGTTATCTGTTAGAAGTGGGrc 

TTAACCTCGCACTTTTTrCTCTTCTCCACCCCCGCCCCGC^^GGGCTTG CT CTTTAGCGTTTGTTGTTAATTCGCGCCTGAGGTTT 

CTAAGTGGCCCCTTTTAGAAAAAGACCCCCTGTAACCGTAATGGTTTTGTGCTGCGA 

GTTGCAGACTTGATAATTGC^CCTTGTAATACCACrT 

5 5 TITGCAACTTAAACTGGGGGATAATGTCGCT^ 

CTGTCAAAAAAGCGGCATCGAAATTCATTGGGCTCCATTCGATACCTCGTGTTTAGA 
GAGAGGTGGGGAGATAAGCAGTTTACCCTCAAGATTTGTAGTGGCAAGTCCAC^ 
TGAGGGCCAGTGAC^TTTATGCTGCCTAACGTCATCGCATAGGA 
AATGCGCTTCrCCGTCTTAGCCCATCrCTTAAAACACCCrc 

6 0 AATTGGGATAGGTGAAGGTTTGGATAGGTGAAGGOVGAGAAAATTATTTTGAACATTTTACTG^ 

ATGAATGTGATTTTACGGTTCrrGTGTTTTTCCATTTTTCAGTACTTCGAT^ 

GCATTTCATATTG AGGAT CT CAAG C AATGT AAACAAATG T AGCTT AAT CT AG ATGTTTTTGTGAGTTATG AT AAGGGTCAGCT AT A 

TTTAAGTTATGTAAGCTAACAACGTAGTGAGAAACTACTACACCTTCTCTTCTGCTCTTT 

TAAAGTGTATCrCATTTCATATATCCAAAATTTGGAG^ 

6 5 GTCGGAAGATAAGCAGATCAGCATTGTTTATTTTTCAAAGAAA^ 

TGTGACTAGCTGGTAGGAGGGAAATTTGGAAGTAATTAGGGATTGAGAATTCTAGC^ 
TTCTCAGAAAAGC^IAACAGCCGTGATTGAAAAGAGGTAGGAATTTTAATGATC^ 
ATCAACITGAACCTTCAGAATAATCAGATGTAATGAATTATAATGTCTGTGATTAACAAAGCTAC^ 
GAATAGCCAAGCITAGTTCGATACACTTTTGCCCTCAGCTGTGCAAATGGATTGCAT^ 

7 0 GAG CAGGGGACJ\TGGCnTTTT ATTCTGG AAGAT AGAAACT ACT CTT CTC^T AACAAAGAATTTGATTCGGAGTT AACT AAAAGGTT 

CATTTAACAAGCTGCCTCITACTAATCC^ATCAG^ 

TGTTrTGTAGTTGATATTCACTGATGGACTCCAAAGAATCATTAACTCCTC 
GAGAGGGGAGATGTGATGGACTTCTATAAAACCCTAAGAGGAGGAGCTACTGT 
CXjCTTCTCAATCAGACrCCAAGCAGCGAAGACTTTTGGTTG 
7 5 CCAAAGCAGTTTCACTCTCAATGGGACTGTATATGGGAGAGACAGAAACAAAAGTC 
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GGCCAAATCAGCCTTTCCTCGGGGGAAACAGACTTAAAGCTTTTGGAAGAAAGCATTG 
AGAGAACCCCAAGAGTTCAGCATCCACTGCTGTGTCra 
CAGAACMC^CATTTGAAGGGCCAGACTGGCACCAACG^ 
TTGCAGGATTTGGAGTTTTCTTCTGGGTCCCCAGGTAA^ 
5 CTGTTTGCTTTCTCCTCTGG CC<XIAGAAGACGATT C^TTCCTTTTGGAAGGAAACT CGAATG AGGACTG CAAG CCTCTCATTTTAC 

CGGACACTAAACCCAAAATTAAGGATAATGGAGATCTGGTTTTGTC^ 
AAAGAAGATTTCATCXSAACTCTGCACCCCTGGGGTAATTAAGCAAGAGAA^ 

AGCAAAT ATAATTGGT AAT AAAATGTCTG CCA'rrrCTGTT CATGGTGTG AGTACCTCTGGAGGACAGATGTAC CACT ATG ACATGA 
ATACAGCATCCCTTTCTCAACAGCAGGATCAGAAGCCTATTTTTAATGT 
1 0 AGGTGCCAAGGATCTGGAGATGACAACTTGACTTCTCTGGGGACT 

AAGGTAAGATCAGTGTTTTTCTGTTTCTTAAGAATGGTACATTTAAGGT^ 
TTCTCTAAAGATTCATGTGCTTTTTTATATGAATAAGTTTAAGTGGCCT^ 
TGTACQTAACCATTTCAG GTTTT TTCCTTAAATACT 

AAAATAATGTTGCTCATTAACAGATATCTTAACGAAAAATTATATAAATTCAGGAGAGTATAATC 
15 TGTG CATGGTCATT CAG CTG TTTT AG AAT ATGTTCTTAT ATTACAAT AAATGAT AC CCTT ACTT ACAT AGTCAAAAGTTGTGCTGC 

CTT ATTTGT AAATTCGTT AAGTGTT AGCTTG AGATT AAAGAGTT AA CTT CAAACTGA 

ATCTTCTGTTAAAGAATTTGAGTTTTGAAGTTGCTAAAGC 

GCTCACAGGGTTCTTGCCATAAAGTAGACAAGTTATGTCTGCTGAT 

AACAAAATTTTACGTTCAAATGTTCCTGCAAGTTCTCAAGTAGATAAC^ 
2 0 G TTTGATTTTTCT Crri ' Tl ' TTT ACTCAGT CGTTT AT AAG CAT ACTGAT ATTTTTGT CTGACC CAAAAAG GT CAGAAAATGGAATTA 

TCIAGAAAAAAGTTCTAAATGTAGATATACGTGTTGGTAGGGGTGAATTTCTCT 

CTAC^GTTTTATATCCATTTTAGTTGTGAAGGAAAATATAAAAATGTC 

TTAAAAGAAGACATGGTAGACAGTGATTTATTTCTATC^ 

TT AACT AGTT ATTCAG ATTTTTGAAAAT CCTTTTTCTGTG AAAG CT ATCCT AAC CTGGAGGATGT CT CTTTT CTTT CCTCTGT ACT 

2 5 TAAGAAGCTTTTCTTGTTAGGGAAATAATTTAGAATTAGATTT^ 

AAAAAAAAAAACACCAAAAAACCTTGGTTCTTAC^TGTCTTAATC 

TTTAGAAGAACACTACTTCAACCTC^GTTGAAGGTTTAAA^ 

TTGTGTTTTAAATGCrAAAAGACATGCTTGTGTrTTAAAAT^ 

TTAAAAGAGTTAAGTACGGTTGATTTGACTAAGCTATCTGTAGGAAACrrCTTAAATT 

3 0 AAAATAAAACATCTTAG<3AAACTCTTGGGGATTATTAATGGATTTTGCCCTGATAA 

XATAAAGAAAAGGCAAGGGACAAAACTTATTTTCCATTT'GCTATGAA 
AGATGAACATAGTTACTCTTAAATCACTAAAGCTGATTT^ 
TATTTTTAATTTGGGTCCCATAACCTTATGGAGGTAGTAGGCAAGGC^ 
TGAAATTATTATTTTTATGCCAATCTGTGATTGGGAAATA 

3 5 GGGTATTTTGTTAATATTTGAAGATTTTATACCTTATGGCATTAACT^ 

CCAAATAAATTGAAACAGTGTTTCTTTTAAACCATCATTGAAT^ 

CACATTGAGACCACAACTTGAATACTGACCTTCTGACrrTACGAAGAAAAAT^ 
ATTAAAAAAAGTTATAAATTGCAATTTGACTTTTTAAAA 

4 0 cnTTATTCTCTGGGGGTGGGAAGAGGCAATTCCrTCCTTC 

TATGATTGAGCTTAGTAAAGCATTTTCTAAGTTCATTTATAGTAAAACAAGAGAAACCTTATTCT 
AACAAACTAGTCATTCTAACTTAATATGCITTTAAAAATACTGAAGTTCAGT 
TCTGCATTATTTGATTTTCAGCCTTGTTTCATTTAAGCATTA^ 
CTTrrATTGGGATTATGTTGAAATTTTAGTCTTAAAAAATTGTTCGTATCC 

4 5 AGGATGAACTTGTAAATTCAGTTTTCGGCAGAGTCTAAAATTAAGTG^ 

TTTTAATGTCTCATTAGTAATCGCATCTGTATCTGGTTTTATAAAAGTAATGAAATTGAAGACCT 
AGCAAATTCTGCTAACATCATGTTGAATGTTTTCTCAGAAA 

CCrCACCCCCATG<IATACTTTCTTTACTTCCTAC C , l ' l"l ' 1 TTCTT T TI C 1 1 1 TCCTTCTAAAGATTCTGG CAATGGGTGTTTCAGTG 
TTTTTTAAGCTTAATATTTCTGGTACTC7VTTTATGTAAAGTGATTTCTGAAT 

5 0 TT ATTTTT AG CTTT ATG ATG AGAAT CIT ATTTTTT AAATCTGT AACTTGTT ATGG CTACATGATT AGT AAAAAAAGTTTTT AAAAC 

ACACTGTGTATTCAGGTGTGTCATTTTAGTGTGAAATGACTAATGCAGAAATATGTGACTAGCATG 

TACTT ACGATGTTT CT ATGG CT AGT CC C CTTGTATTTTTATAATTC^TAAC!AT AAT TTATTTTGGTCTTGT CT ATTTGT 

GTTACATGTATTTTAGTCTGACCACTTTTGCTACTTATTTAATGTTTAT^ 

ATAAATGTAGGTTAAATGCAAATTGTATTAATAGTGAAATGGATATGTGGGTAGAGATCACTTTA 

5 5 AAGGAAAGATTGGATCAAAAGGGTTTACTTTAATGTGACTGCCTAATGTGAAAGTCGGAAC^ 

AAATCTTAGTCTACTCTCGCCTGCAC^TGACTGAAACAGCCCA^ 

AGGCTTAGTTCTTAAAGCACATTCTT1TTTACTTTAATGATTATTCCTAAT 

CTTAGGTGGGCAAATTAAATAAGCAAAATAGGTTTGTGCTGAATAGCATTTACCCT^ 

AG AGT AATTGTGTAATGCAATATTT ACAGGTATTTG CCAG ATT AATGGG CACTTGTTTTC^TATTTCTGAGTCATGG AAAATATAC 

6 0 ATTGATGATTCCTGTTGCATAAAGAGTTTTCAAGAAAATTTTGTTGAATTAA^ 

ACCTTTAGAAAGGATTTTTAAAAGCCCATGCTGTCCTTATTTCT 
TTCTATTAATACTTGGGTTTTGTATTTTTCAAGTAAATAAAATATTC 

AAGTGTTC CTTT C C CTGTTG AATT ATGTGTG AT CATTTCT ATG CT AAACT AG ATT AG GGTGTGACTTGTGATGG TGATTTTTGTTC 
ATTTTACATATTAAGAAAGAAATAGAATTTTATTGCAGTTCAAAATTATTTGTAGAC^ 

6 5 TGACAGGTTGCTTTCCTTAGTGCTCAATACTGTTGTAAATGTCTCTAAATACAGAATTTCCAGTGGAGTT 

GTGGAGGGTGAAGAGGGAGGAGCAACAGAGATGTGGGATGCTATAGATA^ 
GTTTGGATCIAACrGTCATGAATTAGAGGTTTAAGAAAAGAAAAATTTAGGACT 
AATTTAATTTCTTTTATTTCAGGTAGAAATCTAAAACTGACCATG^ 
TGCTTTATTATCAAAAAAGGATAAAATGCACATTTTACTTGAAGA^ 

7 0 TTTAAGCTGCTGTTTAATAAATGAAAATCTAATGACTTGAATGTAGTCGACCTAA 

TCATAGTGCCCTTTTACAGCCATTGTCAACTGACTGGAGAGCAAC 
CTGCTGGAATATTGAGAAAATTAATTTTTCATAATATGCAGAATAAATTATC^ 

TTTAT ATTGCAAT ACATTC CTTAAATTCAGT ATTTTGAATG AAAGTGTGTT ATC C CCG AATTTT ATCACTT GT CCAATTT AAAT AT 
TAATTACATCCCAATAGAGCTGCATGCTTAAACATGCTTTTTCAGAG 
7 5 TCTTTTTAGCAAGTTTCAACAACATTAATCCTGTCTATAATGCAGCAAGTTCAG 
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CATTCTCACTGTAGGGCACCAAAAATATATATAAGGGGAAAAAAGTTTTAATGATATGATTAGl^ 
TACCTTGAATTTTTATTTTTGTAACTAATAATTTGAG 
TAACTTACTATTTCTTTTTACAATAAAATTAGCCITTATTrt 
GAAAGTATATTTGATAGCCAAATTTTGAAAGCTATTATGAAATGATACAATTCACT 
5 CAGTGCTGTGACTTATGACCTTATGATTGTCACATGCTGAAC^CTAA 

TTATCGTTTT AATG'rrri' C1TTT ATAATT ATTG AGG AT AAGAG CTT C CTT AATTTT AAG ACT ATTT AAATTG CAGATTTTG CTTTT 
TTATTTTTTTAACCATCCCCTTCCAAAGAATTTGATT^ 
AGACCTAGTTTCTAATGCTAAGGGAGACAGTCCATGGCCrCCAACT 
TTGAATTGTTITCAAAACACCTGTGTGTGTGTTTCT 
1 0 GGTAGGGAATGGTATAACAAAGTTGGTTCTTTGACA'ri'rrC'ri 1 AT AGATT ATCG AATG T AAGACAAAT AGATGTG AATGCAGATT 

TGGTGTTTTT ATAAG AT AAGG ATTTAAAAT AATGTAGTTG CACCCGAGAGG T 

GGGGCTCTTGGGTTCTCAGAGCTTGTTTTCT 

ATTCTACCCCCTAATTTTTTCGAATGAGAAATTGAGATCCATAGAAAGTGTT 
TTGCCAGTTTGAATATCTCAATTCTTAGTTATCT 
15 ACAAATAGTTGATTGAC^TAATACAGACTAGCCACATATTITATAAGAGTTA 

GTCTGTATTCTTCTCTATTCA TTTT GTTAA 'lTT'n 1TJ . T ACCTAATAATGATTAAGCAC CAATT ATGTG ACAGCACTATG CT AAG C 
ACTTTGCATGC^TTCATCTCATTTAAATCTCAACTCTGTCA 

TCCATG AAG CACAGAAGGCAG CTGTT ATTT ACCTTAATTTTACAGATGTG CATTT AAAGAGAAAAAGAAAAAAAA 
AACCAGGAAACCTTAACACTTATCTGAAGGGAAATATTTAATATTGGGTATGTT 
2 0 ACAG CGAATCTTT AAAT AAAATAT AAAGGATCAGGC CT CTG CTCTC CTG<1AT AT ATTTGT AAAGTCACTT ACTGC'l'T'rri'GTCACA 

GTTTCAATTTCTGTAAAATAGTGAGAGGGTTTTTACCTGACAG 
GCCAGGCGCGGT<^CTCACGCCTGTAATCCCAGCACTTTGGGAGGC 
CTGGCTAACACTKjTGAAACCCTGTCTCTACTAAAAATACAGA 
CGGGAGCCTGAGGCAGGAGAATGGTGTGAACCTGGGAGGCGGAGCTT^ 

2 5 OSACAGAGCGAGTCTCCGTCTCAAAAAAAAGAAAAAAAAAAGAAAAGAAAAT^ 

TAAAGCTCTCCCAAGTCCACTTAGCTTAAACC^^ 
ATTGCCTATAATCAAGAATCTTTCAGGGTCTGCTATGTC 
TTTAAAGCCTTCAAATGCC^CGTAGTATCITCACAA^ 
TTTTCAGTGGATTTTGTTGAGGGATATTAAGTATC 

3 0 ATTAATAAATGCTAGAGTATTCTCTTTCTGGTATTTC 

GGAAACCTTAATTTGCCTGAAAATGGGAATGAAATTTTCAGGTTT 
AAAAGCTCTTGAGTATATTGAATACCAAAATTTATCCTAACTGCATAAAGTTGGGAGG 
TTTCTTTATTGATCAAATGGTTGAAAAAAACTTCAGTTAAAC^ 
TAGAATTATTAAACGGCAGTTTAAATTGCTAAACATACCGATC^ 

3 5 AAAAATTTTAATCAGAACATTATGTGAAATTTGTCATCTGG 

GATTG AGTTTTTTTTTT AAAACAG AAAACCCAA CACCTT AGG AGG CT ATGT ATATTG CTC CTCTTTG 

AAACTGCCTTCAGAACCACCTTGTAAGCCATAAAAGAAAATCGGACTCATTG CACTATAGT AACACCTAACTGTTCTTG CTCAAAG 
AAAATGTATTTATCCCTTAGCTTTATTTGTGTGACTCCAAATCATATGAGTAT^ 
ATGATAACATTTATTTTCTTTGAGGGTTTTTAA 

4 0 CCTGGAATAAGTGAATAGCTTCCCAGGGTGACTATTCTGAAATGGGTGA 

GTTATT AAGGAACTTT ATGATAAC7VGTTT AT AT ATTCCCT CTT CTTGG CAT AGT AATGAAGTAATAG AGACTATTCACCTCT AAG C 
CTGATTTTTTAAATAAGTGTTTATTTTATGTTTAAGTA^ 

AAAGTAAACATAATGTGAATTCCTACAAC^GTCTCCCAAACAGTTTAATTTCTCA 
ATTAAGTATCTGATTATCAGTATAGCAAGAACAACTCAAGTATACTGAAGTC^ 

4 5 CAATATAAAATTAATATATTTTTGACTTTTATTCCTCACAACCTGAAA 

AATGAATTTTGTTACTACTATTTGCTAAATTTAGTCATGTTTA 
ATGATGAAGACATGTGTAGTGGTGTCAAAAATAGGATATTCATTTTGTAACT 
GTGACAACAG CCTCCTTAAGAACTTT AGATTTTTTAGAATTGT ACT 
AAAAGATGATTTATAAGTTTTCAGAGCrAAGAAATGGGAAGGAAGAGC 

5 0 TTTTCTCTGCACrcCTTTATTTGCCTTGAACCTCT^ 

TGCTTCTCATCACTAAACACATATCATGGGCTGGTGTTAGTGCAGTCTGTG^ 
AAAGGAAAGACAAATTGGTGAAGTAATTTCTAATTCAGTATTTTAGGGAT 
CTGTACAGTGCTTTGGGGGTCCCACAGACATGTTTAAACAAGAAAACAG 
TAAGAAAGAAAATGAAAATTCTAAACTTCAATTCTGGTGCCTGGCTAAATTTGA 

5 5 CTGGGAAATCAATAAGCAACCTAGCTACGTTATTATGTTCGTAAGTGGAAGAACT 

AATTCTGGTGGTAGGTAGAGTGGGAGGTTAACTAGATGATCrCCAAGGTCCTT 
AGCTAGACACTGGGOITATGGAGATGAAGAAGATGAATATCCCCAGCAGCATGGAGAG 
CCCTCCCTCAGTTTGCTTTTTGAAATGTGAGCTTGAAAGATCTCAAACT 
AAGTGCATGAATGAATGGGAAACAAGATTTGATTCAACTATTTGGA^ 

6 0 AAAAGATGGAAAGAATTAGTTGACTAGATGAGGACTGAGTACATAGGAATGAGCCAACAGGAGACT^ 

GTATTATGTGCATGCATGTTGTCATCAAATATCACATGATACAAGACAAGGAGAAAA 
TACCCTAGTTGCAAGATATTTTTTTCTTCTAGTCACTT 

CAGTGTATAAGCCCAAATCTGAGGAAAATTTACCTGAAATGTTCTTTCCCCAGATACCCACA 
CAGCTCTGTAAAAGAGATCTCrGACTGCTCTATCTGAAATAGTAGAATCTTT 
6 5 GTGTCTTATCTGCTCACCTGCTACAATGTAAGCrCCAT 

AGAATAGTGTCTAGCACATAGTAAGACCTCrACAAATACATGTTGAATACCTAAATAAACAAA^ 

T AT AT AGG AATGGATT AT ATTT CT AATCTTTCTCG AGTGAGGAAAATGTCAG CAG AT AGTG AAT AT CACTGAGAG AGAGATG AT AG 
C CCAGGTT ATCTT C CC CAG ATAG AAAT AAGCCTTAAG ACTG ACAGG T ATT 
TTCAATTGTCAGTCT CT AAATTTG CTTT AAGACTT CGAAAT GG ATTGCTTTT CAT AATTT CTT AGAAT AACT CTGGT CTGTTT AC C 
70 ATTGAAAAATT AGAGT AGCCAATGTTTGTAAATGAAG TT AAAAG CTTGCTCAAGG C 
AGTAACATAGTAAATTGTCAATATAGGAACTTTTGTAGCAGAAGCTTTATC 

AG ATG AGTCT AATGT AT ATGTTTGT ACTGACTTAC CT AGAAGGTCAGGCAAGAAAT CGGTTTCCT CATTTTT CAG AT 
TGTAATCACTGAGTACCTTAAGAGAGGAGGGGTGTTTTATTTTTC 
TAGATTTGACTTATTCTGTCTATAGTATATAACAGTCAGGAGGTTGCT 
75 GTTC!AACCTAATATGGCAATGACAGTCGCAGAAAAGAGAAAATGCAAGTTAAGTAG^ 
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GTCATTTAAGAATGAACTGCTCTAATGTTCAGGAAAAAAGAGG 
GCCACAAAATAAATGAATAGCTGAAGAATTAGGAGACAACAATCTTAAAATGTGG^ 

AT AG AAGAG ACAGAATAG AAT AACATAAAT ATATGAGTG ATT ATTG TT CTTGAAACCAGTCTTTAAAACATGGGAAC1ATTCCCAAA 
AATCAAAGCCAGATAAATTAGGGAAATCTTAAATGGCACAATATAACTAGTGATTTCGT^ 
5 AATTTGAAATTTAGATGTAATTAAAGCAGATAATAAGAAACATAC 
GTAGAAAGGCTGGAAGCCAGAAGGGAATTAAGTTTCTGTTC^ 
CAAACTCTTATTACTCTTATCAATAGTCTCCTAATTGTTTCTCT 
CCAGATTAATCTTCCTAGAACACCACTTTCAGTATTATTCCCCTGATCAAA 

TTCTCCrTC T TTGT ACCACAG CC CATATGCACGATGAT AGATGGTGGGT AG C CACATG AACTCTC CAT AACCTrTGGAGGATTTGG 
1 0 GTTATACACAGTCTGTTATCCAAGAAAGCATATCTGAGTGTAAGTGAGC^ 
TTTCTTCTTCTTTTTGCAAATCCCCCTTCAGA^ 

TGT AATTACAT AAG CAT AAACAGACACAT ATGAGATTTT CTGGGTTGCTIXSCCTTTAAATAAAAGAATGGGATT AT CTT AT AC CCC 
TTTGTCTGCAGCTTGCTTTTCTCACCTAACAAGTAC^ 
1 5 AGACAATATTCTATTCATGTGGTTTCGTGA^ 
AATAGCAACTTTTTTTCTAAATATAAATG^ 

ATC7VGGGGAGGGGCCTGGTGAAAG<3TGATTGAATCCTGGAGGCAG^ 
ATCTGGrrcCTTGAAAATGTATGGCACITCCCTCTTCACr 
TCCACTGTGAGTGTAAGTTTCCTGAGGCCTCCCAGTO 
2 0 TTTGTAAGTTACCCAGTCTCAGTTAGTTCTTTATAGCACTGTGA^ 

TTGCTATAAAGATACATGAAAATGCGGAAGCAACTTTGTAACTGGATAATG 
AGAC^GGAAGATGAGGGAATGTTTGGAACTTCCTAGAGAC^^ 
AAGTCCAGGCTGAGGTGGTCTCAC^TGGAGATGAGGCACITATTG 
AGACTGACAGCATTTTGCCCCrrGCCCTAGAGATCTGTGGAT 

2 5 TTCTAAGTAGCAAAGCATTCAATATGTGGCCTGGCTGCTCCT 

TGGAACTTAGGTTTAAAAGAGAAGCAGAGCATAAAAGTTTGGAAAATT^ 

TCTGGGGAGCAATTCAAGTGAGCTGCAGAAATATGCATAGATGAAGAGTAGCCTAATGTTAATAGCCAGTAGAATAGGGAAAATO 

TTCCAGGGCATGTCAGAGACCTTCATGGCAGCCCTTCCTATCACAG 

GCCCAGGGTTGCGCTGCTCTCTGGAGCCTCAGGACATGGTGCCCTGCATCCCAGCTG 

3 0 CAGGAGATAATCTTGGGCTGTTGCTTCAGAGGGGGTAAGC 

AGAAGGCAAGAGTTGAGGCTTGAAAGCCTCTGCCTTGATTTCAGGA 

AGAGCCCTCATGGAGATCCTCTGCTAGGGCAGTGCAGAGGAGATACATGGGGTTAGAGCCCC 
TGCCTACTGGAGCCGTCAGAAGAGGGATACCATCCTC 

AAAGCTGTAGGCACTCAATGCTAGCCTGTGAAAGCAGCTGCAGGGTCTGTACCC^ 

3 5 CTTGGGAGCTCACCCCTTGTGTCAGCGTGGCTTGGACATGA^ 

ACTGTCCTGCAGG1TTTTGGACATGCATGGGGCCTGTAGCCTCTTT 
CGAATGCCTGTATCCCAATTTTTTCTTGGAAGTAACTAGTT^ 

AGAAGACTTTGTACTTGGACTTTTGAGTTAATGCTGGAAGGAGTTAAGACTTCCGGGGAC 
AAATGTAAGAACATGAGATTTAGGAGGGGCCGGGGGCAGAATAATATGGCT 

4 0 TAATCCCCACATGTCAGGGGAGGGGCCTAGTCGGAGGTGACTG^ 

AGTTCTCATGAGACCTGGTTGTTTG AAAGTGTGTGGCACTTCCCC CTTCTCTCT CTGTCT CCTCCTCTGCCATGTG AACATGTGCT 
TGCTTCTCCTTGAACTTCCACCAGGATTCTAAGTT^ 

AATTAAGTCTCTGTTCTTCATAAATTATCCAGAC^C^GGTAGTTCTTTATTGCAGTGTGAA 
CAAAGTTGGGTTCCTGAGTCAGG^GTATGTGTATTTAAAATTTTAACAGA 

4 5 TCACAGTTCCCCCTGGCAGTGTTTATACTTTTCTTTATAATAAAAATACA 

TTAACAATATGTGTAATGTTCAAGACCAATTTAGTGC^ 

CCTCATCTGATGGTATCCTCTGGCAGCCACAGACCAC^ CACATAGTAGGTACCCAGT 
AAATGTTTGTTGAATTAATACTGTTTATATATTTCTAATTTATCTC 

5 0 CTATAACACCTAGCACTTAAGGTACTO^TTTAGGTAGGGCTGTTT 

CTTTAATACATATATTTTACTTTCTCTTAGGGAAAACTGGAACTG 
GC!AGTTACTATATGTGAGGCACAGTTCAAACTGC!AGAGGATAC!AACAGTGG 
ATGGTTTATGGGTGTTACATTCAAGACATTTGTAGGACA 
GGAATATTTTTCCGTTCATTCAGCATTTACTTAGTGCCTGCTCTGCCAGGAA 

5 5 CTGAGAGGTAGTGAACTGACCAGGATGACTGTGGGCAGTAGACTTGTTTCCCAAACT 

GAGGAAAGGATTAGATATAGGGATTCATGTCAGCATACACCCCAGGC^CATTTC 
TACCCAGTAGACAAACGACGAAGAAGTATGCTCCCGTCATTGTGAAAGAATCAT^ 
GATCTCACTTCATATGAGGAAACTAGGTCCAGTATTGTGAGTAAGAATTAGGACTCTT 
CAACTGTGTGAACTTGACCAAATTCATAACCCTGTAAACTCTGTTTCCT 

6 0 ACTG CAT AGCACAGTGTCTGGCACTT AAAAAGCACT C CTG AAGTT ATTTTT AG TG ATGTG TTTT AAGATT AGAC AACT C CTT AATG 

CCAAAGGTTTTTACTTGAGAACTCTGTCTGTTCT 
ACTGGCTTCATTATGATCTTAACTTTATT^ 

ATTTATGGGGTACAGGTCATGTTTTAATGCAAGCATGCAATTGTGGGGGTGAT^^ 
GTTTTGATTTGCATTTCTCTGATGATTAAC^ATGTTGAACATTT 

6 5 TCTATTCAGATCTTTTGTCCATTTrT/rAAGTTGGATT 

TCCCTTCTCAGATGGGTAGCTTGCAAATATTITCT^ 
TTAGCTTGATGTGATCCCATTTGTCCATTTTTGCATTGGTTGCCT 

GTGT CCTG G AG AG CTT CC CAAATG TTTTCTTTTAGT ATCCT AG TTTCAGGT CTT AGATTT AGGG CTTT AGTCCATTTfTATTTG AT 
TTTrATATGTGGTGAGAGATAGGGGTCTAGTTTC^TTCTGCCTATGGATATC 

7 0 CCTTTCCCTAGTGTATGTTCTTGGCACCTTTGCTGAA 

GTGTATCTGTTTTTATGCTAGAACTATGTTGTTTGGGTTATO 
AGTTTTATTITTTTTGTTCAGGATGGCTTTC^CrATTCC 

TCTGTGAAGAATGTCATTGATATTTATTAATAAAGATTGCATTGAATCTGTAGATTG 

TTGATTCTTCCAATCCATGAGCATGGACTAT CTITCTTTTTT^GTGTGTCCTCTTCAATATTTTT CCTCAGTGTTTTATTGTTTTC 
7 5 ATTGTAGAGCrCTTTCACTTCTTTCGTTGAGTTTAT^ 



1865 



WO 03/008583 



PCT/US01/51291 



GATTTCTTTTTTAGATTGTCCTCTGTTGGCAT 

ATTTATCTGTTCTAATATTTTTTTGGTGGAGTCTTTAGGCTTT^ 

CTTCCATTCCATTTTGGATTCCCTTTATATCTTTCTCTTGTCTGATTACT^^ 

GTGGG CACT CTTGTCTTGTTGT AGATCTT AG AAG AAAGG CTTT CTG TCAGTTTGTTG 

5 CAGATGGCATAACTTTCAAACTAATTGATTATAGTTAGGAAGTGGATAC^ 
CCATAAGCTTGAAGAGGAGCTGAAAAATGCATATGTGATGCATATGCTTC 
ATCCACACAAGTTCCTCTCTCAGTCACTCATCAACrAC^ 
CTGAAGAAGAGCAAGCCTTTCATGTCTTGAGAAATTCTTGG^ 
CTTCTTCTTAAAGAGAATACTAACACTTTGAACGTCATAATACTA^ 
1 0 TTTTGTATGATTCAGTGTGAATTAAATCCCACAGTGTAAAGGACTTT^ 

GTTTATAAGTAGATTTATTACACCAAAGCACCTAGCAAATTCTTGAATGGATCM 

AAGTCAAATAATTAAGGATTTGTTAAATATTTGTCTTAA^ 

AATLTriTGTTTTCTCTGCTTGCTCAAAGTAGCTATGTTTC 

T CCCACTTAGATTTGGCAATT ATCACTCCTGT AAATG AGAT ATT ACA.T A^ TAT CTTTCCAAGAAGAAT A 

1 5 GTATCCTTCCATATTAACAGTTTAGAGCTGACTGCTTTTAAAATTTAGTG^ 

TCTACAAATGAGGTGGGCAGTTCTGCTGATCTGGCCAAGCTGAACTTATCTCA^ 

GCTGGTTGGCTGTAGCAATGAATGGTGAAAGTAGGCTGCCCTTAACri" 

GAACCATGCAAAACTCTTTAAGACCTAGGCTTGGAACAACTATATTTCT 

TCTAGATTCAAGTGGGTGGAGGAGCTGCAATTACACTGCAAAGGAGTGTGACT 
2 0 TGCGATTTGTCACAGTAGTTGTAGGAATCAGGTCTATTTAAAATTCTGATCCTTCTGTG^ 

TCTGGTGGAAAGGCAGAATCATTGTGACAGAAGGATAAAATCTTGGAATTTA 

CTT ATTTCTG CAATCCTGCAGAAGTT AATCTTCCTGATAGGCATTCAGTAATGATTGATTCAC CTG AACCTCAGATTCTTT ATGTA 
TTTT AAAG AAAGGGCT AGGT AAATG C AAAG CACTT ATGT AACTG CTTTT ATT ATTGCAAACCTGG CT CCC ACACT CCAT^ 
GTAAGACTCAGTGTCTTCCTTGAATTAAAAAGGAAG^ 

2 5 AATAAAGTGACCACTCACTGGGGGTATTTTCCTGTAAGAAAAGAATGGTTG 

CCTTGGGGTAGGATTCCCTGTGATTCATGGGTTGAGAGGTGTAACATTAGACACAG 

GTGGTCCATCCTATACACACTG^TGCCTTAATACTATATGTGGCAATTATCACT 

AAAGAACAGTATCATTCCACATTAACAATTGAAAGATGACTGCTTTTAAAAAAT^ 

ATTTCTTGTGTTAAACATAGTTGT CAT ATTGG ATG AGG ACT AAACAC CT AAATT CAT C CAACT AG TAG TAATAGAAAAGATG AAAC • 

3 0 AC ACACACAG T AAAACT AG ATTAATTT AATTT AT AGAAAGGG CCAGAT ATCT 

TAATGCCTCTTTTAGGAGAGGTACCAGGTAAGTGTTCTCAAAG 
AACACAAGCAAGTCTGAATTTTTCCGCACCATAGCTACCTTT 
TGTAATTTTTTATTAAGGGTGAAATTTAGTGTAGAGAGTAAAGGCATTTG^ 
AACTGCCAGCCrrTTATGGATTGCAGTCTTCGCTGTTTTGATTGAC^ 

3 5 AGGGTCTCACTCTGTTACCCAGGCTGGAGTGCAATGGGGCAAACTTGGATCACT 

CCCACCTCAGCCTCCCAAGTAGCTTtSGACTATAGGTGTGCACCACCATCCrTGG 
AGTGCTCTGTTGCCCAGGCTGGTCTCACATTCCTGGCCTCAGTTGATGCT 
GTGAGCCACTGCACCTGGCCCCCAATTTCATCCITTACAAAGACTACTTTCAACCATA 
t CATATTTGGGATCCAAGGATATTTTCCCAAATGATTAATGCTAATTTCA 

4 0 AGTAAAGAAAGACTCTTAATTTGGGAAAGACAGTTACTTGGAGAGAAGAGAAAC^ 

TGAGAGGATCATAAAGAGATAGCCATAAAAATGTTTCTCCCTATATTGCCTC 

ACCTTGTACCTTTTCAGCGCTGCACTGCGTGCrCATAGGGAGGAAAGATA 

TCTGGCTAGAGAGACATGATGCAGACACATGAAATAATTAAGAAACAATTAA 

AGAGAATGGTAGAGCTAGTGTC<5GCGAAGGTATAGCCCAGAGCATCATCAGATGATTCTTCCTTATGCAAATTCA 

4 5 C^TCAAGTATCATCCTGGCATGCAGCAGCTCCATAGGTAATGCCCTAAGGCTAGCCT 

GTCATGGCCTTTTTTTGTGTGGGGGGAGG^AATG<5CATCCCCTTCCTGTCTGC 
GCTTTGTTTTAGTGGATTGTTAAAAAGAAGTGAATGAATTTATGCTTCA 
GGTATTTCTATTTAACAAATGACATAACTTGAAGGAATGAAATCATA^ 
ATACTTTGAGACAAAAGAGCCTTCCCAGTGTAACCGAGATCACAGCACCTACTTCA 

5 0 TCCCATAGAGCAGTAGCTGCCTTGTTTTTCCTCCCTCCTCCATCATTCATTCTAAATCTCCAGTCCT 

CCCTGATACCCTTAAGTCACAGATGGTGAATCAGTCAAAAGTAGTATTAAAAACTAGTGGTACACAGCTACACCTGG 
AGAAAAATACGGATTTCTGTACATCATCTTCCCTCCCTGCTCTTACCCCCATTT^ 
GTTTTTGAAAGTCCCTAAAAATTTTGGATGATCA.CCTACATTTAGAACCACTGCACTAAG 
CTGTTGCCAAGGAG<3TGATTATGCAAGCTC^AACCCTGATAACATGAGGAGAATCCCACAA 

5 5 ACATTATAATAAAGCCAAAAGCAGCAGGCCTACCTGACTTTCTCCT 

TAGAGGTAGCTCTAGATGGAGAAGGTCCACTAGGTGTCATGGAAAGAATCATGTGCAAGAAGTAAAGAGGTGCTCT 
CCCTGCTTAGGTGTCTGTGTCCTCACATGAGAATTTATCCACAGTTCTTTCCCGCT 

GAG ACAGCAAAAAGCT ATGGTCCAGTCTCCTTCCATTGTATCAT CT CATOU^TGT ATTTCTCCCACT ACCCTTGTGTGAAATACAA 
A C lTl Tl TtaGCTTATTGTGATTATGCAAGGTCrr^ 

6 0 TTTCCTT ATTTTG CTTTAGAAAAATTCTCATTGGCATAGATCTAATTTCAGGGAG CCTCCCTTGAAAGCT AAATAACATTGAGAAT 

TCATGAAAAT ATAATGTAGAG CATTATGCCTGTTAG CAT ATT AGTTTAAATAGAAGTGGTTCATG AAAATTTTTG AAATGCCAGAC 
CCTGTCCTGTGTTTTGTATTCTCCCAAATACTCATCCAGATACTGTTCAGAATGTAACATGATTATT^ 
CTAGTTTTTAAAAAAGTTACTTTATACATTAACCCTTATGTTCCT 
AGTAGATTTCTTACAGAAATTGTCAAGTGGCTCTCTGCrGTTAGCATGGTTACT 

6 5 TTTCTGGAAGTGGACAACTTACTTGTAAATAAAAGTGCATAATTTGTAT^ 

TGTGAGCAGCATGAATGTGTGTGATATGCGTACATACGAATTATGTCTCTTAAAAATCT 

AAATCTACCTCATTCTTTCTAATAGCCATATGGGTATACCATAATATATTTAACTAGGCCC 

GTAGCTACTATAGTGTTGC^GTGTGTATCTGTGTATGTATCTTT^ 

G CT ATTTCAGATT 1TTTAGG TTT AAATCTT AGG AAAGGTTTTGAAATTG TCTT AAGT ATTTT CAG AAG CATT AAAT CATGGTT 

7 0 TTTTTACATTTTTCTTTTAGAAGTTTTATGT 

CATTTAAAAATAGGTGGCATGACTAGACrrTCTCAGCCGAAAGAGTGAGGTCCCA 
TAGCTGCTGACCTGATGTCTGCCTAGAGTCTGGCAAGGTGATGTGTTGAATCTAGTGTCT 
ATGAGATTTGTGGTTTTCATCACTTCATGGTAATTATCCCAAGTTATAAGATGC^TCT 
C7UU3TAAAATACATGTGAGTGCATGTATGTGTGCATACAAATTACTTCTCTTAAAAA 
7 5 CTCCGAAGGATTTTGTGGAAGCTGTGTTCAAGTACAGCTGCTGAGCGTATGTC 
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TAGTGTCTTTTT G ACTCACT Gl 'TTTTTTTGACTCACTGTC 

CTGTCTCTCCrCCCTGCCCTCGCCCCCATTTAAAAGTATAGTGGCAGAACCCAAGAATCAG 

TTGGATGATCACCCACCTGATCATGTCTrrrTTACTCACTATGl 111111 1"! l"i TTTGAGACAGAGTCTCGCTCTGTCGCCCAGGC 
TGGAGTGCAGTGGCATGATCTTGGCTCACIXXLAAGCTCCGCCTC 
5 GGGACTACAGGGCCTGCCACCGCGCCCGGCTAATTTTTTGTATTT^ 
CX3ATCTCCTGACCTCGTGATCCACCCGCCTCGGCCTCCCA 
CACTATGTTTTTAAGCCCTTGTTTTCATTTGCTC 

TTT AAAATCTGGGGAAT AT AAAT ATTGGAT AATGATCATCCTGAAAAAACAA CAT AAT ACTTTAT AT AT CAAT A 
AAATGGCAAAATAAGACAGTTGTTGAAGGACAGAAAGAGTAACTGAAGT^ 
1 0 TAGGCATCATA TlTl ' TT AAA TITITr TTAT 

TACXLACCCAACTTAAGGAACAGCTCATTGCTATTC 

ATCTGCAGGACTATACTAGTAAATTTTGTAT'rr 1"! 1 GCATTATTTTGCTTTGTTTTATGATTTTACTACCTATCTACATATCCCTA 
AATAATACATTATTTAGTTTCATATGTTTTAACm 

AAGTTCTAGGGTACATGTGCACAACGTGCAGGTTTGTTACGTAGGTATACATGra 
1 5 CATTTACACTGGGTATTTCTCCTAATGCTATCCCTCCCCTAGCCCCCCACCCCC 

AACTCAACATTGTTTGTAAGATTCATCCATGTAAGCTGAAGCriTTlTATAGAGA 

ATACATTTCTCTGTTCCCCTGTTAGTGGACACTTGGATTGT^ 

CCTGAAACACATTTATAAGAGTTTTTTTTTCCCCAAGGGAATTATACCT 

ACAACTTCTGCTAGGTAATGCCAAATTGTTTCCAAGGAGCGTTAGAA 
2 0 TTGTGGCAGTATGGTGGGTGTGAAATATTTATGTTTAGTTTTTCT^ 

AAGGCAGTTTTTTGTTTTTGTTTTTATTTTT^ 

TGCAAAGTTCCTAAAAGGAAAACrTrTTTTGGTATTCT 

GTAGTTTAGAAGTCAGAGCAATAATTGTGATTCTTGAACAGAATGG 

CTTGTTGTGGGTCAGTTTTAGCCTCCCCAACTTTTATG1TAA 

2 5 GTGACCAGCTGTGATAAAACACAGGGAACCCTTGCAAAGGATTTCAAAATT^ 

CCAAGGTTTCRGTTACCCGCAGTCAACTGTGTTCT 

TTTATGCTTCCCACCCATCCCACCTGGGATGTCAATCATTCCTTTGTTCAGCAT 
CACTTGGTAGCCATCCAGGTTATCAGATTGACTCTTCTAGTATTACAACACT 
GCCCCAAAGTGCAGGAGTGGTGATCCTGGCAATTCAGATATGTCAAAGAGAAGCTGTAAATTC 
30 ATT CT AGACTTATAT AT AAAGAAAAG AAATCATATG CTGAG ACTGCT AAG AT CTATG AT AAGAATG AAT CTTTT AT ACATGAAATT 

GTG AAG AATGAAAAAG AAATG CG TGCTGGTTTTGCTGT CAT ATCTC AGACTG C AAAAG TTTGCAG C C AATGTGT AT G AT AAGTG CT 
TAGTTAAAAGGAAAAAGGCATTTAAGGTAAGTATATATAGTGTTTG^ 
AGTATAAGGGGAGACTACTCTTTTAGTGTTAAATGAACACTAAGGAACAGAGAT 
TTTGAGTATAGGTAAACIAGTTGTTTGAGAAAGAAGAAAAATGTGA 

3 5 ATTATAGTCACACAGAACTCTTAGTATTTACAGAACGTTCAC^ 

CTACCACTGCTTTATTC^TAAAGATATAAAAGGTAAGAGAGATGAAACATATT^ 
TGGGGTGGGACCCCAAGGCTCTTCTCGCCAAGCCCAO^TTTTTT^ 

TTCTACTGTGGACAATGTCTGTCAGCAAGCArrGATCCCCTGCCTTCAGCACTCTTACGTCTT^ 

CAGGAGTTT ACAGT CTGCTGG AG CAACAG AAAAG AATG AT ATGAAATATG AAG AG AC CAAAAT G ATTT AT AAT AAGGTG CT AGACT 

4 0 ATGTAGTAAAAATCTGCTTTAGCTGTAAGTCAAAAGCAAGAGCAGTCTTTTCAGAATGGAATAGAAATGTTG 

TTCAAAGTTGTGAATTTTTTTCAAGATAAACATGTTTTATTT^ 

CT AAT ATT ATAGAAGATGTACAT AT AATG AC AAAAG C AAACATTTT ATTTTT AAGGAC CACAAT CT AAT CT AAAACAAAATTTCCC 

CCTTTTCTGGTCTTTGGTTAATTAAGGACTTATTTAAATATCAAAGAAAGA 

ATTAATGTCCTCCATACCTTACACAGATACTTGACTTGGCTATG^ 

4 5 CTTTATGCT AAATTGATAGATCTCTAGG ATCAGATTGCCATCACTCT CAG ATGCG AAGCTT C CAACCACTTATAGGTTCCTGATAT 

CTTGCTTTTATACAGACCTTAATTTCTCTrCCTTTAA^ 

TTTCC&AGTC-GGAATTTTTGTGTTTCTGCATCTATC&TG 

AGGCTGAAGCCAGACCAATACAGCTTCAGCTAAATGCCGACCIT 

AGTCTGTAGCCGGTGCCCTTAATATGTTGGCTTCATGTCTCATGACTCTCTTCTGTAAATAT^ 

5 0 CTGCTGTAGAAGATACATTTGCAAAATTGATGTATCCCCTC^ 

ATTTGAGAAGAATGCK^CGATATGCATTATTCm 

TCTGG AGTGTATGCAAGGGGT ACCATCTrGTCCTTGGTTCCTGG CTGCT ATTCCAAGGTGCTAT AAAGTC AG CTAAAG AGAGCAAT 

CATAATACATTGATAGCATCCCTCAATGTGTTTCTGAGCTTACTTC^G 

AGGGCAGCATGGGCAAAGGGATTGGAGGGACTATTATTATAAAGATCCACTGAACTGCT^ 

5 5 GACTGGAAAGAGCCAGATTCCAATTTAATCTGCTCTTCTATGAATTCTTAGC 

CAAGATTTTCAAGACATCrrTAAAATAGGTGAACrCCAAGGTC 

GATCTGATATTGTCAAAATGAATGAGGACTTAGTGCTCTAGTAAGTTTGGAACAGAATGA 
TCTTTTTTGCTTTTTAATAGATTAAACTTCACCGTAGAAC^ 

6 0 GCCTATTCTGGACATTTCX5TATAAATAGAATCATATATATGTGGCTTCTTGTACCTC 

TCATCCATATTGTAGCATGTAACAGCACTTCAl TTTCT 1 ' 1 ' 11 'ATGGCT^AGTAATATTCTGTTATGTGGATATACTACCATATTTT 
GTCTATCCACTCCTTAGCTGATGGTCTTTTAGGTTGTGTCCATTCTTTC^CTATTAT 
AAGTTTCTGTGTAGACATATATCTTTATTTCTCTTGTGTGGATACCTAC^AGTAG 
TACCTTTTGAGGAACTGCCAAACATTTTTCTACACT 

6 5 TCTGT CCTTGC CT AT ATTTATT AACTGTCrriT C1T ATT AG CCAACTGCTGTGGT T CG AATG TTTGT C CCCTC CAAAACTCATGTT 

GGAACATAATCCCCAATGTGGCAGTATTGAGATGTGAGGCCTTTAAGAAGTGCTTG 
CTAATCCATTCATGAGTTAATGTACTAATGGGTTATCACnXKiATTGGGACT 
GTAAGCTCAGCCTTCTCACTATGTGATTGCTGCCCTGTGTCACCTTGG^ 
ATCAGCTGTGGCCCCTTGACCTTGGACTTCCCAGCCTCCAGAAATGT AAG AAAT 

7 0 CACGTATTCAGTTATACCAACAGAACACAGACTAAGACACC^^ 

CCCAAATGACrAACGATGTTGAACATCTTTTCATCTGCrTTTTG^ 
TTGCCCATTTTAAAATTAGGTTGTL' i i i ^ 

ATATATGTGTATCTATATATGC^ATATATTCTAAACACTAGACCCTTATGAAATAT 
AAGGCCATCTTTTCACTTCTTGATAGTGTCATTTGACTCACA 
7 5 TTTTTGGCACTGTATCTTTAAAAAGTrGCCTGATCT 
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GCTCrTACATTTAGGCCTTTGATCCATTTTGAATTAATTTGTGTATATGGTG 
GATACCTAGTTGTCCCAGCACCAlT"l^'rTGAAAAGATTAlTLXrrCCCCATTGA^ 
CATAAATATATAGGCTTATTCCTGGACTCACAATTCTATGAGTCTGTATGTCT 
TACATCTTTGTACAAAGTTTTGAAATTGGGAAATGTGAGTCIT 
5 TTCGTTGCAAACTCATATGAATTTTAAATCAACTCTCCATTTCTGGAAGAAAAAA 

TTG AACCTGT AGATCAGTTTGGGGAAT ATTGCCATCAT AACAATT AGT AGGTCTTCC^^ C CATT 

TCTGTAGATATTTAGTTTCTTTCATTAATATTTTGTAGTTTTCAATAT 

TTTGGGTTTTG ATGCTTTTATGAATTT G ri'l'I CI '1 AATTTCACTTTAAG ATTGTTCATTG CTACTG ATT AGTAATGCAACTGATTT 
TTGTGTGTTGATTTTTGTATCCT<3CAACCTAGCTGAAATCA 
1 0 ATAAG ATCATG TCAT CTGCAATT AGAG AT AATTTTACTTCT CAATCTCG ACATTTTTT ACTTCTTTTTCTTG C CT AG TT 

GCCCTAGCTAGAACCTCCAGTGCAGTGTTGAATAGCAGTGGTGAGA^ 
TTTCAGTTTAAGTGTGTTGTTGTGGGGTTTTCATAGT^ 

TG'mici i"Ji r r gattgtttt aatcagg aaaggg cattagattttg tcaaatg ctttttctg cag ctattgagatitti'gtgtg tt 

TTTCTGGT CTTTT ATGGTTT AT CAC ATT AATTG ATTTTCAT ATGTCAAAC AAAC C CTGTGTT CTTGGG TTTCAT CTCACTTGG TTA 

1 5 tggtttataatcctttitatatacttctagattcagtttgccagta 
tggtctgttgtagctgaccagtaagtatagtaagctgtatagtttact 
ggattgttggtaattgttctttaaacatttggtaaaattcactagtga^ 
aacttaatgtctttacttgtttgttataagtccattca^ 
tcatttc^tgtagttatctaattggttagtctataattatt 

2 0 ATAATGTTCACTCTTTCATTTCGGATTCTGGTAATTTAAGAGTCrTCTCT 

TTT<jTCCGTTTTCAGGGGAACAG C ' rX '' J. I ' ll l ' T i' l I TGAGGCAGAATTrCCATCTTGTCACCCAGTCTAGAGTGCAGTGGTGCA 

ATCTCGGCTCATTGC^GCCTCCGCTrCCCGGGTTCAAGAGATTCTCCTGCCT 

CCACCACGCCTGGCTAATTTTTTATATTTTTAGTAGAOAroGGGTT^ 

GGTG ATCTGC CTG CCTTGG CCT CC CAAAGTG CTGGGATT ACAGGTG TG AG CT ACCGTG CCCAAC C CAG CTTTGGTT ATTTTTGTTG 

2 5 ACCTACrCTATTGTTTTTCTCTTCTCTATTTC^ 

AGTT C lTCTTTTTCrAGTCTCTTAAGGTGGATAATTAAGTTCCTGATTTGAA 
ATGAATTTCCTTCTCAGAAATGTATATGCTTTCACTGCATC 

GTATAGTCTTCTGATTTCCATTGTGATU'rCXTCCCCCTCTAACCCGTTTATTATTTAGG 
ACTTTCCAGATTTCCTTCTTTGTTAATTCTCAGTXS 

3 0 ATTTArrTGGCTTGTCTTATXaACCT 

TTGTTGGGTAGAGTGTTTGACAGGTGTGTTGGTACATAGTTCTGTTCAAATC^ 

CT ATTGGAAGTAGG ATATTGAAAT CTC CAACT AAT ATT GCTGAATTGTTT ATTGTTTTCTT CAG TT CTG TCACTTTTTG CTTT ATA 

TATTTTGAAATTCTATTGTTAGGTACAAGTAAGTTTATGAT^ 

CCCTATAAGAACAATTTTTATCTTAAGTCTATTTGTCTATATTAGTATAGC 

3 5 ATATTTTCTTCTTTTACTTTCTATTTGTGTTCT 

TTTATTTGGGGAGTT^AAACCATTGACATTTAATGTAAT^ 

ATTTTGTGTCTCTTGTTCTTAAATTCTTCCATTACT 

TTTCTCATTTCTTTTGTTGTATGTTTTTAGTTATTTTCTTAGTG^ 

GGCACAGTGGCTCATGCCTGTAATCCCAG CACTTTGGG AGACTG AGGCAGGAGGATAGCTTGAGTCCAGG AGTTCAAG ACCAG CCT 

4 0 GGGCAACTTAClt3AGATACTGTCTCTACAAAAAAATACAAAAATTAGCCAGG 

AGAGGCTGAGTTGGGAGGATAGCTTGAGCCCAGGAGGTTGAGGCTGCAGTGAACT 

CCAGGGCAAAACTAGCCCAAAGAAATGAAGGAAAAAAAAAATCTAATTTA^ 

CTTTCCCTCTGTATACCTCTTCTGCTTCCACTCTGTGCTGTTATTGTCATAGATTTTCATCT 

T AG ATTT AAAAAT ATTGCTT AGT AGTTGT CTTT AGAAT CCG AT A CG G AG AAAAGG AG AT AT AAACAAAAG ATG CATTTTT ACTGT C 

4 5 TTGTATGTTTACTTATGTAATTCCCTTTCCTG ATGTTGTATTTCTAAAGG CAAAGTAGGGTT ATTGTG AGTGT CCTTTTGTTTCAA 

CCTGAAAGACTCCTTTTAGCATGTGTTGGAGATATGCTAATGATGGACT 
CCTTCATTTTTGAAGGATAGTGTTGGCAGGATACAGAATTCTTGGTTGACAT^ 
CTGTCTTCTGACCTCCATGGTTTCTGATAAGGAATCAACTCTTAATCT^ 
GCTGCTTTCAAGATTCATTCTTTGCCTTTAGCTTTTGGTAGT^ 

5 0 CirGGAGTTTGTTGAGCTTTGTAGATGTATTTCATCA 

CCCCTTTCCGCTCCTTCTGTCACTCTGATTATTTGTCTGTTGCTTTGTTTGGT 
TTTTTCCTCCCCATTCTTTTTTCTTTCACTTCCT 
CTCCAAGTGACATCTCTGAGATGAATTTTTTTCTAGAGAAT^ 
TTTATCATTGCTATCTTTATATTATTCTCAGTTTAATGAGATA 

5 5 ACTGAATATATTTAAAATAGCTGATTTTAAGTCTTTTTTTT^ 

TGCAGTGGCACGATCTCAGCTCACTGCAAGCTCCACCT 
CTGGGCTTCCTCAGGC71TAGTTTCTGT T1T CT 1T CT 

TAACTGGACATTTTGAATATTATAGTGTAACAACTCTGGCAGTCAGACTGTCT 
TAGATGTTTGTTTAGTGACTTTTCATGGCTAATTCTCTAAATTTTATATTCTTTG 

6 0 TTAGTTTAGTGGTCACCTAATAATTAACAGAGATTTCATTAAATGCCTAGAM 

GTGTATGCATTAGGGCAGG CCTTGAACTCTT ACCCAGGGAGTTTACAACCCTG CCTT CCTTT ACTACCAGCTTCTGCAG AGCAT 

TAAGGTCAACAGGTGGTGAGAGTTTGGAGCCTACTCCATCTTTTCCTGAGCATAT 

ATTTCCAGGAGTATGTTGGACCCTTTCAAAGCCCTTAC^ 

TTCAACAGTTATCTGTCAGGCAGCAGCAAATTAAGAG ATTAG CAT AAATGTTTT CAACTCCTCCACCCGTCATGTGCCCCAGGGAA 

6 5 GCACTAAGCCAGTTCTAAGTTAGGCAAAATAAAGACAATCCTTTTGAGGTGGTCT^ 

AATTAATTACAAGTCITTGGCTGGATACAGTCGCTCA 
TCAGGAGATTGAGACCATCCTGGCTAACACGGTGAAACCCTGTCTCTACT 
CGCCTGTAGTCCC^GCTACTCGGGAGGCTGAGGC^GGAGAATGGAATGAACCC^VGGAGGTG 
CTACTGCACTCCAGCCTC<X3TGACAGAGCAAGACTCCGTCTCAAC^ 

7 0 TGCTCTCTTTCATACCAGGAATGTGGAATGTGGACTGTTATTTTCATGGCTACT 

TGGGTAAAACACCACAGAGTTTGCTGTTCTTACCAAGAATAAGCTGGGGAA 
GGCTGCTTCAAGCCGTTGATTAATTTTCAGGTTCCGAAAAAGTTCAGTTTGAC^ 

TCT AGAACTTGAGTTCTTTTTT CXIACCAGTTTTG CTGACATTG TTTT AAAAGCIACTTTTTGT AAAAC C CAAATG TTG TCT CT CT CA 
AGGCTAGCCAATAATTAAAAATACTGTTACTCCCCTTTGATTTTGGAAATGAATTC 
7 5 TTCAAGCTCTTTTACCATTTATCTAAACTTT AGAAT 
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AAGTTCAACTTTGGAACAAATGAACTTl'r 

T ATTCGTGGGACATCT AGGAAT1TCTI' ACAG CAG CTGACAAAT ATTTCAAG TCAGTG CCTGGT AGT ACTGTCCAC CAGGCAACAG C 
TTCAGTAGTAGAGCGATCTTTATCTATAAGGCAGTGTTTGAGCAATTGTTTATTAGT 
GGGGTTATAGAGGTAGCTCAGAGAGTTGGGTGCAAGTAGAGAAATCCACCCGGCTTGCATTA 
5 TCCTTTGAAGAAGGAGTTCTAAGGTTTAAAAAATTACCTTGAATGCCAC^ 

GGAAAGTAGAAAAATT AAGG CACTG AT ACT AGTGTT AAG AATGTTGG TT AAAG CTT CTGG CAATT AATTTTTT ATTT CCTTTTTT A 
ATTTTATTAAAATTrAACAATTTTCAGTTTATGCTGTAA 

TACCT ATTTTGT CTTT AGTCATT CAGCCATG CTTCCT ACTT AT ACTG AAT AAG CT AG CTT AAT CT AACAATCAAAAAAGAAAG CTG 
TTGCCTAAGTTAAGAAAAACAGTTTGAACTGTTT^ 
1 0 TGAGAATTGTCCnXSCAGTAGATCATTTTATCT 

TGTGATTTTTGTGTGCTTGTTTCTATTTTTCAC^^ 

GCAC^GGAAATTTACATTAGAACTTGGCTGTGTATTTACATGTATCTAA 

ATATTAGATGAGTCCCTAATGAGATACCAGAAACCCCGGAAAGATCOT 

T AGGT AAG AC!AAAGC CT AGTACT ATTTGTGG CAT CAAGGTT AGG TGTTT AAAGAC CTGTATT CTT CT ATTGT CATGTTGAAATTG T 
1 5 TCCCTTGATGTAGCAATAGAAAATTTTAGATTAGG 

TT GTGTTT CT AAAT AATTT AACAGTG ATCCAATT AGCTACT T AATTGATAAACTTTTCACT CT CTTTT AAATTG 
CCATCITGAATTTTACCrGTTTTTTAAAGCTC 

AG CACAGAAAGT CACAGAAAAATGT AAACAT AGTTT AAAACTG AATTC CCACT AG AAGTACTTTT ATT AAGTTT ACAA 

ATATTAGTAGAGTGGAACTCATGCATTTAATATGTTTGAAACT 
2 0 TCTTTAATCATGTGTGTGTAAAATGTGGCTTTTr^ 

AGTACTCAACAGAGTAAAC!AAACCACAAATACCACTTAAACTGTACACA 
CAAGTCTTCACTITTCAAATTCAGAAATGGGTTTT^ 
TTTATTTTTGAGACGGAGTCTCGTTCTGTGGCCCAGGCTATGGTGCAGGA 
GTTCAAGTGATTCTCCTGCCTCAGCCTGCCGAGTAGCTGGGATTACAGGTGCCT 

2 5 GT AG AGATGGGGTTT CTCCATGirGGGCAGGCTGGTCTCGAACrCCTGG CCrCAGGTGATCTCTCrGTCrCAGCCTTCCAAAGTGC 

TGGGGATTACAGGTGTGAGCCACCGCATCCAGCTAAAAACTTGATTTTTAA^ 
ACATTTATTAGCAGCCTTGACGCTATACCATATGGCTGTTTATCA 
TGGCAC^AAGGACTGAGTACTTTGGGAAATAAGAGAGAACTTTTGTTGAGGATGGT^ 
ATAAGCAAAAATCTAGAGACTCATTGTAGGCACrrCAAGTATGTATTTGTTAGAA 

3 0 TGAGAAAGCCATACTGACTGGAAGATAGTTCCTACAAGAAACT 

AAGTTTGGGCTTTATTTTTATACGGGGAGAAAGTTTCACAGGGGTTT^ 
GTTTTAGGGTAGTGGCTTTCAAACTGTTTAAATCCAAACTTTGATGA 

TTGTATTGGTTTTATCAAAACATCTITCATTGATCTTACTGAT^ 
AAGCTTTAGCAGTAAAATAGCAACTTTTATCTGTCTTACGTATTGGAGGTTCT 

3 5 GAGTTTTGCTCTTGTTCACGGGGCTGGAGTGCAATGGTGTGATCTCGGCTC^ 

CCTGGCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATGTGCC^CCATGCCCGGCrAA 
TCCATGTTGGTCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCGCCTGCCTCAGCCTCCCA 
AGCCACCGCGCCCGGCCAAGATTTAATTTTTTAAAAGAAAATATTTTGCT 
TAAGACTGATTAAAACTAAAGGCAAAGAGGAGGCTCTTATGTTTGGAATTCT^ 

4 0 AAT ATTT AATGG TACCAG ATATTAG ATGG TT AT AATGG CAAAAGTGTT CAAAGGATG CTATCAT ATTCATG ATTCATG AT CAAAAT 

GAACATTATAAGGCTATCCCTCrTCAGAATTAAATACGTTACTCCT 
TTTCTTCCTTTCTCATGTCCrcrTATGTCCACTGCTGAGCT 

GAGTTCTGTATATAGTAAATGTTTAATTTATTGGGGTGGTGAACGGGAAGTGCTGCT 

CCAAAT C CTTAC CCG CTTT AC AG AACAT AAACTTCCT ATT CAG TAG TAC ACAAT AACTT AACG AT CAAGGCAT CTT AACTTTT CTG 

4 5 TTTTCAGATGAAAGAACTATCGTTTGGCTTGATCAAGTATT^ 

CAGGGTrTTACGTTAGTTATTAACCAAAAGAACTAGTTTTAGTTCrGGAAGTCT 

AGAAGATGAAGGGAGACTTTCC^AATC^CCTATGAACTTCTAGTAACTATACCACCITAAAATAGAC 

AAGATATGTATTTTTCAGTGAAGACAACTAAAATGTTTGCAC^ 

TACTACCTTGCACAACATAAAAAGAAAAAGTGAGTGTGGAATCTAGGAATCTACGTGGCTCTAGGAAATTTTTTA^ 

5 0 TGAAGGAGAGCAAGAGAAAGGGAGCATGGCATTCCCCTG 

GCACACrGAGGATTACAGAATTCTTTTTAAATGTTTGTAGAATTAT 
CCAATTTCAGCTTTCrCATGTTGCTATGCTCAATTTTGTATACCATATATAGTT 
TTCTTTTTCCCATTGGTTAAAATTrAAAGAGAAAGTGGAAGCTAGAAATTT 
GTATCAATCTAAATTTGAATTTTCTTTGTGCATAATCTTTTTTCAAGCTATTTAC 

5 5 TACACrAGCAATACXSTTATAAATATCTAACTTTCAACCTATTTACAGTTGATGC^l'T'r 

CTGAAGAGGTGAGGAAGTACCACTGCTGCCTCAGCArrATTTCGAAATTCTGTTTATAAACTATACAATTTC 
CAGC^CCTTTCCAGOTACTAACTATTGGGACAAAGATA 

TTGCACCTAGTAAGATCTTAGAAATAACGTGTGTACTCTGACCTGTAAACTAATCCTAGTATTCT 
TGGGCTCTTAAAAGGAAAAGTAACGTACATCTGATGATCATTAGCACTGAGCT 

6 0 ATAGGATAATTTAGTAATTTAATAATGTGACIAACATTTGCGTGTGTTTTTTTT 

G TAAAAGAAG CAG CAG CAG AAAT AT C ACT AT AGG AT C AAAAG ATTG CAGGAAC CAAAACT CCAAAATT ATTGGG CATAATGT ACT A 
AAAACAGGGCAGTGGAGGAAAGGGACAGTCCIAGACTAGCTCTGAGGGTCCAA^ 
CTCTGCTATTTGGGCTTC^GAATTAAGTGAAATTGTTGATATACTAGACAGATACTTC 
TTCATTTTTTCTATTTTCTATTTCTCTGGATGCTCCIATTTCnT 

6 5 TTGGTTATTTAAAACIAATTTAAATCAACTTCAGTCTAATTG 

AATTTTATTTAAAAGATATTTAAACTGATGAGAGGATACTACCCATTCCAC 
CTAGTTATTTGGTTTAAAAATGCTC^GATTATGGAAAGTGGATTGGAATATTT^ 

TT AATAAACTT AT AAATTCCTCTTTCrCTCTTG AT CTATCTTG AAAAGACACTCT ATGTCTCT AGGCATTCCTTCTCTGTGGTGTG 
ATTGGTAGACAGC^AGTAAACAACTTACTGTAAATGGGCACCATC^ 

7 0 AGGGTTAGGAAAATGCCTTTTAACTTGTTTGTCTCTGCCTCTTTTAAAC^ 

TAAGGTCTTTTAGGGCTTATAAGATCrrTTAGGAATGGCCTGGAAGTTATTAGT^ 

GATAATGAGAAGTTTTTAAAGGGTCGTTTTATAGTTAAACGGAATT^ 

CACATTTGGGAGTTTCTCTGCCCTACTTTCAATGTATTTAATTTACTGACCATCACT 

AGAAACC^AAGAGTTTTGGAGTTTTTCCCCCATTAGATGTATTTAT^ m AAAGAC AGGGTCTTGCTCTGT 

7 5 CACCCAGGCTC^AGCACAGTGGCATGATCCTAGCTCACTGTATTCTTGAACTCCT 
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G TGG CT AAGT ATCAAGT AAGAAT CACCTGG C AAATT C CAAGG CTGT AT AC CAG ATTT CCT AAATT AG AATTTTGGGGTTGGGT ATC 
TGAATTTTAGTAAAGCCCTCCAAATGTTTCTGGTATTGC*rT 
TTCTGTC^TATTTAGATATAG^CATACCTTGTTTTATTCT 
TTGAAG<3TTTCTGGCAACCCTATATTAAGCGAGTCrc 
5 CATTTTGXSTAATTCTCTCAATATATCAAA CrrrX " r CATCATTTTTC 

CCTCCCACCTTCAAGCAATCCTCCTGCCTCAGCCTCCCC^ 
TATTTTTAGTAGAGATGGGGATTCCCCATGTTC^KTCAGGCTC 
CAAAGTGCTtK^ATTACCGTGCCAGCCTGATGTTACTATTTTAATT^ 
1 0 ACTT AATTGGTAAAT ATTGCG CATGATCTGACTGCTCTTC CAACTGGC C ATTC C CTGT CTGTCT CCCTCTTCCTGGG ACTCTCAAA 

TCCCTCAGAGACAATAATATTAAAATTAAGCTAATTAATAAC^ 

CTCACTTT AAAT AAAAAGCT AGAAGTGG CTAAACTT AGra C CAAG ACAGGCCAAAAG CAAGGACT CTT 

GT ACT AAACAGCT AAATTGTG AATG CAAAGGAAAAG CTCTTG AAGGAAAT AACT AGTG CT ACT C CAG CAAACATGTG AATGAT CAG 
AAAGTGAAACAGCCTTCTTGCTGATACGAAGAAAGT^ 
1 5 AAAGClTAACTCTCnTCAATTCTATGAAGGCrGTGAGAGGTCAGAA 
ATGAGGTTTAGGGAAAGAAGCCAGCX3CTGTAACATAAAA 
TC-TAGAAGATCTAGCTAAGATTACTAAATAATAGATTTTCCATGTAGAT^ 
CTTTCATAGCTAGAAAGGAGTCAATGTCTGGCTTCAG^ 

GTTGAAGCCAGGTCTCATTTACCACTCCAAAAATCCGAAGACCCTTAAGACr^ 
2 0 GAAACAACAAAGCCTGGATGACAGCACATCTGTTTATACT 
GCTCAGAAAAAAATGATTACITTCAAAATATTGC^ 

AACAAGATGGATGTTGTTCTCATGCCTG C CAACACAT CATCCATTTGT AG CCC AT GAATCAGGGAGTGATTT CAAG TTT CAAATCA 

GTAC^TTTTGTAAGGCTATAGCTGCTATAGACAGTGATTGCTCTC^TC^ 

TTGGCCATTCTAC^TGCTATTAAGAATTTGTGATTCGCAGGAC^AGGTCAAAGGA 

2 5 TTCCAACAGTTATAGATGAATTTGAC4GC4GTTCAACACT^ 

GAATTAGAAGTGGAGCCCGAAAATGTGACGGAATTGCTGCAATCTC^ 
CAAATGAGCAAATAAATTTTTTTCTTGAGATGGAATCrrACT 

TT AGAGTATTACATAAACTTAATTGGT AAAGCAGCAG CATGGTTItS AGTGG ATTCATTCCAGTTTTGAAAGAGTTT 
TAAAATG CTATCAAACAGCATCTCGTG CTACAAAGAAATCTTTTATGAAAAGAAAAGTGAAACTTC^ 

3 0 ATTGCCACAGCCACCCC^CCTTCAGCAACCACCTCTCTGAT 

AAGATTACAACTCACTGAAAGTTCAAATGATTCnTAGCATTTTTA^ 

CAATG CT ATTG CACACTT AAT AG ACT ACAGT ATAGT AT AAAT AT AACTTTT AT ATGT AG TGG C AAAAAATT CGT CTG ACTT 
GCTTTGTTGCAATATTCACTTTATTGTGGTCTAGAACCGAACCTGAAATATCTC^ 

GTAAAAAACCCAGCATATAAAAAAAACGTAGAATATGTTGAGAGTTCAGTAATATGGATGAAAATGTTTTTCT 

3 5 ATGATAAATTATAATTAGGGAAGGATATAAAC CAAG AAAATATGTCTG AGATAGCCAATTCTTG CAGTTCATAATATGAAAACTCA 

TTATACCAATCTCAGTAAGAATACTTTTAATAGCTGTTATTTCTTTGGC^ 

AT CAAT CCT AGG ATCACTTT ACAACCACTT AC CC CAT ATTACAATG T AGT ACCAAG ACAAG C AG AC CAAATT AT AG AAGG ACAAAG 

TATTAGAAAGCCACCCCAGAAAAGTTAAATGACTAGG AATC CT AC C CCTAAAATTAT ATAACAAAACTCT 

4 0 CTTCATCTATTATTAGGTCATCTTTAGAACATCTTCTCTTAAATTTGTTATAC^ 

GAAAGCCTAAAATGGCTTTTTACCTATATAATTATTTCC 

GTGAAGTATGT AAAAACTACCATCTTTCAATTAC4GAACCATCGG ATAG CTTCTACAGG ATTG CTGGGG AGAAC CTTTATAAAGAAA 
GTTATATCTTTATAAATTTTTTGTCATTTTACTTAGCTGAGAATATAA 

AACAGATTAATATTGATCAAATGTGTTATTAAATGCTAAAACACCATTTTTTTTCTCTGTAAGCCATGT 

4 5 AAAAC^GACAATTGTCTGTGTTTTATGACAGTTCTGTTCTGTGAGATGCT 

GGACACATCTTTTTTTCCTCAACAAAAGAGGAAAATTATTCTT 
TTTTAAAGTCAGCATTTTTTTATAAAGATAGGTGTTTC-GAATC 

TCTAAGCrGTACCTTAGAAATATTAOTCTGGCAGTTGTACATAGGATGAGCTGTATC 

5 0 TCGAAGAGAGACTACATACGAAGGCAGTTATATGAGTCATTACTAAAGGTCT 

AGCAGTAGC^ AAGGAAAT AAAAGGATAGATGTGC^SAGTCATTTG^ 
TTCTGCTTCCACTCCATTCACATTGACCTTTCCAAGGTTATCAGTGACCTGCTTG 
CTACTGTTGGCACCAGCCCTGTGCTAGACAACAGGATCCTGTTTGTAAAGGCAT 
CTAGTTTTCCTCCTTCTACTTCTCn'AGCCTCTTGGCAAGTTCTTCTTTCAGAGTTTCT 

5 5 TTTTCTCCTTCTCTAAGTGATCCCATCCTTTTCT 

ATCTCCAGCTCAGATGTATCTCTTTGCCTCCTGACCCATATTTCCAACTATCTAACTC^GTATCTTTT 
CTCTCAAACACAACATGTCCAGAATAATTCATTGACTTATTCT 

TATATGGTGCTATCAACCCCAAAGCAGAAATCTGGACATAATCCCTAACTACCCTTTTCCCCTCTCT 
TTAGGCCTCATAGATTGGACTAAATAAATACCTCGCAAACCCrTCTACTTATATTCTTAAC 

6 0 ATAATTTTGTAGCTGGATGACTGCATCATCATCTTGACriGGCT 

TTCAAACATAAACATGTGATCAGATTAGTCCCCTCTTTAGAACACCCTAGC^ 
CATGGCTTACAGC^TCTTTTATGATTTC^TGAGCTTTT^ 

TGTGTCTTTGCATAATGCTGTTCCCTATACTTCACCTCACGTCrrAACCTTCATCTCCTTT^ 
AGCTGAATATCACATTGTCATGCAC^CCCATTCTTGATCTCCCACGTTTG 
65 GGTGTGTCCCCTATCCTAGCATTTGCCTCATTGTATTACAACTACTGTC 
GGGAGAGCAAAGGTTCATGTTTGTGTTTTTCACT 

TGTATGAAGAAATGAGAAAGAGAACAGGAAGAC^GTAAGTTTCAAGACTAGG^ 

GTT AAAACCT AG AAGAAGAGTTTG TTTT AGGAAATACTGTG TTTT AAAC C!ACT AT AACTG AAGCAAAAAC CCAAC^ CCTGGGTGTG 

7 0 AAAGTCTTCTAAGAAGATAGATTATAGTAAGAAGGATTAC^GGO^TAGAAATATGA 

ATG ATGAGTCATCATGAAG AG AGTAGG CAATTGT CCAGTGAAGAAGGGG ATG CT AACCCTTCTT AAC CTTGAATCT CTCAGGT AGA 

agcagttagagaaggaacagccatcatcagatagtgttctaag^a^ 
gcaactaagaaac^tatactaccactgtttaacaatcgccacaa^ 

agg aacttgg aggact aag aagag ag aaatctx3tagaagaggctt ctct ctctg atcctccctccaottcagtttt accacatgt a 
7 5 atgcaacaataattaagaatttgtgtaaaatttciaccaggttggcatgca 
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AT AAT ACAAGG AG CATTTGT AGGGAAAAAT ATTT ACAAATAC AGT AAGACCT ATT CTCITTCT AT ATTT ATGGGAAAATTTTAAGT 

TGTGCCCTTGTTTCATGTGTGTTTCTATTTAAAGATACCATACTTAATATAT 

CAGCACTATAAATCATGTCTGATCAAAACTTATGATACATGTACTT^ 

TCTAGGTAGTACTTCAGGACTATGCATAGAGGCCATTTTAAACA^ 
5 ACTAAATTTACCATXSAAAAGGA(^CTTCTTTACAGTTTGAGAGCT 

GGAACATGCATATCAGTCGACTCAAATTTTTTGCTAT^ 

TACAAATAAAATGTAGCAAATGTGTAAACTTGCAGATGTGGAATCT 

TCAGTTTTTTAAAGTACTATTCTAAC^CCTTATAACCTGCCCCA 

GTAGTTTTATCTGAAGTCACATTTCAAATTr'lXSTAATTGAAG 
1 0 AAATCAACAGGTAATCCACCATCACAACTGTGGGCT 

GAAT CAT AGGCAGGTTT ACTCTGTCAAG CTG CCTGT ATTTTGAGGG TGT AGT C CTCAAACCAAAAAGACAC CAAATGAACAAACTC 

AGATGGCCTCACTG<KK3AACAGAGATTCAAAGCTGACACTGGA^ 

TGGGCACAGAATGTGGAGAGCTATATTTGCTAACTGAAGAAATTTAGACTTT^ 

GTTGCATAAAAGCTGCATTTTAGCAGCATATATTTTGGTAGAGCTC 
1 5 TACTTATCCCTTGCTGTTTGATCTAAATTTCTACAATGAGAATGTC 

GCCGAAATTCITGTGTAGCAAAAATTAACACTTAAGTTAGCCCT^ 

AAGTCACTCTCTATCCCTTTGAATAGACATTAGAAATAACATGTACTTTAAG 

TACTAGTGTGATGCCCTGTAAAAAAATAACTAACATTAGAGTTGAGGCCTAGAAATAG 

2 0 TATTAATGACTGCAACTTATAAAAATGTTACAGACTATATTCTTCCC^ 
CTTTTTAGTTTGGTAAGACAATTTGAATAAACTGCAATAATTGCAAAAG 
AAATATACATTTCTTGTACTATATAAACATTCTAGAAAA 
CTCTGAAAAGGGATTTCCTTTACAGTGCTGTGTACTAAAGCCT^ 

GG T AGCATCAGAAGTCX5 ATTTT CATT AG CCTTAT AC CATTCACT ATTTCTG C CAAGCAAT CTT AAATT ATAAAAGAAT CTTATTTG 

2 5 ATTTTGTGATTCTCTTtaTTTTCTGCTCATAAAGAAA^ 

TAATGTGTAAAAAGACATTTAAAGTATAGTTGTGTTAAGTTTTTGAAGTTTA^ 
CATATTITAGGAATATTTCATTTTAGTCACAACTAGGATATAAACATTA 

AAATTTGGG AAATTTTGATTTGTT AG ATTT AT AAATTTT ACAG TAACACAAAAGT CT AATTT CCTAT AT ATTTT CAAGG C C C CT AT 
ACCTTTGTCAAAATAAAGTATCAATGAAAAATGAAAAAATCATAAACT 

3 0 TAGATAGAAATCTGTTTTCCrCTTCTGTTACATCrCCT 

TAGGCTAGAACAGATTAGGAGATTGTGCTGTATGTTAAGTGGCAGATACCATGGAATTCn'AA 

GAGGCACAATGACCCTGACACAGCCCCTGGGTTGACCACAGCAGATATCTCACT^ 

GATTATCTCTAACTTGTCAGTAACTTACnTTGATAACCTAGATTTAGGAGTC 

CTGTTT ATGAAAGTCAT AACATTGTATGTTTAGCATAATGGTGAAG AG CCTGCCATCTGG AATG<3TCTACTTATTTGGG ATC CACA 

3 5 TACAGTAAGCTCTCACTTAACATCATCAGTAG<3TTCTTGGAAACTGTGA 

CACAGGCTAATTGATATAAACAAGAGTTAAGTTCCTGTGGCATATTTC^^ 

CAAAACACTTATAATATTAACCACTGAAATAAATGTGAGCTATATATATACATTTAAGAATAATAA 

CCCAATTTTTGGTGAACCAGTGAGTGATAGTGATCATAGTGATGGTGGATGAAAT^ 

TAAGAAGCACCCCCTGTCACCACATAGCTCAGAAATAATAATTAGGGCAGGCTTGCTGAGCATT 

4 0 CATG CATTTG AATG ATT ATCG CAGACTTT ATGAATTTTCATTTTAT ATT AATTTGTAGG CCAGG CACAGT^G CT C ACG TCTGT AAT 

CCCGGCACTT^GGGAGGCCAAGGCAGGCGGGTCACTGGAGCTCAGGAGTTCAACA 
CTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGTACACACCTGTAATC 
GAACCTGGGAGGTGGAGGTTGCAGTAAGCCGAGATTGTGCCCCTGCACTCCGGCCT^ 
CAATAATAATAATTTGTATTCATTCATTTTCCAATGTGTTCATTCCAGT^ 

4 5 CAGAGCAACTGACCCTATAGACAGGACGCCACCCCATTGTAGGGTGCACTCAAATGCAC^ 

ACATGC C AG TT ACCGT ATCAC ACACAG CTTCGGGATG TGGG AGGAAAG CG AAG TAT CTGG AGAAAAACT ACACAG ACATGGGAAG A 
ACGAGCCAACTCTAt^CAGACAGTGGCCCTGGACAGAGCTGG^ 
GGGCATGGAGTCTCACTCTGTCACCCAGGCTGGATTGCAGTGC^ 
CAAGAGTTTCTCCTGCCTCAGCCrCCCAAGTAGCTGGGATTACAGGCGCCC^ 

5 0 GAGACAAGGTTTCACCATGTTGGCCAAGCTGGTCTGGAACTCCTGACCT 

GATTACAGGCGTGAGCTACCGCGCCCAGTCAGCATCATTTTTTTT^ 

AAAG ACCTG C CG T ATGG CTATTTT CT AGTTGTGTGACTTT CTTTG GG AAAGTT AG C AAC C CT TTCTGAG CTT AAATG T C CT CATT C 

ATAAAATGGGGCTAGTAATAATGCATAAGGTTTTTGT 

AGTTGTTGTCTTTG CTATTATTTTGATGTXSGTGGTTGTTT^ 

5 5 ATTGGCTTTATAATATTAGAAAAGCAAACTTTTGTACGATATGGGTATGAAAAATTC 

ATTTGTCTTAGTCTTTTTAAAGCTTAGATTTTCCAAATGA^ 

ATGTCATAGTTAAATAACTTAAAGGTGCTACATCTTATACAGTCCAAAAGGAACATAATT^ 

AAATAGCTGACAGTGACTGATTTATAAAAGTAAAATATCTTTTGTTAAT^ 

ATTGAGTGAATGAGATTTGCAGTTCATTTATCrATGATGCTGGTTTATTTAATCTCT 

6 0 GATAGAGGTCATGATAAAATCTAGGCCCAGTGCCACAACTAAATCCCTGT^ 

GAATAACACCTTCCAAGAATATTATGAAGATTAAAAAGTTACGTATCATAAATACACACAG 

CTTGTAAGAAAGAGGAAGCATATGGCATATTCTGATGGTTAGGGATATGGACTCTGTAGCTGC^ 

TCCACTAATGGCTAGTTATATGAAATTGTGCAGATAATTTAACTTCTCTGAGTTTGCATTT^ 

T AAT AGTACCT AC CT (^CACAT AGTG TT AATTTCTATT AG TGG TTCTCATT AA.G AT AG T ATTGTTG TTCATC CCTGG TTGTT AG C C 

6 5 ATCATGTATCTG AGTTAGAG AGTCATTGATTTTAGAAAGTCCCGAGGAGACTATCAGGT CAAG CAACCTGCCTCCTG CTAGACAAT 

TAGCTTTATCCATGAGTTACCAAAGAGGGAGCCGAAACCCAGGGAAGCT^ 

AGAAAGATATCTC^AATCCCAGTAGTTGCCCATTTCCTAGTTCTGGGCTCTGCATTGCACTAGAATACT 

AAAAGG CAGT ATGACCATTGTG CTTGT CACTTTC CATTCC CT AGATGCTAT CTT ATATTTGT C CTT ATG AAATTT AAC CTGTG ACT 
TTCAGATCACTTAGAACCTTGGTTCGACAGTGTTTTCTAGTC 
70 CCCTAAAAGAGCTATACTCTGCK3TGCCA 

AGCTCGTAGCAAATCAGAAGACTTCTCTGACACCTCTCTATGTCTAA 
TGTATTTTAACTGGCCTTTTCCTTGTATGTCTCCTACAATGAGCTGTTGA^ 
GTTATAATTCCAATTACAAAGCACATTTCCT^ 
GATTCCTTCTTAAAACATTTTGGGGCATCTCCTTCTCCT 

7 5 ATTTCCAACTTCTCTTAAATTGATTCAGCAAATGTTT^ 
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GAATGCATTGAGTAAACrCCTCCCCTGGAGCT 
GTAGACCTTCTAAGCCTCAGTTTCCTT\ATCCCTTAAGTGGG^ 
AACCAAGATAATTCCTGTAGATGATTTGGCATAGTGCCT 
AAAAAATGAAGAGACTAAACGAAGTAACCAGAAGGAAAGAAATT^ 
5 GTGAAACACCTGCATGGCACCrr rn X JTT ATTCATACrGTTTTGACTC 
AGACTTGTCATTTTGAACATGGCATCAGTGGAACAGCT^ 

TCAGGACATGTGATAAAAAGACATTCA TrritS CCCCTCCTCCCTCTCTGTA 1 1"! 'rCTTOCTATAAAATTATTGATGTTAAGCCCA 

TAGTACTAATATTTCAGTTCAATTCATAATAAAATTTGAGGGCATTTGAATATAT^ 

CCACAGAGTATTTGAAGTGGGTCTTTTCmrCC 
1 0 GAACAGGTTTATATTAAATAACATCATTTCACTTTCAACTTTCT 

CATGTTCTGTT AGAACAGTTTTGG CAGTT AG AAG ACTTCTCTTCTTGTC CCAT 

GTATCCAGATGTCCTGAAAGGACCAGTGGTAGATGTATTTCTATTTTTGTC1 1 1 TL"rrrriTCTTTCTGGCATTCTAGTTGCTGAG 

TGACTGACTTTTGTTTTCAGCTCTTCTCACAATCACC^TTGTTCT 

CATGGGGCCATTTACCGTTAATTTTITAAAGTACTGAAATGAGAACCTCA 
1 5 TCCTACTAAGCCTTTTGCAGATGGAAATTTCCTTTAAATTG^ 

GAGCTGTGATTTGTTTAATATATTTCAGG C rrCTTAATAAATCAAGTCATC 

GCTTATCAAACCTCTGAAAGAAGAATTTTGTGTTTGCCCACAGACTGAAGAACTG 

ATTCATATTTAATTCCTGGTACTGAGGGTGGGAGGAGG<3AGAGGAGCAGAAAAGATACA^ 

GGTGATGAAATAATATGTACAACAAGCCCCCGTGACATGTGTTTACCTATTTAACGAACC 
2 0 TTTAAAAATATATATTTGGTAAATCAATTGATGTGTTTTAAAAAATATCGCCTT^ 

CCAGCACTTTGGGAGGCCAAGCCGGGCGGATCACGAGGTCAGGAGTTCAAGACCAGCCTGGCC^ 

CTTAAAAATACAAAAAATAGCTGGGCGTGGTGGCGCGCACCTGTAATCCCAGCTACT 

CCCAGGAGGCGGAGGTTGCAGTGAGCCAAGATTGTGCCATTGGACTCCAGCCTGGGCGACAGAGCGAGACT 
AAAAAAAAAAAAAAATCATCTTTAAAGAGATAACTAACCCTTCCCCAGAAGGCAGGG 

2 5 GTATTCCCTATAAATTTTAGAGTCAGCCTGTCAATTTCTATACACACACAAAAAAAGCCTG 

GAAATTAAATCAATTTGGGTATAAGAGACTTCAATTTGG 
CTCCATTTAGTCAGATATTTAGTTTATTTCAACAATATTTTCAGATC^ 

AAGCTCTTGGACATCTTTTGTCCCATATCTATTGTGTATATGTGTTTTGCTAGTTATTAAATTATA 
CAATTGTTTGTCGCATATATAGAATGTTTTAAAAATATTGTGTCCTGTGACCATGCTAAATTAA 

3 0 CATTATCTTTCTCTTGAATTTTCATTGTCTTCCCCTTCTGGGACTCCATTC^ 

ATGAAG TT CTG TT AATTTTT CTT CATTCTT CTTTTTCTCTG TG TT CTT CAACTG AATG AATG CCATTAAT AATTTGGT ATGT AATG 
GCTGACTTAAACITCCTTTTGTTTTTAAGATATTTCTACT 

AG CCT AG AAGG AAGG AGG AGAAGG AT AGGGTGAAAGG AAGG GGAGAGCTTCTAG CTT CAGGACAGAG AT CAG AACAAACAACAGAG 
CAGTCATCTTGGATAAGGAAACTTCCCTCAAACCTATTACTTATATCCTCAGAAATA^ 

3 5 GATTTTGAAAAAGGGAACATTCAGAGAATAAAACTAAACTCTTGAAAGTTAAAAG 

GGATTGAAAGAT AAAAGT AAG AAAATATCC CAG AAAT AAG AGCAAAAAGACAG CAATG T AAAAT AGGGG AG AAG AT AAG AG AATT A 

GAGAACCAGCTTAGGAGTTCTAGAAAGAGAAAATGTAGACAACAAAAGGT 

CTATCTGTTTCTTTTTCTATTTTTTATTTTGA^ 

AAATATGCAGTCATGTAT^GCTTAACAACAGAGATAGGTTCTGAGAAATGCATC^ 

4 0 AGAGTGAACnTACACAAATCTCAATGGTATGTCCTACAGTACACCrrGGACCATATGGTATAGCTC 

ACAGCATCTTACTGTACTGAACACTGCAGGCACCTCTAATACATCGGTAAGTAT^ 
AAATACAATATAAAAGAGGAAAAAAATAGTACACCTGTATAGGTGCT^ 

GTCATTGAGTAGTGAGTGAATGTGAAGGCCTAGGACATTTATTATATGAAGTCTACrGTAGTGTAAACT 
ACTAAATTTATAGAAAAATTTTCTTCAATAATAAATTAACCTTA^ 

4 5 AACATTTTGACTCCTTTTTAGTAACACTTAGCTTAA^ 

AGCTTTTTTCCATTTTTAAGATGTTTTTA 

ACAAAGTCAGTGTCATCAGTGTTCAACCTTCAGATGTTATCCCACTG^ 
TTTCTGTGATAACAGTGCCTTTTTCTGATATACCTACTGAAAGACCTGG 

CAAG AAGT ACACT CT AAAAT AATGAAAAAAGT AT AATACAGT AAATACATAAAC <^C CAACATAGTCATTT ATT AT CATT AT CGAG 
50 TATTATGTACTGTACACAGTTGTATTTGCTGTACTTT^ 

CATAAG CATGGTGTTGT ATT ACAATGCACAG CTGCAG CT AAGT GAT AGGACTTT TT CAGCTCCATT AT AATTTT ATGGG AC CAT CA 

CTATAAATGCTGTCCATCATTGACrrGAAATTTATGTCGTGCATGACCATACATACAATTTAATGAAA 

GCAGTGTCTAATTACCAACCAGGGCAAGAAATAGAATATTC 

CCTATTTTGTTCCTCCTCCCCAGAGGTAACCACTGCCCTGACAAATGTGATCGTTGTTTTCTTGTTT^ 

5 5 CATCCTTAAACAATATAACTCAGTTTGTATATTTTGAATTCCATGTTAATAGA^ 

ATATTATATATGTCATTTTGCATCITGCTTTTTTCATTCAACAT^ 

CATTGCTGTATAGAATTATATCCTCAGAGATAAGATATATGGATGTTTATAAATCATTCCACT 

TAGTTTTTATTTAACCAAAAAAATGCTGCTGCCAACATTCTTACACATTTTACT 

ATTTCTTTTTGAATACATATCTATTGATt^AGTTGCTACATCATAGG 

6 0 TCTTCCAAAATGATTACATCCTTAAACTCAGGACACATCTTATTGTCAAATGT^ 

ATTTT ATTGT CGTTTTAATTTG CATTTCCCTGATTACT AATTAAG CTGAGTAACTTTTCATATGTTTATTGGCCATTTGG AGTTCC 
TGTATTGTAAAGTATAAGTTTTTIT'GTCCATTTTrCTAGTTTT 
CTCTGAGCACAAGTAACTTGGGATGCTTTCCTTTAGATTTAGCCTAATTCT^ 
GATATATATGTGTGTGTGCGC^CACATGTGCGTGTGTCTATATAT^^ 

6 5 CACTCTGTCACCCAGGCTGGAGTGCAGCGGCACAGTCTTGGCTC 

AG CAACCCGAGT AG CT AGGATTACAGGT ATG CC ACC AT ACCCGCTAATTTTTGTAT^ 
AGGCTGGACTTGAACTCCTCACrrGAACTCCTCACGTCAAGTGATCTGCCTC 
GAGCTACCGCGCCTGGCCTC^ATATTTTTTAAAAATATTTT^ 
CTGACCTACCATTTCTATAAAAAGAAACCTGTAAATGTTCTTAAACAG 

7 0 TACCCCCAGTTTTTGAGCCTTTCAGAATTTTTTTTC^ 

TTAGGAAGCCAACAATCCTTGTCCATCTGCTTTCT 
ATGCTTTTAAAAACATATACrcGGTTTGAGAGXaGAGCTGAAATAAAAG 
TGACTGGTCATTTTATTTTCAAGCTCACATACACTTCAAA 
CTGAACATCACAGATTATAT(IATCATACTTGAAATATTTGAAATTTT 
7 5 CT AAG TGTTT AAG T ACTT AACATC CAAAACATTCTTACTAATCAAG AG AAGACAAACAC CCCAACAG AG A CAAATTTT AT 
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CAATAGCCAGTTCACC^GATTTGTTTTCTGTTAGA 

CAGGT AAGGAG AGG CAGTTTAAGGATC CATGTGATAAACCCT AAAGTTGTCCAT CGG CTTTC CAG TC CCTTCT AGGAATTT AACTT 
AGGGAAATAATCAGACATTTGCAAAGGTGTGTAGAGTGGTATTTATAATAGTGA^ 
GAAGTTACAGCCAAATACTTTACAACTACTAAAGAATCATCTAAAATATCTATTGA 
5 OUSATGAATAGTACC^CACATAAAAAGCAAGGTGCAAATTAGCCATTTATATTGTT 
AAAAGATGCAGGCTATATATGGAAGTGTTTGCTGGTTTTCTGTCAAAAGAATGG^ 
TTTTCTAAATTGTCTAAATAGTTATAGTTTTTATAATGTAAAAGTATCTTCC^ 
CTATCTAGCTATGTAATTCTAAATGAATTTACAGCAGTAATCTT 

G ATT AGATGT AAAAACATGAGTTGGGCTTT ATGGTTACAG AGAGTTTTCCT TG TGGGGAT CAT AGCTGT ATTGAG TTT ATTCA 
1 0 GTTTTCCTTTCCCACATGAATGAAAAATGGGGCCAGCCTACAACTC 

ATTTCTTGATGCTAGTAGGGAGAGAATCAAATTGCCTCCTATTCAAACCAAGACCCACAAA 

CTCCCTGCAGTGTCAAGAAGGTGTGAACCCCTCATGTTCTCTATTG 

TTTTGGCTTTATGCTACATTTCAGAAATCATAATAACCTTTTCTGG 

TTTCAAGTGGATGCTTCTTATAAACTATTTATACCCTTTTGCT 
1 5 TAACTAAGAAAACTTACTGCTTCCAATCTAATTTAAAAGATCTCCCT 

TTTTCACAGCAGAGACAAGTGAACATTTATTTTTATGCC"^ 

ACTCTCCAGATTCAATTATGTCCTTGGGCITGGTCGACT 

TTACCAACATTAAACCCTAGGATACATGCAACAAAGCAGGACTCCTTCCT 

AATGTAGAAAACGCTGGAATTTTTCCTTGGAACTAGACTGTGATGAGAGGTGCT^ 
2 0 TTTTTGAGACAGAGTTTCt3CTTGTTCCCCAGGCTGGAGTGCAATGGCGTGATCT 

AGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGCACGTGCCACCATGCCCGGCT 

GATGGCATTTCTCCATGTTGGTCAGGCTGGTCTCGAACTCCCAACCTCAGGTGATCTGCCTGC 

TT ACAGGCATGAGC CACCACG ACCGGCCAGCTACTGTC^ CAGG AAATA 

ATCTTCTCTGAAGATACTTGATAAAAATTCCCAAAACAACAAAACGCATGCTr 

2 5 TCACCT AAGAGTATGACAACAG CAAT AAAAAGT AATTT CAAAAAGTT AAG ATTT CTT CAG CAAAAT AGATGATTCACAT CTT CAAG 

TCCTTTTTGAAATCAGTTATTAATATTATTCTTTCCCCATTT^ 
TTGTTTGTTTGT T TT TT GAGATGGAGTCTCGCTCTGTC^ 
CTCCTGGGTTCAAGCGATTTTCCTGCCITAGCCT 
ATTTTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGATGGTCT 

3 0 AGTGCTGGGATTACAGGCGTGAGCGACCGCGCCCGGCCAGCAATACAGCT 

GGCTATGCATAGCCCCATGTCCTAATCAGGCA.TTCACTGATCCCAGCAGGTCTCCATCT 
AATCTCAAGGTTACTCITAAATACTAGTAAATGCAAAAAGAACT 

GCAGTTTTCAGCCCTG CCTCACCTGGGTCTAGTTCAGCTG ACGGATC CCAGGTGTGCCC 
ATCTCCTTGAGGAAGCCCACTCTATTTTTGGTAGCATGATGGGCC^CTGAGAGGTGGA 

3 5 GAAATGCTTCCCTGGGAAGGCAATTTCATGAATGAGGTCITCCAAGCAAATGAAG 

TGTTGTCTGTCAGAGGGATGGTCTTATTCTTGACCTTGGCTTGTCCACGTTTCAAAATGAGTT 
AATCCCCAGGTCACATAAGGTTCCACTATATGCAGCATTTTTAGATTCT 

CTTTAGG CG AAGT CTTG CAGTGGTTCT CTG CACCCG T AAACT CACGC CAT CAAT C CTTT CG ATG CGT ACAACAAAGG CCAAGG AAT 
GTTrATCTGGCAATTCCAAGGCATGAGGTTTCACTTCTAGTCGTCTGAG 

4 0 AGGAATGATTCCAGTCGCTTAAACCTGAGCCCTTTTCCTTTCTTCTGTCT 

TTCITTTTTTAATTATAATTTTTATTTTA^ 

TGGTTTGCTGTACCTATCAACTCATCAGGTATTAAGCCCGGCATGCGTTAGCTATTT^ 
ACGGGCCCCAGTTACACTCTTAATCCTTATAGCTCAGATGTTATGATCC^CAGTGTGGTTCT^ 

AAAAAAAAACACT CAAAGTG CCCG AACTTTCTT AAAAT AAT C CTGGT ACAG CT AAACTCATG C ACTG ACTGTCCACCT AAT ATTT A 
4 5 ACAGTCTGTGTTGTGATATATTGTTTTAATGTTCTGAATGCTTCT 

ACACATTTAGTCTAAGGTTGTCAGCTATTTATGCTACAAATTAATGA 
GAGGTGGTTAGATGTTTTGTGTTCTAATTTTAAACTATGGATA 
CTGCCCACTCCCCAAAAAACCCTGGAATTCTTTTTTGTT AAACT 

GG CAAATTTTGTG C CACAGGGGTTTG CTGT ACAG ATT ATTT CAT CAC C C71GG AAATAAACACAGTACTTGATGG AT AGGTTTTT AG 
50 TCTTCATTCTCITCCC^CCCTC^GTAGGCCCCAGTGTCTG 

GTTATAACTGAGAAGAACATGTGGTATTTGGTTTTCTATTCCTGTGTTAG 
TGCCGCAAAGGACATGATTTCATT(nTTTTATCX3CT 

TGTTGATGGGCTTTTAGGTTGATTCTATGTCTTTGCTATTGTGAGTAGTACT 
AGAATCATTTATATTCCTCTGGGTATATACCCAGTGATGGGATTGCTGGGTCGAATGGTA^ 

GGATCTATTATTTTCTGACTTTTTAATAATAGCTGTTCTGACTGGTGTGAGATG 
CTTTCCCCATTACTTGTTTTTGTTGACTTTGTTGAAG 

CATTGGTCTATGTGTCTGTTTTGTACCAATACCATGCTGTTTTGGTTACTTTAGCCTTGTAGTAGTTTGAA^ 
GCCTCCAGCTTTGTTCTTTTGGCTTAGGATTGCTTTGGCTATTTGTC 
6 0 TCTAATTCTGTGATGAATGTCATTGGTATTTTGAGAGCAATAGCACTGAACCCGCTAAT^ 
AATATCGATTCTTTCTATCCCCTGCAATTCTITGTTGTTGTATTTA^ 
TGAAAGTGACAACTCTAAAAATTATGTTGGTAATTAAAATTTTAAGTAATGACT 

TTGGAGCTGTT AACAG CAGTGTCCAATCTGCAGTGGT ACTCAGCAGTTTCTGTTTCCTG CATGCAGAACTGCTTAT ATGAAAACAC 
AGTTTTAAAAATGCTTTCCTATGGCTGACATTCACATTCTTATTCCTT^ 

6 5 ATTTITGCAATACTTTTATGAAGATACAAACTCTGACAAAGCTTTT 

AATCTGACAATTATTTTAGAAAAAAGGAAAATATTATTTACTATTATT 

TGCACATCTCC CACTACAACAGTAGCTACCATTT ATTCTTTCTCAAAAAAAGTG CTAAGTGTGCCCTTGAAA 11 1 1 1 ACATTGTG C 

AGAATATCCCTAAAATTTTAAAACAAAAArrACATOVTCACTTGCTT^ 

GTTAGCAAATAGCATTTTAGAAGAGACACGTTACrnTTCTAATGAATGTTCTA 

7 0 TGTGAAAAACAAAACTTATATTCTATTGTTAAATTTTCAAAAGTGAAA 

TGAATTGACAAATGTATATCTAGAa^TATGTGAAAA^ 

TGTTTTGACTGATATTTAATTATATAATTTACCAAGAGGATCTGACCAGAATGTA^ 
ATTTAAAAAAAATGTAGTC^GCCCTTTTATCTGTATCTGGAGAATGCAGGGTAAAGGA^ 
AAGGTGTT AATTT CTTACCTATGATACCTGTTACTTTGGGTATCAT^ 
7 5 TCTAATTATTATAATATTGTTCTAATTTAATCTATCITAAT 
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AAAAAATAAAGCAGCAGTAGACTTGGAATCTTC 

C CAATTCTCTAACATTTCTGAGTC CT AGTTTCTGCATCTGT CAAATGGGATT AGAGAT ACCT ACTTTCAGGATGTG AT ATGGTi" 1X3 
GCTCTGTGTCCCCACCCAAATCTTATCTTGAATTGTAATCCC 
AAGTGATTTCCTCCATGCTGTTCTCGTGATAGTGTGGGAGATCGCAAAACATCTC 
5 TTCT CTCTCTCCTGCCACCATGT AAGACTTTCCTTGCTTCCT CCT CCC CAG 

CTATGCAGAACTATG AG T AAATT AAACCTC CCTT ATAAATT AC C CAGTCT CAG AT ATTCTITT ATAGTAGTGTAAAAACTG ACT AAT 
ACAGAGAATTGGTACTGGCAGGGTTXKMTACTGCTATAAAGATA^ 
GTGGTTAGAACAGTrTGGAGGGCTCAGAAGAAAACTGGAAGATATA^ 
TTGACCAAAATGCTGATAGTGACXSTGGACAATGAAGTCCAGGCTGAA^ 
1 0 AAGG TT ATTTTTGCT ATG CTTT AG CAAAAAGACTGGCAGCATTTT AC CC CTGC CCT AGAG A^ 

GX3TATTTGGCAGAAGAAAATTTCTAAGCAGCAAAGCATCCTAGTC 

ACG AAGAT ATGGTCTGAAATTGGAACTT AGGTTT AGAAGTG AAG C AGAAC^ CTGAC CATGTAG 

TAGAAAAGAAAACCCC^TTTTCTCGGGAGGAATTCAAGCCAGC^ 

AGTAAAAAGTAATAAGTAATAAGTAACAAGGAGCCAAATGTTAATAACCAAGACAATGGAGAAAATGTCT 
1 5 ATCTTCGGGGCAGCCCCTCCCATCACAGGCCTGAGAACTAGGAGGGA 
CCTCACGACATGGTGCCCTGCATCCCTCATGCTCCAGCTCCA^ 

AGAGGGTG CAAGCCC CAAGCTTTGGCAG CTTTCACGTGG TG TTGGGCCTGCAGGTG CG CAGAAG ACAAG AGTTG AGG TTTGGGAAC 
CTGTGCCTATATTTAAGAGGATCTATAGAAACGCCTGGATGTC 

TCTG CT AGGGCAATG CGGAAGGGAAAT ATGGGGTTGG AT CCCTCATACAG AGT CC CCACTGGGG CACT ACCTAG TGGAG CTG TGAG 
2 0 AAGAGGGCCTCTGTCCTCCAGGCCCCAGAAAGGTAGATTC^ 

CTGTGAAAGCCACAGG^TACCCTGCTGAGCCAC^GGGGCGGAGCTGCCCAAGGGTATGAAA^ 
CTGAATGTGAGACATGGAGTCAAAGGAGATTTTGGAGCTTTTAGATTTAAGGGCT 
GTGGCCCTTCGTTTGACCAGTTTCTCCCATTTGGAAC^ 
AACTTGCTTTTGATTTTATAGGCTCATACGTC^AAGGGAC^ 

2 5 GCTGTAATAAGACTTTGGGGGACTGTTGTGAAGGCATAATTGGTTTTA 

CAAAATAATATGGTTTGGCTCTGTGTCCCCACCCAAATCTAATCTTGA^ 

AGGCAGTTGGATCATGGGGGGGTTTTTTCC^TGCTGTTC^ 

TTCCCTTGTGCrCTTTCTCTCrCCTGCTGCCAGGTGAGACGTGTCTTG 

AGGCCTCCCCAG CCATG CAGAACTGTGAGTCAATTAAACCTCCTTTCCGT ATAAATT ACCCAGTCT CAG ATAGTATCTTTATAGC A 

3 0 GTGTCAGAATGGACTAATACAGGATAGTAATGAAGATTACAGAATATGTAGATGAAGAAGTGCT 

TGTAGTQVAATTGAATGTATACATTGTGGTACTTCAGTGTCCTTTAAATTGAATAACTAGAAATTT 
CACATCAGATGACATGTTAATTTATGCCTATACTTTTTTCTA 
TGGTAGTGTGGCAAGACATCTCATTTCTAGTTAAGGCTGTATAATATTAAGTTC^ 
GCTTTTCIATGTCATCATGTATGCCCAATTTGATACTTTAGTG^ 

3 5 CAGGGTTGAATACATTAGTCAGACATATAACAATGTACTTCAGAGTTCCTCTAAGGGCAAAAATCG 

CTCCTATTTATACTTTTGTATTTTTGAAATC 

TTGGAATGCCAATATGTTCATTTAAGACITAATATAGTAAGTATCTAAAGAAAA 

TAATTATAATTTGTGCTCCATTGTGATATGAAGGATAGAAGGGGCAGGATAGATAGAAAACAGAAAT 

TACCTTAAGACTGTCTGTTAAGTGACCCACATAATCTTAAAAAACTCTGTCAAGCT^ 

4 0 GGCAACAGTCACAAGGTTATGAGGTGCATAGATTTTGGAATTAGGCAGA^ 

TTTGGG CAAAT AAAAGGC CCAATTTTTGTATT CTT AT CTGT AAAATG G ACT CAGT AAAAATT ATTTGAG ATAATTT ATTTGTGT AC 

TGTACCTAGGCATGCAGCTTGACACAC^GAATTACAA^ 

ACCTACTGAAAACTTTTTTCAGTCTTAAATTTTTTATCT 

AAAAACT ATTTCTTTCCTCCTTG AT CTT CT AGGGGT AAGGTT ACC AATGTTTTCATT ATTT ACT AAAT AT AG CAG CCC CCACCCCT 

4 5 TATTCATGGAGGATAGGTTCCAAAACCCCTAGTGTATGCTTG 

TTTTCCrATACATACATACCrATGGTTAATGTTTAACCTACTAATTAGGAAGAGTAA 
AACAATTGTAACAATATTCCAGCAT(^CTATTCTTGTGCTTTAC^ 

TGTGATACTGTGGCAGTCCAACTGGTAACAGAGATAGTGATGCGGTTTGGCTGTGTCCTC^ 
TCCCAGAATTCCTATGTGTTGTGGGAGGGACCCAGGGGGAGCTAATTGAA 

5 0 T AGT T AATAAGTCT CAC ATGATCTG ATGGGTTTATC AGGGGTTT C CC CTTTTG CCT CTTCCT CATTTTTCTTTTGCCACCAC CATG 

T AAG AAGT AC CTTTTG CCTC C CGC CATG ATTCTGAGG CCT C CC CAGCCCTGTGG AACT CT AAGT CCAATT AAAC CT C "ri"G TT C 
CCAGTTTTGGGTGTGTCTTTATC!ACAAGCATGAAAATGGACTAATACAGTAAA 
TACCOUU^TGTGGAAGCGACTTTGGAACTTTGGAGGACTCAGAAG 
ACATGTTGAATGGCTTTGACCAACATGCTrGATAGTGATATGAACA^ 

5 5 CTTrTTGGGAACTGGAGCAAAGGTTACTATGTTATG 

AACTTIt^CTTGAGAGAGATGATTTAGGGTATCTGGTGGAAGAAATT^ 
GTTAAAAGCATTCTGTTTTAAAAGGGAAACAGCATAAAACT^ 
TTTTTTGAGAAGAAATTAAAGCTGGCTGCAGATATT^^ 
TGTCTCCATGGCCATGTCAGAGACCTTCACAGCAGCCCTTCCCATCACAC^ 

6 0 CCAGGCCCACGGTCCTCATGCTATGTGTAGGCTAGGGACTTTGTGCCCT^ 

ATATAC^GCTCAC^CTGTGACITCAGAGGGTGGAGGCCCCAAGCCTTC^CAGCTTC 
AGTCAAGAATTGAGGTTTGGGAACCTCrGCCTAGATTTTAGAAGACGTATGGAAACACCT 
AGGGCAGGGCTGTCATGGAGAACCTTTGCTAGGGCAGTCCAGAAGG^ 
GGCACTGCCCAGTGGAGCTGTGGGAAGAGAGCCGTCATCCTCCAGACCCCAGAATGGTAGA 

6 5 CTGGAAAAGCCACAGACACTCAATGCCAGCCTGTGAAAGCAGCCXK3GAGG 

CCATGGGAATCCAT CTTTTG CATCAGCATGACCTGGATATG AG 

TAACTCACTGGATTTCAGACTTGCATGGGCCCCGTAACCCCTTTGTTTTGGCCAATTTCT 

TGTGACACCCCCCTACCCCCTGCCCCCCATCCCrCCGGCCCTTGTATCTGGAAGTAACTAGCTTG 

AGGCAGAAGAGACTTACTAGCCTTGTCTCAGATGAGACTTTGGACTGTGGACT 

7 0 GGGGACTATTGCXIAAC^CATGATTGGTTTTGAAATGTGAG^ 

GTGTCCCCACCCTAATCTCAACTTGAATTGTATGTCCCAGAATTCCCATGTGTTGTGGG 
TTGAATCATGGGGGCTG^TCrrTTCCrGTGCTATTCTC^ 
CAAAACTTTTGCCTCTTCCTCATTTTTCTCTTGCCACCACCAT 
C CAG C CATG TGGAACTGTAAGTCCAATTAAACCTCTTTITCTTCCCAGT^ 
75 ATACAGATGGCTAGTAAGGGACTAACCGGCAGGGAGCGTCTCCAGTGTGGATATGCTC^ACAAAGGG 



1874 



WO 03/008583 PCT/USO 1/5 1291 



ATAAGATTTCATTACTCAGAATTGCACAGAATTTAAAACTT^ 
TTGACTGCAGGTACCTGAAACTGTCAAAAGTGAAACCACAGATAAGTGGGGA^ 
CAGTGGATATGTAGGGACCTGAGTGTGAAGTGAGAGCAGCAGCATCAAAACCTGAGGGA 
AGTATACTGGCATGACAGAGAAACCAAAAAGTTCTCAAGTTAATG 
5 GGAAGGAAAGGGGTAATGTGGCTTTAAATCAGGTTGAGATTGGTTCTGAGG^ 
GACACAACTGTTCTGCATTTCCATTTGTTTTTATAAATGT 
TTAGTTTGAGGAACATTTGAAAATTTGGTCAAATGACACAGATCGTCAC^ 
GTCTGGCAATCCCTATTTGGGCAAAAATCTTCATTTGCAGGTC^ 

CATTATGTCATCAAG ACATCAAACTTCAGCCT ACAAAGT AGAAAG C 
1 0 TCAGTTCTGACACTTTATCATAGTGGAAAATGAAGAAGATTGCTTAAGA^ 
CCTGACTTTACTAGTTACTTAGATCACTTTAGGCAACTCAACT^^ 
CCCACCTCTTAGGTCTGTGGAGAGGATTAAATGAGATAATCTATACAAAG 

TT ATACCTGT ATTCTGATT AT AAT CTCATAAAT ATTT ACCATGTT AG CTGT CT CAGAG TTCTTTTGCAAAACAG ATAAAG AT AG AA 

AGTATAAATAAGAAAAATAAGTGAACATATACTGAACTTTGTACAAGATG 
1 5 CTAAAAG AG ATT ATTGATAG AAACAGGATACAT ATACATCAAAAGGT AACAT AGG ATCATCT CAAAGTG CT ATATGGCAGTG T 

TTT AGGAAGTCT AGAAG CTGT CATGGAT CAGGAAT ACCATGGTGGACACTTCAGG CAGGGAAAACAGATCTT AGCAAAAG CT ACT C 

CTATCAT AGGT ACTTG AT AAAT ATTTGT AGAATCCAGGATCC CTGT AGTG AT AAAGAAACT 

AGACATATGACTGGAAAAATAAAAAGACCAAATTATGGACCATACTGAGCTTGTACT 

TAATCATGTTGAGAAAAGAGTTTTAGC^^TGTGTGGGGGATAGAATGGA 
2 0 CCATTGCAGTGGTTAAAGCAAGAGGTGAGAGAAGGCATGCATTAGAATGGCAGCC^TCAGAGT 

CATAGTGTTACAGGGAGTAATAAATAGGATGTGGAAGATGGGTTAGAATTGGCAAAATC^ 

ATAGTGAGAGAAATTCAAATCTCTCTTTAAGAATCGAATAAAATATTTAGAAATAAG 

GGCATTTCAAAGATGCTCGAGATACCTTGTTGGAAAAAGTCAATAACTGCACTAT^ 

ACATCATGTTCCTAATTCTGAATTATGAACCATCTATTATCCTTGTATGCT 

2 5 CCCV3TCTTCATTCTAGAAATAACTATGCGATCAAAAAGTTTTTAGTCTTT 

ATTCAATGTACTATTATTGCTTCTGCTTAAAACTCGCATCCCCT 
GAAGGGAGTGCTTGGGGTCTCAAGACTTATTCTGTTTTTCTCCATC^ 
TAGGAAGTAAGTTACCCAAACACAGTGACCAAACTGGATTAATTCTTCCAATGAG 
AGACTTTAAGTAGCATAGATAACATGATTATATTCTCTCTACAAATAAATACACAGGACCT 

3 0 TTTTCCTCTCCACTTTTCCATCCCCAAACCCATC^ 

TACAGACAGGGAAACCAAGCCCAGAGACACTGAGTAGTAGGCCACT 

CTTGAT CTATTAATGT ATAGG TTTTG TTGCTGTAACCCT CTCCC CAAG AGGAG TG AAT AT AAATG ATGCAGAGTTTGG ATGAACT A 
TCTT AAT AAG AACCT AAAGTTG AAACCAATGCAAACCT CT CT CAAT AAATGCAAAGCAAAG AGAAT AAT CAGT CTTTCTTTGG CT T 
GTTAAATAAGATAAAATGTGTTCTGCTAAAACCATTTAACAGA^ 

3 5 TGAAAACTATTCCATAG CTT ATTATCAAACAAATCAGTAATCCTTTAGCTAATC C 

ACCTGGTGTCTGCAGCTGAGTATTTACATTTTTCCTAATGAACA 

CTTTTAGTGCTAGCGCTAGAGGGAAAAAGAAATTACTGGCTC^ 

TGGCTTCAGTTCTGTTTTGAATCTCTTTTTGGACTTGTCTTGAATGGACTGTT^ 

TT AAGCAT AG TTT AGGTTTT AAAATG TTTCTGGT CC C M lTlT"rr'rrrrr CTTTTCCA CTTT ATGTTG CTT AAAG CTTT ATGG C CAGG 

4 0 TTTTCT CATC CTCAGCATTATTGACATTTG AAG CTGGATACTTCTTTC 

GCATCCCTCGCCTCTTCTC^CTTAGATGCCAATAGCATTTCC 
TCCTAGAGAGCAAAATTGCTCTCTGTTGAGAACTACn'GTGTTACGGTGTTTGGA 

TGGT AGTTGTAAAAAATT AAT CCAGTTATAGCAG CTGTATTTCTGGAATTTTTTT CCAT ATTAACACTTG CTTTCTG AGGTGATAA 
TATCTTTGTTTTTTTTCTCCCAAATAGATTTCTTGCATTACACT 

4 5 ATCATGTCATTTGGGT AATAGTTTACCAACTCTG CCCCTTTCTCTGTCAGGGAAG CCrCrAATTTAGTAAGCG ATACTGTATCCTT 

TTGTCAGGTACATTACCATTCCTATTAGCAATAGGGCA^ 

ATTAGGTTATGAATTGTAGCCTATCTC^TTTAGAATCTTTACCTTACTAGTCTCCATAACA^ 

GGCCCTGGG AGT CTC C CCTT AAAC^GCAGACT ATTTT CACAG T AACACGT ACTTTATTTGCCATTT CATT ATG TCAG CATTTG CAA 
TAATGGTACAAAAGCAAAGATGAGTAAAACTGTTGGCATCriTAGTATACAGTAGT^ 

5 0 TTGAAGAAGCAAAAAAATTATTAATTACATTAAATT 

AAGGTGCATAATCCACTTATATCGCATATAGCATITGATAGTTGTCTaVAAGAAAAGTG 
CTTTTTTTCATGGAGTACCATGAGAGATAAACTCTGGTTTTC^ 

AACTTAGCACTTTAAGGAAAACAACTTAAAGTATTTGTTGCCAATTGATAAAATATAGGTTTCAA 
GACTTGTATCTGCCACTGTGAGCTTGACAAATGTGACTCTTTTATATTACATAATGAACTATC 

5 5 ACTCAGTGAACCAGTATTTTCCAGATGACTAATGCATGATAATACAAAATCAT^ 

AGACTGACIATATTTCAATGTAACAATCAAAAGTTCATTGATAACAGTTTTGGA 
GAAATTGTCCAATTTTGGTGTAGTAATCAGAAAAGGCAATCTATAATTACCTG 

CAGGCTCATGGGGAAG ACTGTAG CACTTCTCTTTCCCT AAGATCCTCCAG AAAGG AAGAAGGTAAT CCTC 
CCTATTGTGTGATGATCACCAAGTATGTAACAATGCTTTAT^ 

6 0 TAATAAGGGAATGGACTCAAAGAAGTTAAGTCAGCTAGCCACTGTCACAGCTATTAGAGCAC^ 

TTGTCTGT ATG T AAAG CTGATTCTCTTCGT AAT AGT ACTGAGACAC AAG AGGCGGCTACAAAATAT^ 
CAGAGTTTCAAC^TTCGTTATCATTTGTAGCATGATACTGGATCCT 

CCTGAATGAATGGGTGTAATTTACCTGCACATTTTACATGCTTCTCTAGGTGTGTGAATA^ 

CCATAATCCTCCITGT AGCAATTGCTTTGCTTGCAACAAAACTAAGT CTAG CTTTATGCATGGTTTTCTCTCTCTG AA 

6 5 CTCTAACATAAACTCAGCCTCAC^AATTATTCC^TTTCTACTACATTTG 

CCTGGAACAAC^CATTTTGTTCCAAGGGTTCCTCACTTA 

TCAGAATGTTTGGCACAGCTGTGAGTGAACACACATTGAGCAATCAATAACTATTACAGATAATGATG 

TTTAGCTTTCCCATTCAAAGGGGGTGAAATATGCACTCTTACTATGGT^ 

GATGTCAATTCCATATC^TTTTCTCTTGAGTTCTAACCATTT 

7 0 TACCATTTAGCATTCATCCATTTTTTCCAATGGTGTTCATTATAC^CT 

TTTCACCGATGTCTATGGTACATCTGGCAGCTTTCCATGTACTCAGTTCT^ 
GCCAAAAGCCTGATTGAGGTTCCAAATTTTCAGCrACTGTACT 
CTTACAGTTGAGTGGCCTGAACATGTTTTGCATACATTGTAATATCTAAGAA 
CTACCTTCCATTTATATAATACTTACTCATAAAATCTTCAGTGTTCCTG 
7 5 AGCAGGAACTTGTAGTAGATTTTCTATGTGTTACCTTATTTTCA 
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GAAACTCAGGGGTCAAAGTAGTAGATACCTACCCAAGGTAACAGAAGCT 
TCACCGTAGATAGGCTCCCTGATGAACCACCTGCCACGGCCCGTATGACCGC^ 
TTGAGAGCTAAAACTACGAAGTACTACAAACTATTATTTAAAATATAAATACATACT 
ATGGGGTAAACATTACAGAATACTGTCCTAACCTTTAAACAATGCACTCGTT^ 
5 CCTAAAAATAGATGTCAGGTGACAGAGACTGGCTGAGCAAGAATAGGA^ 
CCAACACATTGTCGCACC^TTCACTGTTCCAGGACCTTCCTACT^ 

CACAAGTTTCCATT AAT CAGGGGTGGCCTGTGCCGGAC CTACAGCAGAGTCATTTCAGG TT ATTCTGTT ACAGG CTTT CGACGTGT 

AGTCAGTCCACTCGCCCIAAATCTAGCAGGGAATGAATGCCTTGTAATA 

AACAATGTGAAACCCTGGGGCCTTTTCCCAGAATTAGGGTGGTGGGAATGCTGTCCT 
1 0 GTTGGCAAGATTCAGGAAGCTTCATTTGAAGATAGAATTTA 

TATAAAAGAAATTGTCATTCCATTATTATTACCTTTTAACTTTrATTCCT 

AATGT AATTTCT AAC CAG ATTG AT AACT AGAAAAAAATTTTAAGTT ACTTTGCT 

GACTTACTACTGTTATTGGCTGAAATAAATAAAAGCAAGAGATAATAAA 

TTCTAAGTTCCATCTTTTTTAGAGAAAAGGCAAATTAAGAAAAGTTTAG 
1 5 ATAAGCAAAACGGACTTTAATATGTAGAATTCCAGAATCAACAGGTTGCCAGCATCC^ 

ACCAAAAATGGAATGGGCAGTCTCTAATTACAAGCAGAAGGCTACAAAATCATT^ 

CAGCACGTAAGAGGAAAATTCCTTAGGAAAATAO^CATTGAAAACCATTG 

CTCCAGTAATAGAAAGATCACITGTTTCATTGGTTTATAAAAATA 

AT ATTGG TGAAAAGT AT AT ATCCATTGTACAATTCTTTC CAATTTTTTTTG A^ 
2 0 AAACAGCTAATAATCACTAGACCTGCACTCrrTGTGGTGAGACTATGAAAAATGTTAG 

TTCTGTCTTCTTCTTCAAGCCAAACAGTC^TAGAGTGGAGTGGGCAGAATGGAACT 

CTTACTGCAAGCCAGACIAGGAAGGTTATAGAAAATGTTTCTGGATC^ 

ATGACATTAGGAATTAGAGACATGGGACAAAAACTTTAAGATTGTAAAAAAATTT^ 

AATGACACTTGATTGAATTTTAAAAGATGCCTGTATAAGATCTTAA 

2 5 GGAGGCATCTTTITGGGATGATGGAAATATCCTCTCTCCTGATTGTGATAGTAGTTACATGAATATTCA 

ATTATAGACTTAGAAAAOVGTAAATGTTACTGTATGTGACACCTTAATAAACGTGATTATAAAAATAAATCC^ 
AAAAAAAAAAAAGAAGAAGAAGTGAACCAGAACCACACCATTCTATTTTGGAGACACT^ 

TGTTT AAAGAAGAAT ATAACATGATTTGAAT AT ATTT AG CT AGG ATATTTT AG TG CCTG CT AG CACTTG AAG C CAG AGTTCACTGT 
GAGCATTCTGA 

30 

HUMAN SEQUENCE - mRNA 

TTTTTAGAAAAAAAAAATATATTTCCCTCCTGCTCCTTCTGCGT^ 

GACGGTGGCGGGCGAGCGGCTCCTCTGCCAGAGTTGATATTCACTGATGGACrCCAAA^ 

ACCCCAGCAGTGTGCITGCTC^GGAGAGGGGAGATGTGATGGACITCrATAAAA 

3 5 GCGTCTT CACC CT CACTGG CTGT CGCTT CTCAAT CAG ACTC C AAG CAG CG AAG ACTTTTGGTTG ATTTTCCAAAAG G CTCAGT AAG 

CAATGCGCAGCAGCCAGATCTGTCCAAAGCAGTTTCACTCTCAAT^ 
ATGACCTGGGATTCCCACAGCAGGGCCAAATCAGCCTTTCCTCG 
CTCAATAGX3TCGACCAGTGTTCCAGAGAACCCCAAGAGCT 
AAAAACTCACTCTGATGTATtnTCAGAACAGCAACATTTGAAGGG 

4 0 CAGACCAAAGCACCTTTGACZATTTTGCAGGATTTGGAGTTTTCTTCTGGG 

TCAGACCTGTTGATAGATGAAAACTGTTTGCTTTCTCCTCTGGCGGGAGAAGAC 

GGACTGCAAGCCTCTCATTTTACCGGACACTAAACCCAAAATTAAC^ 

CACTGCCCCAAGTGAAAACAGAAAAAGAAGATTTCATCGA^ 

TACTCTCAGGCAAGCTTTCCTGGAGCAAATATAATTGGTAATAAAATGTCTGCCAT^ 
45 ACAGATGTACCACTATGACATGAATACAGCATCCCTTTCTC^CAGCAGGATCAGAAGCCTATTT 
CCGTTGGTTCCGAAAATTGGAATAGGTGCCAAGGATCTGGAGATGACAACTTGACTTCT 
ACAGTTTTTTCrAATGGCTATTCAAGCCCCAGCATGAGACCAGATGTAAGCTCTCCT 
ACCACCTCCCAAACTCTGCCTGGTGTGCTCTGATGAAGCTTCAC^ATGTCATTATGGAG 

TCTTCAAAAG AG CAG TGG AAGG ACAG CA CAATT ACCT ATGTG CTGGAAGGAATGATTG CATCATCGAT AAAATTCGAAGAAAAAAC 

5 0 TGCCCAGCATGCCGCTATCGAAAATCTCTTCAGGCTGGAATGAACC 

GCAGGCCACTACAGGAGTCTCACAAGAAACCTCTGAAAATCCTGGTAACAAAA 

CT ACC CTGG TG T CACTGTT GG AGGTT ATTGAAC CTG AAGTGTT AT ATG CAGG AT ATG AT AGCTCTGTTCC AG ACTCAACTTGG AGG 
ATCATGACTACG CT CAACATGTTAGG AGGGCGG C AAG TG ATTGCAGC AGTG AAATGGG CAAAGGCAAT AC CAGGTTTCAGGAACTT 
ACACCTGGATGACCAAATGACCCTACTGCAGTACTCCTGGATGTTTCITATGGCATT^ 
55 CAAGTGCAAAC C TG CTGTG TTTTG CTCCTGAT CTG ATT ATT AATG AG CAG AGAATG ACTCT ACCCTGCATGT ACG ACCAATGT AAA 

CACATGCTGTATGTTTCCTCTGAGTTACACAGGCTTCAGGTATCTTATGAAGAGTATCTCT 
TTCAGTTC CT AAGG ACG^TCTG AAGAG CCAAG AG CT ATTTGATGAAATT AG 
TCAAGAGGGAAjGGAAACrCCAGCCAGAACTGGCAGCGGTTTTATCAACTGAC^ 
AATCTCCTTAACTATTGCTTCCAAACATTTTTGGAT^ 

6 0 TCAGATACCAAAATATTCAAATGGAAATATCAAAAAACTTCTGTTTCATCAAAA^ 

GAAAGTCGAATTAATAGCTTTTATTGTATAAACTATCAGTTTGTCCTGTAGAGGTTTTG 
GTTGTTTTGTTITAAATACGCACTACATGTGGTTTATAGAGGGCCAAGACTTG^ 

AGTGATGGGAGAGTAGATGGTGAAATTTATTAGTTAATATATCCCAGAAATTAGAAACCTTAATATGTGGA 
CAAAGAAGGATGGCACCTAAACCACCACTGCCCAAAGTCTGTGTGATGAACTTTCTCTT 

6 5 G AAATTTT CT AGACT TTCTGTTGGTGT AT CCCC C CC CTGT AT AG TTAGG AT AG CATTTTTG ATTT ATG CATGG AAACCTGAAAAAA 

AGTTTACAAGTGTATATCAGAAAAGGGAAGTTGTGCCITTTATAGCTATTACTGTCTGGTTTT 
AACTACGCTTGCTCATTTTTTCTTACATAATTTTTT^ 

CCCAAATAAACTCTAAACATTAATCAATCATCTGTGTGAAAATGGGTTGGTGCT^ 
CACAAAAATTGACTCAAATCTCGAGTATTCTTGTCAAAAAA 

7 0 AATTATATAGGTTGTGCAAATTAACAGTCCTAACTGGTATAGAGCACCTAGTCCAGTGACCTGCT 

TGCAAAAGACTAATTTAAAAAATAACTACCAAGAGGCCCTGTCTGTACCTAACGCCCT 
GCrGGTAAACTATTTGTCTrTCAGGACCrrrTGAAGTAGTTT 
ACAGCTGAGAGACTTTTAATC^GACAAAGTAATTCCTCTCACTAAACrrT^ 
TAGACACCCATTTTCAGATTCCCATCTGTCACCAATTGGTT^ 
7 5 ATTTAGAACTGTATGTCAGACATCCATGTTTGTAAAACTACACATCCCT AATGT 
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TTCTTCACTGTTGAAAATTATTTTAAACAAAATAGAAGC^ 
AAACTTAACATATTTACTAAGCCACAAGAAATTTGATTTCT^ 

AATATTAAAAATATGGAACTTCTAATATATTTTTATATTTAGTTATAGTTTCAGATAT 
AAGGGAAGGGCTACTGCAGCTTTACATGCAATTTATTAAAATGAT^ 
5 GATGAGATTGTTTTATCATGACATGTTATATATTTTTTGTAGGGGTCAAAGAAATG 
GTACATGCATTCATACAGGCAGCGATGGTCTCAGAAACCAAACAGTTTGCT 
GCAGTGAAGGTTGCTGAGGCTCTGACCCAGTGAGATTACAGAGGAAGTTATC 
AACAGTGAGTCTtSTCAGCGCAGGTTTAGTTTACTCA^ 

GTGGTGCTTACATCCTTAAAGGCACCATCTAATAGCGGGTTACTTTCACATACAGCCCT 
1 0 CTT CAGAAGTTTGGCAAT AGTTTG CATAGAGGTACCAGCAATATGTAAATAGTG CAG AATCT CATAGGTTG CCAATAATACACT AA 

TTCCTTTCTATCCTACAACAAGAGTTTATTTCCAAATAAAATGAGGACAT 
TGTTATTTT CAGT ATTTTG G AGAAATT ATTT AAT AAAAAAACAAT CATTTGCTTTTTG 

HUMAN SEQUENCE - CODING 
1 5 ATGGACTCGAAAGAATCATTAACTCCTGGTAGAGAAGAAAACCCGA^ 

CTATAAAACCCTAAGAGGAGGAGCTACTGTGAAGGTTTCTGCGTCTTCACCCT 
AGCGAAGACTTTTGGTTGATTTTCCAAAAGGCT 

GGACTGTATATGGGAGAGACAGAAACAAAAGTGATGGGAAATGACCTGGGATTCCC^ 
GGAAACAGACTTAAAGCTTTTGGAAGAAAGCATTGCAAAC 
2 0 CCACTGCTXSTGTCTGCTGCCCCCACAGAGAAGGAGTTTCCyU^AAACrC^ 
CAGACTGGCACCAACGGTGGCAATGTGAAATTGTATACCAO^ 

TGGGTCCCCAGGTAAAGAGACGAATGAGAGTCCTTGGAGATCAGACCTGTTGATAGATGAAAA 

GAGAAGACGATTCATTC ClTrT GGAAGGAAACTCGAATGAGGACTGCAAGCCTCTCATTTTA 

GATAATGGAGATCTGGTTTTGTCAAGCCCCAGTAATGTAACACTGCCCCAAGTGAAAAC^ 

2 5 CACCCCTGGGGTAATTAAGCAAGAGAAACTGGGCIACAGTTTACrc 

TGTCTGCCATTTCTGTTGATGGTGTGAGTACCTCTGGAGG 

CAGGATCAGAAGCCTATT^TTAATGTCATTCCACCAATTCCCGTTGGT^ 

CAACTTGACTTCTCTGGGGACTCTGAACTTCCCTGGTCGAA^ 

TAAGCTCTCCTCCATCCAGCTCCTCAACAGCAACAACAGGACCACCTCCCAAACTCTGCCT 

3 0 TGTCATTATGGAGTCTTAACTTGTGGAAGCTGTAAAG^ 

AAGGAATG ATTGC AT CAT CG AT AAAATT CGAAGAAAAAACTG C CC AG C ATG C CG CT AT CG AAAATGTCTTCAGG CTGG AATG AACC 
TGG AAG CT CGAAAAACAAAG AAAAAAAT AAAAGG AATTC AG C AGG C CACTACAGG AGT CT CACAAGAAACCT CTGAAAATCCTGGT 
AACAAAACAATAGTTCCTGCAACGTTACCACAACTGACCCCTA 
TGCAGGATATGATAGCTCTGTTCCAGACTCAACTTGGAGGATCATGACT 

3 5 CAGTGAAATGGGCAAAGGCAATACCAGGTTTCAGGAACTTACACC 

CTTATGGCATTTGCTCTGGGGTGGAGATCATATAGACAATCAAGTGCAAACCTGCTGTG 
GCAGAGAATGACTCTACCCTGCATGTACGACCAATGTAAACACATGCTGTATGTTTCCrrCT 
ATGAAGAGTATCTCTGTATGAAAACCTrACTGCTTCTCTCTTOVGTTCCTAAGGACGGTCT 
ATTAGAATGACCTACATCAAAGAGCTAGGAAAAGCCATTGTCAAGAGX3GAAGGAAACT 

4 0 ACTGACAAAACTCTTGGATTCTATGCATG AAGTGGTTGAAAAT CT CCTTAACT ATTGCTTCCAAACATTTTTGG ATAAG ACCATGA 

GT ATTG AATT C CC CG AG ATGTT AG CTGAAAT CAT CACCAATC AG AT ACCAAAAT ATTCAAATGGAAAT ATCAAAAAACTT CTGTTT 
CATCAAAAGTGA 
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Table 90 

MOUSE NOMENCLATURE 
ICSGNM Dadl 
Celera mCG8692 

5 

HUMAN NOMENCLATURE 
HGNC DAD1 
Celera hCG40724 

10 MOUSE SEQUENCE - GENOMIC 

TTAGTGCTAAGAAATCCAGAGAGGGAGATGCGGTGTGTCTCTGGAGCAGGGCATGGCT 

ATGTTAGAAGCTGAGGATCTGAGTCCGGCTTGCTGTGGTGCTG^ 

CTCTACCAGGTTGTCACACAAAATTCACTCATGTAGCAGAGAAGTGGCACATC^ 

ATACAGATGAAGGAAATATATATATATATATATATATATATAC^ 
1 5 TATATATCAATCACCTGAAACCTCCAATACAGAGACATGGTTCIGATGAATGGT^ 

TTTAAGAAC CAGTGAGGATGACAGAGAT AGGT AGGTGCAC CAAGGTGAGTGG TGGTG AATTGGATGG CCTTGG CTT AAAG ACAG AG 

TCCTTCATCGGGAGGTGACTCACCCCAAGGCCCAGCCACTCGAAGAAC^ 

CCTTGACCCAGGTTGGAGGTGCACGGATCTCACTAGGGTCAGTTC 

TTAACTCTACATTAAACAGAAGATACACTGTATTTGCTTGGAGGTAGCATAG 
2 0 AAT AAATATTGG AACG TG T AGGAAGGCAGAGTCAGCTCATG AGTGAGAAG AGTAG CTT AAGTTACCTTTCAGGGTACTTGATAGAG 

TAAGAAGTGGCCAGGAATCCTCCCAAACTGTGCCCCAGGAGGATCATGGTGGGGATTCCCATGGTCT 

GAAGTGTGCGGCGGGCACTCAGTTGAATCCATGTTGAGGATCCAGAGGCCCACACC^ 
CrrGCGATCCTTTTGTTCTGGGCTCACAGTCACCGTCCAGATCrTG 

2 5 CTGGAGACCTGGGAAGGGAAGAATTCCTGGGTAAAGCATCAAGAATGCrTCCT 

TCAGTTTTCTCTCCTCITAAAGAGATCATTTTAGGAAGCCCTCCTAG 
AAGACCCTGTCTCAGAAGAAAAGGGTAATAATGATGATGATAATAATAATAA 
AGCAAGCCAGGAAGCCACATTGCrrCTATGGTCTCTGCTTCAGGTCTTGCCTCT 
GATTGTAACCTGTAAGCCAAJVTAAACATTrCCTCCCCGAGTTTGATTTGGTCA 

3 0 AACAGCCTCATCCATCTTCGGCCTCCTTAGCCCGGCCTTAGACAGAACCCTAAGTC^ 

CTGTTCCCCCATCTTTAGATCTGATCCAGTGAACCAACT^ 
GCTGAGTTCACTCAGAAGAAACTCCCCGTCTTGAGGTCTTTAGTAATCT 
CCCCAGCTGTTCCTGTAGCATGGGGAACTGGGAATGTTATCATCCCAGTGAT^ 
GAATACTGTGTTGTCTGTCTGTATCCTCCCTGCAAGCCAATGATTTTCTTTCCT 

3 5 GGCTCAGTGGACCCAGGTTGGGGAAGATCCTGTATCCTT^ 

TGG AGG ATCCTGGCTTCCACATTCTTCAGTTGAG ACATGGAAGTGGGGCGCCAAGTGGGG AG CCAGCT ACTCAGCCAGC CCTGAGG 
C CT ACAAG ACACAAAGGT ACTTACTGACAACACTGG AAAG AAG CCACTG TT CCCAG C CTT G CTG AT AG CTCCT AG AGTT AT CAGGG 
T ACAAGGTGTTTCCCAACTTCAGGAGGCCTG CCCTTCATGGTCTTCAATATCAGCAATGC CTAAGCAGTCCGATAGATACACTGTT 
ACTCTTTCCTTAGGCAAATGGCTATTAAGAACAATGGCCCTTAAC^ 

4 0 TTTATTACCTTCCTGGAGCTTTACAACTCCrCTTTGTT^ 

AGACTTTTCACTAATATTTCCATTATTCTTTGTCACT 

GTCATGCATGCTTCTGGAAGTTTGGGGTGCTGTTAGTGCCTGATTGGGGAATCC 

GCAAGGACCAAGGXK5TAAGATATCTGGATTCATGTCACCAAAAGACACAGG 

GAGAAAACTTGCCTATGTCCTAGAGTAAAACTGGTAACrTATCCTGCTTTCCC 

4 5 GAGGTGCCAAGCACTGGCTCTGGAGTATACAGAACTCCTGGGTCTG 

TGTGAACTCTGC CTAGGTC AAGGT AAGAGGATGGTTGCAGCAAGGGCTTGTGGGTT^ C 

GGAACCATCCCAGCTACTCATAGCATAGACAATGGGTACCCT 

ACAGATCTTGGCCAGTCTCAAAAGAGAGACITCCTCTTTGCA 

5 0 ACACATCTCTAACTCTGGACAGCAGAGCTCAAGA 

AAGGGGTGGGGGTTGGGGGACAGGGAGCTATGGGCACAGCTTGGCCTTTCTTC^ 
GTGCCTCTCACACrrGTGTGTCAGTGCTATTAGGGAGGTGAGAGCTGAGGTGG 
AGTCTCTGAATTTCrATGGGCATCCGCATCTGTTGCTTTATTTGTAAAAC^ 
CCGTTTCCCGCAAGTCIACrrGTAGGCACCATAAATTTCCTCITAG 

5 5 CGATTGTTTACT ATTGATACAAAGTTATTT CGTTGTTCTGCAG ATC CAAAAGGACAAATATTAGGTCAAGKK5AAGGCCCTGTTCCC 

GTGAAACCCTTGGAGCTGAGTCATTCTCCTCTAAGCAACTTTTTCCCG^ 
CTTGATGGCCTTGGGCGAGCAGTAACCACATCCCAATCTAACAAACCT 
AAGAACACTATCAACTAGGATTGCAAGCAGAACCTAAAGGC^ 
ACCGTAGTGGAGA C TTCTTTCTCCTACCCATGCCTTGTCCTTCAAGGGCTTT^ 

6 0 ACAAGCACCACGTCCGCATCGGGAAGAGCTAGCTTATTCTCGTGCTTTACCACTAATCTAACTGG^ 

AAG ACT AGTGG AAAG TCAGAGTCCCCAGG CTG AACG CAAAGCTITAACCCTCAACACTCT 

AGCGCTCCCGGGG CTCCTGAGCGCAACCCATCCCTCCCTGG AAG CCAACCTGAAAGGAAAGGCGGGGAGTCTG AAGCTGTCCG CAA 

GTACAGACAGTCCCCGGATCCTGGCCCGGACAGATAGAGACAGAAAGAGAAACGG^ 

CAGCCATAGTAAACAAGCCCTCCCGGGTCGAGCTGAGCCAGCCCATCCCTCCGCT 

6 5 TCGGATAACAAAGGGGGCGGGGCCGGAGGCGGGGCCTGGGTCGCTAGCGGGGTTCTAGCA^ 

AGCACATGGACAGCAATCAGTTGGGTCCTTCCTTCTGTTTGCT 

GTAGGCTCCTAATCCGGATCACATCCCTTTGGGGGACAGTGAGGGAGACAGGAAGACAGGTA 
'GGAATCCCACCAGCCCCGCCTCCACACCTCCTTCAATTAAAGAGTGAAGGAACTGCT 
CTCTGTATCTCAGGTCACAGACGCTTCATCTGCCTTGGTAGTC 

7 0 CCTATAAAAGGGCCTCGGATATTGTAACCCAAGATTTCCCGGTT 

GCAAAG TTTAAAGCT AAGT AGG CTC CCTTGT AGT ATTTGGGGCATCCTC C 
ATAAGCATTTCTCTGTTCTGGGGATGTCTTGCCTCAACCTA 
GGACCTCCCAGAGGAGGGCCAGTTAAAGGCCTAACGCATAAGGCTGGGCGTGATT 
ATGTGAAGTGACTTAACCCAGAAGTTTTGAGCACATTTQlGG 
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AATTCACTTGAAATGGCAAAGGGGGTGTGGCATCCCTGGAGCGGCTCCC^ 
AGGGGAAAGAATCTACTGAGGACTCTCTCTCTCTCTCTCTCTCrCTCACTCTCTCTCGCTCT 
CAOVCAC^CACACACACACACACACACACACACACACACACA 
ATCTGAGGATGATCTTGCTrCCGAGTCTCCreTTTCTATCTCT 
5 AAGATTGTGGATCXlAACCCAGGGCTTTCTGCTTGCTAGCTAGGCACTTTAG 
CTTTGGTCTTCTTGAAAGACCTAAGTGAACCTGAGATTTATTCT^ 
AGTTGAAGAGAACTGGAATTTTCAGAAATGTCCATCCATGATTTTAGCCAA 
TGGCTAGGCTTGATAAGAACGCAGCTTCCTGTTTCCCATCCTCA 
TTTCTTTTCTTTTTTTTTTTTTG 
1 0 TCTGGGTTAAGTTCAAGTTC1TTGTTTTCTCATTTAT 

TGTGTGGCTTGTGAGAATTCACTAAGGCAATTCAAAGAAGATTTAGCATA 

AAACATT ATGTT ACACAAAGAAT CCAATGAT AAAAAGAAAGG AAG T CATCTTTTTTGGTGGTCCC CCT ACAGACTCC C CAGG CTGG 

TTTTGGAGAGCTGGCAACTATGTAGTCAGAACTTAGTTCAAACTTCCCT 

GACTTTAGTTTATTTTCCTCAGAACACCATTTTAAAAGGCTCITGCATAATC 

1 5 TCAGGGGGGGAATCTTGTCTGTTTAGCACCCTGCTTTGCT 

ACAAG CAGG CTAGT AAG CATGCCTGTCTGACAGAATGAAG CATT AAGTG CTAAGAATG AAG TACT AAATTCTGAGGG TACT AAGTA 
CT AAGT AG CTGACAGAATGAAGT ACT AAGTT CT AGGACAACAAAAG CTCCCAT AAT CTG AT AACCT ATAAAATGTGGGT CCTTTTG 
GGAAAGGCTGATCAGGTG CGACTGGTATTAGGACTTTAGATT CTAGAACAT ATATG AGG AACACAGTGCAGAGAGCATTAGTATTC 
TGT CTCCACATG TT CCAAAT AAGCATT AGCAT AGTTGTTT AG AAGGAAG AAATTGAAGT AT ATGG AA 

2 0 G TTATC CT AAGG T ATTT C CAGAATTG AT AT AACT CAGG ATTG AGT AGAT CTTCT AG AATT AT AT ACAATG AAATGAACT CTTGGT A 

CCTACrCTAGACAGGCCTCTGGAGAGAAAAATGTATTCTCCCCTTCTTATCTTC 
TCTTAAAAACATATCTTTTAAAACATGCATGATTIGATACATAAAGTAT^ 



CTCIGCCTCCCAAGTCCTGGGATTAAAGGCGTGCACCAGTACKjCCTGGCT 

2 5 TCACAGAGTATCCCCCCTCTACTCTCCCATATCTTATATTGTGGAGCACCnTAGTAATGAAGCCT 

GATTTGACTTAAAACCTAATTTGTCTTTGCCAGTCATCAGATTTACAGTGCTAGCGT 

TTTGCAGTATGGCCGAGTTTTAATTATTTGGAGAAACAGAGGTCGTTCATGTCAGAATGA^ 

TCTGGGCATTTACCATGGGCTCTGGTCTTTGCT^ 

CAGGTATTGTTATTTCCATGTCATGGAGATGGAAAATGAGACCATAGTGACATATACCT^ 

3 0 CAGGAGG ATTG CAAATTTGAATC CATT CT GGG AT ACAT AG TAATTT CCAGGTT AGG CT AGATAAAG AAACCTTGTCAAAAAG AACA 

CZAAAGAAGG^AAGCAAGAATTAAATGTTAAGTGATCAGTCTCAAGTAATTAGTAGTATG 
TCTGACTCTGGAACTTGAACTTTTATCTACTATCCCTCTGCAC 
AAACCAATGACCATATGATTTCCTATTGAGAGTTTAGTTAAGTTGATATAC 
CAGGTAGCTGAGAAAATCTGGAGTTCAGAATGGCTTrGTCTCAGAATGAGTATXSCTT 

3 5 CTGTCTTCCTAGGGAGTTCCTATGAAGATGTTAATGACAAAGATCAAAGGCTAAGATGCGTCTGGAAG 

AGGAACTTCTGGAAGAATTCATTACTGGATAATTAGAAACAGCCAGTTAGCCC^ 

TT CT ACTTGTG CTG AGACAGGGAG AAGGGTTC AATTGCAG ATGCATTTCAT ATTATGTG^ 

CACATCCCACCCACTCTACAGTAACACAAGAGCACAGAGGCTACATTTCTAGGAGTG 

CCTCTTTTAACAGACTAAGGCTTTCCAAAGACACGAGTGATTCACTGGAACCT 

4 0 CGCTAAGTGTTTCCGGGCTAAATACAGATAACAAGCGACAGTAGTTCAGTTGCGGGTTTAGAAAT 

GGCAGTTGGAAGTGTCTTAAACGCTGAAATGAAGTACCGGATTCTATGCGGGTAGACT 

TAGAAt^TCGG^TTATTGTGAACTTGT AATGGCAGTG CAAACCCGAAGGCAG AACTTAAGTTTTCTT CAGCT AAAACGCTAAAAAG 

CAGTTAATTGTACTTGTTAAAACCTCAAACTTTCAATTCTTTTTCTATA 

TTTGGAGCCATTCAAGGAATATTATAACAATACCTGTTGTACGATGT 

4 5 GTGACT AT CT CT CTTTTT AG AC AG TGACTG ATT AACCT AAAAATT C CT ACAAT AT AACT ACAG CT ATTGG AC CT CCTTGG T ACTCA 

AGCTCAACCCTCTCCCGCCAGTGTAAACCGGCGCTTGGCCCGTATAAGCATAGTTCTCACrc 

GATTGGTCATAGGCATCTGAAGATAGAGCTTTTAAGAGAATTAGGGCCAGGCAGGT^ 

AGCAGATTCTCAAGCTAGACCACTGAGGGGGATCAAAGAGAGCCGGAACTCTCTGGGAAGCOT 

5 0 CCCTTGTTCCGGTCACAAATCTCGTCGCCAGAGCAACAGTGGGCGTCAACAAACGCAGGGCGGAAATAAGG 

CCAGAGGGGGAGATAGGCCCTGCGCAGGAG CATTAATTTATTTCCGTTTGT AGTTCTCAGTCTGGATCAGCAGCCTG TO 

ATT AATTGTGACGTACC AATG CTGGCCTTTATGCCGGAG CCAATTGGCCCGCCGGTGTGGGCGTGGCT AGGGCCCGCT AACG CG CA 

CGCGCGAGCCACATAGTCTGTGGTCGTCCGGTATCCGAAGTCCCCGTGTTCGTCATGTCGGCGTCTGTGGTGTCCGTCATCT 

GTTCCTGGAGGAGTACTTGAGCTCCACTCCGCAGCGGCTGAAGTTGCTGGACGCCTATCTCCTTTATATACTGCTGACCGGGGCGC 

5 5 TGCAGTTCGGCTACTCTCTCCTCGTGGGGACCTTCCCCTTCAACT 

CTAGCGGGTAATGATTCTGTAACCTGGTCATAATGTCCCACTTTGGCGCGCAGCTTCGTCCTCACGGTACCCT 

GTTGAAACTGAATATGTAAAGAAATAGCTCTTCTGGTGTTCGTCGCAATTAATACT 
AACTTTAACACAAATTTAAAGTTGGTAGAGCTTTAAATTAAAATGCCAAGCTGAGCCA 

6 0 TC71TCTAGGAGAAGACTTTAGTGATAGTATAGGAATATAAATATTAGTGACTGCTTGTGAG 

TTTAGTTGCCCCTACTTATTTCTGTGCATAATTTATGTGCTTTTTCTCT 
TTAGCTCTrCTTTCGGGTCTACGCATTTAATAACCTTACAGACTrCCAGGT^ 
TCTGACCTTTCTTGCCCCATGGAACTGAGAGAATACTGGGAACAGGTTTGGCTAAAGAATGTTACT 
TCAGCAGTGTTTTGGGGGTTTATTAGCAATTGTTTTCCTCAGTGA^ 

6 5 TTGTTTGGAGACTTGATTTAGAGTCTTAACATAAACCTCACGTCACATGTACACATTCACAT^ 

AATTTTACTTTTCAGAACTTAAATAGGTCCCAGCCGACTTGTGCCCCACTCAGAG 

CATCATATTGTCTTTGGGTTTTGGTCCAGTTTCTG^AGGCTACACC 

CAAACCTCTCTACTGTGCCCAATACTGTTCCCX3AC!AACTTTATTAACC^ 

TTAGCCTCTGGTGATGGAAGTGTAGGAACAGGGTGGGGCTAACATCTGATTTAAGTAAGGTCTCTCCTGTTG 

7 0 CAOUteCCTCTGAGTTAAGTCCTCftCTO 

TCTCTCTTTCATCCCATCTGTCTTTCCTrTCClTTTCT 1 1 1'CX I"l I X GGTAGATGACTTT 

XGCTTTTTTTTTTTTTT I' ll "! 1 C TTTT AAAG ATTTATTT ACTT ATGT G AGT ACATTG TAG CTG T CTT CAGACACAG CAG AGGAGTG 
CAG AAG ATCTT ATT ACAG ATGGT TGTGAG C CAC CATGTGGTTG CTGGGAATTG AATT CAGGACCT CTGGAAG AG CAGTCAGTGCTC 
TTAACCACTGAGCCATCTCTTCAGCACCAGCCCCAGCCCCA^ 
7 5 TATTTTTGTGAGCATCTTTGTGTGCATATAATGTATTGTAAAGTGTCATrATGT 
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AAGAGCAGAAGTATTCATTGGCTAGACTGAACTGGCTGGTGGATAGAGAACTCT 
TTGTCTGCCTCTCTCCAACTCTCTGCAACTCCACAGTTAAAGA 
ACCAACTGAAACATCTCTCCAGCTCCTAAGTCCTGTTTTTG^ 
TATGGCTTGTTATTGACTAGACTGCrTA^ 
5 GAGGCCATTATTAGTGAGGACATTATTAGTACTGAGGTCATTATTAGTAGTAAGATTGG 
TAGGTAACAGGGAATAGGGTGAGAGAAATAAAATTGGGTTTAAjGGAAAAGAAA 
GTTATCATTTGCATTCTCTTCCCTTCZAGTGAACTCAGGGTATTTTGTC^ 

CCTATCCATC CTTTCTTTTTTTTTTI 1111 rC CTTTTAATTTCTTTTITTTTTTTTGAGACAGGGTTTTAATGTCT 
TGCTGGAACTGTAGATTGGGTTGACCTTGAACACAGAGATCAGTGTACCT 
1 0 AGGCC^CTGCCATG lTrClTrr CACAGCATTCATTTTACT 
TTTTAGTGAAAAATTAGAAATCTGGCTGTTTGGCCTTCG 
TCTGTGGTCTTTGGGAAGAGCCTGGAAGGGGCAGCTTTGCTGAGGGAG 

GTGGCACT AATG CTGG AGCTTTGG CCT ATTTCTGT AGGAGATCAT CTG AGAATT AACAGTG ACAGCACAGG CAGAT AGATGGCT CA 
GTGGGTGAGGGTCCTTGTCACCAAGCCTGAGCATCACCTGAGTTTGATC^ 
1 5 CATTCTTCTCTCACCTCCACATGGACACACTCAC^ 

TTTTGGAAGTGAAGGGCAGACTTGAAGTTGATGTTAGTIt5CATG 
TTGGCTGGTTGTTTCTTTTGCATGTACATTTGCT 
CATTTGAATCCTGAAAAGACACAATGCTGTCTTCTGTTTCCTC^ 
TGAGAGGGCACGAGGGATGGGCTTATTCAGCAGCGGATAGTGTAGTGGGTTTTTA 

2 0 AGCCATGCTCAGCAGCTGTCTGTTTATCTTCCTGTTAGCTATCACTAGAGGA 

GTTGGTAGTCTTTGTAGTTTCCTTTTGTACCTAGCTGGGAGCTGCTCATCT 

TAAGTCTGTCTTTAAAGTAAGTCTTACTAGGCAGACACGCCACCACTGAG 

ACTTTACAACTTTGCTTC<5TCC<jTACTCAAGCTTATGGA 

CAGCGGTGAGTTCAGATACCTAGC^CCAATGTCAGAAGCrAGGCATGGTACCTATC 

AG<3ATGAAAGAGAGCAGGACACCCAATGTCCTCGGC^ 
ACGCCATACACACATATACCACATTCATGTCTCTGTCTCACACACACAC^ 
CAC^CACACACACACACACACAOSACGACATTTACAAAGTAAAGTTTTAGTACT 
TGAGAGACACGGAGACTAACATGGAGTGTTTTTTGTCTCAAGGAAGCCATTG^ 

3 0 GTGTCCTTTGATCTTTGGATTCCTCTCAGCTACTGTCCAGGAAATTGGGAGTTTGC 

ATGTCCTATCCATCACTrriTTATTCTCGGTGTCTCAG^ 

TCTGATTTGCACGCCTGATCATGTTACTTAACCAAATTTGTCTGTGGTTTCCAAAGTAGAG 
TTTTTCAGGACACAGGCTCTTTTGGGAGCTGACCTTGTTAACACCACC 

3 5 GGACCCAGGGCTCTCTTGTCCTTGCTTGCGTGTGTATGTGGGGGATGAGG 

GTATTTCTTGTACTTGTCTGCCCTCTGGGAGGCGAGGGCAGGCGCTTTCATGCCTGCATCCTTG 
GCTGTCTTAATTTAAGGAAGATGGAGGACTGTTTGCCAGGTTCTGTGCTGGTGTGCTGTGGATTCAG 
GGCTCCTCCTTAAGAGTCTTACAGTCTGAAGGAGGATGCAGACAGGT 
GCTCCCTGGCCCTTTGAGTGCTCATGCAGTGTGTC^ 

4 0 CCATTTCrTTGCCTGTATTCACAGTTTATCTTATATATTAGTTTTTTACTAG 

TAGTGGCTCA.CACCTGCTCCTGTGCCTCTGCTGTTTGA 

CTTCTCTTGGTGGCTGAACAGCTTCTGTGGTGATCCCTTCATTCAGCATTGTG 

ACTTGCTCTAATATTGTCTAATCTTACAGTTGAACCAGGCATGGTGGTGCATACTTTT 

GAGGATATCTATGAATTTGAGGCCAGGCTGGTTTACATAGTGAGTTCCAGGCCAACCAAGAC 

4 5 ATAAGTTCCAGTCGATCCCAGGCACTGCTAGCAACCATC^ 

AGTTGCTGTGTCTCCCTTTCCCCCACATATTTCTCAGTGCTCTTGTGCTGTGGCCT 

ATGTGGCCAGATGGAGTAGTCATCCGCCTCACTTTACCTGACTGCTCAGCATTAGTATTTAGCT 

TGAAACACTCGCTTCTCAGCACTCTTGTGACTGCCTCT^ 

GGCACTTCATTTTGTA' l T r CTTTCCATAGGTGAAT'ri T 1"1 1 IT I T T TCTCAACAGCGTTTCTCTGTGTAGCCCTGCTGTCCTGGAA 

5 0 CTCACTCTGTAGACCAGGCTGGCTTTGAACTCAGAAATCCGCCTGCCTCTGCCT 

ACTGCCTGGCACCATAGGTGAATTTAAGTTCTGTATCCTGTCCAGTTGGTTCAAAATACCA 

TTG ATGTCTCTTCAATCGCTCATGTTTACATATCCCAAATGGACATTCC CCTTACC CTTCTTCCCAGCCATCTGCTT CTTTACCAG 

CCAGTGTACAGTGTCAGCCCTGGGTAGGTTGTTCAGCCAGATATATAGGAGGTCT^ 

ATCCATTTCACCAGCAAGCTTTGTCTTCTCTGGAATATAG^ 

5 5 AG CTCTTCATTTCCTAAGTCATTCATCTTT AGTCCTGC AAACC ATTCTTTATGTTG CAGTCTGGCCTTTTAAAG AATGTGAGTCGT 

TGTTGCTTGGCTAGTCGAACCCTTGACTGTTGCTTCATTGCCCITTGGCTAAGTCT 
ACCAGATAGCTTGGAAGTGGCAAGACTGTTTTTACTATCTGTATTGCTCTGCCCCAGATCTTTATGTGC 
AAATGGTCTTTAGTAGCATTTTTAAAGACACGACTACTGGATACAACAGCCTTCTAGTCTC 
GTGTGTACCCCACACAGAATTTCCTTGTTGGTTTGCAGACrrGGAGACACTTGT^ 

6 0 AGAGTCCTGTATCTGGTCCTG CC^TCTTCCCAGCGTCTAGAGTGATGCTTGACTC^ 

CACATGAGCCTTTAAGGAGATGATGAGCAGAGGTGCTGGATGCGCAGTCAGGAGGCTGTCGGTTTCC^ 

GAGACGCTGAACCCTCATTTCTGGAGCACTTTCTITA 

ACAGGATTTGGAGGGTTGGTGGTTCTCATGCTCATCAAATCC^ 

TGGAGGCTGCTTCCAGCTCACCTGTCTCCCACCACTTCCTGGACATTGCTCTAGTTTGCTTTCTA 

6 5 AGTAGCAACCTGGGGAGGAAAGGGCTTATTTGACTTACATGGCATAGCCCTTTATCAAGGGA^ 

GACCATGGAGGACTGCTGCTTACTGCTTGCTCAACCCGTTCCTTACACAACCAGG 

CTACAGTGAGCTGGGTCCTTCCATATTAGTCATT^ 

TGCTTTTCTAGAGGACCCAGGTTCGAATTGAACCTGGCA 

ACACAAGTGATACACAGACTTACATGAAGACAAAATACCCAATAAATAAAATAAATTTTTTAAAAGGAA 

7 0 CCCCCAGCCTAGTCTGATAGAGGCAACCCAAAGCCCCACATGCAGTTCCCCAGCTGAGGTTCTCA 

CAAGTTGGCAAAAAGAAAAAAAGAAAAGAATCCAACTAGCATAGGCATTTTATGCAC 
GGCTCAGCAATTCAGAGTGTTGGCTATTCTTCCAGAGGACCC^ 
GTAACTTCAGTTCCGGGGTATCCAGTGCTGTCCAGTAACCTC 
GGCAAAACACTCATACACATACACTAAAC^GTTTTTAAAAATGGGGTGTTGATCT 
75 TGGGGTTGACTTGTGCACTGTAACTTAATTTGAGTTGTTATAGTCATTTTAAGGTGACTAGC^ 
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CAAAGCTrrTCT CTTGTCCC CT CATAGTT ACTCCAG ACACTAC CT CATTTT CT AT CCCAG TG CTGTTAACAG'l'TLTG'l'ATTT ATTT 
CTCTAGAGTTTTTGCTAGCTGAGAACTAAGTGATAGACTAGATGTA 

TTTTAGAAAGAGCAGTTTGGGGCCCACGG CAGAGCAG CTG CAGAAGCAGG CAG ACCTG AGTTAAGG TC CCAGCACT CCAGGTCTGT 
CACCC^USGGTGGAGGAAGGGGAAGGCAGGCAGACCrriXlAACCT 
5 TCTGTGAGAATCCATGAGTTCATAAGAACT 

TT CCTGCTTTCAG AGGATCTGAGTTTGATTC CCAGAAT CCACAT CCIAACCT CCAGGAATG CCCT CGTCAGCCTTTT AGGAGCACCT 
GT AT ACACAAGTCATAT AGACATGGCACACAGTGAAGTGCG CGTX3 CGTG CACATACATG CACACACAGATACACAG AGACATACAC 
ACATACACAGAGACATACACACAGATACACACACACACA 
TAGGTGGGATGTGAATGAGGAAGATTCCTGAATTTAACCrCraACCTC 
10 ACATATGAACATGTACATATACATACACAAAATACAGACATTTACACGCAATACT 
GTTAGTGATTTTGACTATTTTTTTCACTCAGTGAAAGTGAATA 

TTG CAT AGGATTTCATTGCAGGTGTGTAGTATGATT TAATTATTCCCCTTGGTGACATTTGGAG CATTT CTTTTTGTTTTCTTTCA 
ATTTTGCAAACCTACTGTGTTTTGTAAAACACATTCAT^ 
TTTCCCCCAGAGGTGGACTAGCCTGGCCTTTrACAGCAAGG 
1 5 AGTGCACTTC03AAGTGTGTCTGTGCTTAGGTTGATGGGAACT 

GTCTTCTCAGTGGATT CITTGTTAATGTTGGAGTGGATGCT AG CCTGAAGCAGGTG CAATAATTTATTGAAG CCAT AAAAGG TGTT 
TGAGCAGAGAATGCCCCTGGAGCTGCTTTTCA^ 

GGAGTAGAGATCACTGTCAAGAGTCTTGTGTGTTGGATCITCTGTTGTCTTT^ 

ACAGTGTATCT ACAGCAGGAGG C CCTTGTGGACCTTCAGCTGTACT CCTGATCT CTGCCGTG TAT CTT CTGGTAG AAG ATG ATT AA 
2 0 GATGCTATAGGGTCATCATTCATCCCAGTCTTCGTTACrGCTGCATCrrGTA 

CCAGCTTAAGAGTG CTT AGGCAGTTAGGAGGTGTG AGT AAGCTTACAGCTTT AG CAC CT CTGATGTGTTGTTTrCCCCCTTTATTG 
TT CCTTTAATAACGGAAAGTTAAGATTGT AAAG GTATATAATTTTATG AG TTTT AG TTTTCT ATTTACTTTTTTCAGTTTTT AGTT 
ATTGGATGTGTGTGTGCTGTC7VGGCTTGTCGGCAAATGTCCTCCCCAGAAAGCTA 
CAGTGTCTGATGTAGCTCAGGTTCCCCTTGGACTTGCTGTGGACCT 

2 5 CAGTGCCAGGCTGCAGCGGTGCAGCACCATACCTAGTTTC 

GGACAGAACGT TAT CACTACTGT ACAGG TTTCTTGCATTTT AAGCCATGATTCTCTC CTGAGAG ATGAAAGTCACTATCTGT CTT 
TGCTACTGTAGATTTACTCAGCCTATTTTGTTTATTTGTTITTGTTGTCT 
GACTGGTCTCCGAACTTGTGGCTGTCCTCCTTCAGACTCACGTTCT 
TGGTGTTAAGGCTCCTTATGTATGAATCATCTAGTGTGCACTCTTTATGTC 

3 0 ACTTGGATCTAGCAGAAGTTCCITTTCATTGCT 

AGACT TTC CAT CACGG GATAT ATTTG TGTATTGATT AGCAATCATTTG TT TTATATTTCTGAATAATG ACTTTTGTGTTTTGTT TC 

TTACCCAGTTTGCCTGAGAATACIAGATCAACCCCCAGAACAAGGCG 

TCCTCTTTGCCAGCACGATCCTGC^CCTTGTCGTCATGAACTTCGTTGGCT 

3 5 CGGAGCGTAAGTGGCCATCCCAGCCTTGTGCTGCTGTGCTGATTCCAGCAGGTGGATTAGATGGG 

CGAGTTCCCATGGTACAGTAGGGTGTCTGCAGGCTTGCCCAGCTT 

ACAGGTGTGCTCCCAAAGGCTTTTGAAGAAAGATAAACTGCAAGTTT^ 

TTGCCATTGATGTTAAATATTAATTATGGCCATTGTTGCCTGGTAGGTACT 

GAATGTGTG C C CTG AACTCAAATG GTAAAGTTGG ATTAACTTGTGAGAAG ACTTAGAAC CAGACGGG TAG CATTCGG TTTG TTTG C 

4 0 TTCATCCCTTGTTAAGCACGAAGCCATCTTGCCCCCCCAAACAAACTT^ 

CTCTITTGTAGAGCTGAAAATTCTCTCTGCCCTCCCTTCATA 

TTCATTAATGCTCTGACAAGACACCATGGCIAGAGGCAGCTTGTGGAAGAAAGTC CTGCAAAGAG CTA 

GACTCTATGGCTGTCAGGGTGGGGAGCATGGCAGCAGGCAGGCTTGGTGTTAGGGCl^ 

AGCTTGAAACAAAGCTAACTGAGAGTGGCATGGGCTTTTGAACCTGAAAGCCT 

4 5 CACCTCCTAATCCTTCCCAAACAGTTCCACCTACT 

TACATCC CAG AAGCAGAGTCTTAAGCAGTG AAACGGACATGATTTTT C CCAG CACAT ACCAATATTTTTC CTT TG ATGAGAGAAAA 
TGAGCCGGGAACAATGCIAGTTGCTTCCCAGATGC^C^ 

CATGCAACTCAGGGACGAATGCCTGTGCTTCTCCTTTCCCTTGCACAATAAGATT^ 
AGAATCTTAGGAAATAATAACCTTTTGGGAGAAAATGTCCCAGGTCCACCCTCCTTACIATGACTACA 

5 0 ACTGAGAGCCTGAGAAACCTCACACGAGAACCAGGGATGCATGG 

TTTCTTGAAAGTAGTAGGCTCCTAGAAAAGGTGGGGAAACGCTCTTGTCT 

CCGATACACGCTAGCTACCCTGTGGTCCCACACATCTGTCTGTCCCATCTGTGTC 

CACTAACTCAGAAGGTAGGATGAAAAGGAAAACAAAAACAAGAC 

GGTGCTGATGCCCTTATTCTGTGAAATGAATTACGGGGCTATAGGGCAGCACAGT^GGGGCTAGATTCTCA 

5 5 TT CTC CT TAT CTTC CAGTTGACAG ATGGGCACTCTAAAAAACTT AATGAATTTG AAGGTGGTATT^ CAGG AAG AAAAGGCA 

TATCAAAAGAAAATGGCATTGCG CTGTAAAAGTCAGTGTGACTTAGGGGGGCCAGTGAAGGG CAGTTGCTTCAAGTGGCGCT CACT 
GCATGTCCTACAAGTGCTTGTGAGTATCATTTGACTTTC^ 
CACTGAGAGACCCAGAGAGCAa3AGGAGAAATCTTTCATTTGTTTTTGCT^ 
GAAGGATGGCCTTGTGTTCTTGACCCTCCAACCCTTGCCTCCCACGTGCTC 

6 0 ATTTTAGATCCTTTTTCTAATTTAAAAATTTTTAATTTTAATTTG 

GGTTCCCATGGCGACCAGAAGTGATGGGTCTCCTGGAGCTAGTTATAGGCA^ 
CTTGGGTCTTTTGAAAGACCAGCTTATAATCTTAACTGCTGAGTCAT^ 
TGTTTTTCTAGACAGGGTTTCTCTGTGTAGTTCTGGCTGTCCTAGAA 
CCTGTGTCTGGGATTAAAGATGTGTGCCATAGTCTGACCG^ 

6 5 ATTGGGTTGCATTTGGGGAGTTCACATACCTGGCGATATGGGCCATGGG^ 

CGTTAGTCTGCTTGCCTTCCTGGATGGAGGTTGTTGCCAGAAGCCTG 
GGAGAGTCTCTTACTATCTTTAAAATTTCGTAGCATTTAATCCT 

AAGGCTAC CTTTGAGCTGACAGT ACTGTTT ATTAC T AGG CAATT CTCTTT CTTTCTTTCTTTAATTT ATCAT AATTTAG TAAT CTA 
GAGAATTACCTTGGGAAGGGACTCAAACTGTATTCCTTTTTTCCATCTCCATCCTTATTTTTAATT 

7 0 TT ATC C CAGGCTTG CCTGGAACTTG AAAAAAAAATTGTTG TTT AAAATT TATT AGATTTTATTGAGGGGGG AGGAG CTGGTGTGTG 

CGTGTAGGTCCTAGGGATCACACTGGGGCATCAGC^^ 
AAAGAAAAAAAGTTTrAATTTATTCTTTAACAAT^ 
ACCACATGCTCTTTTCAAATTATT^TATAACCTACCGAGTCCAT^ 
7 5 AGTCTAGATGGTACCCTCTGCACTCTCCCGCCATCTAGTGGATGCACCTTTCCTATGCTGGGAGTGGATC^ 
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GTATTTTGGTGGAACAACTAAAGCTAGTTC 
TAAATCCAGGATTTCACAGGGTTCCCTTTGTC^ 

AGTGCCCCCTGGCAATACTGTTCAGTGTTTAGTTGATGAGCGAGTGGGTCCCCT 

CTGGGAAGTCAGG TAACTTCTTTATGATTGGTCTTAAT AGTTCTTAAAAAG CATTAATTGATT CTTTT AATTTGTGG CTGAATTAT 
TCTGGTTGGTCTTTTCAAGGCAGAGAGCrAGACAGATGTAGTTAG 

GGGGATGGAGCGCAGGTTGTCAGGAGTAGCAGGAGTACCCACTGATCCA^ 



GCCTTCATTGTAATCTCTTACAGCTGTTAACTTTCACrAATACA 
TTCTCTCTCTCTCrCTCTCTCTCTCCCTCCCTTCCTCCCTCCCTCCCTT^ 

TGGGTGACT AAACATTAAAGTTCAAAAT AAAATGT C CAGTGTT ACAG AACTGTC CAT CTAT AAGATGCTTCGT CAG CAC CACATCT 
TTAGGATTCCTGTGTCTTCCACATGCACCTTATATCAGTG 

TGTATGTGCCTGCTTTGAACCTGAAGGAACTTGGCCTCCGCAAAATTTCAGTTG^ 
GCAGTTAATAGTGAATGTTTGGATCCrrAAGCTCTGATATTCATTG 

ATATATACACATACATATATATATATGTATATGTATATGAATGATTATATATGTATGTTTA 

GTGTTTGTT AAAATATTAAACATGCTT AAATGGG AAAGGTTACGAGTG G TTTAT AAGAAGGG AT AAAG ACAGAT AAC C CAGGTGTT 
T CTG CAAAGGCGG CAT CATT AC CGTGGTACAGG AGAGAAGT AGG AGCATGCAT CTTGGGAG AG CTG AG GTGACCTCACACAGCACC 
TGGGTCCTCTGAQTGTGTGTGCGTCCCACCCACAACAGTC 
GACTCACCAACCAAGGAACTAAGTTTTAATTTCCTTAATTTA 
TAGCCTTGGACTCAATCCATAGTTAAGGAAGACCTTGAACTTCTGACCCrrCCT 
CAGCATGCCTGACACTGTGGGCTCTGGGTGTTGTGATGCTACAGCATCCT 
" AAGG CGGTTTTAAC CAGAGTT AAAT ACATGTGTGTAG CTG CTGGGT AATGTG TTGG ACAGAAT AG CTGTGGAT AAAATT C CCAAG C 
T AC CAAGTTGT AG ATTT ATCATTT CTAAAGG7U3AGGATTGGC TT AGTTGCTTT CT TC ATTTTAG AAGTT ATGATCT 

GTGCCTTGTAGTATTAGAGTGAT7CAACAGTC 

CAGCTCTGATTGGGTGGTCAGGATCCAGGGAAGGGCZAGCATTTGATCGT 

AAAGAG C C CIACAGT CC CAGGAGCTGTTGTT ATTTT CAT CTAT AG AATG AAAT AT AG AAAATGTCAT TTTCATTATTTCTTAT ATTT 
TTTCTTGCTTTCTATTAGGGTGATTAGAATAGGTGGGTCA 

TCTCTCAGTAGGTGCCTTCCATTTTGCGTGTTTCAGTGACATCAGGGAGGAGATTTTGAG 

TGCTTGCAGATATAAAAGCATTCTTGTGAGTTCCTCACCG^ 

ACTTGGCAAGTTTGTCCTCACTGGAGAAGACAAAACAGNNNNNNNNNOT 

CATGTTTTTAGTATCAGAATCCACCCTGCTCrrTAGCAGCIAGC 

ATGAGACGTGAGAACTAAATTACCTCTTCACAGACCTGACT 

CTCAAACTGAACATTAGTGCTTTGAGGAATATGAGGGCCAGGAAGG 

TGGCTGCCAGTCrCCAGCACCTCAACGGCCi T rT T TTCCrri^ 

GAGGCTACTCTCATACAACGCTTGTGAAACACTGGCCATC 

TGCGGCAGCTGTCTTGTGGGTTGAGAGTTTAGATGGAAGGATTTGGGACATGACA 
GACGTCCCCTGGATCAGCATTCTTGAGCTGGCAGCTT^ 

TGTGAGGTGGCACGTCCAAAGAACTCCCTTCGCTCTATCGGACCAACTCCACAGTGTACGTCT 

TCACrGTAGCCCrTACTTTCCATATTAAAAACATTT 

GGCTCAGTGCGCTCTCATGACTTGCTCCCAGCTtSTGACTCTTCACCTCTG 

GCTGACAACCTGGAAAATGGCAGTCCTTTGTCCTTTGCTCGGTCAGGCAATCCT^ 

GTGAACACTTGTCAGAAACTGACCTGTGTGCTOTCCTC 

CTCTCGGGCAGTATAGAACAAGTGCAAAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 

AGAGAGGTGTGACTGTGACTGCAGGCAGGTGTCTCTGTGTGTATTGCATTCTGTGTC 

GCTGCCTGTGGCMGT'GACTGGCGTGTCTGCGCGTGCACGGGTGTT^ 

TTGGTCTCACGTCACAGTTGTCTTCAGAGGGTCCAGTTCTCCT 

TAGGTCTTC^CAGGAGATCACITTAAAGACAAAGCCGGAGTAGAGTAGCGTCTGC 

AGCTCTGTGTCTCTCAGGAGCAGTAAGTTTGCCATATATGGGGA^ 

GTGCTCCAGAAGCCCTCCCGTAGGACCX5CACGCTGCCATAGGCTGAGGAACTCA 

TCAGTCTTGGGTGTAGCTX^CTACCCCTATCCTGGATGATTGAGGCTrrGC^ 

ACTGTTGGAGCTGTTGGAAAAGGAGACTCCCAGGCCTTACCCCCAAGCGACAGAAGCAAAG 

TGTGGTGGTGTG CACCCGGAAGTCTGAG ATGCCCGCATTTC CAGCTCTGGGTGTATCCTGCIAACTACCCGGAAGCTTAACAGAGCT 
XGAGCCTCTCCATTCTTTATTAGCTCTCTGCTTAGCTAGACATACTCCAGAGTG 

AATATAATGACAAAACCTGGCAGGGGTCCTGTGGGCCAAGGCTGAGCACTGAGGAGAAGTTCACTAGGATC 
TGCTCGACACTCGTGCAGACATCCTGCGGCTTTCCCC^ 

T ATGAAAAGCAAT CGGACITTGAGAGCAAAGGAAG ACG TCACAACAAT AC^ C 

GACCTGGGCCGGGTACTAGTTTCCAGGCGCAQATCTGCTTTTGTCT 

GCGTAGCTTGCTGTGAGTCTGTACCATTTACAACAGTACAA^ 

GTTTGCTGTAGACTTTAGTTAGCITGCTCTGTGGCTCTCAGTTCCATAA 

AC^GGCTCCCCGGGCAGAGGGGTTGGTTTTGCTGACTCAGGTACrr 

TTCCAGCTCAGACCTCCTCTGTGTGTGTGTGTGTGTGTTTGGGGGGGGGTTATTTTTATTTCCTGTG 

AGCCATTTACCACAACTACTGTACATTGTTCCTTGCTCGGCA^ 

GTCACCACGGCATGCAAATGTTCCTCCTGCTTCCTCTCCCGGCAGCCTC 

ACAAAG CGCTCTTGTTATGATTGTTGGAACATACTT*GGGGGAAGTGACAGAGATTTG CCA 
GCCAGCATCAGCTAGTTTCCAGCACCTGGAAGCTGCTAGCTCAGTTGTAGCTC^ 

TTTGGGTGCCTCATGACTTAAGATTTATCT^ 

CATGGGAGGAAAGGGTTTGTTTCAGTTTACAGCTGAAATCCA 

AGCTCATGCTCAAACCAGGGGTGAAAGCTGCCTGCTGCCTTACAATACOT 

CCAGGGTGGCACCACCTACAATCGX3TTGGX3CTTT 

GGGGACTTTTTTCAATTGACATCCCTTTTCCCA 
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CTAATTAAAAAAAAAAAAAAAGTTTTCCCTGGGTATGTGra 

GAATTTCTQTOIGACTAGACAGAAATATTTTACTC^ 

GGCAGCCCGGTATTGTGGAATGTGCTTTGCATG^ 

TAGAGGAC CCTCTGTCTGTACAATTCTGTAAATAATTAATTAGGTGC CTGGG CTGTGC CCAGT AGGCATGGG C T ATAAAG AT AAGA 

AGCAGCCTCGCTGTCTAGGGGTGAGTAAACCCACTCATGCCCTGTGTGTC^ 

GAAGTTCTATGATCCCGTGAGGTTCTCTAA^ 

AGTGTTAGG ATGTGAATCTTCrri'CrGACT CAAAG CATTT ATTTG CTTTGGATGAG CTAATGACTAATGAGTCTGGGAGCGGTGAT 



CATATCCMCCGGACGGGGTCTTGCCACGGACCTCGCTGGCCAG 
1 0 TCACAGGGATGGACCACTTTGCTTTAAAAAATTAAATATTGCCGGGCGTGG 

GAGGCAGGCGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAGAGTGAGTT^ 

TCGGAAAAAAAAAAAAGAAAAAAAAATTATGTCATTTCCCCCT^ 

TCTCAAATTCATGGCCTCTGGCCTTTTTAATTGTGATAGGTC 

CACACACACGTATGATTTCAGAGCTGACCATCTGGTGTTGGATAGCTGGTTGGGG 
1 5 GTCAGTGTTGCTTGGCTGCTGCGCTCCTTGTTTAGGGTTGGAGAATCC 

CCTGTTTCTCCGGTTCTTACAGTCGTTGTGCCCCCAATCCCCTTCTAAGATGT^ 

CCAGGGCGTCACTAGGTGCCTTTGTATACAGAGTCATCCT 
2 0 GCAAGGTGGCTCAGAGAGTAAAAACCCCTTCTGCGCAGCCCCGGTG 

AGGTAGAGAACTGACTATTGAAGTTCTCTGACCGTCGCAGGGCACCGCGCCACAC 

ATGCATCTATTATGCTATGTATTCCCTCAGCAGGGGAG CCTTGTCCACCCATGCTC CCTGGAGTGGACAGGGCTCGGGTTTGCCAT 

TCTAATTCAGTCCTAAAGAGGAACCAAGTAAGTGCACACAATCACAGCTGAGTC^ 

AGGGGCCCTGCCTAGCTGATGGCTGCACCTAGCACTAGGACTTCGTGTTCGCTCTC 

2 5 TCCACCGTGTTTTTTTGTTTGTTTGTTTGT^ 

GTCCTGGAACTCACTCAGGAGACCAGGCTGGCCTCGAACTC^ 

TGCGCCACCACTGCCCGGCCC^CCGTGGTCTTATATAAGCCAGTTCGTATGTTTCCTAAGTTGTCAAAGAGG 
GCCTTGGTTCAGGTTTGTAGGATCGAAGGAGGTAATGTTTGCTTTGATGTTG 

ACG AGGATGTTGCC CGCTCATG TGATATCAT TGG AATAACT CAG C CAGGGTCAG CAAGGAGAAG GAAGG ATGCCCT AGAGTCG ATA 

3 0 AAAATCATGTCTTGCGCCAGGCATGGTGGCGCATGCCrrTTAATCCCAGCACTTC 

GGCCAGTCTGGTCTACAGAGTGAGTTCCAGGAC^CCAGGGCTACACAGAGAAACCCTGTCT 
AAAAAATCGCGTCTTGCAGTTTTGTTGACCACAGTCTTTTAGGTTTACCTTGGGTATTTCCTG 
CAGTTGCTCCCACAACCTAAACCATGTTATTTGATCCACTTCACTCT 
TTCCATCTGGTGTTTTCTTTTTTTTTCCTTGGTTC 
3 5 AGTGGATTTCAGGAGGGCTTCAAAACTGGAATCCTGTGGCCTCTCTTTTTGTTTTTGTTTATT^ 
GACAGGGTTTTTCTGTGTAGCCCTGGCTGTTCTGGAACTCACTCT 

TGCCTCCCAAGTACTGGGATTAAAGGCGTGCGCCACCACTGCCCAGCCCTGTGGACTCTGGAAGGGT 
GACAAACACACCCTAGGATATTGTAGGACACACTGGCTGGAGGAGGA 

AGAGATTGTATCCTG CT ACT ACTGG CT CAAGG AATCC TACAAT CAGGG AAG CCCCTGGGGTG CCAGAGAGGAGGGAAAAAAATGTG 
40 TCTCCTCACAGGTGATACAGTGAGAAGCATCAGGGGGGCCATG 
CAAAAGCTGGAAGTCATGGAAG<3AGTAACAGATGCGCGTGAT^ 

GCTGAGCGGGGGACCTGGCGGGCCCCAAGCCTATGTGAGCGCCGGCATCCTCACGCAGCAGGCTGCTTTTATCGCAGCATGGGGGT 
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ATTTCGGTTCGCGAAGCATTAGCATACCATTATCTCCTCAGACAACAGGGAGAGAAGGTGGGCATGCATTCCGGTCTCCGTGTGAA 



CCTGCCTGAGGACTGCCAGGAAGATATTAACAGAGGCCATAAAATTACAGCCATTACGTCTGTCTCCCATGCTCAC^ 
GACTGTCTGTGCCTTGACTTTAGGCAATCTTGAGTTTAATTTACACACAC^ 

ATTTTGAC^GGCATCTACCCCGATGTGCCCATCATCCAGGTTCAGATCAACCGCCCCGAATTCCAAAC 

TCAGTCCTTTCTCCCTAGAGAC^CCTAGCATGTGAGTACCTGTTATGGGCGACCCCTTTGGCCATTTCTTA 

GAGTCAGAGTCCCTGAAACCTGACACAGTGTCTGCCCITTAAC^ 

GAGAATGTTGGTACAAAGAGGAGAACTGAGGCTGCGGGTTTACAGGCTACAGAGTGAAGTCA 
ATATT CC ATCTCCTGG CTTCCAACAGGGTAGTTTCTTTCTGGGTTTCTCTTGTAGCTGGGTCTCCT 

CCCTGCTGCCTGAAGGCCTGACCCTAGACTCTTTGCAGGCAGAGAGGATCACACAAGTCTGGAGTCCAGTGACTGGCA.CAGGGCAG 
GACCCTGTGAAACAGAAGCCACTTGTCATTTTATTTTCCTTTGGAAAGAGTTTTT 
ATAAACACGCCGAGGGAGAAAGCCTTTTGGTGGAGTACTGCTGCCCCCGTGTGGCTACCTTAAGACA 
CCAAGACGAAAGTTTTTCAGGTTTCAGGTTCTGCGACTTCCTAGGACCCTGAGAGACAGATGTCCACTAG 

TTAAATTTCCTCCTGGAGTGGAAATGAGAGCGAATGGGGCTTCTTGTGGAAGGTATCTGTTGGGGAACGAGCTTTAGCTGTGTGTG 

TAGGCAGCTGCCTCTTTCACAACCTGTCCAGCACAAGCTGGACTGGACATAGACTTTCTCCCTTGAACTAAGTATATAGACTTGTC 

TAATTTGAGGGCAATAGAGGTCAGGGGAGGGGGACTTCTTGGGAGTGGAGCCGCAGTGCCTCCCCAGCCGGCTGCTGTGAATGGGT 

GTTACCACCAAGACCTGCAAGCCCCACCCAATGTGCTCTCCGTGGCCTTCTTTTCTGCACCTGTGGTTGGGCTGC 

CCAGCCTACCTCTTCTGGGCACGTGGCCCGGAGAGATCTTATCTCTAACTCTGTATTTTTAACTCCTCT 

TCTGCGGGAGAGCTTCAAAGGGGGACCTGTTTGCCCATGTCAGCAACCTGACGTCATGGAAGTGG 

CCTGGAGGGAAGTGGGAAACTTTTTCCATTTCCTGTTCTACTTCTGGCTATTGATTCACTAT CTGAC CAGCT TACCCATTTCAGGG 
TGCAGCTGTCTGTGTGATTCTTGGAGGAAATCCTATGCCTTGTGTGTCTGTATGAAAG 
TACCCAAAGCAGGGCTGGCGATCCTTCAGACTGCCCCCCACCCC^ 
ATTTGACAGGTCTCTCACACrCACCTCTGGGAAATATCCCTCGGGGACCCAGACT 

AT AG ACT T CCTG CT T TGT AAAC CGG AAGCCCTT CT T CC CACATG TTG CTCG AGGAACT AGGT TT CAGTGGGTT TGTGTGGGT CACT 

GGGGAAACTGAAATGGACATTAAGTGACCTTTGGCCCTGGATGCTGGTTCTCATGTAGGAC(2AGAGTOT 

AGTGT/TTCCTGGTAACTGTCCTGCAGGATTCAGTCAACAGTCIATGAGGCTAATGTGAGTC 

AGCTCTGATTATTCTAATAGTTTTTGCCAAGCTACT 

ACCACATTCCCTCTCCAGCTCAGGTCCCTCCCAGTGACTGTGTTGACOT 

TACTGCACTAGGTATGGAGGAGCTACTGGTGAGCAAGGTATT 

CTTGGTAATAAAACAATATTAGATTT'GATATAGACATGA 

AAACTAAACCCrTGAAGCCTOWSGGGGCGGCCTGAGTGGTCAGAGC^ 

ACTCCAGCTG<^CTGGACAACGTTGCAGAACACTTTCC^ 
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AG AG CTGG CCTGCAG AAAGCCCITGAGGGAGTAAGGG CATTT AT CTAGACTCTAGAGAT AC CCATTTTACAGCTTGGCCCTGGCAAG 
GAGTTTGCTGGGGTATGGGTGCCTTCTCTTTGGGAGTTGGTCCAGACT 
TCTTCAGGTTTCCTCAGTAGCCTCTTCCAGACTAAGCAGTGA^ 
ACTTGTTCAGCTCCTGTGGAGATTTAGCCT<IAG 
5 TATCAGTGGCCTCCTGTCTGCCTTGTCCAGGCCTCGCTTGAGTTGGGGCTGG 
CCCTTCAAATCCCTCCCACGTCACCTCACTCTGGTC 
CACTGGGCTAGCATGGTTCMCCGAGGCTGGGCCTCTTTC^CGTTTAG 
GCTTCTTACTCTGTCCCATTCCCAGAAGGTCCCCCTC 
GCCACCCACCCTTTCTTTGCCTGGAT TTTrTTTT^ 
1 0 AGACAAACTAATTTGCCCTAAGTTCTTAATGGTTGAT^ 



MOUSE SEQUENCE - mRNA 



15 



20 



CACTCCGCAGCEGCTGAAGTTGCTGGACGCCTATCTCCT^ 
TGGGCACCTTCCCCrTCAACTCGTTCXrrcrCTGGC^ 

ATCAACCCCCAGAACAAGGCX3GACTTCCAAGGCATCTCTCCX3GAGCGAGCCTTTGCT 

CCTTGTCGTCATGAACTTCGTTGGCTt^CTCC^ 

CTGGATCAGCATTCTTGAGCTGGCAGCTTGTTGC^CATG^ 

GGCACGTCCAGAGAACTCCCTTCGCTCTATCGGACCAACTCCAC^ 

G CCCTTACITTCCATATTAAAAACATTTCAGC CTC C 



25 



MOUSE SEQUENCE - CODING 

ATGTCGGCGTCTGTGGTGTCCGTCATCTCCCGGTTCCTGGAGGAGT^ 
CTATCTCCTTTATATACTGCTGACCXSGGGCGCTGCAGT^ 



GGCATCTCTCCGGAGCGAGCCTTTGCTGACTTCCTCTTTGCCAGCACGATCCT 



HUMAN SEQUENCE - GENOMIC 

AAGT AAAACATTCCC CTGG ACATTT CTTTCAAG CCCAGCAAGCT^ CC CCGG ACTTCCA 

3 0 GTTCTACAGGTCTGAAAAACAGGATTACTGGGTCGAGGAAAGGCCCAGTTCTGGTGAAACCAAGG 
TCTTCTCXaTAGCAACTTTTTCTTACTCCTGTCAGATTC 

CTTGAAAGAGTGGTAGCAAATCCCIAATAGCTCTAAATCTGAAACCATTGCATTAAAGATGTTGTTACG^ 

AACTAGGATCACATGCGGCTACGTGTCTCGGTGGGCGAGAGCTAAAGAAACTTGGAAGCCCGTTGTTCC^ 

ATGGAGACCTCTCCAGTCTCTCCAGATCTTCTCTTTGCC^ 

3 5 GGGCAAGTOSAGACGAGCACGCCTGGCTGCTACCTCCGCACCC^^ 

gaatctctccagaggccagaggagaatgaggggcactcgcx;aggactaaggtagaaagccaggatccccgggctgc 

GTCTCCCATGGCTCCCCAGGCITCAGCCGCTGCCTCTTCT 
GGCCGACCTGGAAAAGGAAGAGC^GGAAGGTGTCTACAACraTTCCT 

G AGAG AAAGAGAG AAAGGGACAGGATT AACT CACTG CTGCT C C AG AT CAT C GG C CAT AGT AAACAAGC CTTG CCAGGT CCTGG CGA 

4 0 GCCCGCCCAGCGCCTGACX5CCAGCACCAGCTGACCCGGGA 

CGGCGGGCCGGCGGTCGGCGGTGGGACCCTCAAAACrCTGCAACITGGTTTCAGCT 
GGCATGTGTCCCGGCAGAGATGTCTCTGACGGTCTC!AGAATGGGACAGTAATCCT 
CTCCGACAGCCCTCCCTTCTCTCATAACTAAAAAGTGGTAGAACrrACGCCTGCACGCTCGCACCCACT 
AACTGTGTCCTCCrrGAAATCGACCCTTGTTAATGTTCCTTCrAGTAAGAGGCTTAAATGTTTGTGGAGAC 

4 5 TGCTCCTAGAAAGGGCCTGGATGCCAGATTTCGAAACrAAAATAGAGGACACCTAGTTAAATGTAAATTCC^ 

AAATACTTTTTTAGAATAAGTATGGGCCATGCAGTATTTGGGACATAAGTAGGCA 
TATGACATCTGCCAAAATGGGCATATCTCTGCTCTGATGTTGACTCTGACCCCAACCTATTT^ 

CAGGTCACCCTCAGTTTGGAGGATAACAGTTGAAGGACTCACCAATAAGGCTAGGTCTCCTTTCGGTATGTAAATGCG 
GAGATAAGGCTGGCTTGGCCCAGAGGTCCTGAGTGCACTTCAGT^ 

5 0 TTGCTTGTACCGGTAAACAGGGCCTGACCTCCCTGGAACTGT^^ 

GGCGCGGTGGCTCACGCCTGTAATCCCTXSCA^ 

GGCCAACATCGTGAAACCCCATCTCTTACTAAAAATACAAAAAATTAGCC^GGTGTG 

GGGAGGCTGCGGCAGGGAGAATTGCTTGAACCCAGGAGGCGGAGGTTGCAGTGA 

AGACAGAGTGAGACTCCATCTCAAAAACCAAAAACAAACAAGCAAAC!AA^ 

5 5 TGAGAGAATG CTGGTG CACAGGGGTTTAT CAAAG C ATTTTT CTTTAG CCTT CCTG CAGG ACCCAGATG ATT CTGAGATTT C TT CTT 

TG TAG CAACACAGTGTT AAG CTGATCATTAT AAAG ACTT AACGT TT AGGAG AAC GAG CATTT AT AG AGTTTAG GG G CTT AG GAATT 
TGCCAGATGTGGTACAAAGTAGTTCAACTAGCAGAGGACTTGTCTCrm 
GTCACCTAAGCTGGAATGCAGTGTCATCATCATGGCTCACTG 
TCTAAGTAGCTAGGACTACATGTGCCCAGCTAATTTTTTTTTTAG 
60 CTCAAACTCCTTGGCTCAAGCTATCTTTCTGCCTTGGGCTCTCAAAGTGCTGGGATTACA 

TT GAATCT T TCAACAAAGCACAGTGGTTCTAAACTTCAGTATC CAT CTG GTGGATTT ATT AG AAATTCAG ATTT CTGG 

GCTGGGCATGGTGGCTCACGCCTGTAATCCCATCACTTTGGGAGGCCGAGGCAGAT^ 
CTGGCCAACATGGTGAAACCCCATCTCTACTAAGAATACAAAAATTAGCCAGGTGTGGTG^ 
AGGAGGCTGAGGCAGGAGAATTGCTTGAAGACGGGAGGAGGAGGTTGCACT 

6 5 GACAG AG CAAGACTCC!ATCTTGGAAAAAAAAAATTCAG ATTTCTAGCAC CT AC C CT CTG AG ATT CAG ATT CAG CTGGTTTG G CTTG 

ATG CTAG AAAG C C G CATTTTT AGGAAGCAC CGCAGATACTGGTGCT C CACG AG ACT C ACTGG TTG GTTAGAAG TATAGATTCTGGA 
GTCATGAAGCCTGGTTAAGTTCAGTATTTGCTTTGCCACT^^ 

CTTCCTCTGGCTTGTATCTTGTGAGAACTAACCAAGATAATTCAAATAAACATGTGG 

ATTTTTGTTGTTG CTGTTATTCAATGCCAAGAATT AT GTTGAG CACTAAAAATACAATTATCCAG C CTGGG CATCAT AG CAAGACT 

7 0 CTATCTCTACAAAAATGAAAAAAATTAGCTGGGCATGGCAATGCACACCAAAAGTAACA^ 

TCACTTGAGTCT AGG AGT CAGAGGTT AG AGTGAG CT AT AATC CCACCACT GCACT C CAGCCT AGGTG AG AGT CAGACTCTGT CT CT 
AAAAAAAGAAAAAATAAATAAAAGCAATTATCAAAAGGAAATAAACAACCTGCTCTAGTGGTAA^ 
TACCCTAGGCTGTTCTTGGCrTTTGCAGAGCTGATAACTTTGTATTCAGAGCTCAGCTCAAACT 
GAC CACAGGAT CTAAAGAG AG CC C CCCAT CACTCTGTCACATTAC CCTATTTTTATTTTCT^ 
75 TTATCTTGCATTGTTTTTACTCATTTATTATCAGTTTCCTTCC!ACTAGAGAATAAA 
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CTGCTGTGCTCCCAACACCTAGAACAAAGTGTGGCAGAGTAGGCACT 
GGCAGAAGCATAAAG&ATTACCTATGAAACAATA 
AATTCCATACTGTATC)ACrCCCAGAOU5GACAATAAAATCT 
GGACX^CTACACATGAACTGGTATTAGATCTC^ 
5 GAAATCAGCTTTAAAGTAGAGGTTTCTTGGAAGTAGGAAACTTGAATGGATC 

TTTAAGGT ATCCC CAGTGCTGATGTATCTCAGGATCAGGATGGATCCG CTAGAATT ACACAGAGTAAAGAGGACT CTCGGAGCCTA 

CTCTAAGGCAGGCCTCTGGGGAGAAAATGTAAGTTATTTTTTTTCTC 

ACTCCAACAAGTGTGGATGGAAATTAATTTGAC7UVA 

CTTAACTGACTCCATCTCXKrrTCT AACCAGGTTGT CCTTGT^ CTGAACT AACTTTGGGAGAAACTT AGTT 

1 0 TATAGTTTGTGGTTTGTGACAGATAACAGCCCTTT^ 

AACATTAGCCATAAGATTGGAAATTATGQTTTGGGAGTC^ 
AGATCAGTGCTTGAG&TATTTGXjTGACCC^ 

TG(^GCGCCCCCCA.CCCAGGAACTCAGGGCAAGAAGACAGCTTGGAOT 
CACTGGCTTCCCCAACO^GTTGTCCrrTAAAAACTCTGATCC^ 
15 TCTCCTGCACAGCTGGCTGTGTATGAATTACTCTTTCTCCATTCT 

G CAAGGTG AACT CCTT GGG AGGTT ACAAACCCT CTTT CT ATGACT GT AAGT CATCACTGAGT ATT C CAAAAGT AGGTGGAG CTGAT 
AAACTGATAGCTCAGGCCTGGCAGGTATAGATTATTCTTGTTTCAAATGGGTATCA 

T ATCTCTTTG ACTGAACATAG AAATG AGTGG AATAG C TCATAAGTTGTTCTG TC CAATGACAG C CACTAG C CACAAGTGGCTAdTT 
AGATTTAAATAAAAGCCAAATAAAATTTCCT 
2 0 TAAAAAATCT CTCTTCTTTTCCCC CTT CTC CTCACTT CCT C CTTAGCTcJT'I~rAGAAATGCAATTGTAGC CTTTTACCTCCTCTTTA 

TCAGACACTCCCTATGGGGCAGGTTCATCTAACTATGTGCTTAGAAGCTCC^ 

AGAGACAACAGC CTATCTACAACC CAAAGTATGC CTG CTACAAAG CTTTCTCTT ACCTG G AGAGTTTTGG CCACTTTT ACAACCCA 

GTCCTGCCCACAAAAGGCACCAGCAGTCACTGTCCAGTAGATAAGGCA^ 

TCCCCTCGTGCCATTCATGGCAGCATTCGATGCCCCATTTCAAAAGC^ 

2 5 GCAGAAAACCTGTACTTCCTCCCCr AAGGTAGCTT^ 

CTCTGCAAG CAG CGAG AGAGTGAAC CTGCATTTGGGTTACAACACT AACT AGTTTAAGTG CTCAATAGCCACATATGT CT AG CAG T 
TACTATACTGGGCAGTGCAGAGACATTATCACAGAAAATTCT 

GTGGTGAGTGATCTGCTCTTGGAAAGGACAATATAGTGGACTCACTTTCGGGTGCC^ 

G ACT CCCAAACAGCT AGGAGG GAGGGAAAC CAGACAAAGGAGCTGAAAAGTTGTT CCACTTCATCAGAAACTGAAAG GATG AAGGG 

3 0 ATAT GGGTT CACIGAAGATATTTACTG CTTT ACTAGG ACTTG AT CAGAAACT TGAT ACTTT AAAAATTTGACTTTGTGCTGTTTG C 

AACTTGATTCTCCTTAAATTTTTATTTCTGAAAATAATA^ 
CACACTGGGGCCAAAGTTTAGAAGGTAGAGA 

GACAATGCAAGGGAAAGGAAATGGAAAAGACATTCTTTTCCATGCGGAGATTTCCT 

GAGGATTTGTGTTTC TTTTGG TAAATAATGTG ATTT CTTTTTGTAG AGGATGTATG ATGT AAAATC!ATGGACTTTGCC CTT AAGG G 

3 5 AACCCTCTTGCTCCTCCACTCTTCCATTGTTTTATGTAATGAAACCCCTAAAGTATACTCTGACT 

GATCTGCCCTTTAAGAGCCATTTGAGTTGCAGTCCCITCCAGTGCT 

CT AACTTTTAAT CATT CAG AGAACT AGG AG AAG CT^G ACATGGTTAATT ACT AGGATT AATG AT AGTT7AT AAGAG CAAACATTT AC 
TGAACATTT ACTTCTCTG CCAAACACT ATGCT AACAG C TT AATG CTTAACATTAT CTCATTT AATC CT C ACAT AATTTCAATGAG CC 
AGGTATTGTTATTTCTGTTTTATAGAAAAGGAAAATGAGACTGTCAGACrrGAGT 

4 0 CAGAATCAAGGTTCAAACCTAGGACTGTCTGATTCTAGAGCTTGCACTCT 

AAT ATTCTG ATTTAAACT AATTGAAATCAACAC TAG T GTT AGCAATTTCCTATTG ACAGAT CAGT TAAAAATG TTTCAATGATT AT 

AGAATCATAAACCGAATAGAGAAGTAGGCCTTTTCTTGCTGTTTTAAGTAGTTCAGAAAAGCC 

TAGCAGGTCTCAGAAACAAATGCATTCAATAAGTCCATTTGG 

T CTT AATGATGATAGTT GACAT GGAT CAAAGG C AG ACAAAACAT AAGGGAG AAAACACTG^ AAGGAG CCTGGCTG AT^ 

4 5 ACTCCCTGAAGAATCCACACCTGAATGATTAAAAAGCATCAGCCTAGCCTGTAAAGTTTGAAGTTGA^ 

CCTC CT ACT TGTTGATGAGG CAG AG AGAAAGGTTCAACTTGTCAGAG AAAT TG ATG CTTC C TG CCAT ATTATATTGAAATAAGAAC 

CAGCTTGTTATCCACTCTTTTTTTTTCTTTCTCAGCAACk 

GGGTATGAATATAAATCTATCTATTCAAATAGACTCTAAAGGTTTCT 

TGATCACCACTGATAATACATCCGTTAGTTAATGCGTGTAGAGTTTCCAGTCTAAAGGCAAATAAAAAGTGAAA^ 

5 0 AGTGGCTCACTCCn'ATAATCCCAGCACTTTGGGAGG CTGAAGCAGGCAG ATCACCTGAGGT CAGGAGTTCAAG ATCAGCTTGGCCA 

ACATGGCG^AAACCCCGTTTCTACTAAAAATACAAAAATTAGCCGGGCGTGGTGGCAGGT 

TG AGGCAGGAGAATGGCTTG AACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATCGC^ CACTGCACTCCAGC CTGGGTGAC 

AAGAGCAAAACITCATC!TCAAAAAAAAAAAAAAAAAAAGTGAA 

CTCACGTACAG TT CAAAATATT AAG AAG C TG AAAATAAAGTAC CTGGT CATGTGTGGAA7VAACTTACAGCTAAAAGGTGAGATG 

5 5 CTGGGTCTAGAAGGTCGGATTCACGTGAAATTAAAAGGGCAACGAA^ 

GGTGCAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCTGGC^ 
CTAACAATGGTGAAACTCCGTCTCTACTGAAAATACAAAAATTAGCCGGG 

AGGCGGAGGTTGCAGTGGG CCGAGATCGCG ACACTGCACT CCAGCCTGGCCGACAGAGCAAGGCTCCGTCTCAAAAAAAAAAAAAA 
AAAACAGTTAAAACCCTCAAAATAATCCCACTTGTTAAAATACAAAACCT 

6 0 GAGAGTAGGGTTGAACCAGGCATGCTCTTTCGGGCCTTTCGACGCTGACATCCCGGGATATTACACCAGGTTCA^ 

GCGGAAAGAGAGCTGCTAAACTTGACCATATGCTTTTTGCACGACATTC^ 

GCAAATATATCTGATTC^CGTGAGAGTCCCTACAAGTTAGTACAATC 

AGGGCCAGATCCCX3AGTTCCAGTTCAGAACCCT 

CCCCAAGTAATTAAGTCTAAACTGGGGCCTGTGCAGTGATTGGCCGCGCCGGGTCTCACTGCGGATGAACTTGACTTG 
65 ATCTGTGACAGACATCTGCAGACAGCGCCCTTAACAGAATT^ 

AAAGTAGATGCAAACCGTTAAATCTTCCAGGACTCAGAAGGACCAGAGAGAGC 

AAAGCTTATTCCCCTCTTCCTCCCAGTGTGTCTTGTTGCCAGAGCAACTGTGGGCGTCACCGAGCTGACG^ 
AAAG AGTACACGTC CCCAG C TCATTG ATAGGG CCATTGCGCAGACGCATTAACTTTTCTGAGT^ 
TCCGCGTCTGCGCATTAGTTGTTACGCGCCCTGGCCGGCCrC^GCGCCTG 

7 0 ACGAGGGTGCGCATGCGCAAACAGCAC^TCCGGTGTGGTCGACGGGTCCTCCAAGAGTTT 

GCAGITATGTCGGCGTCGGTAGTGTCTGTCATTTCGCGGTTCTTAGAAGAGTACTTGAGCTCCACTCCGCAG 
GGACGCGTACCTGCTGTATATACTGCTGACCGGGGCGCTGCAGTC 

TTCTCTCGGGCTTCATCTCTTGTGTGGGGAGTTTCATCCTAGCGGGTAATGGTTCTATAAGTG 
TGGTGTGCTGCTTTTGTTTTCATAGTACTCTATATCAGACCACCGCCCCTTGAAGTTATGAG 
7 5 GTGAATGGGGAAATTGAGGCTCAGAAAGCTTAGATTATATGGCTAATACCAAACGGCT 
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GAAACCAC CTT CGTGC CG CTACTCOVCTTGGTGATCCACTT CACTT AATATTG G TC CX3GGGAAAACCCTATTCCAA.CTTTAACTT A 
GGTTTAACACAACTCT AAAATTAGTAG ACTTTTCAAATG AAAAAC C CAACCTTG AATCATCT CTGTCATCTTGG C CTAATCATT AC 
CAATCT C CGGTGAAACGTTTCA ATGA AAG TGCAAAGAAT ATAAAT ACTGGTG C lT 'r r L Tl 'GTTGAA.T CATCCTTT ACTTGATTTTG 
CCTTTTAG CTG CT ACTACTTTTTTTCTTGTTGAGGAAGCTGTTATTT CACTGTCCTGAACCCAATTTTTGTTCACATAACT CTAT T 
TTAGCCTTATTTTTCCTTTTGATTCCAGGATTTC 

GACTTTGAAAACATGCCAGCATTCATGGTTTCTTTTC^ 

GCATTATGG TAAAAG CTTTGGCACATGTTCTTT ATCCGTAAAGAGTTTATGGT CTAGTTGGGGTACAGACAAAATACAG CATTGGA 
ATTGTCTGGAGG CCTGGCACAGTGG CTCATG C CTGTAATCCCAGT ACTTTGGGAGGCCGAGGTAGGCAGATCA.CTT 
GTTCAAGACCAGCCTGGCC^CATGGTGAAACCCCATCTCTACTGAAAATACATAAA 
ATCCCAGCTACTCGGGAGGCTGAGGCAGAATCGCTTO 
CAGCTITGGGTGAGAGTGAGGCTCAGCCrCAAAAAAAAAAAAAAAAAGTC^ 

T AGWVTGCTTATTGT ACATTTACATTGTTC TGACACAC CAGAG ATATT TCTT AG AATTT AAAACAGG TCTGAG GATCTAATGACT T 
GTGTACTGAAGTTG CAGATTAGCCT TTTGATTGGATATTTAGTT ACCATCTTT 
CTGTGGTGGCGTCCTTGCCTTCAGGATGTGTTAACCATCATCCGAC 

CACCCACCGC C CTCTG CCACGGTACTATATGTTTTTATTAGC CAG C CCGGAGAAG G AACCTTGATGAGAACTTTTGATTTAATTTT 
AGAAGTCACCCTGCCTTCATAAGCCTCTAGGGTCTA^ 

CATGGACT AGAGTTATAGATTAT CAGTGT AATAAAG CCAGAAGGTTTTT TTGAG CAAATGACTTTCACTTATG CCAAGACCTGTTT 



AAAACCTAGTAAAAGGGCATATTAGTAATGGGGCCAATAAGGGGGATTAGGA x 
AT AGAAATG AAAGCAACTTCAAGAAAAGAAGGGATTCATGAT AATT CTTAGTTC T AC C CTT CTTTTGA 
CAGCATTGACITTCCTAAATCCTGCCrCTTCTGCC^ 
ACGCTTATCCTTTGTTTTGATTGTTGGCAAAGGTAGCCTTGTTCCTTT^ 

CTGTTGCAAG AGAATGG AGGAG TCCAGTGAG CTGGGAGAGGGAAATCTAGGCGAT ACTGTTG AAGGATTAG^ TGCTGGG TGCT 
AGATGTTTGAATGCCAGGCCAAGGAATTTGGACTTTTT^ 

TCTCX3GTTCACTGCAACCTCTCTCITCCCAGGTTTAAG CAATTCTCCTGCCTCAGCCTCCGAAGTAG CTGGGATTACAGGCATGTGC 
CACCACGCCCGGCGAATTTTGTATTTTTAGTGGAGACGGAGTTTCTCCATGTTGGTCAGGCT^ 
GATCCGCCTGCCTCTGCCTCCCAAAGTGCn^GATTACAGGTGTG 
GTAATAGTAAAGCTTCTGAAGATGCTTTTTTTTTTTTTTTTGAGACGG^ 

GGCTCACTGCAAGCTCCGCCTCCTGCGTTCACGCCATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGG^ 
ACGTCCAGCTAAGTTTTTGTATTTTTAGTGGAGAC^ 

CCCCCGCCTCGGCCTCCTAAA.GTGTTGGGATTACAGGCGTGAGCCACCACACCCGGCAC 

AGATTTAAAATTGGTGTTAGCATGTAGGATGAATTGGAGGGAA 

TTCTATTCCCTCTGCATGTGTTAGTTTAGCATATGCTCTTTGGAAATTGACC^ 

CCTTTAAAAAACCTGTTGTTATTGGGCCTGATGGTATGAAAATAGTCTGT^ 

TGGTTTCAAGAAGGAACCAGAGGTAGAAAGTTGTCATTGAGAGATCTTTTTT^ 

TGGCTTTAAAAGGACCTCAGAGGTAGAATATTTTTCTATTTGAGACAGAT^ 

TAGATATCTCTAGAAGAG^ACATCTGTCCCTTTTTTAGAGTTATATT 

TGAAAGCTGTGCTCATTTAAAGTCCTATTTAGGAGG^ 

AACAAGTTTTAATTCATTTCCCAACTTTCATCTGTTCTGTTCTCTAATCTATATGAGGTA^ 

TTTCCTAGTTTACAAATTCAAAACCAGTTTGAATCTGTAAGGCAA 

GGCTGTTGTGGACTGGGGTCCAGCATTTCTTTTGGAATTATCC^ 

CGAGAGvATTAGAAGTTTG C CACCCATG TG C CTGAT AGT CTTGT TT CT AAATAT CTTT CATATC ATTGCT ACTTTTT CATCCTCTGT 



TGTTTGCAGGCTTGATTTGACTTGCAAATCTGATTGTGTCACT 

CAACAGTATTGAGCTACTTGTCATTCTCTGAACACrCT 
ACTTTTTAGACTTCTCAGCTCAAGTGCAATTTCCT^ 
GGCTGGCCTCAAACTCCTTGGCTCAAGCAGTCCTGCCT 
TTATTGCCCTTTCGAACCCTTCCCTGACrCTCCAGCAGATTGAGG 

TGAGAGAGATATGGCATTGTGCTGTATTTGGGTGTCTGCCTTCTCTCTGGAAGGTGAGCCCTTG 
CTCTCCATCCTCAGTGCTGAACACTGTATATGTTAAATGGTGAGCAGGCC^ 

AGGTTCTCTGCCAAGGTGCTGGGAATTCAACTGTGAAGC!ACACAGACGTGGCTTCTACTCTCACAGAGTT 

TACAGACAAGTAAATATGGATATGGTGCTGATGGAGAA^ 

TCCTGCAATAATATTTCTCTTATACATTCGCCTTCCT 

AGTTGGTGATCTTGCTTTTATTTTTTCACTAAAAAAAAGAAGTAGATAAGAATT 

TCACCTACTTTTATACCCATACTCTCTGACTTCT 

CCAGTCTITCCCTTGCTCACTCAGGGACTCCGCTTCTGCAATTATTCCT 

ATTCCCATCATACAAACATGTTCTAATATTGTCTATTTAAAAA^ 

TCCTTTCCATCATTACAGAATTTCTTGAAAGAATAATTAATGT^ 

GTATTGGTCTGTCCTCGCTACTTCACTGAATTACTTCACTGAAACT 

TTGGCTTCTGTGTAAAAGGAAAGTTTTTTCTTGAATACAACACTCTCCTGCT^ 

CTGTTTTGTGCTGGGCGCTTCTCTGACTGAGAGCATGGTTGTAGATA 

TAAC^CCTGTATACCTACAACTACTCCTTTATATTTTTAGTTTTGAACTGTCCCCCGAATT CCAAATTAGTAAATCCAGTTG CCAA 

CTTGATGTCTTCACTTGATGTCrniATATGTACCTTGAACT 

ACCAGTTAGTACGTGGTGTCAATGTTGGTTGTGGTTTTTTTTTTTTTTTTT^ 

AGTGCAGTGGCACAATCTCGGCTCACTGCAACCT^ 

ATTACAGGCACGCTCCCCCATGCCCGGCTAATTTTTGTAT^ 

ACTCCTGACCTCAGGTGATTCACCCACCTCX3GCCTCCCAA^ 

TTTTAAATGAAGAAATTTAAGTTTCCGGGGTCCCTTGATTTCTCTTATCTCCTGCATCTC 

AAAATATAGCCTGTATGTCTTTATTTCTGTCCATTTTTACTACCATTC 

TTTTTATTTTTTATATTTTTGAGACGGAGTCTCGCTCTGTT^ 

CCGCCTCCCGGGTTCACGCCATTCTCCTGCCTC^GCCTCC^^ 

TTTTTTTTTTTTTGTATTTTTTAGTGGAGATGGGTCTC^ 
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GCCTTGGCTTCCCAAAGTGCTGGGATTACAGG 
CAGTCTC^CTGTGTCGCCAGGCTGGAGTGCAGTGGCGC^ 
TGCCTCAGCCTCCCTAGTAGCTGGGATTCC^GGAGCCTGCC^ 
GGTTTCACCTTCTTGUICCAGGCTGGTCTTGAATTCCTGA 
5 CGTGAGCCACTGCGCCCXMCCTTCTGCCTCTTTTTAACCTTC 
TGTTTTCTGAGGCTTACCATGCCCCTC^^ 
TTTTTTTTTTTTTTTTTTTTO 

GCTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCT 
1 0 GCCTCGACCTCCCAAAGTGCTGGGATTACGGGTGTGAGCCACCATGCOT 
TCTTACTGTCTAGAATGCTCTGCCCCCAGATTTTTACATGG 
AAAGAGACCITCTCTTATGATTGAATTTAAGAAAAG 

CT CTACCT AGT AGTTCCTGGTTTATTTGTT AACTTATTGATT^ CTTG AGAGCCAGAC 
TGTTTTGTTCTGTGCCATATTCCTAGCACCTGAAATGATGCTTC 
1 5 ACATT TAAGG<X5AT AGAGT AAATCAGAATTGCT AAAATGGAACAGAC TG CTTG AAGAGGTAGT AAATTTC CAT CTTTGGAAGTG C C 

CAGGGAGAGACTGAAGGATCATTGGTAGTGGAGC 

ATGTG ATATCCT T AT CAGT AACACAGG CTCTGG AAAGTTGATAATTCTCCAT CTGGAGGTGG CTTACCAG ATT CTGT AAGTGGTGA 
GTATACTTTGGAAAGCTTCCCACGTGATTAGTACGCCTCCCCAAGCATGGCCACCCCCTA 

AATGGGGTGTTGGTG TTT ATGACCTCTG AGGT Cu\L-ri C CAACTTTGTGGGACTCTGATT CATATGTTTTCATTGAAATAATCTGGT 
2 0 TACCATGAT G&CAATAAT AAT AACATCAATTT ATTAC CTTAT C CACCirCATGGATAA'ril"lTCiT"riTTTCCCTCATATTTCCTC 

AAAACTCAACACCGTGTTCTTTACTCCAATGACCACTATTAATAGTTTTCT 

TGACAAGATAAT TATGTAT AT CATTAT ATGTAAAT AAAATAT TTT ACTTT TTTTTTGAGACGGTCTTG ATTTGT CT C C CAGG CCAG 
AGTGCGGTGGCATGACTGTGGCTCACTGCAGCCT 

AGCTGGGGCCATAGGTGTGTGCTACCACACCTGGCTAATTAAAACATTTTTO 

2 5 TGGTCTCAAGCTCTTGGGCTCAAGCGATCCrrCCCAC^ 

CCAAGTGAAATTAGGAATGGGCTGGGTGTGGGGGCTCATGTCTGTAATCCCTACCC^ 

GCAACATAG<3GAGACCTTGTCTTTACCAAAAACA^ 

AGGGGGCTGAGGTGGGAGGATTGTTTGAGCCCAGGAGGTCGAGGCTC 

G ATGGAGTGAG ACCCTGT CTCAAAAAAAAAAAAAAAAAAAGGATTGG Cr rrri~l T CACTTAATAAGTCTTAGAT ATGT TTG CTGTC 

3 0 AATGTACAGAG AATTTAC CTTATTCTTTTGAAT AATTG CATG GGATTT CATTG CATGT CAGTTGT AT ACT TT AAT CCCCTTGATGT 

TTAGACTGTTTT7U\TTTTTTTTTCAACTTTACAAACATAOT 
TAAAATCATTTTAAATATTAATGGATATTGCTAGTTGCTCT^ 

ACTGGTTACCCrTGCTTCAGTCATTGTCTTTAATATTTTTAAGGGTCTGGATACCTGAA^ 

ACATGTTTAAGCATAATG AAAACTGTTGCCATGGTG ACAGG C TCTGGAG AACTAGTCTG AT AGTTTT CTGG C CTT TGGATT AGATT 

3 5 TGTTTGTTGCTTT'IT CTTAGTTAATG ATGGGATGATG CTAGCCTGAAG CGACTTTAGGTTTTCTTAAGCAATGTAAT AAT T TATCA 

CAGG CAT AAAGATACTT CGGAAGAGTG TATTCCTGGAACTG C CTGTTTTriTTTTTTTGTCAATG GTG ACTGTT AGAAT CTTCAAG 
GGGATGGAGTGTGGCAGTGGAGAGTTCTCATGAATTATCTCTCTTAGGTCITOTGCCTT^ 

ACTTCGAGAT CAAAGATAGTTT ATCGAG AACTG TGAG GTTCT CAAGAATGAGCGTGCTATCTTTATAAACTCTGCTACCCATTAAA 
TGAATATTOTATTAGAAAATAATTCACATTTTTATAACTTTTTATCAT 

4 0 CATCTAGGTATAATTAGAAACCTTAAATCAGCAGATAAGAGAGCrrTGTGGGTT 

CCTCTAGGTTTCAGTGTAACTCTGAGAAGTAATATTTTTTT^ 

TTAAATGT AC CTGTCTAAC CACT AT CCAG AT CAAGATG CAGGACATT ACCAGCAC CATAGATG CCTC CTTG CAT CTTC C CAGTTAT 
CCTCTCCCTGTGAAAGATAAAAACCACTATCTTGCCATA^ 
CATATACTTTTTTGTTTTGTTTTGTTTTGAGACGGAGTCTCT 
45 ACCTCCACCTCCTGAGTTTAAGCAAATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGT 
AATTTCTCTAGTTTTTATTAGAGATGGGGTTTCCCCATGTTGGCCAGGCTGGTCTTGAACTCCT 
TTGGCCTCCCAAAGTGCGGGGATTACAGGAGTGAGCGA£^ 
AGTTTCGCTCTTGTTGCCCATGCTAGAGTGCAATGGCACGATCTT^ 
TTGCCTCAGCCTGCCAAGCAACTGGGATCACAGGCATGCGCCACC^TGCC^ 

5 0 TCCA.TGTTGGTCAGGCTrGGTCTTGAACTCCCGACCTCAGGTGATCCGCCTGCCTCGGCC 

AG CCACCGTG C CCGGC CTTTTTTTTGTTTG ri"i I"1'GGTCAAG ATAAAACATG ACAAAATTT ACCAT TTTAAC CATUTTT AAGTG T A 

AATTCAGTTCCCATTAAACAGTAACTCCTCATCCTCCTCTCTGTCAGT^ . 
TTGACTACTCTrrGATGTCATATAAGTGA 

5 5 GATTCAGCCATGTTGTAGCAGGTATCAGAATTTCCTTCCTTTTAA 

TCCGTTCTTCTGCTGACACTTGGGCTGCTCT 

TTGAGAGCCTGGTTCCATCCCCCACCOTCCCAACCrCCCAGATGGAGTTTCTCT 

TGTCGGCTCACHX3CAGCCTCCACCTCCTGGGTTCAAGCGATTCTTCTO 

CACCACGCCCAGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTT 

6 0 TGATCCGCCTGGTTCAGCCTCCCAAAGTGCTGGG ATTG CAGGTGTGAACCACTGTGCCTGG C CX3AGATCCTG CTTTCACTTCTTTT 

GGATATGTACCCAGAAGTGGAATTGCTGGATTATATGATATGGCAATTCTGTTTTTAATTTTTTTGAGGAAAC 
TCAC^CTGGCTACCCTATTTGATTTTCCCACTTGTTAGTGCACAAGGGTTTCA 
TGATACTAGACATCCTAATAGATGTGAGGTGATATCTCGTTGTAATTTTGATTTGTATTGCCC 
CTTTTCATATGCTTGTTGGCCATTCCTATATCITCTTTC 
65 ATTTTGTTGTTAATTGCATTGTAGGX^TACTTTAATATATTCTGA 
CCCATTCCXTTAGX3TGCCTTTTCACTGTTGTG 

CTATTTTTACTTTGTTGACTGTACTTTTGGTGTCATATCCAATAAATCATTGCT 

TCTT CTAAG AGTTTTTT AGTT TTTGGTCl'TTAT C CATTTCGG AT TT AGGTCTTTG ATC CGT TTCGAGTTAA'lTTTTGT AGT ATATG 
GTATAAGGTAAGGGTCCAGCTTCATTCTTTTGCCTAT 

7 0 TGAGTGGTTTTGTCAGTTTGGTTGAAAATCATTTGATTTTGTA^ 

TTGATTCATTTTTT AACTGTAGT AAATG TATTTTT AAAATAAACTTAT AATG G AAAACCTG AAT CATTATCCCTAAAT AGAAGGT C 
ACACTAAAAATACCATG^1A.GCTAGGCAGATGTGTTCACCCGTGCCTGTG 

ATTCCTCATCTAATTGAGGGGGTACATTTGTTCCCCCAGCATATACACTTAGGCTCT 
7 5 TTTTTTGTCTTTG^CTTCCTCTGGGTACCTGCCAGGTTCCAGCrGG 
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ACGGTGGCTCATGCCTGT AAT CC CAGCALTTIX^GGAAGC CGAGGCAGGTG GATCACAAGGTCAGG AGTTCGAGACCAG C CTG GT CA 
GCATGGTGAAACCCOnCTCTACTAAAAATA^ 
CTGAGGCAGGAGAATTGCITGAACCCAGCJ^ 
GCGAGACTCTGTCCCCGACTCCCCAAAAAATAAAGTTTTTG^ 
5 TAG CAGAAGTTCr mi' CATTGCAGTATAATATTCTATTGTTTGTATATGCTATAATTTAT C CATTTTACTGTTGATAGACATTAG 

ATTGTTTCCAGTTTGGGGCTATCTGGGTAT CATTTCTGTGT ATG CCC TTTGATGT AT ATATG CATT CATTT CTGTT AGAGTAGAAT 
TGTTGGGTCCTCGGGTATGTCTGTGTTCATTAGCATATTTATTTTA^ 
CAGTTTGCCTGAGAATACAGATCAACCCACAGAACAA^^ 
TTTGCCAGCACCATCCTGCACCTTGTTCT 
1 0 AAGAATGTAAGTGATGATCATAGTCCTTGTCCTATAATGTTC 

GGCCAGGCTCCCATACTGCT ATTG ACATTTGCAGAA CAAAGT ATGCATTAATAT CATCATAATT CATCAG T 
GCTACTGCTCTTGGTCATTGCCTTGTCTTTCTGlTCrT^ 

TTTCTGTAC CAAT AACT C CACGAAAATAACT CX^^GATTTTGGAAGAG ACAT AAATG GTAGGTTTTGCCTTT AG ATTTGTTAGTTA 
ATTTTTTTCTGATTTTGATCTCAGATGACTT 
1 5 TTGAATGAATGACTG TGGG CATTTGCTAATATGTG CCTTAAAGT AGAAAT AAATGGGGAAATTGGGTT AACT^ T 

AAAATATAAGATGGGATGCTGCTTGGTrCATTTGCCATTTTCACTTTTTTAGC^ 

TCTT ACAAAAAGAATCTTGGTATATAAAG CAGAAGTAGAATTGGAGTTTTTTTTGATGGACAG AAATTGAAAAG C CCTCT CTGCTT 
GGCCTTCATCACAT ATGGTACGCC CTCCC CTAAATCG CTCCCT CCCAAG CAGAGTTCTAAAG CTTT AATAAATATATGGCTGTTTT 



2 0 AGTGTTTGCAGTGAAGTGCCCTAGTCTAGCGGTGTTCTTGGCCCCGTACT 

TGCAATCTTTGCCTTTGCATTGCAAAATGTAATGTTCTGGGTGGTAGAGAAAAGAG 

AAGGATAT CCTAGGACCAAC CTCTTCCTG TCAC CACTTGGAAAGCTGTCATTTTTATACT AACCAGCAGAAAT CTAGTATAAG AG T 
TAGAG CAT CTATGTGCAAAGTGGATGCAAT AATGTGGAAGTGACT ATACTTCCTTTTCTG ACATG AA TCATTTACTT AT AAT 
ATGTTACCTAAATCCTTTCCTTTGATACTGCCTCAAAAACAGTTCre 

2 5 ACAGTGAGGTTTATTTCAGTGAAAAACrrrrc 

ACCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCTCACTGCAGACTCCGCCTCCCGAGTTCATGCCATTCTCCTGCCT 
GAGTAGCTGGGACTACAGGCGCCCGCCACCACGCCCGGCTAATTTTTTGT^ 
GATCTCGATCTCCTGACCTGGTGATCCX3CCCGCCTTGGCCTCCCAACAGTGAAAAACT 
AATACAATTTAACTCATTAGACACCTGATTAAGACTTC 

3 0 TGTCTTGTTGTTGGGAATATAAACAATATTGTTGGCTGGGCGTGGTGGTTC^^ 

GGGCGATTGCTTGAGCTCAAGAATTTGAGACCAGCATGGGCAAC^ 

CTATAGGTTTGAACAAAGTGCTGCAAAAATCCAGAGAAG 

AG CAATGG AT AACAGCTATTGCTAGTTAAAATGAACTGTTACCAGTGGATAAC CTTG AATG CAG GT AGAAAACTG AC CCACCAACA 

3 5 TTGACCTTCTCACGCTTGCAAATATGAGTAACCCTTGGGCTTCCTGATAGAGCCTTCTGGAAAGGTCA 

T CTAG ATTTCTGAAGTTCT TTTT TT CTAGATGAGTCATGGTGTGT ACTT AGG AAGGGGACACAG CAAACATTT CAAGG ATGTTTTC 
TAGGCTCAGGGAGATACCAAATCATGGAAAATTATTTCTTAGGGCTGTCATGATTGGAGGAGGACTCT 

TTT CAAAATTATGACAAAAG CAAAG ACAAAAAAGCAGT CTCAGCAGAAACTGTGTACTG CAG AC CAT ATTAACAAC CTAGG ACCAC 
CTGCTCAGCTGGGAATTATGGGCCCAAAGCATAGCAAAATCCT 

4 0 TCATAGAAAAATCTTTCTCTCTGGGGGATTGCCCAGATTGTTTTTC^^ 

CrCAGTTTTTAAAAATATACATTTATTTGCTCCTTTTATGTGCTTOT 

AT AGTTC CAAGTAAATTC TCAATATTTAT ACCAG CTCAG AAGGTAGAACAAAA 

TGAGAGTTTGATATTAAAAAAAAAATTCTGTAATATAGCTGATCrrACA 

G AG GCAGG TCAAAAGAAAATGGCATTGGGTT AAAAAAATAAACAATT CAAGGGGAAGT AAG GAAGTTAACAGAT AATGAGTG CT CA 

4 5 GTGTGTGC T ATG uT'rr AAGTACTT CTTGTTCTATAT CATTTT ATT TTCAG AATAATTTCACG AGGAT AGG CTTG CTGTT CT CATTT 

ACAGATGAGGAACTGAAGTTGAGAGAGCCTGGCTAGGATGGCTGAGCACGGTGGCTGATGCCTGTAAT 
CGAGGTGGGTGGATCACCTGAGGTC^GGAGTTCGAGACCAGCCTGGCCAACATGGTCAAACCT 
TTAGCTGGGCGTGGTGGCCGGTGCCTGTAATCCC^VGCTACTCGGGAG 
TTGCAGTGAGCCCGAATTGCACCATTGCACTCC^GCCTGGCA 
50 GAGCCCCTTGCTAGAAATCTTACAACTAATGTATCTATCCAGGTGATCAAGAAAGACTCAATATTTTTTAT^ 

AATGAAAATACAAAAGCAAAAATTCTCCCAGTGAGCTTCTGACCC CCAGAAAT CCATGCACACCTGAAATGATCAGTTTTTGGTGT 

GTAGGTGGCGGCATATTTTTGAATGAATCCATGACAGAAAGAAGCTCAGTGGATTTCA 

TTTCCATGTTTGTTCTTCTGTGTTTCTTTGTGAAATTCTTTTTTTTTTTTTTTTT 

GAGTGCAGTGGTGCAGTCTTX3GCTCACTG CAACCCTGCCJTCCCGGGTTCAAGCAATTCCTGCCGCAG CCTCCCCGAGTAGCTGGGA 

5 5 CTACAGGCACITGCCCGGCTAGTTTTTGTATTTTTAGTAG 

CTCGTGATCCACTCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTCCACCT 
AAAAAAAAAAAAAATCTCIAGTCCATCTCAGGTGTTCTGAAGGGTA 
AGAATTAAAGTTCCTGATGAATTCGGCCCTGGGTCAGAAGGAAGGGTGAACTC 
GCTCAGCTCCCAGGGAGATAGTTGCCAAATGCCTGTGCCTTGTTAGCATTGTTTTGTTrTTATA 

6 0 TCTTAAAAAATTCAGTGTTTGATCCTAGCTGGAACAAGGGGGCTTACAGGGTTG 

GTTTACAAACATGAAATTATGTTCGCTTTTCGAATTCAGTTAAAGTATAGTAATCTAGAAAATTACCT 
GTTATTCTGTGGGCTTTGCTGCCACCTAGTGGATGGTATGTTTCATGTGTTGAGCGGCTACTACAAAA 

GGGG AAG AAG CTACTG ATAC CTGTGTTAAAAAAC CT AGTTT CTTTC CAGTCATTGGGGGGCAG CAG AG G TC CAGG TGGGT TTAGAT 
GCTGGGTTGGGGGTTGCTTTACATTCTGGCCAGAAGTGGGCAATGTCGAOT 

6 5 TAGATTTTGTTGTGTGGGGGAAGACCAAGACTGTGCACATA 

TTGAGAGCTTCTT'rrriTGGTTTGTTCTACGTCrTTTATTCTCAATGGTG 
GAGGGGCCCATTTAGTTTTTATTTCATTTTCTCTTTGATATT^ 

CAGAGGGAACTGTCCTACATTTGTAGGAAATCTGAGGTAGTCACATCATAACTTTTTTCAACTC^T 

AG ATT AAT T AT ATA'rri'TlTTATGTGCTGGGGTCTCG CTATGTTGC CCAAG CT AGGAACTC CCTGGGGT CAAGTGATC CTC CTG CC 

7 0 TGAGGCTTCTGTGTAGCTGGACCTACAGGTGTGTCACAGTGCCTGGC 

AATCT ACTTATT TT ATTTGCT AAAAAGTG CTAATTGATGACTTT ACT 

CTACCAAAGAAAAAAGAATATCGAAATAGATTTATATTTA 

CCCAGCTTGGATGACTCCTATTCCAAGTCATTCCCACCCCTCA^ 

7 5 TTGAATGAGTGTATGAATGAATAAACCTTTTTTTTTTTTTTGAGACGGAGTCTCGCT 
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GATCTTGGCT CACTATAAC CTCCGCCTCCTGGGTTTAAG CAGTTCT CCTGCCT CAG CCTCTTG AGTAGCTGGGACTACAAGTGTGT 
GCTC CCACG CCTGGCTAAl M rri"l'G 1' ATTTTTTUjGAGAGACATG GTTTCACCATG TTGGCCAGTATGGTCT CTATCXTTTGACTTTG 
TGATCTGCCTACCTTAGCCTCCCAAAGTGCTGGAA^ 

GAAAAAGTTATGTTATTAAGTTACTTGGATGCTAAAACATGGAAAGTGGTACGAAA^ 
5 GAGGCTGTGGGGTGGGGAAGAGCATGAGACAAGAAGTCAG^ 
GGAGGCCAAGGTGGGAGGATCGCTTGAGCCCAGGAGTT^ 
AAAAGAATTT^GAGGGGCGTGGTGGCGTGTGCCTGTGGTCCCAGCT 
GTCACGGCTGCAGTGAACTGTGATCATGCCACTTGCACTCCAGCCTG 
AAAATCAGAAGACTTGTATTGTAGGCTTTGCTCTGTTA^ 
1 0 TCTGTGTTGTGAGAGGATTGGTCTGGATGTTCATCAAAGTCrCTTGC^ 
TTGTOTTTGAGGAGGATTCTACTGTTT^ 

ATATTTTCAG ATT C C CATTATCCATTT AT AGGAGGCAG AAAGTTAAATGATT AT AACAAG AT^ 
ACAAAAAGTAAACATTTGTAAAATGTCTTCTAGTTC^ 

ATACATCAGCTCATGTGATCCTTCCTTCCTACCCAGTGATGTGGAGACAGACAAATAGTAGTAT^ 
1 5 TTACACCTTGGTGAAACTTTAGTTGATGTATCCAGGCC 

CTCTAAAC CTTT AT ATTCCTT CCTGTTATGTTTAGGACTCT CAG ATGATGG CTTTAT ATATATG T CATGTTAG AGGGTTTTCATTA 

AAAATATTAAACTTTTCTTTTTAAGAAATTTTTATT^ 

ATGGGAAAAATTTGGCAGAGATTGATAGGAAAGGATQAAGGCCGGCT 

GTCCAGGAGAAATCGAGGAGAATGCAGGTCAGGAGACCTGACITTG^ 
2 0 CTTTTTGCAGTGGGGAAACATTCAGGGTCTGCACCGCCCATTATGGTAACCACTA^ 

GGCTAACAGACTGAAGAACTGAATTTTTAATTTAACTTTGACTA^ 

AG CACAG CTCTGG ACAG AT TTGT CAACACT ATCAGGTTACAGTTTTCTCATCTCTGTAATGAAAGGAT^ CTGATCTCTA 

AAGCTCCTTTCATTTTTTAGAAATACACAGTCTGTGCCTTCTAAAACT 

GAATGGTCITTGATTCACATTATTTCTAATTTCA^ 

2 5 GTATTTGAACATATCTGCCCAGTGCTAGTGTCTGAATAGAGATATTGCTGGA 

CTGG CATT ATCTTTTCTATCTCT ACAGCGT CATTTTTAACAGTT CTTTGTGTTATG CCITG CTTTCTTTTTAG^ 
ACTGATTAAATGG CACTGTCTTCTGCCCAGGTGAAATAGACTGAAGTCACGCTAACTTT^ TTGCATG 
TCTTG ATATCAGGGGAGGAGATT TTATTAGTCAG TTT CCTGAAACCTTTATG C CCTGGTTAATAGTTGAAACCATTCTTCTT AGCT 
GTTGAGT AGTCAATTGTGTGAG CTAATATTTCAAAG AAT C CTAGAACCT CTG CTTTTAG AATAGTGTCTGGT AAAT AG ATCT G CTT 

3 0 TAAAAGAGGGGAAGOIGATCITTGTAAACAGAATTTTCTGAAAACCACT 

TT CACGTGGACTGGCCT CAG CAT CTTAGATTT AAATGTTTT AAT ATAATAAAAT CTTT AAACTTTGATTTTG TACT AAACTAGGAT 
TT CTTTTT AT ATTTCTAAAAGTGTGAAGGCTTTGTT CACTG AAAATGG CATTATGAT CTAGT C CT CTCT CACCTTGGCCATTAG AAA 
TGTAATGTAGCCAGGGATGGTGGCCTGTGATCCCA^ 

CCAGCCTGACCAACATGGTGAAACCCCATCTCTACTAAAAATGCAAAAATTAGCTGGGT 
35 TACTCAGC^GGTTGAAGCAGAGAATCGCCTGAACCCGGGAGGCGG 

GGACAACAAGCACAAAACTCTGTCHTGAAAAAAAAAAAAGCTGTAATGTTATA 
TAAGAAATTTGATGGAATGAATAGGTGAGTTTGAATAGGTCTGTTTGGTTTTGTTTGGCACTGTGA^ 
TTGCTCTTTCTTTTATACATTTCTTATAATACCACTGATTCAGGAATGTTCTGCAATTATGTAGCATTT 
AGTGAAATAAAAGCATTCTGGCTGAAAGTCCAAAATCTCTCCTTT7^TTCTGATGCCAGAATCTC 

4 0 ATATACCAAATAAGGTAGTGCCCTTTTTCCTCCTGGGGCAGCCCGGTTCTTCTGAAAGGTAGTATTTCCCAGCCTCTT 

TGG TACACGTAG G AGATTG CAG TATTTTTATGG CACACTG GGAAAAT CAGTTG CTT AT AG CTGG AG AC ATTGGAGC CCCTAAGGCT 
GGAGGTCACTTTCCAACCAGAGCTGTGCTGGAGTCCAGTAGGTGGGAGGCTGTGCT 
GGTGAGGGTTCCACCTCACAAGTTACAGATCTCAGTT 
CTTCACCTTTTTCTTGCCAAAAATAAACTTATCTTTAAATGAAAACTAAAT 

GATTCCAGAAAGCTCTTTAGGTTAAACTACAACCCT 
CTTGCGGCTAATTAATGGTAGAGCTGGAGTTAGGAGACATGTCT 

GTTTTGACATTAATTTTT CTGGATACTACTCC CATAAAATGTTCTTTGAAAAATACTTG^TTCTT^CTAGTTTTTCTCGCCTGGTT 
TAAATATTGTCCTGAGTGTGGGAACCCCATAACTGTCTTGTGGGTTAGAATTTAGATGGA 

5 0 CAT AAGACATTTAACCTTG CTGCTTTTTT CTTCTAGGTTCACT CTTTGAATTTCCTGGATAAGAGTTCTGGAGATGGCAGCTTATT 

GG ACACATGGAT1"1"1'CTT CAG ATTTG CACTT ACTGC T AG CT CT G CTTTTT ATG CAGGAGAAAAGC CCAGAGTT CACTGTGTGTCAG 

AACAACTTTCTTAACAAACATTTATTAATCCAGCCTCT 

CCACTCCTCCATATGCCTCreGCTCTGTTTTTATTCT^ 

CTTTATGAGTTGGTAACTTACAGCTGTGAACCTGCCTCCrTGTTTTCATTAAGTA 

5 5 ACAACCCTGGGAAATGGCAGTCCTTTGTCTTCTTCCTGGAGCAGGTAATCCTTTACTAATCCTT 

TCTTATGACAGTGCATTCCTG G AACTGCATCATT ATGCACT CTGCTAGAAATTG AACTAAGTATGTG C TAC CACATTTACAT CTGT 
TTCAACATGTTTTAGTGGTAAAAGTGAAATATATTTCTCTTAGGCAGTGTAGACAGAGTGA 

CTT AT AAG ATACAAT ATTTT CCAAAGCACCTT CACAAAT ATT CTCTCATTTT ATTCTTG CAATTATC TT AG GGGTT C CTGTCC TCA 
TTAGTTAGGTX1AAGAAATTGAAACTCAGATTTGTGACT 

6 0 CTGGCTGAAGTCTGTCATGTTCGCAGCCCTGTTTTCATCTTGGGTGCTGTTGGGTT^ 

ATGCCACTACTTAAATTTAGCCAACTACCGAGAATGAT^ 
ATGTAAAAAGAAAATCCCAACGTCCAGGCCACAACCCACACCAATTA 

CTACC CAGGTG ATTCCAGAG CACAGCAAATTGG AGAAACACTGCT CTG GATTC TTGCT ACTCAAAATGT AGTC^ 
ATTGGCATCGCCTGGGAGCRlCAGGCCTCACCCGAA^ 

6 5 GCACAATAAGATTTGAGAATCTTCTCATCTACAGCAGTG^ 

AGCTTGGGCCTGTATGTTTCTTATTAGCTCTCTGCCTAACTGATAACACACTGAGT 
TGGGCATGGCTATATTCCACTATC^CTTTATTTATAAAAATC 
TAGTTCATTGAGATTATGACTGAGCAGTTTAATCTGCACTTT^ 
ACCCAGGCTGGAGTGCAATGGTGTGATCTCGGjCrrCACTC 

7 0 GAGTAGCTGGGATTATAGGTGCACACCAACATGCCCGGCTAATTTTTGTATT^ 

GCTGGTCTCAAACTCCTGACCTCAGGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTATAGGCGTGAG 
G CCTAAT CTGTTTT AAAACTT AT ATACAGATGTCACAGACCCTAT CAGGAGG CTTC CGG ATAATTTTTAGT CTAGAGAAATTGG AA 
TTTTGAATAAGAAAAGCATCTGGACTTTGAGACACGGGTTGAAAACTGGGAATAATA^ 
GCAGTCTTTAATACATAGGTTTTGTTTAATTTTTCATAATGATCTTGAGATACATGGGA 
75 CAG CT ATTGTCT CAG AGGTTGTGATTTAC C CAAG CCAGT T ACCATCAACAGG AAACATTCITCT T 
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GCTT CGGGGCT AAAACCT ACAGACTG CTTTTGGAGTG TTGGGATT AGA1TTAGTTAAAGGCT CATT ATAGGTAACTCTTGTTTTT A 

TGCTTCTGTTGATTTGCTTTATGGTTCTCAATTTGACATCT 

CCTGGGCAACAATGTAAGACCCTCTTCTTAAAAGAGGCCAGGCATC 

GGTGGGTGG ATCAC CTGAGGTCAGGAG AC CAGCCTGACCAAT ATGATGAAACCCTGTCTCTACTAAAAAT ACAAAA^ 
5 CGTGGTGGCATGAGC^CTGTAATC CCAGCTACTCAGGAGTCTG^ CTAGAAC CTGGTAGG CGGAGGTTGCAGTG A 

GCCGAGATTGTGCCATTGCACTCCACCCTGGGCAACAGAAG 

AAGCG CCAGGTGCTCAGTGGCTT ACACCTGT AATCC CAGAACTTTGGGAGG CCAAGGTGGAAGAATGGCTTGAACCCAGG^ 
AGACCAGCCTGGGCAACACAGTC^GTCCCTGTCTCTACAAA 

AG CTACTCGGGAGGCTGAGTTTGGAGAGTCACTTGATC CTTCAGTCTGGGC 
1 0 AACAGAACAAGACCCTGTCTTGAACTTAAAGACACACCCCCAAG 
TATGTTGAAGGAGTTGATTTTGAGTACTACTGCATGGAGAGTGM 
TATGGGGTGGTTTATTTTTATTTTCTGTGCTCTTTTATGG^ 
CCATTCTTTGCAGTGGTGGGAAGCCCAGCCCTGAGCACCTCCTGCCCA 

TTGTTTTCTT CT CTTAG CAGCCTAT ACCTTCGGCCT CTTGTG AT CACATCATTT CATCTTGGGCTTCAAAACACj."r i~rGTTGTG AT 
1 5 GGCTAGAAAATACTTGGGGAGGTCACAGAGGCGTGCTCCCrGGTGTTGGAGGTTAT 

CGGGAGACTCTGCTGCTCTCCTATCTTGTTGGGTTGCAGCTCAGC^ 

ATAACTACGGTTAATTTAAAAGTTTTTAACAAATATAAAAAATAGATTT^ 

TAGAGGAGAGAGTAGGAAAAAAGTGGGAGGAAGCTTATGAAGAAGGACTAAACAA 

GTAGGTCCAAGTTCrGTCAATGGGATTTTTAAAAAGTATTTC 
2 0 TCTCAAAGTGTTGATTCAAGGACTTGTTCTGCCACTTACTGTCT 

TAGG CTCTGTG CTAGGTAG TG CATGG CATT AGAAAAGATAAGACACAGCC CTATCCTGTCCAGGAGTGAGCGAGCGTGGG CCACAT 

GTCTGGCAAACTGTTAAGCGCCCTCCTGTTACTTGCX3TTTCCCTATCA 

TGAGGAAATGGAGACTCAGCAAGTGTTAGGCTTGAATCTTATTTCT 

AGTCATAAACT CAG CATGGGGATAATAAT ACITCATACTGTGGTT ATCCCAACAAAACGTTAAGTG 
2 5 CTCTACAAATGTATCATTCCTTTCAGTGCTCACCAGAAGCTCTGAGGGTTCC^ 
CGTTTCACTGACTTTCAGCTAAAGCACTGTGGTGACCAATGGCAGG 
GCCCCGTTTGAGACCTGCTATTCAGGCCTGTTAAATCCCAGGGATAAACAACT^ 

TGCCTGCGT ATACATG CACGT CAAGAG GC CAG CAAG CTCC CATATGT CT AACAG CTT CAATG ATTGT CAACATTT AACCC CAAT CT 

AGTGT CAGGGT ATAAAT AAGAG ACATCTGATGG CAGAG CTCATTCTTT CTGTTATTACTCCTTC CTTTAAATTTTTG CTTACCATG 
30 AATTAAGTGCT ACAGAAATGATCAGAACGGACAAGGCG CCT C CATT AGGGAAGAG AGC CAAATTAT 

TACATAAATGAAATAATTTTCTTGTGAGTGGAGTGAAGGAAAGAAACAGTAAACAATATAATGTGACAGTA 

GGAGCTGCTTAGGGTGGTTAGGAAGGACCTCTCTGACCAGGTGACCTTTAGGCTGAGA 

GGACCAGCCATGGGGAGTGTGTCTC^ATATTAGTTATCCCCATTGGCATATGCTTGAGTAG 

AAAGATCAATTTTTTTTTTTTTAGATGGAGTGTTGCTCTC 
35 TCC^CTTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCT 

TTTTGTATTTTTACrAGAGATGGGGTTC 

CTTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCCG^ 
GAGCTCTGCCTTGCCATTCTGATCTCATCCTGAAGGGGGATCAAATC 
CAGGAAAC^AGACACTTAGTCTTAGfGGGGCCT^ 
4 0 TGGTATTTCTGAGTCCACAGAGCTTGGTGGAAAGCCTGGGATCCG AGTCTGG 
CAAGCCGCTCCATGTGCCTTCTGATTTGTAAAAGGGGAGAGAATACT 
TGATGTTTGCTTTGGTGTTGTACAGATGTAGGATATTGTTAATAAG 

AAATCACTGGAATAACTTGGGTCAGCAAGGATAAGGTGGACACCTTAGAGGTTGGTAAAAATAGAATCTTGCAGCCT 
AGAAGTCCCITTTAAGGTTTACCTACTTTCCTCCCATCAAGGCTGGAAGCATGAA^ 

4 5 CATATTCCTTCATCTACCTAGACTCTCTTAGGTCCTGCCA^ 

TTTTGTTGTTGTTGTTGCTAAATTTCATGTGGGTGGTTCG 

AGGAAGGCTTTGAAAATGGGAAGGATCCTCCCCTCAGATCCTATC^CCTTTGGAATGGGAA 

GACACTGGAGTCCTCTCTTCTCAAAGGTGGCGAACCGTCCTGGAGTGACGTGGCGTGTTCTGGCCTGCGAGGAGATGG 
AGAACCTG CCCGCTGGAGGGCG CTGCAGAGGTCTGGCTCCGTGCTTTCAGATGTGTTCGGCCAGGGG CTCCAGGGGCTCCTTCTGA 

5 0 AAGCAGAGAAGCCCCCAGAGGTTGAGAGGGGAGATAAGACGTGAGCCTTTTC 

AGCACTGTGGGGCCAGAAGTGTTGGCTTCTGTCAGAAACTCGGCCCAGCCTGACCGCTGC 
TCACGCGCTTCCTTCTGGAAGCCGGAAGCAGGAGGTGGGAAGCTGC^GAGGTAGTGA 
GGGCCGAAGTGACTGGAGACGTGGCCACATACTATGAGAAACACCAGrcCAC 
CTGTGTGAAGCCGCAGCCAGCCCACTTTTTTCTCAGTATGGCAGTG 
5 5 TCGTCTTCCTGGGCATTCTGATGCATTCCCACTTCATATTTGCACTGGGATT^ 
TTCAATCCCTTCAACAACACTGAGAATAGGTGCGCAGGTATTTTT^^ 

AAAGGCCCCAGGAGAGAAGGCTAGTGAGGGGCAGTGACGCTTAGGGACAAACAACTGGTTCTCCTGCCTCCTGC 
GCCTGAGGAGAATATAGTAACAAAAGCCTCTGTCTTGCAATTGCAGCCATTAA^ 



6 0 GAGG CATAATT ACATGAAAAT ACACT AATGATG CATT TG AT AT AT AT AT AG CCATCACCCAAGATAAAG ATATGG AACATTT CCAC 

CACCCCTGAAGGTTTCCTCGTGCCCCTTTTCAATCCATCCTTCTACCCCAGAGGCAAACAGTC 
GCGTTTTGCCTGTTCTTGGACTTCATCTGTATGGAGTCATAGTTTGCACAGAGACTGAATCCCCAGAGCACGAC^ 
ATGGCAGGTATACAATGCCAAAGAAAGCCTGGTTC!TTAAACTACCTGTGAGA 
G CGGTTAGTGACCTGCCAAAGGCACGAGTGTATGATGTGGTGGATCCCGGACTCC^ 

65 CCGGGCCCCCAGCCCTCCCTGGGCACTTCCrCCGGGGCTCCGGGTGGAGCGGGTCTGACGATCCC^ 
GCCGGCCCAGX3CCCCAGGCCCTCCTCTCCTTTCGGGGGGATGTGACAG 

AGGGCAGGGACTTGTGGGATAAAAGTCACCTGCCCTTTTATTTTCCTCGGAAAAGAGCTTTTCCTAAACITTGG 
TCCAGAATTAACAGACAAAGGGAGAAGAGGTCTTTTTGGTGGAGCGCTGCTGCC^ 
GCAGGCGATCAAGCCTGGTATTTTTGGGTTTCAGGTTCTGCGGCTCCCTGAGACCTGC^ 
70 CCGGACGTGTCCATCAAGGCGTCCATTAGGGCCATCACCTAA^ 



TAAGTCTCGAGGGTCTGCCCTGTACTA^ 
GAGGTGCGGTCAGGGAGGCGGCTC^GCGGTGGGGCCAGCAGCACCTCCCCAGCCTGCTGCGGGCCCGGTG 



75 



CGTGGCCTGGGGCGATGCTATCTCTAACrCAGTATTTTAAACTCTTCTTTCC^ 
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GTGCCCTACTTGCCCCTCTTGGTAACCATGACCT 
GAGACTTTTTCCACTTCCTGTTCTACTTGTGGCTACTGACTCAA 
GCTACCC3^GAGAGGCCTCAGGCCTGTGTGTCTCTGTGTGAAAGGAGGCCTGCAOT 
TTGACACAGATOCAGGGCGCTGGTGGTCCTGCAGCCTTC^ 
5 GACTCATCCCAGTCAGAAGCTCGTCTTCCCACACATGGCTCA 

GCGAACCCACATAAAGTGACTGTTGGCTCTGGACCrCAGTGTGGTCAC^ 

CCACAGTCCCTGTCCTCGASGCTTTGTTGGATAGA 

CCTrGTTCTGAAGTTCTTGACAAGTTACTTTAG^ 

CCCACTTTCCrrTCTCCTGTACrCCCCACTCCTCACACTCAGCTCCCCCCACC 
10 ATTTAACGGCATCTGTTGTGTGCCAGGCACrcTTCT 

CACAGG CCAGGGGACTTCCACACTAGGATGGT ACCTGAT AAT AAGG TGCTAT AGG C CTGT CTGTATACGTGATCCCAGGGG AGTTG 

AATTTTATGTTTGGACCTAAAAAATAAGCAACTCCTTAAGACTCAGATGGCT 

TGTCAGAGCTGAAGACOTCACCAGGATGCTCCTGAAGCACGTGCTTGC^ 

TGCACTCAGAGGTCAAGTCTTCAAGTGAAGTTTCAGAGAGTCCT 
15 TTCTTCTGCCTTTGGGACTGGCCTGTGCTTAAGTCTTTTGGGA^ 

TAGTAACATGTTCCCTGCTGGGATGTCGGCCrGAGGTTGGGTCCAGGCTGGTAGA 

GOTCGCAGGGTCTCTGGGTCITCATGGCCAGT^ 

AGCAATGCCCAGTTCCGTATAGCTGGGCCCTCTGCACTGAGGGGCTTTGCl^ 
GGGCACAGGAGCCCTTCTCCTGAGCCTCCTCACAA 

20 

HUMAN SEQUENCE - niRNA 

CATCCGGTGTGGTCGACGGGTCCTCCAAGAGTTTGGGGCXKX3GAC 

TGTCATTTCGCGGTTCTTAGAAGAGTACTTGAGCTCCACTCCGCAGCGTCTGAAGTTGCTGGACGCGTACCT 

2 5 GGGAGTTTC^CCTAG^GGTTO 

CTTTGCTGATTTTCTCTTTGCCAGGACCATCCTGCACCTTGTTGT^ 

AGGAGTAGGAGACTAAAAGAATGTTCACTCTTTGAATTTCCTGGATAAGAGTTCTGGAGATGGCAGCTTATT 
TCTTCAGATTTGACACTTACTGCTAGCTCTGCTTTTTATGACAGGAG 

ACAAACATTTATTAATCCAGCCTCTGCCTTTCATTAAATGTAACCTTTTGCTTTCCAAATTAAA^ 
30 AAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGTCGGCGTCGGTAGTGTCTGTCATTTCGCGGTTCTTAG 

GTACCTGCTGTATATACTGCTGACCGGGGCGCTGCAGTTCGGTTACTGTCTCCrCGTGGGG 
35 CGGGCTTCATCTCTTGTGTGGGGAGTTTCATCCTAGCGGTTTGCCTGAGAATACAGATCAACCCACAGA^ 

ggcatctccccagagcgagcctttgctgattttctct™ 
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Table 91 

MOUSE NOMENCLATURE 
ICSGNM Lck 
Celera mCG12243 

HUMAN NOMENCLATURE 
HGNC LCK 
Celera hCG41609 

MOUSE SEQUENCE - GENOMIC 

TCTATGGCAT AG ACAGACACAGTGAACTGGGTC!ATTTTGG CAT AGACACAGTGAG CTG GG CCTCAGTC^CACTT AG ATTGAGT AAG 

ATTTCTC CATCT AGAAG CTCCCAT AATG CAAAGTGGACCCAGAGTTCC C CAAGGCAGG AAC CACTATTAAGGTGCTGAG CACAAT A 

TATCCCTTACCAATGACAGTTGTAACCAACCAGAAGC!AGTTC 

TTTTTTGGTTTTCTOAGACAGGGTTTCT 

GAAATCCGCCTGCCTCTGCCTCCCAAGTGCTGGGATAAAAG^ 

GQTGTAGCTTGGTGGAAGAGTATGCACTTATTATGTGTGAGGCCTTGGGAAT 

GACGAGGCTCCCCTGGGCAACTGAGACTCTGCTTCGAACAAACAGAAAATGGAAGGGT^ 

CTCCTTTAACCCTGTACCCAGGCAGCAGGGACAGCT 

G C CAG AACACATGGCT AATTT ACCAG CTAG CAAGCCTTT C TTATG C CTGCCAG C CTAACGTTT TGGT ATCATG ATGAGGTTT ATGT 
GGGCCGCTCCCTTCTCGGAACACATG CAACTTGG AGAATACT CAAATCTTGACTACTAAAGAAAAAGTCAGG AAACTGTTGTG^^A 



GTATGTTTGTTTATTTTTTTGACATCTGAGACAGGGTTTCT 
TTCTATGTAAAGCAGTAGT 

TGCTGG CG CAGAAAAAGAG CTGATAG CCATGAAGTT GGCAACCATGGC CAAGT CT ACG AGAGT CGG AACTCAGTT AGATGTGTTTG 

ACAATTCTGGGGTCAAGCCGGGCXSTGGTGGCGTACGCCTTTAATNNNNNNNNN^ 

NNNKKNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNKNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NIINNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNb^ 
NNNNNNNNNNN ^^ 

ATTTTTATTTATTGTGATCATAATATTTTCATTACTATTATTT 
•GTATGTAGACTGGCrTTGAACAGAGG TC 

GGGCTCGGGGGACACACITGCTTCAGGGTCT^ 
GTCTGCTGGCCAGGTGGCATTGTACTGAAGAACACCAACCAAT^ 

GAAACACTGGGGTGGGGTAGCCCTITTGATAATGGGCAAACGGTG CCTAGTCGACAAGACACGGCrTTGJMxTTGGACCAAGGG CATT 
G CAAGTCCTACTGAAAGATGGATT AGAAG CCACACCACTGT GATTATTGAGCAAG AGGTGG AAAAAG AGGAGGTT CTTT ATT CAT A 
CCAGAGTCTGTCACCCGCCATTCTAAGAGTCCTGAGGACTGCCTTTTTTTTCCT^ 
GTTTTAATGTTGATCTCTGCATTTGTTGTTTTT^ 

AATCCCCATGGAAGCCAGAGAGGGCATTCGATCTCCTGGGCCTGGACCTAGAGTTACAGATGGTTATC 
GGGAATAGAACCCTGGTCCTCAGAC^GAGCAGCCAGTGCT 

GTTGAACCTGTTCCTAAC CTGTTTCGAC CAC^TCTGGCTACITG ACAACCTTTGTTTTAATCCTCGAC CAGTGAGGTGGGACTCAT 
GTTTTGTGTAAAAAGAAGC!ACCTTACAATTTAGAGACA 

ACACATGGTTT TG CATG ATGT AAGCAAGAT ACTGG GTT ATGT AC TCAG AC TTG CATTCTCTG GTCAGGTGAATGTAGATCTAAATG 



TGTGAGTGTGTGTGTGTGTGTGTGTGCTTATATGCGTGTGTGCGCGCGCGTGTGTGTGTGCGTGTGT 
TAAGAGGGGAGTTATTTTGTGACAGGGTAACACTATGTAGCrCTGACTGGTTTGGAACT 

CT CACAAAGACCCTCCCGCCTCTACCCCCACGAGCTGG AATTAATTA CAACGGTGGTCATGTTT 

CAGATGAGGAATCTTCACTTCCAGTTCTGTGTCCCTGTCTAGCTTCCTTTGTGATTTCT 

ATGC^TGCAGGGGTGAACAGTAGAAGACAAGGCACCTATCTGTAAAAGATGGATGCTCCTGCTGTCT 

GTTTGTTTGTTTGTTGCTTGCTTGCTTGTTTTATTTTGTTCm 

CTTGCTGTGCAGATACAGCTGGCCTCAAACTCA 

TGTGATTTCCTTTTGCTCTTGCCAGAACCCAGCAGTGTGT^ 

TACTTATCTATTGGCTGGTGCATGC^TTTTAAAAAACTATGTG 

AAGGAGGCAGAAGAGGGCATAAAATACCTTAGTGGTAGAGTTGGGAGGCAACTTGTTGGAAT 
GCAAAAGCAGTATGAGTTCCTAGTTGCTGAGCCATCTCT 

AGACG CTTCGGAATGTTTATGGGTGTGGAGGCCAGAGGTCAGG CGCCATGGGTTTTTT CTTTCCTTTTATTGAAACAGGATCTCAC 
TGTGTAGCTCTGG(ZTGTCCAGGAACTCACTGTAGGAGACCAC^CAGCTGGCCTCAAA 
TGAGTGCTGGGGTTAAGGACAGGCGCCACCACGCCCAGACAAAGTGTCATTCTTT^ 
TATGTAAGTACATTGTAGATGTCTTCAGACACTCCAGAA 

GCTGGGATTTGAACTCCGGACCTTCGGAAGAGCAGTCGGGTGCTCTAACCCACTGAGCCATCTC!ACCAGCCC 
TTAGGGAACTGTCCACCTTGAGTTTCAAGGCAGGGTACTTCATCTGTTGCTCT 
TGACACCATCTCGCCAGGCCTTCCACATGGAAACACIAGG^ 
CACCCCAGCCACTGTCTTCTCTTTAAAAATCTGTGTGCGTGTGGGTGTC 

ATGGTAGGCATT AGGACTCTCTCGAT AC CATGTCCCAG AG AT CAAGGTCAG AT C CTCG ATT AGGTGG CAAGG AAG ATT ACCT CCTG 
GGCCATCTCACTGGCTGACAACCCCACCCTCTTCCCTTTAACATGGAGTCTCACTCT 
TATCTTCCTTCCTCAACCTCTTGAGTTTAAAGGTGTATACCAACACTCCTGATO 
GATTTCTTTCTCTCCTTTTTTTTTTCTT^ 

AAGG CAT CCAATCT CATTACAAATGGTTGTG AGC CAC CATGTGGTT ACTGGG AATTGAACT C7VGGACCTCTGG AAG ATCIAGT CAGT 
GCTCTTAACC^CTGAGCCATCTTTCGAGTCCCCTGGTTTCTm 
TCTGTGTAGCCCTGGCTGTCCTGGAACTCACTCTGTAGACC 
TGCTGGGACTAAAGGCGTGCGCCACCAGACCC^GCTCrOTTTTCTTTTCCTT^ 

CAGACTACTTGATGAT AAG ATTACAATTTTTTTT T AAAGATTTAAT TTT ATTT AT ATGAGT ACACTGT CTTCAGACACACTAG AAG 
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AGGGCATCAGAGCCAATTACAGGTGGTTGTGAGCCACCATGTGGTTGCTGGA 
GCTCTTAACCACCTCGAGATTACAAATTTGTACCCATCATGTCCTGC^ 
TCTATCTAT CCATTT ATGTGTATGTACACCTGTGCATTC 
GTGfTTGTGAAGATTGGTTCTCTCCTTCCACCATT^ 
5 CCTCCTGAACCATTAAGCAGGCCATATACATACACACACA 

CATATATATATAATTTAAAGATTTATTrATTTATTTATTTATTTA 

AGACACACCAGAAGAGGGCATTGGATCT CACT ACAGATGGTTGTGAGC CACCATGTGGTTGATGGGAATTGAA 
TCAGTGCTTTTAACCTCTGAGCCATCTTTCCAGCCCCAGAGTCTCTTTATAT^ 
AGGCTGGCCTCAACTCACAGGGACCrACCTGCTTCTCCCrCTTGA 
1 0 CCCCAATCTGACAGATACCAGGTACTTTATGA 

AACCAACATCTTCTCCAGTCTCTGAAGATTCCTTCTGCTCCGTCTTCTCGATTTCCTTCACCG 

ATTACTATTATTATTATTATTATTATTATTACTGATTTTTGGCAGCAM 

TCACCCTCCTTAACCCCAAACTTATCTCTCGGTTTTT^^ 

CATGTAAACCAAGCTAACATAGAC CTAGTAGAAATCTAC CTGCATCTG CCT CCCAGGACTGAGAGCTCAG G CT AAAGG CAT AAATC 
1 5 ATCACACCCAGTCTCTCTTGCTITTTCCQ^CAGCTTCCAAAAATC 

ATCTTCTGTGGGTACTTCGGTTATAGGTTAACCAACCTGTGTTTTCCTAGGGCC 

C CCCACTCTT CTTCTGGGAGTTCCTAT CACTGCCAG CCG CTTT CTGGCCATAAG ATAACTGAG CTGAT AAG ACAG CAATTGGAG CC 
CTGACAAGCCGGCTTATGGCCrrCGCTGCCATGGGTGGAAGAA^AOT 

TTTCTTTcrTTC Tr rcrrrL y nvrrrc 

2 0 CTTCCTTTCTTCCrrTCITCCTTCCTTCCTTTCTTTOT 

TTCTTTCTGTTTTITGAGACAGGGTTTCTCTGTGTAGCCC^ 
AGAAATCCCCCTGCCTCTGCCTCCCGAGTGCTGGGATTTCTC 
GCAAGGTCTCTTTGTATCCCAACTAATCTTGAAGCCCT 
CATCAAGCCCAAGTCTATTCAGTGCrGGAGATTTGAAGCT 

2 5 CTCTCAAGAAGTGTCTTGATTTGGCTTATTTCTCCATGG^ 

ACCCAAGAGGAAAGCACACTGAAAGTCCCGGGACCCGTGAGATG CTCTG ACTAG CTCAGATT GTGAG CCTGAAGAGGGTTACTCTC 

CAGTCCCATTTTCrrCACAGTCCCATGCAGGGGTGGGTAAAGTTGCAGGGTACT 

CTAACTAACGTATGTAGGGTGGTGAAAAGGAGTTGACAAT 

TTGTTATCTTACTGTTTTATGTTTTCASGGGTTTCOTTCT 

3 0 TCTCGAACTGATCTTGAACTTTTGATCCAGCCTCTACCTCCCAAGTGCCAGGACC 

ACAGCTAATATCTGAGCATAGGGCCTTCAGATTCT 

TCCACTTGGACACAGCATAAATTAGCAAGTCTTGTTTTGTTCCACTGCA 

TTGATCTTCTAAAATCGAAAGTCAAAGCTCAGTAGGACACTGTGCATCCAGCTACCTAAGATG 

ACAAG AGTGCTGG ACCAG CCTGGG CAATACATGGAAACT CT ATCTGAAAAAT AAAC ACAT AAACAAACAAACAAAAC CAGAGGTT A 
3 5 GACCTCTGTAGGTGGGAGAATCTGTG CTTAGTCTCCAGGACCCAGTGATGGGTGTTGCAGG CGTCTGAGCAGGTGACACACAGTTT 

CAATTCTTGTTTGTTTGTTTGTITGTTTGTTTTT 
ACCAGGCTGGCCTCGAACTCAGAAATCCGCCTGCCTCTGCCTCCTGAG 
TTCTTTTTTTTTTTTTTAAGAT1TATTTTTATTTATTTCATGTATGTGACT 
TCTCAGATCCCATTACAGATGGTTGTGAGTCACCATGTGATTGCTGGGAATTGAACTCAAGACCT 

TGTGTGAGTGTGTGTGTGTGTGTTTGTGGTATGTTGCAAAATAGATTAAAC 

AACATG AT C GTGAAC CC CTT TATT AAAT CT AACACT CAG AGACAGG AG CATCTC TG C AGGTTTGAGGCCAG CCTGCTCT ACAGAAT 
GAATTTCAGGTCTCAAGGTC^GCTTGGTCTACAAAGTGAGTTTCAGATCTCAAGGCCAGCCAAGACTA 
AAAATAAATAAATAAATAAAATAAAATAAAAGTTAATCTTCGCTTGGCAAACCGATAATTGAGGACCAGTG 
45 CACGGGAATTCCAGAGGCTACAGAGGGAGCCTTOCTCrGACCTG 

GCTTGATTTCATTTGACAAAAAGTTTCCACAGCTAAACCAGGCAAGGGAGCCGAAGTAGA 
GTTTCTCTCTGCTGCTGAGAAGCAAAAGCCTGTTTC 

GGGGTTGGGCTGAGGCTGAGGGTTGACTCTAAGGAGCTGGAACCTCTCAGCTTCGGTGGCTAGGCAG 

GACAGGTACCCTCCTTGCTGGAGGAGGGTGGAATGAAACT 
5 0 GTAGTCCCCCAAAGGAGGAGGCTAGCAGAGCTG^GGAGGCAGGAAGTGGGTAACTAGACTAACAAAGATGCCTCC 

TGCCCATCCCAGGTGGGAGGGTGGGACTAGCCTTGGGCCTGGGCCTCCTGTGA^ 

CAGGAGCTTGAATCCCACGATTCAGCGCTTCTCTCTGCGX3CCAATGGG 

TCCAGAACAGGGCTCTAGGATGTCTGATGTTGGGGCGAGTGGCTTAGGGCCAG 

CATGGGCTGTGTCTGCAGCTCAAACCCTGAAGATGACTGGATGGAGAACATTGAC^ 
5 5 CGCTGGACAGCAAGATCTCGGTAAGAGGAAGATGGGATTAGTGAAGGGATGGGGGCCAGGAGATAGGATTGGTGGAGGGATGGGGG 

CGGAGGACATAAGATTGGTGAAGGGATGGCATGGGGTTGAAAGGTTGTGTATCCAG 

GGCTCTGGAAAATAGCAGCACTCCTATCTTCCCTAGCATACATC^ 

AATGTACACATAGTTGGTG AATTCAGAGACAAGGTT CAAC CAGG CTG GAGAAACGG AAAG AAGAGGAGAGG AGAGGG AGCTAGGGT 

CAGATCCTGTCTTTTTTCTTTGACCCCAGCCTCATT^ 
60 TGGTCACCTATGAGGGATCTCTCCCACCAGCATCCCCGCTGCAAGGTGACTTCATC 

GGATGCTTGGTTGTTCGTGTCCATACTGTCTTCATCTTCTCTCCACCTTGTCT 

T CAG CCC CAGG CTGGCTTTTCTTGACTC!TTCTTATCTTT ACAG ACAACCTGGTTATCGCC CTG 

TGGAGACITTGGGCITTGAGAAGGGTGAACAGCTCCGAATCCTGGA^ 

GTGTGTGTGTTTGTGTGTGTGTCCrrCAGCCCTCCTGCACCCATTACAGGCTCTGGCCT 
65 AATAAGTCTCCAGAGTGTTTGATCCAGGGGAGTAATGATCTGGTCTAACTGGGGACTCAGGGGTTTCCAG 

GATGATGGTCTCTGTGGTGCMGAGCGGTGAGTGGT^ 

TTCGTGGCGAAAGCAAACAGCCTGGAGCCTGAACCGTGAGCAGGAGACCTCCGTGGGAATGGGGTTCACATCCA^ 
CAGGGCGTGGTCCGGGGAGTAGGGGATGTTTGACTGAGATGGAGGAA 

TCGGGAGGGAGGGGCAGGCAGATCTCTGAGTTGAAGGCCAGTCTGGTGTACAGAGGGAGTTCCAGGA 
GAGCGGCAGCTT^TCGCGCC 

GGCGGAGCCCCGGTGTGGGCGTGGCTGTGAGGTGAGGGCGGGCGCCTCCGGGCGAGAAGGGGGCGGGGGCGGGGAGCTCTGGGA^ 
CGGGGGCGGTGACCACCTGATTTATGGAAGGCAATCCGCCACATTTTCTTTTTCCTCTTCAACTTCT 
TCTCCAACCTTTCTTCAGGGTCCTTTTCCCTGTCGGTCAGAGACTT^ 
75 CGTAACCTAGACAACGGTGGCTTCTACATCTCCCCTCGTATCACTTTTCCCGGATTGC^^ 
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G^TCCGGCGGAATGCGTTC&CCTGTGCCCATC 

GTCTTTCATTCCCCTTCJIGTGGATGAGGGGTATAfiAACCTGACCCTAGACC 
CCCATACCAGTCGGTTTCCCCTGCGAGGAAAGTGGAAAGAGCGTAGGGGCTGGAGGGAGCA^ 
ACCTTCTCTTTTGTCTATGCAGACGCCTCTGATGGGCTGTGCACAA 
5 ATGGTGGGAGGACGAATGGGAAGTTCCCAGGGAAACACTGAAGTTGGTGGAGCGGCTGGGAG CTGG CCAGTTCGGGGAAGTGTGGA 

TGGCTGAGTGTGACCCTCGGGACTGATTGGAAAGAGGAGAG 
TAGTTCGTCAGGATCTTGACCTCTGTAACTTCT^ 

AGTCTGAAACAAGGGAGCATGTCCCCCGACGCCrTCCTGGCrGAGGCTAACCTC^ 
GCTTTATGCAGTGGTCACCCAGGAACCCATCTACATG^ 
10 TGAGGGCGCTATTCTGGTCCCACTACCTT^ 

GTCCCTAAGCTTTGGAAGAACGTTCCATCTGATAGTCCCTGATCrTCAGTTTCT 

GAATGCTTGACCTAAGAAATGGTGTATGGTGCCTGAAGAGACCAGAAAGGAG 

CTACTGGGGATCAAACGCGGGTCCTCTCGAAGAGCAGC 

GTTTGTTAAGGATCCAAGAGTCTGACTGCCTGGATAGAGTTGAGGCTTTATAAT^ 
1 5 TCTGCTCCAAATAGAACTATTTTAATCTGTAAAATAGCGGGTTTAGGGCCAGCAAGATGGCTCAGCAGG 
AAGCCTGATGAGCAGGAGTTCAATGCCTGGAATCCACAGGAGA^ 

AXACCTGTGTAATGG CTCATG CATGTGCCGC CTC CTATGAAT AAATACATGTAATT AAAG CAT AG CTGCACT AGTGGAAGAACTGC 
ATCTCTACATAGTTGCTATGGGGATGGAAGAGCAGTTCCTCT 
TTCCAGGAAGGGAAACAAACCATTCATTGGCTAATGATTCCTGGAAAT^ 
2 0 TTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGTT^ 

TCCTGGAACTCACTTTGTAGACCAGGCTGGCCTCGAACTCAGAAATCCGCCTGCCTCTGCCTCCCAA 

GCGCCACCACGCCCGGCTCTTTTTTTCTTTTTGAGACAGGGTTTCTTCGTGTAGCCCT 

AGGCTGGCCCTGAAGTCAGAAATCCACCCACCTCTGCCTCCTCAGTGCTAGGATCMAGGTGTTTGC 

ATGGACACTGAGAAAAGCAGTTTGTGACCT AGGGATGTTCC CAAATAC CCCAGG AATG TAGACAGTG AAGATATTTTCCTTT CT CT 

2 5 TTClTTClTrCTlTCTTTCTTTCTTTCTTTCT 

ACAGGGT1"I'OT CTG TAT AG CCCTGGTTGTCCTGG AACTCAC"i''rrGTAGACCAGG CTGG C CT CGAACTCAGAAAT CAGCCCGC CT CT 
GCCTCCCAAGTGCTGGGATTAAAGGCXSTGCGCCACC^CGTCTTT 
TGTCCTOTGCCrrccCTAGCACAGGGATTAAAGGCATGTGCCACCACCACCAGG 
GTCTTCTCTGCTCTAGAACTCIACTCAGTAGACCAGGCTGTCrrTTGAACTCA 

3 0 GGGATTAAAGCTGTGCATCACCACTCCCrrGGCCrCTTTTTGTTGTTATTG 

GGGTTTCTCTGTGTAGCCCTGACTCTCCTGGTCCCGAACTCCAAGATCTGCCTGC^ 
TGTGTCACAGTTGCTCTGCTAATTAGCTGAAGGAATTCATGACAGCrGGTAGGAACAOT 
AAAGGAAAGGAGAAACTTCTAGAAGACAAGAAATGAAGCAAAGAGACT^ 
CAATAATCTGTGATAGAGAAAGTCAGGTCTCTGGGAAGAAGACACAGGCA.TG 
3 5 TCTGTCTTAACTGGAGGCAAGAAGCCTCTGGATCCTCTGCTCATATCCCAACCrrAGCCC 

" TTGTACACAG(ZATCCCCCTTGTGGAGAGCACCCCCCT^ 
CTCCCTTATGCACAGCATCCCCCTTATGCACAGCATCCCCCT 
CACAGCATCTCCCTTATGCACAGC^TCCCCCTCTTATC 
40 , ACCTCCTTGTGCACAGCATCCCCCTTTGTGCACA 
TGCACAGCATACCCCCTTGTGCACAGGA^ 

ACAGCATCCCTCCTTGTGCACAGTATCCCTCCTTGTGCACAGCATCTCCCTT^ 
CCTTGTGCACAGCIATCCCCCCTTGTGCACAGCATCCCCCTCTTGTGC 
45 ACAGCATCCCCCCTTGTGCACAGCATCCCTCCTTGTGCACAG 

TCCTTGTGCACAGCATCCCCCTCTTGTG CACAGCATTCCCCCTGTG CACAGCATCCCCCTCTTGTGCACAGCATCCCCCCTGTGCA 

CAGCATCCCCCTCTTGTGCACAGCATCCCCCCCCCTTGTGC^CAGCATCCCCACCTTGCAC^ 

C^TCCCCCCCTTCTACCCCCAAGCAC^CACTTT^ 

GTGGATCTAGGTGGATCCTAGGTGGATCTAGGTGGATCCTAGGTGGACCTAG 
5 0 GGTCTACGGGCTTCGTCCTG<3GCAATTTAAGTCAACAACT 

TCGGGC^TCAAGTTGAATGTCAACAAACTTTTGGACATGGCAGCCCAGGTAAGGAGC 
CAGTGGGGTAAGGACTGGTGACCTATTCCTGGCCTC 

GGACCTGCGCGCCGCCAACATCCTGGTGTCTGACACGCTGAGCTGCAAGAT^ 

ATGAGTACACGGCC (X^GAGGGT AG GTGTGGGATAGAG AGGAGGTT CATGGC CAG CAGGAGAGCAAG CACACACAGG GGGACTAAC 

5 5 T^TGCTTCTCTCTAGGGGCCAAATTTCCCATTAAGTGGACAGCACCAGAAGCCATTAACTATGGGACCTT 

GTGTGGTCCTTCGGGATCTrGCTTACAGAGATTGTCACCCACGGTC 

TCTAAGTCTTCAAGGAAGGGGAGCCAAGCCATACCTTCCTGCTCACCTCTACCCAGGCTCAGACTCTGAAACTCTGAGTTGCTTCT 
CTCCCATCCACTTGGG CCTGTGACTCCGAAATCTTCTGTTAGATGCTTCATTCCAT CTCTGCCTCTGAGTCCCCACACTTCTCAAA 
AACTCCAGTTCTTCCTTGCTTGGTTATGGCAGAGGCCACCTAGTA^ 

GGTTTCTCTGTTTGGCCCTGGCTGTCCAGGAACTCACCATGTAA^ 

TCCAAAATGCTGGGATTAAAAGTGTGCCACCACTGCCTGACCTCCAGTTTCTT 

AAGCCATTACTAGTCATTACTCTGGGTACGTTATGTTGCCCTCTGAGCCTCAGTTTCCCCATAC^ 

ACACTTGCATGG GAG CTGCTT AG TAAG CACTGGTGCTTT CCTACTG AGATGTCACAG CTAAAG AG AGAAACCATGT AGAT C CCT CC 

6 5 TTCTAGGAAAGGCCCTTATC^CGATTTTATTGAATGTTTGTT 

ACAATGTCTTACATAGCCTTAAGCTGACCrCTAACACTCTATGAATCTGAAGATGTCCCCTGAGCT 
TCACTGGGAGGTGCTTGTATTACAGCAAGCATGTGCCTATGTACCTGGGGATGA^CCCA 
CTGCCCCTGAGCCATGCTTCTAAGCCTGGGGGTGGGATTTTTCTAGAAGGAATTTGATAACCTCrr 
AAAGAGTAGGAGGTTATAGAGTTCAGAGGTCAGATTGGCAGAAACTTCCCACAGGCCCA^ 

7 0 AG GAATGTAAAGG CACTTTGAAT CT CACAGTTAG AGGCTAGAAGAG TTCC CAGCTCTGGG CTTGTTCT ATCTCACAGCC CAGTATT 

CATCTCCTGTGCCTAGGCCCAAAAGCTAGAAATACCTCAACAC^ 

CCCAT CTTTGCCAT CAAAT ACG AACAAC CC GAGGTACACCTT ATAAAAG AACACC CCTAGTCAGGT AC CTCT TATATACACTGATT 
CCTTCAGTCCTCGTTAAAACTCGGTGATTCACTCrATCCCAGAAGAGGAAGCCGAGGAGGTGGCCC^^ 

CATGTATG CTATTGAGTGTGTGTAGGCACTATAGGTGTGGGCCTGAC^TTCATTCAGCAGGAAATG CTAGCGTG CCTTACCTTAAT 
7 5 G AAGACTAGCAT CAGGTTGGAT TG GTAGTTC CAT ATTCTGAAC CTT ACCTGTGTTGGT CAG CATTC CATTACTGAAAACAAAAT CA 
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ACCAAAGACTCCCAAGACACTCTCAGGTGACCCAAGCTTGGGTGACT^ 

TTCCTGGAAAGGTAGAAGAAGAGAA^ 

TGTATAGTCCTTGGCTCTCTTGGAACTCACTTTGTAGAC 

GTATTGAGATTAAAGGCATGGTGGCCTGACTGAGATCCCATACTT 

GCTTAGTAGCTACGAGCACTGGCTGCTCTTCCAGAGGACCTGGG 

ATTCCAGTTCCAGAGAATTTAGTGCCCTCTTCTGGCCTCCTTGAGCACT 

TAGAAATAAAATITTAAGCTGAAGATGGTGGCACACACCTTTAATCCCAGAAAC^ 

TTACAGAGTTC CAGGACAG CCAGGGGTACACAGAGAAAC CCTGTCTTGAAAAACAAAAAAAAGAAAGAA 

ATAGATGGATGGATGGATGGATGGATGGATGGATGGATGGATAGATACATAGATAGATAGATAGA 

AGATAGATAGATAGATAGATAAAATAAGTAGGAAAGAAAGAGAACTGGAGTCAGT1T 

GATTAAAGTCTTGATGTTTTTTCCTCCATCAACGTTCCAGGAATGACCAACC 

CATGGTGAGAC CTGACAACIGT CCGGAAGAG CTGTACCACCT CATGATG CTGTGCTGGAAGGAGCGC C CAGAG GACCGGCCCACGT 

TTGACTACCTTCGGAGTGTTCTGGATGACTTCTTCACAGCCAC^ 

CAAGATCCTCCCAACCACTGTC 

AT CTGTAGTTATGTGGGTTCT ACACA 



CATAGAGAGGAGAAGGCCTGCGATTGATGCX3TGTCTCTGTCTACCACTGCCTTTCAGAGGGTCTCCA 

CTGCCATCCAATCTTATGTCTCTGT 

TGTACATTTTTCTACTGCATACTTTATT^ 

TTGGATCGCCCACACrAGATTCTCCTGCCCTGAGTTGCCACTATCAN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNIWNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNN^^ 



NNNNNNNNNNNNNNNNNNNNN^ CATGGGGAG CCG TTGGCTTACT AG ACAAAGTATT AT CAG CTAACCTTTAAGC 

TGATTGGTTCTGAGTAAGGTAAGGGGTGGAGTGGGGGGATGGGTGGTGAACAGGGGAACrrTCCAGATCA 

GAGTGTTGTAAC CAAAACTTTTT AG CTTATT AG TAGCATTCCT CAGTGCCAG CCAGAAAAAT AAATAAATAAAT AAATAAATAAAT 
CTACCAGGCAGTGGTGGGACACACCTTTAATTCCAGGACAGGAGGCAGAGGC^ 

ATAGGGTAAGTTTTAGGACAGCCGGGATTACATAGAGAAACCTTGTCrrCATACACCAAGGGAGAAAAAAA 
GGTTTGTGGAGCAGGGTGTGGTGGTGCACGCCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGAC 

AGG CCAGC CTGGTCT ACAAATT AAGTT C CAGGAT AGCCAGG GCTAT ACAGAGAAAC CCT GTCTC GAAAAACCAAAAAAAAAAAAAA 

GAAAAAAAAGAAATGGTTTGTGTGGTCACACAAAGACAGCACTTGAGATAAGGTCAACTCTCT 

TAAATGTTTTGCCTGCAAGTCTGTATGTGAGTACCATGTGTGTATCTAGTGCCCTTGGAAGCAG 

TAATTGTGTGCCACCATGGGGTGCTGGGAACAGTCTTCTGTGCGAGCAGCAAGTGCTCCTAACAGCTGAGACATCAGT 

CCATGTCTACTACTAAAGTGTGTGTGTGTGTGAAGATCATATATATGCTCTA 

GATGAGTCACAGCTTGTGTCCTATCTAGAACTTTCTCTCAGATGCTCCTAGCATA 

AGCITATGGCTTTATAGCCAATCTAG 

TTAAAGTAAGTGCAAGGACTGTTCTTTGTTCTGTTGAC1T 

ACTCCTAAAGGACATTTCACCCGAAAAACAAACAAATAAAGAGAG 

GACCETTTTAC^AAAGTGTGTATATATGTGTGTGTCT^ 

GTAAAACTGCATATCCCCAGAGTCAGAGGACAACGTTAGAGAGTCAGTCCTGTCTT^^ 
GTATCCTCAAATTTAAGACAAGAACCTTTACCAACGAAGCCT^ 
GAGACAGGGTTTCTCTGTGTAGCCCTGGCTGTCCTGGAACTCACTTTGTAGACCA^ 
TCTGCCTCCCGAGTGCCCAGGCCAGCATTCTTTACCCCTCTGATCCAACTTCAOT 

TCCCTTTTAT ACACGGCCATCA<3T^TTTTT CT CXTTCXTTTTTC TTTTXC TTXX TOT CTTXCTTTXTCXT CTTTT C T XX CTTTC TT*T 
CTTTTTTTTTTTTTTTTTTTTT^ 

AGACCTTGAACTGGCTGGCCTTCAACTCAGAAATCCATCTGCCTCTGCC 

CCGGCTTCAGGGCCTCTTG AAGAACAGTCAGTG CTCCTAAC CG CTGAACCATCTCCC CAGCCCCAGTGGTAGAGATTTTTGTGCTA 
AGGTTGGAGAACCTTAACCTTGGATAAAGCCTTATGTATGCTTAGCC 

GATTACAGACATGCCCAACAGGGTTCAGTATTTTCACCTGCATTTTAAAA^ 

AAGATTTTATATGTAAAACTCTAGATCTGGGCTGTCTCTTCAACAATGAGAGGTTTGGTTTCCTGGGTT^ 
CAAGTACTGGCCATCCAGTTTAGTTGGGACATGAGTITTCCATCCCTGCCAG^ 

GTTAATAG CTATTTAT CAC CAAAATTGGTG CTG CTGTTCTCGGGATAAAG AGG AAAG T AAAG AAAGAAATTTTTAAAAAAAT CAAA 
AATTAGGGGCTGGTGAGATGGCTCAGTGGTTAAGAGCACCGACTGTTCTTC^ 

GTGGCTCACAACCIATCTGTAACAAGATCTGACGCCCTCTTCTGGAGTGTCTGAAGACAGCTACAGTGTATT 
AAATAAATCTTAAAAAAAAAAATCAAAAATTAAAAAAAAGTCTGAG 
AAAAAGCAAGAGGTCATATATTCCTTTAAGGAAGTTAAAAGCTCCAAGCTTTAA 
AATATAAACTCATTCATTTATACCATGTGCTTGGCCTATGCGGGCATC 



AGATGGGAAAGGAGGCCTAGAAATGATGAGGAGACCCACCTAAAGCCAGAAGAGTAGCAGGGGTGTAGAAATGGAAGTCTGTGTCC 
TGGCCCTCTGTCATCACACAGCCTCCCTGTCTCTAAAGGCCCTGTC 

GGTTGTGGTGCAAGCCTCAATCCCAGCTTTTCAGAACAGGGCTTCATTGTGTAGCTCTGGCTGTCTTGGAACT 

AGGGTGGCTTCCATTCAGAGAGATCTACAGTCCAGTGCTGGTATTAAACTCGTGTGCTACTATGTCCAACTGT^ 

CAAACAAACAAAAAAACCCCAAAAACCAAAAAACTCAGCACAGAGCCTT^ 

TTCAGGTTTTGTAGGAGCATGGGAGATGGTGGTTTTCTGCCATTTAGACACTCCCAGACATTTGTCAACCTGTAA 

GGTGTCACAGTAAGGGCGTGTGTGGGGTGTTGCTACTGGTATAGAAGAAGGCCAAGCAGTCAATTAACATCT 

CAGCCCCTACAACAAAGAGTGTAAGTAATTCAAGGTTAAGAAATTTTGCTGGCA 

TTTTGTTGTTGTTGTTGTTGGTTTCTCCCCCCANNNNNNl^NNNN^ 

NNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNITON^ 
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g c ctggt ctacaaagtgagttccaggt acag ccaggg ct atacagagaaac c (hatcttgaaaaacaaaaaagaaacaaaaaag ca 

aaaacaaacgagagaaacaaccccgctccccccccaccccccgttccccccaaacaac^c^ 

agttctgggtaggtgagatggttgctgccaccatgcctcac^ 

catagaaacactcccttcaacactcccrtccagtaaattaaaaac^ 
5 ttggattttgtcttttaaaaaaaaaaaaatatgtacaattggagacgga 

attgttttgttattttttgagac^tgctttcact caagttggcct cagattacag agat ctt cctgactctg c ctcct 

gagtgctggg at caaagacat agcaac cacg cccaactg aaaacgagcittg acattaaa 

gcacataaaggtcttgccccactacagtaggcctcr^ 

ccagaccaaacctagtcttttagctgcgtgaccrrc 
1 0 aagttggcctctgatctttgcacacgctcccaactgtgaag 

tggtacggcccco^tctcccccacccctcc<3cgccggactttggtacag^ 

ttgcccccc^cgagttccgggcccgaccccgcgcatgccgattggttagagtgagacccgggc^ 

ggggtgggcggggcttcgaggccggcccctcccccgggtcggacgccgtaaggagccgco 

agggccgggcggcgagcaagatggcgcagactcagggcaccaag^ 
1 5 ggcgcgggccgggc<;ttggggaaaggctgggggaggaggctgggaggaggctgaagccccgggg 

GAAGGAGGAGG TTGGAGAGAGGAAGTCGAGG CTGAGGC CGGAGATGAGG TGGGG GTCGTGG AAGGGGTAGT CTGCGGTAATC CCG G 

TGGCX3GAGATTCTCGGGGAATCCGACCTGAAGTTGGAGTGGGATTCCrGGAG 

GATGGAGACGGTTTGCCTGGATATCTTCCCCTGCTG 

CTTTCTTTTGGTGAGTTCTGCAGGACTGGCCACTCATCCCCTCCCCCACTCCATCCCCC 
2 0 TAGTTGGGAAATGGGGCTCCCCGGGGATGCAAGGAGAGAGGTGCATGAAACTTC 

GGAGCTAGACAATGTCACAGGTATCTTTCGGAGTTAGCGCTGGGACAAGGGAATGTGGGGTGGA 

GCCCAACCTGAGAGGAG CAGGTTGTGTGTCAGTGCTTAGGACCCAAGAACTGGAGTTACAAGCCAGG CCCTGG CAAGTGGAG CCAA 
CAGCAGGGGGAGGGGACAGCACCTTTATCTGAGAAAGGGTCT(^CAGTGTAACTCACGCTGGT 

2 5 ACTCTTGACTCCAGGCTTACACCACTTGGTCCAGCATTTGTGTT^ 

CAGCCAGTAGGCGGT T GG CTG TTT AGAT CTG AAATTTAG AAGACAG ATATTTTTTGGTAATTGTCGG TATAAAAT AATTC^ 
TAATTTTAGCTGG ATTAG AAAAGGG TGTT AT AACTC CAGTTGTAG AGGGTG AAAAG CT ATGTTGAT ACTTT TTTT T TTTTTITGAA 
ATGGCCTCTGTGT AAC CCTX5GCT AGCCTGGAACTCACTT^ CAGGTTTGCTT CCTACTGGAAG AG ATT TAT CCTACGCC C C 

TCCCGAGTGCTAGGATTAAAGGCATGTGCCACAACCCTG 

3 0 GATTGCTCTTTCACTATGCrrGTCTTTCAACTGTTAGCCACATTCCrrGT^ 

AAGCCTGGTGTCTTCTTGTTTCTGTTCTGTTTTGTTTTTAAGGCAGGGTCrrGTCTCTAA 

TGTAGACCAACTAGGCTG^CCTCTGAAOTCAGAGAGATCCCCCTGCCT 

GTCTAGCAGCCTCAl"I"l"rTTGTTTCGGTTCTTTTTGTTTGTTT 

G CGTAG CCCTGGCTGTCCTGGAACT/TACTCTCTAGACCAGG CnX3GCCTCAAACTTAGAAATCCGTCTGCCTCTGGCCACC AAGTGC 

3 5 TGGGAATTTITTTTCTTCTTGTTTTTTGTAGATAATACA 

AGTATAATAGGCATAATTTTTTTAAAGAATTATTTATTTTACGTATGTGAGCACACTGTAGA 
TTACAGATGGTTGTGAGCCACCATGTGGTTGCTGGGAAACGAACCO^AC 
■ CCATCTCTCCAGCCCCATAATTTTTTTTGAGATTAT 
AGGATTTCTCTGTATAGCCCTGGCTGTCCTGAAACTGGTGCTGTAGACCACTAGATTC 

4 0 TCTTGCCTCTCAGGTGCTGGGATTAAAGGCCGGCCCTAATACTTCCTTGCAGAGGTTTTC 

GGCCCTGAGAGATGGCTCAGTAGTTGGTAGTGTGTACAGCCCTTGCAGAGGACAAAGCT 
TCACAGATGCCTGTGACTCCAGCTACAGGGACCCAGCGCCCTCTTCTGGCTTCCACACCGCC 
CACACACCAGTTTAGAGCATTTCCTACTGAGATGCTGCGCTCAATTCCCAGGGCCCCCCACAGCCACT 
GAATCCGGTGATCTCTTCGCGCTTCAAGTGCTAGATGGGAATGTGGTGCTTGGACATACATGCAGGCA^ 
45 AGACTGTCCTAGAACTGGTCTATTAGACGAGGCAGCTGTGAACT 

TG CACTAATGGGGAAGTGCTGGGAGGGATTGGGGCGGAAG CTAGATTGGAGAG AAGAGGATACTTTAGCTTG CAATTTTTTGATTG 
ATTGGTTTTTTGAGACAJGGGTTTCTCTGTGTAGCCCTGGCTGTCCTGGAACTCA 
. ATTCACCTGCTTCTGCCTCCCAAGTGCTGGGGCTAAAGGTGTGGGCCACCACT 
GGGTCTCTCTCTGTGTATTCTGCTTGGCCTGGAACTAAATGAGACCC^ 

5 0 TACCACTACACCCTTCTAAGAGAGGAAGTATTGTTTTTTTGTTTTTTTGT 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNN^ 
. NNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNTTCAGTCTCrrTTCCGGTTGGGACTCACTTCACT 

TTTTCTTTCTTTCCTCTX3TGACAGGGTTTCTCTGTGTAGTCCTGG 
TCCTCCTATCTCTGCGTCGCAAGCACTGGCATCAAGAGCTGTGTGCCACCA 
60 CTACTGAGAGCAGCAGTGAGAAAGAACA 



MOUSE SEQUENCE - mRNA 

GAATTCCGGATTCAAGCTCCTGACTGCGGCCAATGGGGGCCTCTGAGCT 
GGCTCTAGGATGTCTGATGTTGGGGCGAGTGGCTTAGGGGCAGCTCCTTC^ 

6 5 GTCTGCAGCTCAAACCCTGAAGATGACTGGATGGAGAACATTGACGTGTGTGAAA 

CIAAGATCTCGCTGCCCATCCGGAATGGCTCTGAAGTGCGGGACCC^CTGGTCACCTATGAGGGATCTOT 
TGCAAGACAACCTGGTTATCGCCCTGCACAGTrATGA^ 
ATCCTGGAGCAGAGCGGTGAGTGGTGGAAGGCTCAGTCCCTGA^ 
AGCAAACAGCCTGGAGCCTGAACCTTGGTTCTTCAAGAATCTGAGCCGTAAGGACG 

7 0 CGCATGGATCCTTCCTGATCCGGGAAAGCGAAAGCACTGCGGGCTCC 

GAAGTGGTGAAACATTACAAGATCCGTAACCTAGACAACGGTGGCTTCTACA 
TCTAGTCCGCCATTACACCAACGCCTCTGATGGGCTGTGCACAAAGTTGA 
GGTGGGAGGAOSAATGGGAAGTTCCCAGGGAAACACTGAAGTTGGTGGAGCGGCT 
GGGTAOTACAACGGACACACGAAGGTGGCGGTGAAGAGTCTGAAACAAGGGAGCATGTCCC 
7 5 CCTCATGAAGCAGOTGCAGCACCCGCGGCTAGTCCGGCTTTATGCAGTGGTCACCCAGGA^ 
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TGGAGJUVCGGGAGCCTACTAGATTTTCTCAAGA 

ATTGCAGAGGGCATGGCGTTCATCGAAGAACAGAATTACAT C CAT CGGG AC C TG CGCG CCG C CAACATCCTGGTGTCTGACACG CT 
GAGCTGCAAGATTGCAGACTTTGGCCTGGCGCGCCTCAT^^ 
AGTGGAC^GCACCAGAAGCCATTAACTATGGGACCT^ 
5 GTCACCCACGGTCGAATCCCTTACCCAGGAATGACCAACCCrGAAGTCAIT 

TGACAACTGTCCGG AAGAGCTG TACCACCTCATGATGCTGTGCTGGAAGGAG CG C CCAGAGGACCGG CCCACGTTTGACT ACCTT C 
GGAGTGTTCTGGATGACTTCTTCACAGCCACAGAGGGCCAGTACC^ 
AACCACTGTCCCTGGCTAGGGCAGATGGAATTCCTGTG^ 
AACCTGTGCGTGTGTCACACATAATGCAGTTTCTGTCTGTACAC^^ 
1 0 GTGGAGCACGTGAGTCTAAGCCTTGGATACCTCCTAAGATGTCT 
GGTCTGCGATTGATGOGTGTCTCTGTCTACCACTGCCTTT 



MOUSE SEQUBNCE - CODING 
15 >gi | X98763 | gb | M12056 . 1 | MUSLCKMouserearrangedlckgeneencodinglyinphocyte- 

specificproteintyrosinekinaseATGGGCTGTGTCTGCAGCrCAAACCC^ 
TGTGAAAACTGCCACTATCCCATAGTCCCACTGGACAG 

GGTCACCTATGAGGGATCTCTCCCACCASCATCCCCGCTGCAAGACAACCTGGTTATCGCC 
ATGGAGACTTGGGCTTTGAGAAGGGTGAACAGCTCCXSAATCCTGGAGCAGAGCG^ 
2 0 GGCCAAGAAGGCTTCATTCCCTTCAACTTQ3TGGCGAAAGCAAAC^ 
TAAGGACGCCGAGCGGCAGCTTTTGGCGCCCGGGAACACGCATC 
TTTCCCraTCGGTCAGAGACTTCGACCAGAACCAGGGAGAAGTGGTGAAAC^ 
TA(^TCTCCCCTCGTATCACTTTTCCCGGATTGCACGATCTAGTCCGCCATTACACCAACGCCT 
C^GCCGTCCTTGCCAGACCCAGAAGCCCCAGAAACCATGGTGGGAGGACGAAT^ 

2 5 AGCGGCTGGGAGCTGGCCAGTTCGGGGAAGTGTGGATGGGGTA 

GGGAGCATGTCCCCCGACGCCTTCCTGGCTGAGGCTAACCTCATGAAGCAGCTGCAGCACCCGCG 

GGTCACCCAG G AAC C CATCTACATCATCACGGAAT ACATGG AG AACG GGAG C CTAGT AGATTTT CTCAAG ACTCCC T CGGGCATCA 
AGTTGAATG T CAAC AAACTrTTGGACATGGCAGCCCAGATTG CAG AGGG CATGG CGT T CAT CG AAGAACAGAATT ACATC CATCGG 
GACCTGCGCGCCGCCAACATCCTGGTGTCTGACACGCTGAGCTGCAAGATTGCAGA 

3 0 TGAGTACACGGCCCGGGAGGGGGCCAAATTTCCCATTAAGTGGACAGCACCAG 

CAGACGTGTGGTCCTTCGGGATCTTGCTTACAGAGATTGTCACCCACC^ 
ATTCAGAACCTGGAGAGAGGCTACCGCATGGTGAGACCTGACAACTGTCCGGAAGAGCT 
GGAGCGCCCAGAGGACCGGCCCACX3TTTGACTACCTTCGGAGTGTTCTGGATG 
CCCAGCCTTGA 

35 

HUMAN SEQUENCE - GENOMIC 

NNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNl^Nl^NNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN 
50 NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNIJNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNW 
65 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNTCCTTCTTCITCTTTCCCATCT 

GCCTTTTTCTAAAGGCTCCAAGAATGTCAAGCCTGGCAGGGA 

TTTACAACACAACCAACTTGTGGTTTCCTAAGGCTCATACGACTATCAC^ 

GCTTATCTGAGACCAGTTCC^GGTTGCTGTTAGTCCTTTACTGGGGGCTCTTGTTGCTAATTACTCTCA 
7 0 AGTCATTTGCAGTCACAGATGGGTGCACTCTGGTCTrA^ 

GAAAGAGTGAATACGTACCCTTTTCCTGGAAAGTTTCTTGATTTGATTTAAA^ 

GGTCATGTTTGTTTAACTTACATGAATAAGGTCAGGAAACACTCATGCTC 

CATGGGAGGCTGAGTATCTCAGTTTAAGAAACAGGAG 

TCGAGGAGGG CAGATCAC C TGAAGT CAGGAGTTCAAG ACCAGCCAG ACCAACATGGTG AAACC CTGTCTCT A 
7 5 • TTAGGCCGGGCATGGTGGCTC^TGCCTGTAATCCCAGCACTTTGAG^ 



1897 



WO 03/008583 



PCT/US01/51291 



ACCAGCCTGACCAACATGGAGAAATCCAGTCTCTTAAAAAATAC^ 
ACTTGGGAGGCTGAGGCAGGAGAATCACrTGAACCTGGGAGGTC 
GGGCAACAAGAGCGAAACTCTGTCTCAAAAAATAATAATAATAAGGCCGGGC^ 
GAGGCCX3AGGCAGGAGGATCACAAGGTGAGGAGATTGAGACCATCCTGGCAAA 
5 AAAATCAGCCAGGTGTGGCGGCACGCGCCTGTAGTCCCAGCTACTA^ 

GAGGTTG CAG TGAGCTGAGAT CG CGCCACTGCACTC CAGC CTGG GAGACAG AGTGAGACTCTGT CTCAAAAAAAAT AAAAAAGAAA 
AAAGTAATGAGAGAAATGCAAATG CAGAAAGAGAAATGCAAATG CCTGGCAGTATGTTGGGCrTCTTTTCAAAACTTAAAAAAATT 
TATTATTATTATTCATTTATTTATTTACTTTAGACAAGG 

1 0 CCCAGCTAGTTTTTATAATAATAATAATAA^ 

CATTGT ATG CC TCTATCAAAAT AT ATCATGTACCCCAT AAACAT AT ATACTTACCAGATACCCAT 

TAAATAAATAAATATACAAAATTAGCCGGGCGTGGTGGCACATG 

CTTGAACCTGGGAGGCAGAGGTTGCGGTGGGCCGAGATCGTGCC^TTGC^ 

AAAAAAAAAAAAAAAGAAAAGAAAAGAAACAGGAGTCTAGGGCTGGGCACAGTGGCTCMTGCCT 
1 5 GCTGAGGTCAGGAGTTOSAGACCAGCCrGGCCAACGTGG 

AGCCAGGCGTGGTGGTGCACGCCTGTAGTCCCACCTGCTTAGG^ 

ACACCACTGCACTCCAGCCTCGGCMCAGAGCAAGTCTAAAAAAAA 

GGGCATGGTGGTTCACGCCTGTAATCCCAGCACTTTG^ 

TGGCCAACATGGTGAAACCCCCATCTCTACTAAAATTACAAAAATTAGCTGGGCOT 
2 0 AGGGGGCTGAGGCAGGAGAATCACITGAACCTGGGAGGTGGAGGTT^ 
GACAGAGTCAGATTCTGTCTGr i T lT rri TC T lT rGAGATGGTAT 
AGCTCACTGCAACCTCCACCTCCATGGTTCAAGCAATTCTCCTGCCT 
ACGCCCAGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTCACCATGT^ 

G CCCTC CTCAG CCT CC CAAAGTG CTGGGAT TACAGG CATGAGCCACCG CGCCCAGCCGAG ACTC TATTAAAAAAAGAAAAAAG AAA 

2 5 GAAACAGGAGTCTAAAGCAAAGGAGATTAATTACCCAGTCCCATTCTTCC^CCTT 

CACTGGTG AAAGTGTTTC TAAGCACT AAACT AAG CT AACATT CATGTGAGCTTATGAGAAATGGATG AAG AG CCCCAT TTG ACAG A 
AGAGGAAAAGTGAGGCTTAGGAGGAATGATTTCAGCAAAGTCACACAGTAAGTGTCAAAATTGC 

TATX3GGGACAGTCTTCTGAGTTTTGCTAATGAGAAACTTCCACT CAG CATGAATTCTCTCTCTC 

TGTCTCTCTTTTGTTTTTTCTTTTTAGACAGAGTCTCTGT 

3 0 CTCACTG CAAC CTCGACCTCTTGAGTTCAAG CAATTCT C CAG CCACCATGC C CAGCT AACTTTT ITT^L'lTTriTTTTTTTTGT ATT 

TTTAGTAGAGAC1AGGGTTTCA.CCRTTGTTGGCCAGGCTGGTTGTCTTGAACT 
CAAAGTGCTGGGATTACAAGCATGAGCCACCGTGCCCAGCCT 
CIAGTGGCnTGATCTCGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAGTGATT 
CAGGCGTGTGCCACCATGCCTGGCTAATTTTTGTATTTTT^ 

3 5 CTGACCTOTTGATCCACCCG C CTCAGCCTCCCAAAGTGCTGGGA C CGGACTTTTTTTTTTTTTT 

TTTTTTTTTTTTGAGACGGACGGAATCTCACTCTGTTGCC^^ 
TCCTGGGTTTAAGCGATTCTCCTGCCTGAGCCTCCTGAGT^ 
TTTTTAGTAGAGGCAGGGTTGCACCATGTTGGCCAGGATGGTCT 
GTGCTGGGATTACAGGTGTGAGCC^CCGCGCCTGGCCTTTTTTTTTTTTTTT^ 

4 0 AGTGCAGTGGCACAATCGTAGCTCATTGCAGCCTTGAGCTCCT 

ACTACZAGGCATGCCCCATCACACCTGGCrAATTTTTTGATTT^ 
GAACTTCTGAGTTTGAATGGTCTTACTGCCTCAGTCTCCCAAAGTGCCGCGAT^ 

TCCAGCTTTGCTCTAOTCCAAATAGGTTGTGTGACTTTAAGCATA CTGTGGGCCTCAGTTTGCT CATCTAAAATCAGAGGTTT 
AGCCAGGTATGGCAG CT CATG CC TGT AAT CC CAGCACTTTGGGAGG CCAAGAAGGAAGGATCACTTGAAGC CAAG AAT^ C 

4 5 AGTCCGGGCAACATAGCAATACCCCCATCTCTAGAAC^ 

AGTTAGAGGATCACTTGAGCCCAGGAGTTTGAGGTTCCAGTGAGTTAGGATTGGGTCATTC 
CACTTTATCTCAAAAATAAATCAACAAATAATAAAATTGGAGATTAGGCTTGCT 
ATGCATTAACACTGCATGTTCCAGTTGACATTTATTAACTAATTTTGGGTC 
CTCTTGATGAACCCCCACTCTCATCAAAGGTATGGGAGAGGCAGTGGGACCCTGGGGCTG 

5 0 CTGTGTCTAAG TTGATT ATTGAGGTCTGAATAGAATG TT GCAGG AG ACTGGG GTG AGAGGG AAG GAAAGAG CATTGGGAAGT CTT C 

AGACTAGGTGATATTTAGGCCAGGTGCCTGTAATCCGAGCACTTTGGGAGGCTGAGGCAGGAGAATTGCTTC 
AGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAAT^ 
AACTACTCAGGAGGCCGAGGCAGGAGAATTGCTTGAACCTGGGAG^ 
AGCCTTGAAGGTTGAGTTAGAATTCAGCAGTTGGACAAAAGAGAGCA 
5 5 ATTTGG GGAG GC CAAGGCAG GAGGATTGCTITG AG CCCAGGAGTT CGAGATCAG ACTGGG CAACATAGTGAGAC CTC^ 

AAAAATAATCAGCTGGGCATGATGGTGTGCACCTTTAGTCTCAGCT^ 

GGT CTAAG GG G ATTG CGCCACTGCACTTCAG CCTGGGAG AG G AAGGG AG ACCCTGTCT CAAG AGAGAGAG AGAAAGAGAGAGAG AG 
AGAGAAAGAGAGAGAGAGAGAGAAAGAGAGAGAGAGAGAGAAAGAGAGAGAGAGAGAGAGAGAGAGAGCGAGAGAGCGCACGCAT^ 
T CAAAGCAGGGAG AACCACAAGT ACAAAG GCAT TAAGTAAG AAAAT AG CATGGG GTGTTTGGG AAAG GCAAGAATAGTTCAGAACA 
60 CAGGCTGTGTGTGTGGGGGTGTGTGGGGGTGCCTGTGTGTGGGTGTTTGTGTGTGTGTGAGTGTGGGTGTGTGGGGGTATCTATGG 
AGGGGATGCCTGTGTGTGTGTGTAAGGTGAGTGTGTGGATGTGTGCATGGGTGTGGGTGAGTGTGTGGAGGTGCCTGTGTGTGTGT 

GTGTCACTGTGTGGGTGTGTGGACGAGTGTGTGTGTTCCTGGGTGTGGGAGCCTGTGTGT 
GCATGAATGTGTGTGTGGGTACGTGTGTGAGAGTGGGTGCCTGTGTGTGGGGGGTGAGTGT^ 

65 GGTGAGTGTATGTGTTTACTGAGTGTGAGTGTGGGTGCCTGTGTGTGGGAGGGTGAGTCTGTGTGTGAGTC 

GTGAGGGGTGAGTGTGTGTGTTTATGTGAAAGTGTGTGTGTGTGGATGCCTCTGTGGAGGTGGGATAGGGGGTGCCTCTGTGTGTG 
TGTGTGAGAGTGTGTGTGTGTAGGGTGTGTATATGTATAGGGTGTGTGTGAGTGTGTGT^ 
TAGACTGCGGAGGTGGATTTCATCTTGATATGAAAGGTCTGGAATGCATGGTACATTAAACTTTG^ 
TCTGAGGAGCAGCCXITAGAGAAGGAGGAGCTGCAGGGACrCCGGGGGCTT 

7 0 GCACACATTTATCACTTTATCTATGGAGTTCTGCT^ 

AAAAAAGTTATGGCCAACAGAGTCACTGGAGGGTTTTCTGCTGGGGAGAAGCAAGC 

GTGGGCTGTGTGAGGGGAACAGCGGGGGCTTGATGGTGGACTTCGGGAGCAGAAGCCTCTTTCTCAG 

GAATTATAATAGGAGGTGTGGCGTGCACACCTCTCCAGTAGGGGAGGGTCTGATAAGTCAGGTCT 

GTGT CATCTC TAGGAGG TGGTCCTCC CAACACAGGGTACTGG CAG AGG GAG AGGGAGGGGG CAGAGGCAGGAAGTGGGT AACTAGA 
75 CTAACAAAGGTGCCTGTGGCGGTTTGCCCATCCC^GGTGGGAGGG 
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CCTGAGGGCTCAGAGGGAGCACCGGTTTGGAGCTGG 

TCACAATCTCAGGTACTTTTGGAACTTTCCAGGGCAAGGCCC 

CAGCCCXXTCTTCCATTCCCTCAGGGACCATGGGCTGTGGCTGCAGCTCA 

TGTGAGAACTGCCATTATCCCATAGT CCCACTGGATGGCAAGGGCACGGT AAGAGG CGAGACAGGGGCCTTGGTGAGGGAGTTG GG 
5 TAGAGAATG CAACCCAGGAGAAAGAAATGAC CAGCACT ACAGGCC CTTG AAAGAATAGAGTGG CCCT CT CCC CTGAAATACAGAAA 

GGAAAAGAGGCCCAGAGAGGGGAAGGGAATCTCCTAAGATCACACAGAAAGTAGT^ 
TGGAGAGGCTGAGAGCAGAGCAGGGGGGAAGGGGGCCAGGGTCTGACCCAA 
CCACAGCTGCTCATCCGAAATGGCTCTGAGGTGCGGGACCCACTGGTTACCTACGAA 
AGGTGACCCCAGGCAGCAGGGCCTGAAAGACAAGGCCTGCGGATCCCTG 
1 0 CCGGTCTTGCCTTCCTTGTCCCCCACCCTGTAACTCCAGGCTTCCrc 
ACAGACAACCTGGTTATCGCTCTGCIACAGCTAT^ 
CCTGGAGGAGTGAGTCCCTCrCCACCTTGCTCTGGGGGAG 
CCTTGACCAGCTCGGGGTGGCCGCCCrrrGGGAClAAAATTCGAGGCTCA^ 

GGTGGAGG GGTCITTGAGGGAGGGTCTCAGGTCGACGGCTGAG CGAG C CACACTG ACCCAC CTCCGTGG CGCAGG AG CGG CGAGTG 
1 5 GTGGAAGGCGCAGTCCCTGACCACGGGCCAGGAAGGCTTCATCCCCT^ 

CGTAAGTGGGGACCCGTCGTGGGGGTGGGTAGGAGCAGATCTAGGGATCCTGGAGCAGGGAGTAGGCCTC 
TTGAGGGCCACGGAGGAAGATCCGAO^CAGCCGACG^^ 

CGCAAGGACGCGGAGCGGCAGCTCCTGGCGCCCGGGAACACTCACGGCTCCTTCCTCATCCGGGAGAGCGAGAGC^ 
GCG^GCGGCGGTCTCGACCGGGCGCGGGGGTGCCCCGGGGTGTGCCCGAGGGGGGG CG CAGGGTGAGCCCGAGGTGGAGACACGGG 

2 0 GTGAGTCGGAGGGGGACGCGGGATGAGCCCGAGGTGGGGGCGCGGGATGACCCGGAGTTGGGGGTGC^ 

GGCGCGGTGGCGGGCCAGACTCACTGCGTTCTTTCG 

GGGA CTTCG ACCAGAACCAGGGAGAGGTGGTGAAACATTACAAGATCCGTAATCTGGACAACGGTGGOT 
ATCACTTTTCCCGGCCTGCATGAACTGGTCCGCCATTACACCAGTGAGCCCG^ 
TCCCCTCAGTCCCCCTCAGGTGTCCCCCATCCATCTTTCAATGCCCTACTCCATTCCCCGCAGTC^ 
25 CTACGGCCCCAAGTGTTTGGGTGACAG CCC CACACCCCCTTGCTAGTCCACTTCACCTAGATGGGGGCTTGGAGAAGTGGGGG AGG 

TGGTGTCAATACGAGGCCTGCCCTATTGACAGCCTTCACCCCTCCCTCGTCCTCGCAGATGCT^ 
GAGCCGCCCCTGCCAGACCCAGAAGCCCCAGAAGCCGTGGTGGGAGGACGAGTGGGAGGTTCCCAGGGAGACGCTGAA 
AGCGGCTGGGGGCTGGACAGTTCGGGGAGGTGTGGATGGGTGAGTGTGGCCTCCAG 
CCGCTGAGGTGATGAGAGTCCCAGGACAGCTGCCTGGCGACTTTCCCACTCCTTCCCTTCCCCGACCCAG 

3 0 CACGAAGGTGGCGGTGAAGAGCCTGAAGCAGGGCAGCA1GTCCCCGGACGCCTTCCTGGCCGA 

AACACCAGCGGCTGGTTCGGCTCTACGCTGTGGTCACCCA 

TACCCGAGTCGGCTACCAGGGGATACTGCTCTCCCTGCTGTCCCTGCC^GAGGGTGGAAATA 
CAAG CAAGGAGGGTTTCTTGAGCTTTCCTGCCCCCTGGAGACTCACCTCCAGCCGGG 
CTTTGGGAGCCGAGGCGGGCAGATCACGAGGTCAGGAGTTCGAGACCAG 
35 TACAAAAAGTAGCCAGGTGTGGTGGCACATGCCTGTAATCCCAGCTACTCAGAAGACTGAGGCAGGAGAATCCC^ 
GGTGGAGGCTACAGTTGCAGTGAGCITGAGATCATGCC&CTC 

AAAAAAAAAAAAGACTCACCTCCAGGCATGAGAAAAAGGTGTATCCTCTGATCTGATCCTG 
GAATGTTCTGGAAGAGCATTCAC CCCAACCTTCCCTGACCTGCCATTCT 
GCTTGATCTCAGAAAGCAAGTGGGTCCCTAGCCCTGTTTCTCTCCCTTCTGCCCCTGAG 

4 0 GCCCTCTTTTTCAGTTCIGAGTGAGACCCCCACCCCTGCACCCCTCrTGGTCTGTTA 

AATATCAGTTTCTAGCTTCAGCTTTGTTCACACCGATTTGCC^ 

CCCAAGTAT AACAG C AG AAATCT AAATGGG CT CACCAGAAAT AAT ACAGAGT ATTCATCAAGG CCTCTAG AGTCTGACT G C CTGGG 
TCAGAAGCCTGG CTTGATGATTTTATGGCTTTGTGATAATT AT AT CACCTCTTTCTG C CATGGTTTC CT ACTATAAT AACG CGTT A 
GTAACTTCCTCACAGGGTCTTGTGAGGATGAAATAATTAATGCATACAGAGCAGTT 

4 5 TCAAGCCTATAACTCCAGC^CTTTGGGAGGCCAAGGTGGGTGGATCACTTGA 

CGAGACTTCATTCCTACAAAAAAATGTTTAAAAATTAGCTGGGCATAGTGGTGTATX3 
GGTGGGAGGATCACTTGAGCCCAGGAGGTTGAGGCTGCACTGAGCTGTGATTGCAC 
GACCTTGTCTCAAAACAAAAAACCCACAAAACTATGGCTACCATATA 
ATTCTTTGGGAATGTAAGATCTATCCTTTTTTCAGGAAGAGAAACAAGCTGTTCAT 

5 0 TCATGTGTATCTTT/rTTTTTTTTGAG 

CCTCCTGGGTTCAAATGATTCTCATGCCTCAGTCTCCC^GTAGCTGGGACTACAAGCATGT 
TATTTTTAGTAGAGACAGGTTTTCGCCTTGTTGGCTAGGCT 
CAAAGTGCTGGGATTACAGATGTGAGCCACCATGCCCAGCCTATCATC 
GAAGGATTCTGAGGTTCATTGGAAGTGAGTGCCAGGAATGAT^AGTAACATCTGCCATC 
55 TATGTOTATGTATTTGCTGTGTGTAGGTGAGGTCTTTGTCTTTGGTAACTAGTT CAGCACT 

TCATCCTTTTACAGGGTTTTAATCTCTTGGGAAGGGCC!ATTGGTCACTCACTGCATCCT 
CTCCACTATTAGAGAGGTGGTCCCTCAGACCCCCAGACCCCCTCT^ 

CTTACTCTCGGCTGGGCACCAGGAACATGGATGGATGAAACAAGGTTCCTGTTGTCAGGGAAACCTG^ 
GGGAAACCAGAGTTTGTGCCATGGGAAAGTTCCAGCATG 

6 0 CT AG AG AG G C TTCCTAG TGG GG CAGTCTAG CAGGGCCACG AAAGAAAAGTAGAATTTTTT TT AG CAGG CAG AG ATGAAACAAAG AG 

GCITTGGGGTGAGAGCCTAGTGGTCTGAGTGCAGGATCrCAAAATTGCTAAACTGTT^ 
AAGAAGCTACCAGTGTGGACC^TGCCAAGCTTAGGATACTAGTTGAGGATT 

GATCCCTCCCAGCATGCTCTAC CCTAG CCCCTCTGCAAAAAAACCTTTACATATCCTGGGCATCOTTACCCTGAAC^ CT 
CCATTCTCACTCTTCAATCTTCCTTTTCACATGTCAGACACACTTCTAGACT 

6 5 CTGGGGCAACTTGGGCCAGCAACTCTTGCTTCTGCCCACAGGGAGTCTAGTGGAT^ 

CCIATCAACAAACTCCTGGACATGGCAGCCGAA^ 
CATCTGGCTCAGGACCGCTGATCTGTGTTTGGCCTGCAGATTC 

TGACCTTCGGGCTGCCAACATTCTGGTGTCTGACACCCTGAGCTGCAAGATTGCAGACT 
ACGAGTACACAGCCAGGGAGGGTACGTCTGAGATTTAAGGGTGGTC 

7 0 CACTCTGCCTCCTCCTTAGGGGCCAAGITTCCCATTAAGIGGACAGC^ 

AGATG TGTGGT CTTTTGGGAT CCTG CTG ACGG AAATTGTCACC CACGGC CG CAT CCCTTACC CAGGTTAG AGC CAAGG G CAGGAAC 
TGAAAGGGTGATGGGAAGGGCAAGTCCATGTCTTCCCATTCAAT^ 
TCTATCTTCTCAGGGGTATGAGTCCAACGTCCCCAGGCAGATGCCACATCTCCCAGGGC 
ATAACCCCAGCTCTTTTCTTTTTCTCTTTTTTC 
75 CTGTCTCCCAGGCTGGGGTGCAGTGGCGCCATCTTGGATCACTGCAACCTCT 
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CTCCO^AATAGCTAGGATTACAGGTACGTGCCACCATGC^ 
' AGCCAGGATGGCCTCAATCTCCTGACCTCGTGATTGGCC^ 

CX3GCCCCAGCTCTTTTCTTTAGTTCTGGCCCAGGCAGCCTGGTATGCT 
TATCAGCAGATCTGCCACTGTTGCTAACTTTCTGTTTGGCITTC 
5 CT CTAAG CCT CAGTTTCCTGATACTAAAAATAAGAGACTAGGC CAGGT A 

CCAAGGTGGGAGGACTGCITGAGCCCAGGAGTTCXSAGACCA^ 
AGCCAGGTTTGGTGGCATGTGCCTCTAGTCCCAGCTACTCAGA^ 

GCAGTGAGCCAAGACCATG CCATAACACTCCAGCCTAGGG CAATAGACCAAGACCCTGT CTCAAAACAAGAAAAAGAGAGACTCCA 
TCTCIACAAAAAAAAAAAAACAAACAAACAAAAAA^ 
1 0 TTTGGGAGGCTG AGGTGGGAG GATCATTTGAGC CCAGGAGGTTG AGGG TGCAGTGAG CCATGATTGTTCTATTGCACT CCAG CCTG 
AG CAACAGAGAG AGAATTTGT CT CAAAAAAAAAAAAAAAGAAAAAGAAAAAGAAAAAAGAAAAAATG AG^ 
GCCTGACTCATAGTAGGTTGTCAGTATACACTGACTCCTCCCACCATCTCCTC 

CAGGGATGGAATTTTTGAAAGG AACTTG CT AAGAAACTCTG GGGAAAGGAAATGGAG CAT AAAGACAGGAGTAGGGGGGT CATGCG 

G CACAGTGG CTAGATTTG TAGAAGCTT C C1ZAT ATCCCCAGAGCAAGGTGAACAGAC CTTTGCTC CAAGGAAGGTAAAG AC^ 
1 5 GAAGGCCGGATGCAGTGGCTCACACCTGTAATACCAGCACTTTG^ 

CAG CCTGACCAACATGGTGAAACCCCGTCTCTrACT^ CTGGGCATGGTGGCACGCGCCTGTAAC 

CCn^AGATACTCAGGAGGCTGAGGCAGGAGAATTGCTTGA 

CCAGCCTGGGTGACAAAGTGAGACTCTGTTTCA^^ 

CCTGCCCTGGGCTTGCCCCAAACCAGCGTCCAGGGGTAGTGCTGGA 
2 0 GTCCT CACTGCTCCTT CCCGGTTCC CACAGAGG CAAAC C CATAGGG AATGT ATCT ATTAGAAG AAAAT AGG CAGTGAAAG CATGAT 

G AT AAATG G CAACTGACATATAGCATTAGG GTCAGGGAGACAATGGGG AGAAATGGGGAC CTTG GTAAACTTGCAG C TGCTC CAGC 

AGATGG CTGCAATTAAGG CGTGGGGG AC CCAAAAAAACAGAT CT TTTGTTTTTTT AAGAG AAG C C CAAACTTGGAGT TTT AGG AAA 

ATCrcTTCATTTCCATTTATTGGCGATTGATTCAAATTGTT^ 

CCAGCACTTTGGGAAGCTGAGGTGGGTGGATCACAAGGTGAGGAGTT^ 

2 5 CTAAAAATACAAAAATATTAGCCAGGCGTGGTGT^ 

AACCCTGGAGGCGGAAGTTGCAGTGAGCCATGATCACGCTGTTGC^ 

AAATAT AAAT AAATT AAATAAATAAATT AATTAAATGAAT AAAT AAAT AA^ CAATC CAAAGTGGAT CTG TGGG 

CTATACTCAGGCTGGATTTAGCCTGrGGGCTTGC^ 
/ CC CCAG TTTTAAAATGTG AGAGT C C CAG C CAGGCGTGGTGGGT CACG C CTGCAACC CAG CACTTTG GGAGG CTGAGATGAGCGGAT 

3 0 CACGTGGTCAGGAGGTGGAGACCAGCCTGACCAACATAGTGAAACCCCATCTCTACTAA 

GCATGCACCTGTAGTCCCAGCTACTCGGGAAGCn'GAGACAGGGGAATCATTTGAACCTGGG 

CTGTACCATTGCACTCCAGCCTGGGCAACAGAGCGAGAC TCTGTCTCAAAAAAAAAAAAAAAAAAAAAAAATTGTGAGAGTC CCAT 

AGGCAGAGGCTCCCTTGTACAAGTCCTGGGGAAAAAGGGTCTCTGTTGACCAGGCTTTAGTCCCA^ 

TGATTTTGTAAATACCCTATAAGGTTTTGGTCTTTX3GGAACTGCTT 

3 5 ACTACATGAGCAATTTAGAAATAGGGAAGGAGGGAGCTATGTTACTGCTG 

TCTACATGCCAAAAG CCTG C CAG AGGGG C CAGG ACCAGCAT AAAGG GAG GG AAC CAGC C CAAAGACT ATTTCATCTGGGTCATGGA 
GGGGCCTGGGGAACCTTCTCAAGTCCAAGCAAGGTCTGCTTTGCTAGGCCCAGGGTCCA 

TGCCTGGTCTTAAAACTGGAGAGACCTCACC^CTGTCTTGGTTGTGGGTGCC CATC CTGCCCTGGG CCCTGACCATCTCATCATTC 
CTACCAAATAATAATAATCCCAGGTGCCCTTTATTATTTATTTATTTATGTATTTAT 

4 0 GAGTGCAGTGGCATGATCTCAGCTCACTGCAACCTCTACCTCCTGGGTTTAAGCGAT^ 

AATTACAGGTGTGTGCCACCACGCCCAGCTAATTTTTGTATTTTTT^ 
CAAACTCCTGACCCCAGGTGATCCACCCGGCTTGGCCTCCCAAAGTGTTGG^ 
TGCCCTTTATGAAGCATCTCCGATCCAGACTTTCTGTGCCrGGATAGATTTGATGCTT 
GTGAGATATATATATCTGTGAGATATATATATATCT 

4 5 TTTATTTTTAATTTTTTTGAGACAAGAGTCrGGCTTGAGTTGCTCAGGCTG 

CTGCACCTCCACCTCCCAGGTTCAAG(ZAATTACTGTGACTCAGCCT 
GCTAATTTTTGTATTTTTAGTAAAGATGGGGTTTTGC 

CTTTGGCCTCCCACCATGCTGGGATTACAGGCATGAGCCACCATGCCTGGCCTGTGCA 
TCATTTAATCCTCATAACAATTCTATGAGGTAGKSA^ 

5 0 CCACAACTAACATGTGTGTATTATTGAGCATTTAATTTACAC 

TAGGAGGCTGAGGTGAAAGGATCACTTGAACGGAGGAGTTCAAATTTGCAATGTGCTATGATTC 
TCCAGCCTGGGCAACATAGTGAGATCCCTTATCTAAAACATTTTTTTTAA 
AATCCAGCACTTTGGGAGGCTGAGGCGGGCGGATCACCTGAGGTCAG 
TCTACTAAAAATACAAAATTAGCTTGGCGTGGTGGTGCATGCCTGTAATC^ 
5 5 GAACCTGGGAGGTGGAGGTT^CGGTGAGCCGAGATCGCACCATTGGACTC 

AAAAAGAAAAG G AAAT AAT CT AT ACC AG GCACT C CAAGTGG TGTG ACT GAT ATT CAACAAGT AC CTCTAGTGTG AC C TTAC CATTG 
ATGAAGACCAAGATTCTTTTGGATTGGTGCTCACACTGTGCCAGTTAAATA 
CCTGACCTTGATGTCCTTTCACCCATCAACCCX^TAGGGATGACCAAC^ 
GTGCGCCCTGACAACTGTCCAGAGGAGCTGTACCAACT 

5 0 CTACCTGCGCAGTGTGCTGGAGGACTTCTTCACGGCCACAGAGGGCCAGTACCAGCCTGAG 

GGGTTCTCCCCCTTTCTCTCCAGCCTGACTTGGGGAGATGGAGTTCTTGTGCCATA^ 

T ACATGTGTAG CCTGTGCATGTATGT CTTGG ACACTGTACAAG GTACCC CTTTCTGG CT C^ C CCATTTCCTGAGACCACAGAGAG A 
GGGGAGAAGCCTGGGATTGACAGAAGCITCTGCCCACCTACTTTTCTTTCCT 

6 5 TTATCTAATACCTCTGTGTGCTCCTCCTTGGTGCCTGGCCTGGCACACATCAGGAGT^ 

CATCTCTTTGCTGTCCACTCTTTGTGGGTGGGCAGTG<3GGGTTAAGAA 
GGATGAGTGGATGTCTGGAGGCTCTGCAGACCCCTTCAAATGGGACAGT^ 

TACCTGGAATCCCTTAGGGAATGGGTGCGTCAAAGGACCTTCCTCCCCATTATAAAAGGGCAAC^ 

GCTATATTTGACC TCAGAT TTTGTTTTTTT AAGG CTAGTCAAATG AAG CGG CGGGAATG GAGGAGGAACAAATAAAT CTGT AACT A 

7 0 TCCTCAGATTTTTTTTTTTTTTTGAGACTGGGTCT 

CCTCAACCTCTCCAGCTCAAATGCTCCTCCTGTCTCAGCCrCC 

AGAGTAAGATCCTGGTCTCCAAAAAAAGTTTTAAAAAAATCAAA 

AGATCAGGAGGCTGAGGCATGAGGATTGCTTGAGATATGATTGTGCCACT 

X CT CIAAAAAAAAAAAAAAAAAAAAAG G G AAGGCTTCATTGAGAAGGT AGGTGGCTTTTGAG CAAAGATTC CAAGGTGAAGGAGTGA 
7 5 GCCTTGTATGGAGGAGGGGAGGAG CAGTCAGCCAGAGGGAACAGGGTGAGG CAGGCGTTTACCGGGCATGCTCTGAGAGAGGGGGA 
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TAGATCCTAGAGGCCTGGGGCCITGTGGCCT 
CTOAATTACATTTTATTTTArrTATTTATTT^ 

GGAGTGCAGTGGCGCGATCTCGGCTCACTGCAAGCTCTGCCTCCCGGGTTCATC^ 

GGACTACAGGCGTC CACCACCAAG C CTGGGTAATTTTTTTGT ATTTT TAG TAG AGATGGGGTTTCACCGTGTTAGCCAGGATGGTC 

TGGATCTCCTGACCTTGTGATCCGCCTGCCTCGGCCT 

TTTTATTTTTAATTGAGATGGGGTCTCACTCTG 

CCTGGGCTCAAACAATCCTrCCCACCTCAGCOT 

TTTTCGGTGGAGA03GGGTTTCACCATGTTGCC 

AAGTGCTGGGATTATAGGCGTGAGCCATAGCGTCCAGCCCTGACTTACATTTTAAA 

GACTGAGTTAGTCAGTTTATAAAACTGGGG7VGATTTTGCATAAAACT 

CGTTGGCTCACTCCTATAATCCCAGCATTTTGGAAGGCCAAGGTGGGCAGA 



GGCAAGCTTGTATTCCTAGCTACTTAGGAGGATAGTTTGAGCCCAGGAAG 

CCAGCCAGAGCAAAAAAGAGAGCGAAACCCCATCTCAAAAAAAAGGGAAGAT^ 

TGAATAATCGCCACCCCTCCAGGTAGGGCCTGAACTCTACCTTTGCCAGAGTCCCCTCCACTC 

CTGAGTGTTAGTAGCTATTTACCACCAAGCT 

AAAATTATATTATGGTATTATACCATTTAATAAATGGTATAATAAAG;^^ 

GAAAATTAAAGATACCTGAGGATGGCAGAGTGTTTG^ 

CCAGCAGTTTAGGAGGCCGGGGCAGGCGGATCACAAGGTCAGGAGT^ 

CTAAAAATACA7^AAATTAGCCGG^3AGTGGTGGCAGGTGCCTACAAT CCCAGCTACTCGGAAGGCTGAATGGCGCGATCTCAGCT C 

ATTGCAACCTCTG CCTCCCAGGTTCAAGCCATTCTTCTGCCTTAGCCT CCCTAGTAG CTGGGATTACAGGCGCCTGCCACCATGCC 

TAGCTAATTTTTATATTTTTAGTAGACGTGGGGTTTCGCCATGTTGGCCAGGCTC 

OSCCTCGGTCTCCCAAACTGCTGGGATTACAGGAGTGAGCCACAGTGCCCGGC^ 

ATTTTTTTAATTTTTATTTTATTT^^ 

CCCAGCACTTTTTGGGAGGCTGAGGTGGGTGGATCACGAGGTCAGGAGA 
CTACTAAAAATATAAAAAAATTAGCCGGGCCTGGTGGCAGCT 

GTGAACTCGAGAGGTGGAGCTTGCAGTGAGCCAAGATCGCACCACTACACTCCAGCCTGG^ 

7^UVAAAAAAAATT ATATATAT AT ATAT ACATATAT ATATGCAAACAAAGAG CATCTG AGTCATAATAATGT AAATCTAT CAC CTG A 

CTGACCTGCTGCCACACCTCATGATCTCATCTGATCCCCACACTCCTTCTCTTTGGGATAC 

CTTTGTATTCCTATCCTCCCCATTTTTGTTATTTTATTTTATTTCTTAT^ 

TAATGCAGTGGCCTGATCTCGGCTCACTGCAACCTCCACCTCCCGGTTTCAAGCGAATCTCCTGCCTCAGCC 

GACCTACAGGCACACACCACCACGCCCAGCTAATTTTTGCATTTTTAATAG^ 

GAACTCCTGACCTCAGGTGATTTGCCCACCTCAGCCTTCCAAA^ 

CCATTTTATAACAAGGGAAATGGAGGCCCAGAATGGTTAAGTAAACCCAC C CAGGGCTAGCTGAGAATTAGCAACAGAG AACTGGG 

AGTAGAATTTGTTCCCTGGCCCTTTGCTGTTTCTATTATAA 

AATGCAGTTTCTCAGTGAGAAACACAAGTC 

ACAAGGCCTGGCTCTATTG^CCAGGCTGCAGTCTG^STGACATG AT CTTGG CTCACTGCAACATCTGTCrCCTGGGCTCAAGCCATC 
ATCCCACCTCAGCCTCCTGAGTAGCTGGAACTACAGGAGCGCGCCCCCACACCTGGGTAATT^ 
TTTTGCCATGTTGCCCAGGCTGGTCTCGAACTCCTGAGOTCAAGTGATCCACCCATCTTGGCCTCCCAAAGTGCT^ 
CGTGAGCCACTGTGCCCAGCCTCTTOTTGACTGTAGGACCACAAGTTGGCGGCTAACr 

CTCATT AAAT ATTGACATTATTGT AGTTATTTTAAT AGC C CAG CATTG CACTTTTT AGGTCTTTCAGCTTT CAGTG ATGAT C AGTTG 

ATAATTGATGATCTGGTGGAGTGGTTCTTAATGGTAGAGTTGGGGGCAATTTTACACTCCCT^ 

ACCCAATGGTAAAGCTATTGCACAGTACTTGGCAATGTCTAGAGACAATTTTGGTTGTCA 

AT CT AGTGGG TAG AGG CTAAGG ATG CTG CTT AAT TTTTTTriTTTTT GAGACAAG AGTTTCACT CT AGTT GC C CAGGCACAG AACA 
GCTT CACAG AAG CTGTT AATG CACAGAATAG C IT CCTACAAAAAAG CATT ACCTGGC C CAAAATGTCATT AG CT ACC AG G CTGAGA 
AACCTGCCCTCCTAGGCCGGGCGCTGTGGCTCACGCCCGTAATCTCAGCATATTGGGAGGCCGAGGTGGGCGGATCCTGAGGTCAG 
GAGTTCGAGACCACCTGGACCAACACGGAGAAAC CC^TCTCTACCAAAAATACAAAATTAGCCGGG CATGGTGGCACATGCTTGT 
AATC CCAGCTACTCGGGAGGCTGAGGCAGGAGAACCGCTTGAACACAG AGGCAGAGATT CATTGCA 
CTCCAGCCTGGGCAACTAGAG CGAAACT CTTGTCTCAAAAAAAAAAAAAAAAAAAAAAACAGGGAAAG AAAAGAAAGGAAACCTGC 
CCTCCTATCATAGGATAATCCCATTTCCTCCTGTCTAAAGAGACGCCTACTTAGTCATCCTGGGTGACTGC^ 
TTTGGAGTCTGAAAGGCTGGGTTCTGTCACTTrGTTACAGTGCCTCTGGTG CAAAGAAAG CATTTTAAAAACCCTGTACAATTAAA 
AAATTGAATTTAATTACTTTGTGTAACTTTGAA 

AGCCCTTGGCCGAGAGCTGTTTTGAGGAAAAACTGAAATAACATTGGTAAAGTGTCGC^ 

ACAAACATTAGCTTTCTTTCCCTTTCTTGTCGGTTTCTTCCTCTCCAAACCCGCGTGTTC 

GCCCTCCTTTGCGGCCACAAACTC^CTTTCTAACCCAGGTTCAGCCCTTTTATTGGCTGAGTGACCT 

CTGTAGGCCTCGGTTTATTCTCCGTAAAATCAGAAAGTTGGCCTCCGATCTCCAAG CACG CTTTTCACGACGAAGTGGGACTGTTA 

AGTTTACAGAGCTGCTTTCCTCCCCCGGGACTGATGGTACGGTCCCCGGGCGGCTCCCCACCCATCTGTCGCAGACCTT 

GCCCAGGGGGCCCTCGGCX3GCCTCTCCGGGCTGCCCTTGCCCCCTGCCGAGTTCCGGGCCCGTGTCTGCGCAAGCT 

AG CGGTG CCCGGGCTGCGCGGCTAT AGGTGAG CCCAGG AGGGGACGGGCGGGG CGTTATAACAAAAA.CTTC TTCTGAAATACCTTT 

TCCCCTCATATATGT1X5TGATATCTACATTCGTATATTGATATCCATTTTTATTCTC 

T ACTT ATTGGTAG CG ATAT AGTGCTCACAC CCAACTTTGATG AT CT AT ATTTTATTCTTAATC CT TAATTT ACTTAAT AACT T G TT 
TGACAATTTATATTTATCCTTCTGATGTGTGTGTCCTCTATGAGTCTTCGTGTTTCT 

G CAC CTT CGT AAGTCT AGAAACACTCT CC T AC TACT ATT C C CT AGTTCTTT CT AG TATAT ATACACCGCGTAATG C CCATCACAAA 

TTTCCACCAAATATTCACACTrATCATAACTTGATCA^ 

AGATCATATCTTATTAGGGCGTAGGTATAACATCT^ 

TCATCTCGCGTCTGCTGTGCTCACTATTGCATCT 

ACAT T AACT AACAATT AACTCGTCGAT ACATCTG ACGCT ATT GTTATTT AT ACCT G ATATAGTCG CATAACTGC TCTCTCCTCTG 
AATCTACTCTTCCTTACTCTCTAACATCACTTTTCACTCCGCTC 

NNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNN^^ 
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NNNNNNNNNNNNNNNNNNNNNN>^ 
NNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
5 NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNI^ 
NNNNNNNNNNNNNNNNNNNN^ 




15 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
2 0 TGGG ATJATAGG CATGAG C CACACTTCCTGGCCTCTGATGTGGGCAATTTGATGG TGAG TG TTTGG AGG TTTGGAGAATGCCATGT 

CAGTTGATAACTTTTTTTTTTTTTTTTTTTTGAGATGGAGTCT 

CACTGCTGCAAGCTCCACCTCCCGGGTCCATGCCATTCTCCTG 

CACTGGGCrGATTTTTTGTATTTTTAGTAGA^ 

GCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCGCACCCGGCCCCAACTT^ 

2 5 TAGAGATGGGGTCTCAATATGTTGCCTGGGCTTGTCTTGAACTC^ 

AGTGCTGGGATTACAGGCCTAAGCCACTGTTTTTTTTGTC 

GTTTAGGGGACATCCAAGTAGAGG CATC CAGTAGGCAGTTTAGAAGATAG AAATT TTTTGGTAGT CATCAGTATCTT AGTCTG CTA 
TAACAGAACACTGTTTATAAAGAACIAGAAATTGCTGAATGT^ 

ATGGTTCT C TTGAG C C CAGG AGTT TGAGACCAGC CTGGGGAACTTAGGGAGACTCTGTCT CTGG 
30 GCATGATGG CATGTGCCTGT AGAGAGGCTG AAGTGGG AG G ATCAC TTG AT CCTGG G AGGTCGACT CTG CAGTG AGC CATGATCG CA 

CCAGTGCACTC CAG CCTGG GG G AC AGAGTGAGACCHTGTGTCAAAAAAAAAAAAAG AACATAAA 
GCTGAGAAGTCCAAGGCTAGAA 

HUMAN SEQUENCE - mRNA 

3 5 CG CCTGG ACCATGTGAATGGGG CCAGAGGGCTCCCGGGCTGGGCAGGGACCATGGGCTGTGGCTG CAG CTCACACCCGGAAGATGA 

CTGGATGGAAAACATCGATGTGTGTGAGAACTGCCATTATCCCATAGTCCGACTGGATGGGAAGGG 
GCTCTGAGGTGCGGGACCCACTGGTTACCTACGAAGGCT^ 

CACAGCTATGAGCCCTCTCACGACGGAGATCTGGGCTTTGAGAAGGGGGAACCACTCCGCATCCT^ 
GAAGGCGCAGTCCCTGACCACGGGCCAGGAAGGCTTCATCCCCTTC 

4 0 GGTTCTTCAAGAACCTGAGCCGCAAGGA03CGGAGa3GCMCTCCTGGCGCCCGGGAACACT 

AGCGAGAGCACCGCCGGGTCCTTTTC^CrcTCGGTCCG 

TAATCTGGACAACGGTGG CTTCTACATCTCCCCTCGAATCACTTTTCCCGGCCrGCAT 

CAGATGGGCTCTGCACACGGTTGAGCCGCCCCTGCCAGACCCAGAAGCCCCAGAAGCCGTGGTG 

AGGGAGAOTCTGAAGCTGGTGGAGCGGCTGGGGGCTGCAOIGTTCGGGGAGGTGTG^ 

4 5 GGCGGTGAAGAGCCTGAAGCAGGGCAGCATGTCCCCGGAC^ 

GGCTGGTTCGGCTCTACGCTGTGGTCACCCAGGAGCCCATCTAC^TCATCACTGAATACATGGAGA^ 

C TCAAGAC CCCTTCAGG CATCAAG TTGACCATCAACAAACTC CTG G ACATGG CAG CCCAAATTG CAGAAGG CATGG CATT CAT TGA 
m AGAGCGGAATTATATTCATCGTGACCTTCGGGCTGCCAACATTCTG^ 
TAGCACGCCTCATTGAGGACAACGAGTACACAGCCAGGGAGGGGGCCAAGTTTCCCATTAAGT 

5 0 TACGGGACATTCACCATCAAGTCAGATGTGTGGTCTTTTGGGATCCTGCT^ 

AGGGATGACCAACCCXSGAGGTGATTCAGAACCTGGAGCGAGGCTACC^ 

AACTCATGAGGCTGTGCTGGAAGGAGCGCCCAGAGGACCGGCCCACCTTTGACTACCTGCGCAGTGTGCT 
GCCACAGAGGGCCAGTACCAGCCTCAGCCTTGAGAGGAGGCCTTGAGAGGCCCTGGGGTTCT 
GGGGAGATCGAGTTCTTGTGCCATAGTCACATGGCCTATGCACATATG 
55 GCACCTTGTGTCTGTACACGTGTCCTGTAGTTGCGTGGACTCTGCA 

ACACTGTACAAGGTACCCCTTTCTGGCTCTCCCATTTCCTGAGACCACCAGAGAGAGGGG 
TG CCCIACCTACTTTTCTTTCCTCAGATCATCCAG 
GTGCCTGGCCTGGCACACATGAGGAGTTCAATAAATGTCTGTC 

60 HUMAN SEQUENCE - CODING 

ATGGGCTGTGGCTGCAGCTCAC^CCaSGAAGATGACTGGATGGAA 

ACnXBGATGGGAAGGGCAGGCTGCTGATCCGAAATGGCTCTGA 

CTTCCCCACTGCAAGACAACCTGGTTATOSCTCTGCACAGCT 

CCACTCCGCATCCTGGAGCAGAGCX3GCGAGTGGTGGAAGGCGCAGTCCCTC 
65 TCTGGCCAAAGCGAACAGCCTGGAGCCCGAACCCTGGTTCTTC^^ 

CCGGGAACACTCACGGCTCCTTCCTCATCCGGGAGAGCGAGAGC^ 

AACCAGGGAGAGGTGGTGAAACATTACAAGATCCX3TAATCTGGACAACGGTG 

CCTGCATGAACTGGTCCGCCATTACACCAATGCTTCAGATGGGCTGTGCACACGGTTGAGCCGCCCCTGCC^ 
AGAAGCCGTGGTGGGAGGACGAGTGGGAGGTTCCCAGGGAGACGCTGAAGCrGGTGGAGCGGCTGGGGGCT 
7 0 GTGTGGATGGGGTACTACAACGGGCZACACGAAGGTGGCGGTGAAGAGCCTGAAGCAGGGCAGC^ 

CGAGGCCAACCTCATGAAGCAGCTGCAACACCAGCXGCTGGTTCGGCTCTACGCTGTGGTCACCC^ 

CTGAATACATGGAG AATG GGAGTCTAGTGG ATTTT CT CAAGACCC CTTCAGG CATCAAGTTGACCATCAACAAACT CCTGGACATG 
G CAGCCCAAATTG C^GAAGGCATGGCATTCATTGAAGAGCGGAATTATATTCAT CGTG ACCTTCGGG CTGCCAACATT CTGGTGT C 
TGACACCCTG AG CTGCAAGATTG CAG ACnTGGCCTAGCACG CCTCATTGAGGACAACGAGTACACAGCCAGGGAGGGGG CCAAG T 
75 TTCCCATTAAGTGGACAGCGCCAGAAGCCATTAACTACGGGACATTCACC^ 
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ACGGAAATTGTCACCCACGGCCGCATCCCTTACCCAGGGATGACCAACCCGG^ 

GGTGCGCCCTGACAACTGTCCAGAGGAGCTGTACCAACTCATGAC^ 

ACTACCTGCGCAGTGTGCTGGAGGACTTCTTCACGGCCACAGAGGGCCAGTACCAGCOT 
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Table 92 

MOUSE NOMENCLATURE 
ICSGNM Git2 
Celera mCG19236 

5 

HUMAN NOMENCLATURE 
HGNC GIT2 
Celera hCG38510 

10 MOUSE SEQUENCE - GENOMIC 

GGGACTACATGTTGGAACCTTATCrTAA^ 

AAAGCAGGGCCTGGAGAGGTGCCTTAGTGTGTAAAGAAAGTGCTTGCATQA.GGTCCT 

TAAAAGC CAGGCATGACACACACCTGT CACCCCAG CACTAGG GACACAG AAT CAG G AGCAT CCCGGGG AAC CAATGGC CAGCCG CT 
CTAGGGGAAATCATGAGCTGCGGGTTCAGTGATAAAGCrrTGTCT 
1 5 ATCAACCTCTGGC CT CCCGG CTCC CTCACAGGCAAG C^CAACCAGGAAC^ACACACACACACACATACATACAC CAT ACAAACACAA 

ACCACACAGCCACACCACACCCACACATATGCACACAAACACA^ 
TTTCATTTCTCTTCTTTTCGTCITTCTCTCCAGAGCGAGGGGA^ 

CCAGTTT CAAATGACGGCTCAGGCACAG GCAAAGCATGCAAGTGAGAGGCTCTT CAGGT CTCTAGG ACAGG AAG CACAGCGAC CTG 
ACCTACCTCAGAGAGTAGGCAGAACTAAATTAAAGATCTGAGACCCATGAGCGCT 
2 0 CACACAATTTAG AC CTAAAAAAAAAAAAG AACAAC CCCACAATTTT AT AAG AAAAAAAT ATTT AG CAAAAAT ATTT AG CAGAAG CT 
GG C CACACCGTCTTTT AATTCCAG C CCT C AG AAG ATTGACTG AG G CGGGAGGACAGC CATGAGTTCAAGG CTGG CCT AGACT ACAT 
AGTAAAGTTCCAGGCCACCCAAATTCACAGGTGATCATTCA^ 
CTGGTACTGACCACTAAACTCAACTGCCTCCCGGTCGAA^ 

CACTT C CTGCTGTGTGG CTT AAGGAT ATT CT AG CAAT ATGGATT AT CTTGAC CC CAAAC CAGTCAT AT AAGTT ACAAT AT AATGAA 

2 5 AGTCCAAGATGCCCCTTATTCAAAATAGTGAAAAATTTCATAACCGTC 

CCTGCTGAAGTTCAACTGTATGGTTATGACCGGCAAGATGTCCATCT 

GGACCCATTGTGT TAAATTAGGG CAGTGTCAAAACCAGAGGACCAGAGG ATCGG CAAGTATG AGCATT CG CTCAACCTTGAAGTCA 

AGGGCAGCAATGACCTTAAGTTTTCTCCCATAGAATCTACTGCAAGACTT^ 

ACTGGGGAGGTTAAGAAGGGAGGCCACTGAGGAGGTTTGAGGCCA^ 

3 0 ACAGTAACACCCTGTCT CAAAATTAGAAGATTT TAATTC CCAAATTTT AATT CCAAACGT C CT CAAGATTTG T CATGGTCAAAGAC 

AGGAGAATGTCCCCGGGTACCTTTATCGCCATTTTCAG^IAAATGGTGACT 

ACCTTCCTATCAAATGCGATTTAACCTCGTGACTCTGTACTTTCAACGGTGCTC 
GAACCCGTGCCAAGGCTGGATGGAACCGGAACCGGGACCCTTCCaCGTCCTAGCCCCG 
3 5 CTG CATTTTTTTTT CATTT TTTTAAAAGATT T ATTTATTGATT ATATG TAAG TAC AC TGTAG CTGTCTTT AG ACACC CCAGAAG AG 

GGAGTCAGATCTCGTTACGGATGGTTGTGAGCCACCATGTGGTTGCTGGG^ 
CTCTTACCCACTGAGCCATCTCGCCAGCCCTTTTTTTTCATTT 

CTGTC ATTCTCCTG CCTCAGC CTCCACGACAGCTGTGGGCCACCGTGTCCAGTCAAAACTGGTTACTG CTTCTTTGAAATGTCCAG 
AGTGGGCCTTGAGGATAGGCAGAAGAGCTTCTCGCTTAGATTC 
40 TCAGCAGGTATACTGGGATGCCTGAATCATCCTCTCTGGGGGCATGTGTCATATCTCTTTAAATCCTAAGTCGT^ 

TAATTAAGAACAATGACTCTT ACAAAACCGTCTTCTGTCTGGACT CCTC C C CTGCCC CACACACAC CTG AGGATG AAACC CAGG 
GCCTCTCACACAATGGACAAGTGTTGGGGC(^CTGCGATA(^ 

AAGTTGCCCAGGCT AG CCTCCAAATCTTCCTGCTTCIAACTTCCTGAG TGT CTTGGGTT ACCCGTGT ACAC CG C CACACCCG C CATG 
GAATCOTTTTTTTAAATGCCACTTCTGACITTCCCATGCTGCCCA 
45 . CACTTAGAACCGTTCCAACAGCGCITCGATGTGGGGCTGGCCCAATGTT^ 

GAGTCAGTGGTTAAGTCATTCAAAAGCAAGGCrCTGACACACTGGCTAAGACGAAT 

G ATTTCGGGAGTAATT ATAAATCCTTATCTCATTTGCTACAGG CTAGAGAG CAACTCAGGG CACC CGTGCCT AATGAAATGACTG C 
AGTGC(^TTAATCGCAGGTGTCTCATTCCTGGTGGGCGG CTCTGCCTTTCAG CAGGCTACCCCTCCCCCAAGTTAGAAGCCAGCCT 
GAGCTACACAGTGAATTCTAGGCTAAGC^AGGGCTATACAGTGAGACCCTGTCT 
5 0 CAAATGAAAT AAG AATT CAT ATT AATG CZAAAGATTCTT C ACCACGG CTCT AACT AACATTGG AGT CTG AT AACACTCAGTGG TGGG 
TGAGTGGCTGCCITGAACACTGTAAGGCACAGCGAGGT 
AACCCAACCCAAAACGTCTTGTCTCCrrTGCATTCCCAAATATTCACTAT 

AAGAAAAGAAAAAAAAAAAAAAAAAAAAAAAAAAAG CACTT CAGAACAGGAAGGTCCTAAGGAGTCTG 

ACCAACCACCCrGGGGAGCTCGTTAAAACCCCGATTCCTAGGCCCAG CCG CftGCCCCATCAATTTAGACAMTACTGGGTTTTCCG 

5 5 TCGTCACTGCCTCCATCCTCATCCAATGGGTAAGGAAAAGTGGGGTCCCCCCAACG^ 

TAGAACCAGG CCACCGAAGCCAACCAAGCTCTTCCCCAGCACCAGGG CGGGTGGAGTAAGGTGTGTTTCTTCCGTCTTCAAGGCCA 
TAAAGCTTAAAGGTGAATTCTGGGGCCAGCGAGCCTGAGATAAACGATCTGGACGTC 

AGGAGAATGGACCAAAAGCCCCCGGCaCCGAGCCCGGTCTACGGTAAGCGCTCTATGGATCCCGGGGCTCAG 

6 0 CCGACTCCGCGCCGCGCACCTCACCTGGTCCCGCAGCTCCT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 
65 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 
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NNNNNNNNNNNNNNNNNNNNNN^^ 

ACCTTGATTTCCTCCGGGTTTTTTTTCCCCCCCACCCACTGCTGGAAACTATGGCCAATC 
CCCGGC^TCGGCTCCTTT^ 

TCACTCGCAGCGACTTCGAACOSCCAAAAC^ 

GAACCCAACTGACAGGTCTCGATCCTGATTTTGTGTGTGTGTGTGTGTGTGTGTGTCT 

GGACTTTTCGGGTTTGAAGTCAAATATCAGATGACTTTCCTGGTGTAGCAG 

TCAGCGGCTTGCAACAGATTTTTCTCTTCGTCTTGTCATTTGGAA 

G TT ATT TCATGG CCCTGGGTC CT ACAGGTTAACAAATGGAAG ATAAGG AG AGG CAG AGGCACTTTT AAAAGAGATG CTTTGT TTGC 
AAGCCGCGTTTATACAAACTATTATTTGGCTAACGATGTTGTGTGTTC 



CACCCCGATACTGTCTTGTCCATTITGTGGTGTTTTGGGGCCTTTT 

AAAGGCTTGAGTCACTTCCTGTTTTGAATAGAATGAAGCTACCTTGAAGGCTGAGGAACAG^ 

CACAGCTCAGGGGAAAGGTTACCTGTAGGAGTAGGCTCCTTTGAAATCCTCATGCTAA^ 

ATTTGACTCTCCTGGACGTTTGCAGAAGATGGGTGGG^ 

AAAAGTCCTGCCTTTCCGGGCGTGGTGGCACTCGCCTTTTiATCCAAG CACTCGGGAGG CAGAGGCAGGCGGATTTCTG AGTTCGAG 

GCCAGCCTGGTCTACAAAGTGAGTTCCAGGACAGCCAGGGCTATACAGAGAAACCCTGTC 

A(^CAAAACAACAACAACAACAACAACAAAGTCCTGCCT^ 

TAAGTTTTATGTGTCGTCTGTATTC^ 

GTTAAAATGTTGTCATCCTGTCATCCTTTTCTTGACTAGTTTGCCCCAA 
CGCTTTCAGTCTCAATATTAATAACTTGCTGCAGAAGC 

TCCTTAGAATGGCTGTTAGCACCTATGGTAAAGCTGGCTGAAACCCTTTCTGT^ 

TTGTGTGTTGGGGATCAAAGCCAGGCCCTGACAATTGCTAGGAGAG CCACACCCTCAGCC CTGATTTCTTGTTTGTTTCT71ATGAA 
TCTAGTTCATTTTAAATGCATCATTCTTGTGCCAAGATGACACTCTAATT^ 

ATCACTATTCCAGTTTGGTCCTCAGTGAGAACACTCCAAGACGCAGCTTGCCrrCAGATGATTGCAC^ 

TGGGATTCAAACCATGGACTGTTTGTCTCCATAACCAGCCCTCTTAACCACTO 

AAATGGTTTGAAAGGGTTGGAATAAACATAAAAGTGGTAAAGATGCTCTACATGCATTTC^ 

TTTAAACTGAACCCTTAGGCCAGGCAGCCGGGGTAGCATCTAATGCCACCCCACATGAGTCTGA 

AGTTTACTCACACAGATCCGTCCTACCCQCCCAGTGTGAGTTACTCCTGTCTGCAGTTT^ 

GTACTCAATTAAAAATACCATCATTTTTGTTTTCCGTCTGTTCGTTTGAAGTACC^ 

GCAGTGCTAGCATAGAGCATTGTCTCTCTGAGQVGAA1X3AGCCACCCTTGCTTGCCCGGG 

TTCrCTTCTGGCTTCTGTTAGCACCTGGGTCTGGACTGCCTGCTCTGGGCAGCAATAAG 



TGCTTGCCTAG CCCAGACAAGGCCCCTGG CT CAGTCTCCAGCACTGGAAGCAAACAGCGGAGCAACCAC 



CTGCTGTGTGCAGCTGCCAACCTAGTTAGCAGAGCGGTCAGTCCGGGTGGGGGGATCATO^CCTCCTAGG 
AACAACAAAAATGTGCTCTGTTTTTCTTAAAAACCAAACATCACTCATC^ 
TGCCACTGAGGGGTTGTTCTTTTTCCAGATCOTCCTGGGCGTCAGTAAATAGGGGAACm 
CCATCG^AGCCTGGGGCGCCATATCTCCCAAGTGAGGCATCT 

G GAAGAGTT CT CATAATTGGAAAATTATATTTT AGTGATTGTTG CTT AAG^CTAAAATACTG CCTACTATACATCTTAAAT CTTA 
TAAAAGTTTACTCrrCTATGTATAGTATAAAGTGCTGCCCTGTGTTGAAGTCATTGTCTGTCCTTT 
GTGTTCCCCACATTCAGTTATACATAGACAGATATATCAATAGATTGTGTCGATAAAAGCTTAAATTCTCTAGTGCG 
GTGGCTTTCTAGAGAGATTATCACCAGAAATTGTTGTGTTGTGCTC 

TAGCCCTAGCTGGCCTTGAACTCACAGACAATCGACCTGCCTCTGCTAAGTACTAGGATTAAAGGTAAT 

TCAGAAATCATTTTATTTTTTTAGATGTATTTCATATATATGTCTGTGTGTGTGTGTGTGTGTG 

TGTGTGTGTGCATGTGTGTGTGTGTCTGTGTGTGTTTGTGTGTATGTTTG 

TGTGTGTCTTTGTGTGTGTGTGTATGCCTGTGTGTGTOTGTGTGTGTTTGTGTGTGTGCTCA 

TTTCCTGACACTGGAATTACAGAGCGTTGTGAACCACACCGGTGCrc 

TGGTTTAACACTGAGCCATCTCTGCAGCACTCAGACATTTTTTAAAGTCTTTCTTTCCTA 

TTTTCATT AACAGG CAAATTTCT G AGTAC CTG CCTATATTTTT AGAACTTTTTTTTTT C CTG CTTTGATAG CAG CAT AGACTGG G A 

TTCCGCAGAAATTGCTTCAACAT/TTGAAGTTTTTAAAAAGCTCTTGOT 

GCCTAACACTTGTCAGGGGTCTTTGTCTTATTTGGGATCTGATGTCAT 

T CTCCTGGAACTCAGGGATTAAT CCTGTGTTTTGCGATATTTGT CTTAAT ACACATCTTACTAAAG CTGTAACAG CTG CACAG CAG 
TACTTAAACTCCCAGAAACGAAGGGACCAGAGCAAGTGGTTCGGGTTACTGATGA 

AGCTCTAGGAGTCCTGATTGG AAAGGAAGTTGAG AGAGATTTATTAAAAGTTC T ACTTAG GGG CTGAAGAG ATG GCTCAG CCTTTA 
AATGCTAGCTAACAACCATAAATCCCCGAGTTCTGTTAGAAGTGTTCAAAGGCCCCTTCTCGTGA 



CCCTTCTCGTAAAAGGGTTGGCTAGCGCCTTCCACCACACTGTGT^ 
TGATTGAAAATGTTAGCAGGGCGTGGTGGCACACGCCTTTAATCCCAGC^ 
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GCCAACCTGGTCTGCAGTGAGTTCCAGGAC^CCAGGGCTACACAGAGAAACCCTGTCT 
ACCAAAAAACAAAAAAAAAGAAAATGCTAGGGTTGACTTTAAGGC^ 

AC CACAGTGGGGTGTG AGAGAG CTTAGCACX^CAGACGTGTG CTTATAG CATTACTTTAATCATGTAC7UUIGAAAGT AACATAATTA 

TTAACAAGATATTTTAAATAGGTGAATTTAATGAAAAGCATTGAGTGAGTG^ 

GTGTGTGCCAAGGAGCCGGGATTGGGCTATCCTACACACAAAGCTGGCTCCCATGTGGA 

ACAG TGACG^CCGOTCCCC3^CT 

CTGrrriTTCATTTTTGTTACTGGTTGTTTTTGACTA 

ACCTTGTACAACAATGGTGCTAACTCTATATGGGAACACTCTCTGCTG^ 

CCCACAGGACAAAGTTCAGTG AGTGGAGTGGTTATAACTTCT ATT^ CAAG TGGCCCTGTGGAAAGGAGTGTGGCTATT 

TTGATCAGCCTCCAAAGGGGCTGTGGGCTATGC(^GGCATTTGCTC 

TTGGATGCTCACCCAATTGAAGTTAGTGATTTCTCCAGTTGGAGTCCAGGGOT 

GTTGAGAAAGGTAAAGGGAAGAGAGGCGCTGGGAAGTTC 

C CAGCTTTGCC CT T C"r rACTGTC CAGTGGGGGGGGGGGGGTGT CACTT CCTCTTCTTACAG C7VCAAAGAATGAGAAATGTGTCAGA 
GCTGTTAAGTGAACTGTGTTCTTAT1TGTTTCCAAACCAGCCCCAA 

TT CGTC CACCGATTG CCCTG CCG CGAGG ACGACAG CGTGACGGC CAAAGATCTCAGTAAGG TGGG CG CG CT CTG CCTTCTGTGGGG 
CCGTGAGATGGGCCGCTTCCCTTAAAGCCATGTCAGGAGGAAGAAGCAGCCTT^ 

CATTTTCATTTAATTGATTAATTAATTTGTACTTTTTCGGTACAGT 
AGCCATTTCAAAGAAGGTTTTCTAGAGAACCTAGCATC 

AAGGCTGGAGGGCAAAGTGCACCTAGAGCTGTAGCACAGCCGTGTCTGTTACACCGAGAGC^ 
TTTCTACGAGATGCCGTAAAAATGCACATCACAGTTTTTTGAG03GTGCTTTGAACTAGATGCT 

ATT CAGAATACT AATT CTG CACAGATGTAAGTGATTTT C CTAGTGAGTAAAT AGTTTC CTTTTTTTACCAAATAAAAAACATGGCC 
AGGTGTGGTGGCACACATCTT/TTTTTAGTCCCCGGTATTC1AGGAGGCAGA 
TCTATATAGTGAGTTCCAGGTCAGCCTGGGCTACAGTGAGTCCT 
AGAAGAAGAGAGCGATGACTTGGGCTGGAGAGGTGGATCAGTGATTGAGCAGACACTG 

ACGGCACATCTAACGCCAG CTCCGGGGACCTGACACCCCCTTCTGGCCTCTGTTGGCAC CTGCACTCAGTGTACATTGATGTACGC 

ATACIATACAATTAAAAAT AAAATTAAAAAAAAAAAACAAAGACTGG TTC CGT AGG ACTCATCTG CCCTGTGAG CCAGT CAGGACAG 

TTGTAGAGTAGCCCTGAAGGAGAGAGCCATTTGTTCTTTGGGAGTTTAAAAGCGTT^ 

GCCCACrrCCCACTGAGCGCAGCCTGGCAGAATCGCTGTGTGGACAAACAGC^ 

GTGGCTGGAGCCGCTGGGATCCGCGCTGTGGCAGCAGCTIX1AAGACTGATAGATGOT 

TGTCTCCTTCAGTACACACCATAACCOTCCTGGTAGAGGGCACCXSTAGCACCC^ 

ACAGCTGGAGGGAGGTAGACCTTGAAATCCAACCAC!ACACACAC!AC^ 

GCTTCTTGCrGGGTTTATGGTTGTACCrcrTACrrGGCA^ 

TTAAACTGACAGACCCACCTGCAGCCTGAAGCCATTGTAAACAGCAAAGACCCCACCAACCCA 

TTCCTGAGCaVAACAAGACCCGTTACrGGTGTAAAGTGTGTTAGAGTGTC 

TGACAAOTACTCTGTTTCTCTTTGTTTTTCAGCAACTC(^T^ 

TCTTTAGGAGCCCAAG CCAACTTCTTTCATCCTGTACGAAAATGTCAATTATTCAGAT AT^ CTGAGTTATGTTTGGA 

TAGTCCAGGTCrGCTCAGTTTGTTCTTTTTGTCra^ 

TCCAAAGCAGGAC^GATCTTACAGGCTGAATTGCTGGCCGTGTATGGAGCT^ 

TGTTGACTATGCAAGGTGAGAGGCTGAGGTTCTCACTGCACrGCGTGCT 

GGTATCGCTTCTGTTTAGCCCCTTTAAAGACTTCCATTCrrATTATTTTG 

GACAAGATGAAAGGACTGCCGAGTCCTAGCTGGTGTGGTTAACACCCGGTCATCATAGC^ 

GATCAGGGATCAGGAACTGGGAGGGGGTGGCTAGAGAGTGGGCTCAGTGGTTCAGATTGCCATCTGCTC 
CAGGGTTCAGTTTCOU^CCACGGTGCCT 

CAGGCAC!ACACACGGTACAC TTATG AGCAGACAACIATTGTATACG T AAAAGTCAATACAT CIT AATTTTTAAAATTAAAT AATTAA 

AGAAGGCTAGGGAAATGGCACACTGAGTCGTGTGCTTCCCTATAA 

AGCCCTGGCCTGCCTCCTGTCTGTGGGCAAGGTTAAGACAG 

AGCTGCATGAGGGGACTTCATGGGGCCTAATCAATGTTAAAACCTGCAGATTTAACT 

C C CTGTTATTTTATT AACT ATGAAG ^TCACACTCTGTC&TAAAG AATGCAATGGGAAATGACAAAAGTCT 

AACTTACTTCTCTGGAAGAGATGATTAATAAACTCTTGAAAAGTTTGAGGCAAGATTATATAT 

CATX5TTTTGTTTTGTTTTCCTTTGGAAGACTC 

TGCCTGCCCTCCCCGTGCTCTGATCACACGTGTGAACTACCATGCATGGCTGACAGTGGGTCTTAA 

GAGAAAAGCCCCGCTGCCTTAAGAACCTTCGTGTGGTCCTCAGA 

CCAGCTGGATGAGAGGCAGAGAGGGGAGTCATAAAGTG 

CAGCAAAAGCCCACTCCAGAAGTCGTCTCCCGAGCCACACACATGCC^TGATACACACTC^ 

AGGGCATCTGTTAGGGG CAGTGGGGACTTGCTGTGCAAACATGAAG AT CAGAGTTTAGATCCTCAAAAC ACACAAATGCCGTAGAG 



AGACTCTAGTTCAGTGAAGACCCTGCCTCAGTGATGGGTAGAGAGAGACTGCTGACGTA^ 

CACACACACACACACACACACACACACACACACACACCTGCAGAAAATGA^ ' 
GAATCCATTACTTGAGATGGTCTTTGAGGAGCTGGTTGAATATGGACCTCT 



CCCCTGAGAGAGAGAGCTCTGCCAGGTGCACGGACGCAGGCTGAGACGCTGTCTGG 

GAATTGTG TGGAGAGAG AAC CTG GCT AAAGCTAG ATTG ATGTTTGG AG AT AAG CAG AATG C CTC CACTTCCTTG AG CACTTTGCAT 
GGAATAGCCTGTCTCAGTCACAATGTCTCAGCGTCAGAGTTTACTTCT 

AAGGCAGCCCTGTTTCCTGTTGATGCAGAGGCAGGAGTAGCCGCTGGGGAGTAGCCGCGGGGACAA 
GCCTGAAGCCCITC^GTGATTTTGGATTTCTGGACTGATATGAGGCCAGCTC 

TCTCCAGGGTGTTGACGCCTGGGGCCGTCCTGTCCTGACTGACCCAGAATTCCCAAGGGTCTGTATTCTGG^ 
CTGTTGAGAACCTCTTCCATCACCTCGTTAAGTGA 

CTATGCCATAAGGAAAGAAACTATCTTGGGTGGGAGCCCCAGCTTCCAGAGGGACTGAAAGTGA 
GTATTCCAGACAGCCGTGTTTGATAAGCTGCTCAGGTATCTCAG7UVCGCT 
TGTCAATGCTGCTTTCTTCTCTTGGTAGGCAAGGAGGGC^CCATGAGTTGGC^ 
GATAGACTGGCCTTCTACCTCTGCGGCOMAAACC^ 
TTA G^CCGCATTGCTACTTGGAAAGACAGAAA 

T CTTT TTAATATATATTATTTATTGTTATTTCTATGATTG CACGTAGC TGT CI^CAG ACACACACCAGAAGAGTGC 
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ATCAGATCCCATTACaGATGGTTGTGAGCCACC 

CAGCTTTGCCTGCAACTTTCTGTGTACCTTCAGGTGGTTTTA 
CCCCCCCACCCITTTCCCCTCCAAACCCTCCTG(^TACTACA^ 
C3AGTGCTGAGAACAGGCTCGGCCTTCOTCACTCTCT 
ATCTGTAAG^CC^CACTAATAATGGAAAGAGGGCCA^ 

AGATTTGGTTTTATAAGCTG CT CAGCTGTCTCAGAGCGGCTTGTCAGAGTGAGGTGGC C CrTTTACCTIXKS CXH'GGTTTGTGGATCT^ 

GAACAGGAGTCAGTAAACTGTAGTCTGAGCACAATCCAGCCCTCGACCA^ 

CTOCTGCTATCTGTGTGTC^CTCT 

AAGCCTAAAGCATTGTCTCTACA6CCCTATGCAGAGACATTGCCAACCCCATTTAA 
GGGATOTGTGTGCACGCACACACGTGTATATGTGT 



AACTAAAAACTATGG ACAAAATG CTT AGCTTTGAAAAG CAATT CTTAG CTAAGAAT AACAAGACCCAAAGTCCC^TCAAG CTTGTA 
GCATCTT C CTTCTTTGGATG CAGTGACGGC CTGTG AT ATGGAG CTGCAGG T ACAAAGAGG CCTT CTCTTTCTGATTT CAG AAATG A 
GACCTAAAGCAGACAGACAAGAAGAAGGTTCCTCrGGGTTC 
CCACAGCMTCGCACAGATGTATAAATCTCTGGCACCAGCATGAA^ 

AAATC CCAGCCTG CCCTGT TGCCTC CTAAAAA TTGAAGAAG CCCAGAGT CTTGCAAT ACAAT 

GTTT AG AAT AT CCCAGATT C CGTCCATAG TG ACTCAGGAG AG AAAAAG CTGGGAAAACCAAGGG CTG CATAGAGAAAGCAGGCACT 

AGGTGAACAGGCGCTAGGCCTGGOSCCAGATGGTGGTCAGCAGGTAAAGAGAATGGGCCACCCA^ 

GCACAGGATAGA^AGCCAGCTTCCATGAGTTGTCCTGTTC 

GAGCAATGAACATTCTTT'CAAAAGAAAAAAAAAGCA.CATAAGACAG^ 

GGC^GTGGTGATATACAGCTTTAATTATGGAGGCAGAGGCAGACCCTGAGTTCGAGGCC^ 

ATAGAGAC CAT ATG T CGAAAAACCAAAATAT AAAAT AAATT ACAG T ACTT AAAATATT AACT CTAGATT AATAG GT TTAAT AG AAG 

GCTAGAAATATTCAAAACATAAATTGAAAATAGGCGGCTAGGAAGTGATGTAGCTCATG^ 

GCCCAGTCCCCAGCACAAGAGAAAGAGCCAGTGGCAGCCTC 

TGCAGGGATAAAGGACTTGCTCTGCAAGCCTJ^AGACCTGTGCTTGTTCCAGCATCCATA 
GGCGCATGCCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGCGGATTTCTGAGT^ 
CAGGACAGCCAGGGCTACACAGAGAAACCCTGTCTCTGAAAAACAAAAAACAA^ 
TAAGTTCCACC^CTGGGAAAGCAGAGACACTGGGATCACTGGGGCTTGCTGGC 

AGAG GC CCTGTCTCAGAAAG CAGTGTTGG T AT AT ATATCTGGGTTAAACAG CAATTAATG ATAAAAAGAGG CT ATG AATTTG AAAG 
AGCAAGGAGGGATGTCTGAGGGCTTGCCAGGGAGAAAGAAATGAT^AAATGATGTAATTATAATCT 

AGTGG AAGTG AGTTTCCT AC CTTTAG ATCTCAG CACTTGGAAG CAG AGG CAGAAGGG CAGAGGGTAG AGGG ACTGGAGAG CTGG CT 

CAGCAGTGAAGGGCACTGGATGCCTGTGTAGAAGATCTAGGGTTGGTTTTCCTCTC^ 

CAAGTTCTAGGAGATTCGAAGCCCTCTTCTGCCTTCCGCAGCCACCAACTACAATTG^ 

AGATGTGCACATAATAGAATGCATACTTTATGAAAATAAGAGTAACGGTAGAAAGGGGACTCGTTCT 

CTGGTGCTTCTCAGAAGTCACTGAGAGGAAAAGGAAATTAAATGGAATTTA^ 

AGAGT TGCTGG CC CAGGGCT CC ATATCCAGCAAAGCC TTTTTAG CATGAAGACAAATTAAACACATTGCTAAATGAGG AAAATG AT 

AAGACTTCATTGCTCTAG(^GCAGAAGTAGAGAAGGTTCCTCAGGTCAGCGTGGGCCTGAGGACAG 

GCCCAGCAAGAAGTGCATGGAAGCAGAGCAGCCCGGGCTGCAGGCGTGTGGCCGCTGAC^^ 

AAGACTCAGTGTGCAGC^CAGTGCAGAC<3AAGGGGAAAGAGTGGGCAGAGCAGGAC^G 

TAGTTCCTGTAACCCATAGCCATTGAAAGACAGAGACMCAATGTGAGATCAAAAAA 

TGTG AAGCAGTACTTAAATAAT CGAAAG CT CTTTATTGAAAATATT CAAAAACAGAACAAAAAGGGAAAAAG CCTGACACTTAAAC 
AGAAAATAGTAAAATGTCATAAATGCCAAACACATAAATCCAAAACTGTGTTTC 

ATGCAGACATCGAGGCTAGCATCGCTGTGATGGCATCAGGATGTGGCTGTGGCTGCACC^ 

CTGGGTCAGAGCAG ACCTCAGCAAAGACAGAGAGGGTGGA^AGACAGTGACAAAGTG CCTG CATAGGCACTGG CAGCACTCCAGGT 

CACATACCCCGGTGTCCACAGACCCAAAGGCGAGCCTCCAAGGAGGCGGCAGGCAGAGACACCT^ 

GGTCCCACCTGGTGGAAGGGGAGGACTGACTCTTGATGTCTTCCACC^ 

TCCGCATGCACAAAGTACATGAACCCTGCCTGGGGAGCITGAAAGGAAGGTGAGAATGC^ 

CTCTGTAAGTTCACCTTCAGCAAGOTGAGCCATGTTCTGAATGATW 

TATT CT CTGACT AATAAAACTAAGCTAG AAATTTATG AAAATCAAAAT^ C TAGAAG CTT AG CAACACATTTT TAAGT 

CATCCATC^GTCTCAAAAGAAAATCTCAAGGGAAATTAGTGTCATACATATTA 

TAGCATGTGGTAGCCCAAGCTAGCCTCAGACTTACC^^ 

ATGCCTGGATCAAAGGTATTTTCAACTAAATGAAAATGAA^ 

ACCATCAATCCCATCACTTGGGATGCCTGTGAATTCAGAGCCAGTCTAGTGCACATAGTGTGTGCC^ 
GTGAGACCGTCTCAGAAACAGAAAGGGGATGGGGGAAACTTTATGGAGTTTATTTAACA^ 

TAAATACTTGG AT ATGAAAAAAGTAT AACTCTCAAATCAATCTT CAGTAGAAACAATG CGAAG ATAATAATACAAC CATGTTTAAA 
<^TGTATTTATTTCATGTATGAGTGCT^^ 

TGAGCCACCATTTGGTTGCTAGAAATTGAACrCAGGACCTCTGGAAGA^ 
CCCATATATGAGTTTATATGTATATTCCTTTGCATACTTA 

TCAATTTGTACCTCAACACATCCCATTAATGAAAATGTCAGGTTCAAGAAATAGGA^ 
AGAGCACTGACTGCTCTTCCAGAGGACCTGGGTTCAATTCTCAGCi^ 

GGAG ACAGACACCCT CAC ACAGACATG CATGCAG ACAG AACAC CAATGCTCATG AAATAAAAATG AAT AAATT AAAAAAAATAG G A 

AGAGGGGCTGGAGAGATGGCTTAGCACCAGCATCCRCATGGCAGCCAGTT^ 

GGCATCAGGCACATAAGAGTGCACAGATGTGTACAAGCAAAAGACAA^^ 

C^GTGCTGTGGTTTTGTGTAAGTGGGTTAGTGGGGTACO^CCCAGCAAGAGGG^ 

ATACACAGAAAAGTCCTC^GCGAACAGTAAATCCAGCAATATGTAAAATGGAGAAATGCGCCATAACCA^ 

GAAGTCAGTCACTGTGACTGGGGCG^GGCGCGGGCGTCAGAGACCTGCTGCTCTCACAGAGGACCCAGOT 

CAC^GTTGGCTCACAGCCATCTGTAATTCCAGTTTCAGGGGATGTC 

CATACAGGCAAACACTCATATACATAAACTTTTTAAAAATTTTTAAAAGAAAAT^ 

AAAG G CAG CATAC TCAG ATAATAATTTCAAGAAGAT AATT ACAAACG CTAAAG CTATAAACTAAGGAGGGG CTGTG ATCTTGGAAT 
AAGTTCTTAGATACGTGTTGTTAGATTAATTCTC 

ATGCTGCCCTCTTTGCCCCACCCGCAAGTCCTGGATTAGAGGTGTGCAGCATCACCCCAGC 

AAGAACTGACCAGGTTGG AAAGAGAAGT AG GGAGGAAGTATTTGCCTTTAATCT CAG CAC TTGG GAGG CAGAGCCAGGAGG ACCAG 
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15 



25 



GACAGCCAGAGCTACACAGAGAAACCTGTCTAAACAG 
TGCTCTCGGAGAGGACCTGGATTCTGTTCCCAG<^CCCACATGCT 
GCCCTTAGGCCTCTGTGGGCACAAAGCAC^CATGT^ 
AAGAAGAAGAAATGAAACAGGATCTCATGCAACCTAGGC^^ 
5 TCCACCTAAGGGCTAGGGTTATTGGTATGCAATACCATGCCTGTGCGTGTGTTTACA 
TTCTCTCCATTTTCTTGAGCrTGAATCTTGTCTCrrcCA 
TTCCCATCTCCCCGAGGGAGTGCTGGGATGCAGGTGTTT^ 
TCTCCAGGCTCTGGTACATTTCCTCTGCCCTAGAGTTC^ 

TAATAAATAGCAACAAAACAGACAAACAGGCAAGTAACAATAACACCCGGGCAC^ 
1 0 ACACAC^CTGTAATGCTGCCCAGAACCAGGAGTATAGCTTGGGA^ 
ACTGGGGGCGCTGGGCAGGGCTGCTTTTAGTGTTTGTGTCM 
CCCCAGTGO\GTGAACTCTGCGCTCTGCTTG 
TAAGATGCATGC<3CGCGCTCACAAACACACACACACACACCAC^ 
CCCTGCGCTCCTACTCTCAGGCATTGCCTCG^ 

ATTGGGATGGACATCCGTATTCTGAAAGCTAGGGAAGGACACAGAACT 

CAG AGCAGG CC TGCTCAGCCCnXjGCTGCCTTCCTCTCCCAGGCTCAGGG CAGCTCTGCCATCTG CCACCAGGAGCTGACAGTGACA 
GAGCTTGG ACTGAG CTCCTCAGCAGTTTTT ACTGAGCGATG CCTGGAACAG GG CGGG TTT CAGG GAACTCG CAGC CAATCTCGAGA 
CTTTTCTGTTGTCCCACGTGGCACATGTTTATGGAAAATGGCTAATGGGAAAACTT 
GATGTGAGCATATCTGGTTTCTGTTTCAGAT 
20 TTTGGTTTATGTGCCCACTATCGCCTTGGTCTACAGTATTCIAAGTG^ 

TTTTATGGCTACTCACTTTGGAAATAAGACGGCTGTAAGTAAATTACCGTGTGAAAC^ 

GCACTGATGCAGAGTCGTAGCCAGGGATACCT CC^ 

TGTTTCTGTGTCAGGAGACAGTCTGCTTTGCCTTTGA 

GCTGCTCAGCATGGGTTACAGGCTGGATGAATTTGTACATTTTGAGAAATGAATC 
AAACATAAAAGCCACCCTTAGTTCATGAGCTGAGCTAAAAAGTCCTTAGCCGTGTCCCCAGCCTGTA^ 
NTTGTAGCTGTGAGCTTGTCTTTCTGCCTTTTCTGTTTCCTCTCCGTTTTGTCTTGA 
ATGCAGTGGCTCACGCGTTGGACACAGGCTTTCCTACTGAGCCACACCCCGAGGCCTAGAGAGGAAOT 
CCACCCTGCGGTGGGGCAGTTTGCTCTTACCTGACTATTGGGTCTCTAAGAGTCCTGACCAA^ 
GGCTACAGAATCGTTCCATAAGCAGGGCTATCTCCTCCCTCGGGTGT CTTAGTCTTCTAGTGAGGT CACGG CAGCTTCTGTGCACT 
3 0 TG CTT AATGGTTTTGTGC'nTl'l'CITAG CAG C CTGGATTTG TCTG AG TTGG CAAAAGCTGCGAAGAAGAAACTT CAATCTGTGAG T 

ATCTCTTCCCTAGGAACCCCAGTATGGAACCCTTGTT^ 

AGACAC^TGCTGTCTGGTCTGAATGCAGTAACCMGACCAAAAGCACTTTAGTATATCCAGGCGGCT 
TGATCTTTTCTGTTAAAAATGCAGCCCTTGCCTGTGATGGTAGAGAACCAAAGTTGTGGGTGTGTCTT 
GGGCACCCCTTCTTGGCATTTGTTACGTTACTCACATGGGATACCGAGGACTAATAGTAGCTGCTGGTAGGGCAGT 
ATCCGTCCACTGCAGTGCTGTGACTCCTCCCCATCAATC 

GGCCTGGACCCATTGTGTGTGCTCAGGGTTTACCCAGCCCTGCAGGTCACAAGATCA 
CCTTTGTCTGTCTTTCCCATGATGCCTCACCAGGTGCTTCCTGCTCACCTTGT 
CTGCATCAGAACCAAGAGCCTTGACTTCTCTCTGTGACCTTTAA 

CGTGCATTGTTGGAGCAGCTCTGTGAATO^CTCTCTTC CCCTG CTTCAGGACGTGTGCCTG 

CATGTTTGCCTTGTCTGGTTCrrGGCTCCTCTTAGACTTAGAACT 
GGTTTCCCTCCTTCCCTTTCACAGAGATTTTTAGAGCCT 
TAATTATACTGTGAAATAAATGTTTTATGCAATTTC 

GCCTTTCTTAAGG AAGCCACCTAGTAGATAAATAAGATGTCACGCTTTCGAGGCAGCG CAAGAGTGTGTGCTTT CCTTGTCCTTAC 
TGTAGTTTGTAAGCAAACTTGTACCTGGCATCAGACT^ 
45 GTTATGGGGTGGG CATTAAATACATACCTTGGGTATAGTTTCCTATGCCCTTAAGACCTTGGG CTGTCTGTGTGTCAGG CTTGCC C 

AGCTCACGGCAGCCCCCAAAGCACAG CCTTCCAAG AGGAGCGCTGGGAAGGGGACACTGTCGGGCA CATCCCGGC 
CCTCCTCCGGGTAGGTGTTGCTCTTAGCAGCGC 

CCrTCCCTTCCTGAGTCTGTCrCATACrCTCTCCTTCCTCCCTAGTGAAAAAC^ 
AATGTATTCTTGG<3TTATGAAAATGTCTGTTCCT 
50 TGCTGGCTTTGGATGGTGTCCTTAGCACTGTGTCTAAGATGCCACTGGCCCTGAGCAGTTCTGCCATGTGG 
AAACAGCTTTCAATCCAAACCCCTGCAGGGTCCTCAGTGAGCGCCTTCTGTGGGGTTTA 
ACTAGGGAACTTAAAGAAAGATGGGACC^TCCTCAGTCACCTCGCT^^ 
CCTTTATTTCCTACGCATACCATTTTTTTCTTTCTCGAAACTG 

GG CAGTCAGTAAAG ACACAGG ATGCAAAG CAATAAAG AGG C CAT AGT C C TCGAGTTGCAGGAT G ACAGAGG AGCCAGAC CC CAAGG 

5 5 CTGGGGCTTCAGCCTGCCTCCGTGGGGAGACCTGGATAGTCCTCCA^ 

GCTATGAACTGGACCCCAAGCGGCACTGGGGTGGGGTGGCGGCCACGCT 
TTGTTACACAGTGGCACCCTTCCCCACCCCAGCAGTG 

TGAGCCCCGTCGCCATCCCCTTCCTGTTGTAGAGCAGACATAAGTTCTCCGGACAGCAGACTTCT 
AGCAGGGGGGACTTGTGTATGAGAGGGTTTGTAACAAGTTACTCTTG 

60 CCTGAGAAATGCTGTTCTAGAGCTGGCTTTGAGTAGGAACACCCCCACCCCCACCCCACCCGCCCGGAGATGAAGG^ 
CAGAGGCATGCTCCCATGTTAACTGCTGTCCCT^ 
CCATGGATGTGTACGACGAAGTTGACAGGCGAGAG^ 

CCAGCTGCTGGCCTGGGTTTCTTCTTGTCCAGATGCCCCTGTCCTCATAGCCCATGTGCAGA 

6 5 CCGGCTAGTCTGGCTTGCCACGCAGAACCACAGTACCTTGGTAACCGAGACGACCGTCG^ 

ACTC CT CCACACGAAACCAGG CAAGTGGCT CTGT ATG AAGAGTTACTAATAGTATTATTTGGGGAGTG ACTGATGG CGTG ATATTT 
GGTGTCCTTGTGGACATACACGGAAACAGA.CGGTGTGAGAATTGGTAAGAGAAGTACAAACGACACA 
TAAAAGATCGTAAAGGAGACTCCAAGTTGTCGGGGAGCATGGGGGTCCCATGCCACTGCCTA 
TGGTGGGGTCCTGTTGCTCAGGAAGCTGGAACAGGGGGACCTCA^ 

7 0 GATGGTGGAGAGAACGAAATGGGGTAATTTGTTTTCC 

TCTGACAGG AGG G AGAGAAGGAGGAG AG GGGTTGGTG CTGAATAATGG CTGACTGT AAGAG CAG ACCCACTTTATCCCTAAAGTCA 
GAGATGTCCCTCCAGATTCACTTCrrCACGCACrGCAAATTAAATCA 
AAAGTCAGGGCCTGATCCTTGCTTGTCAATTGTCCCAGCAGT 
75 CTGCTGTAGAAATGGAGATGTTAAGTCCTCTGCCACA^ 
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TGAGCTGGGAGACCCTTGCGTCMCTGCAGTC 
CCATTCTCTTCTCTCCCTTAGGGCAGACAG^ 
TGATGCCAAGAGGAGACAACAGGGCA^ 
CTTCTGGGATGCACTGTGGAGTCCCGAGCT^ 
5 CTATGACAGAGAAGTACCTGTTGGAACCCTGAAGAATAAGG 

Ai i a i<^TAATCACTTATTTTCCCCAAAAATTGG CATGTTGAAATTAACCCAAAAGCCTAACTCTTAAATTTATTAAGGACATA 
AAGTGAGGCGATGGAGACTCTTGATGGCAGCTGTCACTTA^ 

TTAAATGTCTGTAGCGAGAAGCCTTGGGATGTCCGCTGAGCATCTATTTCAGAATGGC 
GCCCGCTTCCTCCTAGCCAGCCCTTAAGTTTGTTCrrGTTrri'l'TlAT^ 
1 0 AACAAGATGTTTGTAC CATAAAAATTAAGGTTC CTGTCAATAT CACACT C CC CTAAGGCAGGAATTATTAAGTAA.CAGTAATTTAT 

GTTTCTT CrrGATTGTTGAGGAATTTGGTCCTATCAATC CAGCCAG CAGTAA^GATGCT CTGAGAAGTGG CATT AAR.TTGGTGAGG 
TTGCAGTCTGGAGAGGTTTTGTATAAACTTGAAACCTC^ 

TATAATGTGAAAAACZATTAGAAACAACT AAATAAT^ CT CCTGG AAAT 

ACTTCATGGTGTGGAGTAAAACAATACAGTCTTTAAAGGTTTGCTATAGTCTCT 
1 5 CTGTACAGGX3AAAGGTGCCTTCCCCAGATGAATATGCACATGTGGTTGCTCCT 

GCTGAGGCTGAGGCTGGTACCCTCCTGCAGGTTTCAGTTTCATACCCTGTCACTGGCCTCGCG 

CCTGTGTTCTCATACTTTTTGTTGGTGAT^ 

CATGCAAGAGGGCTAAGGAGAGGTTCGTGGGAAGCT^ 

GTGGTCATGAATCCACGGTATGTGTCACACAGGGTGTCCT 
2 0 CGGCTCTCAGGAACTGTAGTCCTTTGTCTCAGTGAGTGACAAGCTTTTA^ 

ATATG CTTAG AACAGTAACTAGAAGAAG CAGGCCAG CGTGT CTCAGCCTGGGTGAT CG C CACTGTCCCTCACGAGTGGTTACAGC C 

TCTGTATTCAGTCTT TGTACCGTATCTCTGTATTGGCATGCTGGTGGGrri"rGTTTTGTTTGTTTTGGTTTT^ 

AAGACAGGGTTTCTCTGTATATCTCTGGCTGTCCTGGAACTCAOT 

TCTGCCTCCCAAGTGCTGGGATTAAAGGCGTGCGCCGCCACCACCCGGCCA 

2 5 AAATAGGACC^OTATATCTTAAGAAATCAGAATTCACTGTCCAGATGAAGCTC 

TTGAGTTGCCTCAAATAAACCAAAAATGTTT 

CGTCTGTGGAATGGACACTACTGATAATGTAGACACGTGTGTGTCTACTTGGTCCAGT 
GTATATGAGTXjACCACTGCTGTTCTTGTCTTTC 

GATTTTTTGAGACAGGGTTTCTCTGTGTAGCTCTGGCCGTCCTGGAACTCACTGTC 

3 0 CACCTGCCTCTGCCTCCTGAGTGCTGGGATTAAAGGCGTGCGCCACCACT 

GCCCTGTGCATGCAAACCTGTGACTTCCCCCCT 
CATTGGAACTGCTCTGTGTGACTGTCCCTTAi\GCCTOT 

CAGAAAGTCAC CTTAG T AGTTTTTTAGAAGGAAAAATATATTTT AACT CTGAAGT AGTTTTT CTCCT TCTGTG AACAG ACAATG TG 
GAGCT CATATTG AG AACAGT CAGT AC C CAGCACAGCACTGAGAG C CAAGACAACGAC CAGC CAGACT ATGACAGTGTGGC CTC CG A 

3 5 TGAAGACACAGACGTGGAGACGAGGGCAAGCAAGGCAAACCGGCAGAAGGTGAGA 

TCCTAAAATGTGTTCTTGCCCTGGGGCTGTAGCTGAATGGT 
GCCACCAGAAGACACATACAC^TGGCAGAAAAAGTAAAACTGGCAGAAGGGT 
TAGGCTCACTGCTAACTCAAGCCAAGCAAAGATTAAATAAATGCTCTTAA 
GTTGACTCTAG^OTGAGGTCAGGACTGGAGGCTTCTCTGTGTCCCAGTGTGCTTAGCTCCTCCC 

4 0 AGCAGGGGACAGGAGACTATGCACTGCACAGGTGGTGCG 

CATCAAGGGAGGAGCTCITGACAGTACAGGGCrCTGAAACAAGGCCrCATAGATGTGCTG 

AAGAAAGCCTGGGCTGGAACAGCCTAGAAGAGCCCCACACTGTGCAGAGACTOT 

GC^TTGCAGCACGGCCTCAGGGCTCAGGCCAGTGGG^^ 

GGGGTCTG CAGGCTGAACCCTCAGGACAGACTCAGATAGGCACTCCAC^ CC 
45 GAGGTCTGCIAGGTTTGAAGCAGGGAGATAAGCTTAGATTTGTCCAGCA 

GGACTGCCCCGTGCCAGTCCACTCAGTTACAGGCAGGGT 

CTGCTGCCCTTTCAGCCTCTCGCTGAGCACT 

GCTCCTCCCC^GGCAGCCaVGCCAGCTGCTGCGTrc 

TGGAACACCTTAGCCATCCCCTTTCCTGTCCTGCnX3GAGTGTGAGCX3 
50 . TGGAGACIGACTGATGCTCACTAGGTCATGTGGAGCGAGTGAATCCATAGAGTAA 

GGGCAGCGCAGAGCCTTGGCAGCCAGTGTCCACA^ 

AGTCCTGGAGTGCCCCTGCCTTTCATATGGTCCACCTTTGCAGTCCAGCGGGCGTGGTGGAGATGGAC^ 
TTTAACATCATCATCGOSGATTGTGAACATT^ 

TGCAGGAATTTATGGAGGTCAAAAGCG CCCTAGTGG CTTCTG AC^CCAAGAGACAGCAGCTAATGAAGGTGAATAACAACTTGAGT 

5 5 GACGAGCrTGAGAATTATGCAGAAAAAGGTAACATGTTAATAAATGCCAGGATGGTTGACTCGTTGATCC 

GTCAGAGGGAAGAAAACATTTACTATAACAGACATAGA.CATACTGTCCTCTTGAAAGCAGGCTGTGTG 
CGCTCTGATGAGAAACGTGTTCAGTGTAGCACAGAGACTATAGCCCAG 
GGCCAGGGCCTGTGCTTGGCTGGGATTCTAATTCAGTGTATG 
^ ^ GTAAACTCCCCTAGGAGAGACCTAGCAGCCCACTATTGGTGTGAGATAGCAT 

GTGCTTCTCAGCGTAAG CCCTGGCACGTACCATGTGCCGTGCTTTCTCTG CTGGCACTGGCTCTGACTGCTACCCTGG CAACCCTC 

GCCAGCATCTCAGCCTGCTGTCACCACCTCG^ 

GGCAACCCGTGTACGCTGCACTATCCTAGTCATTC^ 

AGGG GTCGCC TAAGT CCATTGGAAAATG CAGAATTACAGATAGG AAGTAGTAACG AAATAATTT^ C 

6 5 ATGAGGAACTGTGTTAAGGGGACACAGCGTCAGGAAGGTTGAGAACCCCTGCTT 

GTGGGGAAGGCAGTGCITGGGATGATGATGGAGAGCGACAGATGGGAA^ 

GCTTGACTGTCGTGAGAAAAAGCTCGCTATGCACCCCACGCAGAAAAAAATAGGCTG 

GCGATCTCIAGACTCAGAGTATTGTAAGGCCTGGTCACAAGGTACATCrT^ 

AGCACTCCCACTGGCTAGGGAGCACTCCCACTGG CTAGGGAGCACTC CCACGGGCTAGGGAGCAC?TCCCACGGGCTAGGGAGCAAG 
70 GTTGTGAGAGCTCTGCAGTCAGTCAGTGGCTCTTGTGCTGTAGTGCAGAG^ 

CCTGCCTTGCAGCCCCTTCTAGGCTAGTGCA CTGACATCTCC 

TCGGCCTTTTCTGAGCAGGGTCACACAGATGTC 

AACTTGCCTTTGATTGTTGGTCTGCGGGGCITCACCTTTT^ 

G AGATGGC CCAGGT CAAACATTG AAAGACAAGAAG AG GACTG AAGTCCAGACCCCCAG7VG CCCATGTG AAAGACCAGTTGGG CACA 

7 5 GTGGTCCACCTACAGTGCCGTGCAGCAAGAGCAGT 
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TCAGG CTCACAGGGAGAG TGATTGTGGGAGATACCAGTGCT CCACT CGAG CCTCC CATCACATGTGTGT ATG CACGTGTATG CTT A 
AGATTAATAAAG AAAACrrCAGTGTTGGCTGAAGAGATGGCT CAG CAGGTAAAGGAACTTGACAGCTGCAGAAG CTGACAGCTTGAG 
TTCAAGTCCCAGAGCCCACAGAAAGCCGGAGAAGAAGCCTCAGAGTTGTCCTCTGACACCACACGC^ 
CACAGTCAGACTTCATTTTTATTGTAATTGCTAGAGACAGCTC 
5 ATTGGGGAGAGACTTAGTGGGTAAGAGACGAGTACCCAGCCTGATGG 
GTACCGTGGTGTG<^CTCACCCTGACACACATGCACATACACT 

ATCATACAGATG C CTGTGATTTTTAAACT AGTGAATTATCTGT AACTTTTT ATTTC CCAGAGTACAG CCTGG 

AT CTTTTTTTTTTTTTTTTTTT^ 

AGAAGAGGGAGGCAGATCTCATTACGGATGGTTGTGAGCCACCAT 
1 0 CCGGGTGCTCTTACCCACniAGCCATCTCACCAGC^ 

TAAATGCAGAGTAATGAATTTGGACACAGCCCTGT^ 

GGCTAAGAACTAGTTCAAGGCCCGCCTGAGTAACTTGGACTCTGTCTCAG^ 

GACCATGTGCCTCTGTGCTTGAGGCCCTGGGTTAAATTCCGTGTGTAGGGGAGAGG 

ATCTCAGAAGGCAGCTGGCCAGAGTO^GACTGGGTGTTTCAG 
1 5 TCAG AATAG CACAGAGAGCTTTTAGCTCAG GCAGACATATGAGCCCT G CATCTAGGGATATAGTTT CTTG TATTATT C C CTCTCTC 

TGGACCCCACCCAGTTTCCATTCATATCCCGACAGCCCACACACTTGTCTGTCCrrGGAA 

TTTG CAAAGGAGGTTACATAT ATAGGCGGGGAAGC CATGCTTGTGT AGAGGTAGG CACTACAGATAG AGCATGGGG CTTGTTCACA 
AAG CAAGGGTGT ACTGTTTTTATTTT AAAGTTTGTAATAGAGTTGG CAAAGT AGT GTGAT TTATATCTG AT CTTTCACATTT AACA 
TTTG TTTGGAAACG ATGCAAATTAATG AAG AGG GT CAGCCACT AGrGATCTATTTCCCACAGArTG GTG CTTT CTAAAT AAAACTT 
2 0 GAAAGTTAACTCCCAATACTGCATGACCTTGCTATT^ 

TTGGCACTGGATGACAGCATGGTCAAAGTGTACGTTCTCTCAGAGAAGGCAGTTGGACTCATACCATCCC 

AATG CGGGG C TGAGGCAG CACAG ACACGGT CACTTGTATT T AAGATG C CAGC CT CATAATT C CT TGCTGATCT CTG AT CT CATGAA 

GCAGCTCACGTGTGAAAATTGCAAAGAATACTCAGGGTCTCTGTGTGTCA^ 

GCCCACTCAAGGCCAGCATCACTAACTGCATGGGTCAAAGGAGAGCGAGGTCAGGTCACAGCCCT 

2 5 AGGGAAGCCAGTGGACCTCGGGAGTGCCCTTCTCGTGAGGACACTAACCTAGTTCAAGAGGTTGCCGGTTTCTTGTGATACGTTAA 

ATTAGCAAACTTACCATGGTTTGGTCCCT^^ 

AAAGACATTTTTATCCTAATTTTTCTTGTTGGCGTTTTGGAAAGCAA 

TTGCrrrTGCACTTTGGGCTGCTTCCATGGCATCTCCrrGTCACT^ 

CAATGTTCTGAATCTCTTCCTCGAGAGAGCTTTAGAATGACCCAGAGTACTGCT 

3 0 AACTCTCTAGGCGGGTAGACTTGAATTTCTCATGAGCCTG 

GCCCCGCTAGCTTTATGTCCTCACAAGGTCGTGTTACGTTGGTGTACTTGACTTTGC 
CCCACTGCCAAATGTTTCTTTCAGCTTCAAACTCTCCAG 

GGTG CAGACTGGCTCTGACCACAAGGACACTGCAAGCCACTCGTCCCT CAGTAGACCCATGT 
CCATGTACGAGACTGGATCAGGCCAGAAGCCATACCTCCCAATGGGAGAAGCCAGCCACCCTGAGGAGAGCCGGACGAGACTTCAA 
35 CCCTTCCCCACGCATGTGAGTAGCACTGCTGTGTGTCCCATTTCTGCTGTGCAGTCTGACT 
GCTTGAGGACTGAAGGGGCTTGGCTAAAGTTGGGCGTTCTGGCTCTCGCTCAGCC 
TCATTCCCAGGCAGCCTCTGTTGGCATGTGTTGTGTAGAGCTTCCTTC 

GGCAGAAGTGTCTGTCTTCATCATAGGCGGAGGCTGGGGCTGGAGACATGGCCCAGTGGTAACCACA 
CCACCCAGGCGGCACATAGCCCACTCCCAACCAGCTTAACT 

4 0 GCATG CACACG G CACACAG AC AAGCAGGCAG AACACTCAT ACGTAAAATGATCTT TTT AAAAACAACAG AGT AGG CAG AGG CATGC 

CCTCCTTATGAGCTGGAACTCCCCTGTGACrrGGATACATAGAGTTGAGAGAGTOTGGGCT^ 

T AGG CT AGT AGCATG ACTTGG ACATTTCATAAAGATGCAGTGT AAG CAG C CATTAT ACCAGCACACT CAGG AAGCTG AAG CAGG AG 
GATTGCCTCGACCACATAAGATCTTATCTCCAAAAAAAGGAAAA^ 

AGTGTCTGCTCTGTGGTAGGCTCTCGCCGACCACAGTGGTGCACGTGT^GGGAAGCCAAGTT^ 

4 5 TTTGGACTGATCTATTGTAAAGATACCGGTTTCCACAGAGTGCTCCAGGCATA 

GTATTCATCCTTTGGAATGTCAAAATTATTTCTTCAAA^ 
ATAAGATCTAATACATTTGGAGACTTTAGCAGTGTAAGACTGCACT 

ATGTGGGTTTAATGCTTGAAGACCGGTCACTGACCTGACTGATGTTCTCAGTGAAAGGAGCCTCGTTTTCA 
AACTTAGGCAACCTCACCTTGGTAGCTTACACAGTGCACTTGATCCTAGAGGGAGTCGGTGTGAGG 

5 0 CTGACCTGAGGACCAGGGTGATGTGCACACCTGTTCAGAGTCGTCC^TAGCGCTGTGTGGTTTTGTCCCT 

CAGCACTGTAGGTGCTAACAGTGAGTGCCAAGTGTAGCTACGATGGC^ 
GCAGAGTGGTTTCCAGCAGGCCACTTGCCTAATTAGGGATC^ 

AGTGGGAGGTGGGGACGTGTTTGTTTTATGTTTTCCCAAGCGTGTTTCTCTCTCCACTTCCAGATTC 
CTCCTCTTCGTCTCTGCCCTCCTTCCCCTCra^CTTTCCTC 

5 5 GTCTGCCCTGGAGGAGGGCCAGCACCCCTTGGCTTTGTGCCCCCGCCCCTTCCTAACTGCC 

CTCCTGX3GTTCTCATAGCAGCAGGCACATGCAGCCGTGCACATCCGGCCTGGAGAGAGCT 

GCGCCTTCCACGCTGCCTCTCTCACCTGTCTAACCACCTCCCAGATC^ 

ACTGAACTTGAGCTGGCTACTGGTCACAGAGAGCAGGGCAGTCCACC 

T TGGGTTAGCGCAAAAGTATCTTAGCTTTTTTCTT CTAACTGCTGACTTACTTTTTATTAGCACC^TTACCGCTGACAAGC 

6 0 CTCAGTTGGTACTGTCATGCAGAGGAGAGGTGGCTACAGGCTTAAAGCTGTCCACTTGGAATC 

GTGTGACTCACGCCAGTGGACTCCCTCACACCCACCTTTAAGAAAGGCAGCTAAAGGGTG 
GTTCCCAAACAGACATTGTGTAAGAAGAAAACTCTTTC 

AGCCCTAGTGTTAGAGGCCTG CCCTTCGGCTCCTAG CTCGGTCTGTCTGTAG CAAGGCTGAAAAAAAGTAGGGAACTGCAAGCAAG 
ACACC^CCI^CCCCAGCAAGCCCCCGTGTGGGGAAGAGGACCCrGTGCA 

6 5 ACCAGGTCACTCTGAGTTTCCTTACAGCCCTTGTGAAGTCAGAGGA 

GGAAGGGC^CTGTAGTCTTTGTTCCAAATGGTAGTTACGGAAG 

TGAG AGT CAGGT AGGC CGGTCTT ATCATACAAT TTTGT AGrTT T TT TG ATTTGT AGTGTGGTAGCTTTGAAAG GG AC CAGTCACTG 

AAGTGATTCTGTCACAGTGAGCAGTGACGTCATCCAGGGTCCTCTGAGCTCCTCCTGGGGG 

GTTTGTCTGTGCAGAGAAGTAAGAGCAGCTACAAGGTAAC^ 

7 0 AACTTCCATAAACTTAAATGCCGGCTCATTGAAGTGCTGTGC^ 

ACTTGAGGAGCGTTTCCTGAACGCTGTTAGTGGAGGTGTGTCACT 
CTGGGTAATGTATTCAATCTCGAACTGTGACCGTGACATC^ 

CAAAAGAT TG CAAATTT AAGAT AAAC CTGG C CTACATTAG CAACACTGT CTCAAAACCCAAAATGTAAGCT ATGG AGGGTGTT AG C 
ACTGTCTTCTAGATTTCTOTCCCCACTGGAGCAGTTTCAATGAA 
75 TGTGTGTGTGTGCGCGCGCGCXaTGTATATGTCTGAAA^ 
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ATTCCTTTACAGGTAGGAAAGGTCTTAGGGGGGCAGATTTCT 
CTGTGTCCGCTCTCTCTAGTGTCAGTTACCCGCTG 

AAGCCGGGCGTGGTGGOGCACGCCTTTAATCCCCTGCTGGGTGAAGTGCMTCT 
AGCCCCAGTGCTCAGTATCX3TCCATAGAGTGGGAAAGAGGATGTCCAGGGGACAATOT 
5 CGGAGCAGGTCTCAAGACGCCACTCCTCAGCCGGCTGAACCCCAGGCTCGTC 
TTTCTGGATAAGGCTGCACATCTCATGTGACTTAGCAAAGCOT 
TACCTACCCGTGTACTTTAATAACCATGGCTTGAATGCTGGCT 
ATCGGCTTATGCTGCCATCCAGGTGAGGGATTCT 

CTCCTCCAGAGGATGCCATCCGCGCCACACCCCCTCACACTGAC^GACCCCA 
1 0 TTGTAACCTCCACAGTTAGACCTAATCTCTGTATTTTCAGAG 

CTTCCAGGGTTAATCCGTGTGGTTCACAGG CAG CAG CATGGGG GCTCACAGGCAGCAAGG CTT C CTTGAGAGCT CATG AAAACCCT 

CCCTTGGGTCAGGTTTTAGCAGGAGCAG 

TTTCGCAGACTGTGCCTGAAGGTGGGCAGCrrAGCTTATCCr 

AAGTCCAGCrrCTGAAGTGTGTGTTCAGATAAGTGTGTTTGTTACTGACTGC^ 
1 5 GTCAGACTGTGGGAGGCACCGGGGTCTGTTTAGGACTTGTTAGTGA 

AGTGATTATGACAACACTGCCTGTGACCCTGAGCCAGACGACACGGGGTACG 
CTCAG CACCCX3 CX3GCCCTGTGTCTGTGGTCATAAACT 
2 0 TGACATGCAGGATCCGCTGTGCCACACGAACGTGAGC 

GTGTAGCTGTGCCCCTCTGTCAGCCATCCCCTCC^CCTCCTTCACCCTGTCCACAGCAGC 
GAAAGCTCCTAGGATGGCTGTAGGTGCCGGGTAATAAGCAGAGCCCATGAGCCTCCCACTGA^ 
CAGCCTGCTTGAGTGCCAGCGGTGTGAGTCACAGACGCA.GGAACCG 
CTGTTCGCAGGTCAACCCGGAAGGGACGGCAGCGGAGCATGCTCrGGCAAGGTGA 

2 5 TCGCTCCCCAGCCCCACCCTCCCCAGCACAGAAGACGTC 

AG CAG CCCAGGAGAACAAG CATGACAGGTAC^GGCCGCCGCTGCG CTCTGGGAGGGGGTTCCCAAACCTAAGCATC^ 

GGTCCTGTACTGCTTTCATCATGTTACAAAGAAATCTGGGGGCTGGAGAGATGGCT 

GAGGTCCTGAGTTCAATTCCCAGCAATGACATGGTGACrCACAACCATCTGT 

AGAC^GCTACAGGGTACTTATGTATAATAAATAAATAAATAAATATTTATAAGTAAATAAATAAAGTAATCTGAAAGTAAATATAT 

3 0 CTAGAAACAAAACTGACTGCAGTGTACCAGGGGCAGGAGAGAA 

TGCGCCAAGATTCCCHXaGGCCAAAGCTTGCAGAGCT 9 
TTGAGCATGTTAACATGGTATAAGGTGACTGTAC 

GAGCTCAGTAAAGAAGTCTTAAACAAGAGCCACGCCTCGAGGGCTCCCAT 
GAAACACTGCTGGGCTACAAGCCGCACTTCACTGATCGCTCTCTGATGCTG 

3 5 GCCTCCAAAGATTACAATATGGCATTTGCTTCGATTTGGGGAGACTT^ 

CGCCTTTGCTTTGAAGGCCAGGAGAAAGCGGTGATCTCAGAACCATCGGCTTATGCTCCTGAAGCTC 
ACTCACTCTCTATTAACGTCATGGGTTCrrGTACCrrCCCCTGGGACGCGTGCTGAGCTGG 
CACACGTGTCTGTGTCGTGTCACTCACGGTTGCTGAGGTGCAGGCTTGCCAGGGCCCACCTGCACCT 
CCGAGTCGAGGAAGTTCCCOTCTVCTGCCCGTCATCGCrrCCTGCAGCCTG^ 

4 0 ACACACACAC^CACACACACACACACAC 

TTAATTTTATAATCTAAAAAAACAGGGAAAGA 

CCCAGGCATTGGGCAGTCCTGCACACGGAGCCATGGCTAGGTGGCAGTGTTGCCTTA^ 

GTCCTCTAGCACACAGT CAGTCCTCAGGCTTCAAAC CTGCACG ATAGGCTACAGTGAGTCAGCCACCTGCTG CTCTGTCAGTGCAG 

4 5 ACGGTCAGAGTCCTTGCAGAGCCCTCTCAGCACACGGCAGCCACCATGCCT 

TTGCTGCTGAGATCAGATTGTCTGTGGGAACAGAATTG 

CCAACTGGGATTTAAGGAAGTGTCCTCAGCCCTTTAGTTGTCATCT^ 

ATCTGGATGGCAGGCACCAGGCGTAAAAGAGCCATGGCTCTCTCACCTCTGT 

5 0 TCCCTTCATTAACCAAGTAGAGGTGGGGC 

CTCCGATATCTTACAGCTATATTCCCTGCTCAGAGAGGATACACGTTGCTGTAACCGAG^ 
CTCTTCCTGTTTATCTTCAAATGAGCTGCTAGATGCTGCTTCCAGGTGCTCTG 
CXH'GCACACCGGGCTAGCGCAGCACCGCAGAGCTTCAGCCCCrrAAC 
CCGTCTCAAGGGGCTTTGAGATGTTTTCACTGTGCAGGAACA^ 
55 AACGCAATGGAGTTTAGGATATAGCACCTAAGCTTCTCTTTOTAGGCCACGAGTTGCAAAGC 
GCTGGGTGGTAATGGGGCATGCCTTGGGAAGCCGAGGCAAGAGGCTAACCTGATCT 
CACAGAGAAACCCTGTCTCGAAAAAAAAAAACAATAAAATA 

TTGGGAACTGGCTCACTCCATGTTGCTGCCTCCACACCCTGCAGCCCCCAGCAGACGGCT 

60 CCAGCTTACCCACAGATGTG<^GCTGGTCACAC 

ACAATCACCACCAAAG AGAACAG CAGCTGAGCTCGGGTG CTGTCTGCCTGCCCATAGACTATACACGAACTCTTCAGATTTTTACA 
AAACAT TTAATGATG ATT TCAG ACTTTTTAAAAG AAAGCTCAGTATT ATTTTT TG CATGTTTTCTAACAGATTTTTGAC TGTAATT 
TAACCCTTGCCTTATTTGTGCCTATTTGCAACrrTAGTTCTGTCCTGTCAGATTGTCACCAAG 

TCAAAAGTCATTGTTCCAAGTGTGT T ACAAC TTT CCTGT CCTTTACAATCT CTAC C CATG CCTAAAG CTAT AATTG ATATTTTTAG 

6 5 TAAAACATTTTCTTAAGAGATAAGAAAGTTGGATTTAAAAACAAAAACAAAAACCCC^ 

AGCATCGTGCAGTCAATGAGGAAGCTTCCCTTCCATCCTGTGGGGTCrCTCCATGT^ 

AGAGAGCGCATCTCTTGGGAGTGACAAGGAGCGGATCCATTTCTAACTCTTGTATCATGTTAAGTGTTGT 

ATCTTGATTTTCrTATCTATAAAGATTTTACCATCATGCT 

ATCAGCTTCCAGTCGGCCA(^TTCAGAG(^GAGC^CCGCCCACTGTCAGGGGCCTGCTTCCCCGC 

7 0 GAACCCTGAGTCAGTTTCACAGGCCGTGCTGCT 

GTGGC^GCTCCCCCACAGCTCCCCAAGGGCAGAGGTTGAC^ 
GTCGATCGCTTCTGCCTGGGTGGAGCTCTGGCCCTGACCTGAGGGAGCAGGCAG 
CAAGGGCGGCTAGGGACCCGACCCAGCCTGTGAGGAC^ 
75 GTGGCTG CCAGTG CTTTGGTT TTTCAAGGAACCAGAAGTTTG AGTT ATTTCATG ACATT^ CTTTTCITTTTTTTTTTTT 



1911 



WO 03/008583 



PCT/US01/51291 



TTGGTTTTTCGAGACAAGGTTTCTCrGTC^ 
TCCACCTGCCTCTGCCTCCTGAGTGCTGGGATTAAAGGCGT^^ 
TGGACAGTCAATCCAGACATGCCCACTG 
TCAGCCAGCCCrGGAAGGCAGCATCAGTCTCCCATAG^ 

CT CTTACAACATGG CCAATGCATCCTGGAGACAG CAT CACCT CAACGTG AGG CTCACA.GCTCX5GAGGAAAG GCTCAGCG CT CCAGG 

GGCCTCCCCACTCATCXiATACACACAAGCCTAAGTXX^ 

TCAGGCATCCTCTCTGACAAGCTTTTCnXSCGTGCTCCAAGGAT^ 

AGCTGGTTCCCAC^TCCCCAGAGAGAGCAGGCCTCTC 

CGTTAAGTCACGAAGAAGAATGCTGTCCTTGAGAAATCAAAG^ 

AGTCAGTCCCATGTCCTCCTCAGGCTGAAGGCCCTGTGTGGAGGCCTGAG 

GACCTGGCTGGGGTTAGAGCATACTGTTCCTGTGG<5CAAATG<^ 

ACGTCGGTCTCATCGCCTCGCAGGAGTCAGCTCTGCATCCAATCTGGCTGCTCTC 

TGTACCCCACTTTCTAAGATGGAGAGGATGAAGACA 

GGCCTCAGCACCACGGTCTTTACTTGCCAAAGACGAAGTCC^^ 

CTGCCCTGACCTCAGTGACTTCCATCATGGAATGCAGCCrrGAAAGC 

GCAGTTTCCTTT CCTT CTTTGTCCTrAGCTGG CTATCACCCACGCTCTTACAAAG CAAAGTX3CAGCATAGAGA CATTG CAT 

ACCTGGTGTTAAGCCCCCTCGCTCCTCCTCCTCTGGTGGCTTCT 

GAATCTGTGCCCCACTGGTTATGCAATAAATCGTCTC^ 

TAGGTGGGCTGGGGGGTTGAGATTTGGTCTTTTGTTAT 

CAAGAGAGT CC CCACTG AAGACA.TTAGGTGACCXTGTC CCCATC T C TGATTGG AAAATAG AG ATTCCAATAACGGGG CAGA 

AGAGACATAGGTGAAGTTTGGGTTTCGAGGGCTTGGGGTTGGA^ 

GCAG CCATGGGT T AGG AGGAAAAAAAAATGG CTC CG TGGGCTGG CCAAT TGG AGTT AAGAGCAG C C CAGAT AG C CGGG CAGTGG TG 

GCGCATGCCTTTTGACCCCAGCACTTGAGAGGCAGGGGCAGGT^ 

AGGACAGCCAGGGCTACACAGAGAAACCCTGTCTCGAAAACAAACAAAC^^ 

GGGTTGT CAGT AGG AAATAGAT C CTAACAG CAT AG ATGGG TAGACAACGGC CTGACTTT CAT ACTGTTT AAGGT TATGGTGAATAT 

GAGGGTTGTTTGTGTCTTTCATCCAGGAACCTAAACAACAAAGTCA^ 

AGTGGGGGCAAGCTCCCATGTGGCTGATCTGTAAGGAGACATGGTTTAAGGAGACIATG^ 

TCTGTAGCCTCTTTTGTTTTGTATTTGAGGCAGGGTCT 

CTGAACACTGGGATTACAAGCATGTGCCACCATGCCCAGCTCT^ 

TTTTAGGTCGGCCCAGGCTGG CCCC GC^CATGCGCCCTCCTTCCCAAC^GCAGTC^ 

AAC CTTAGC C C CACCAG CATGTTTTGAGAAG TGACTTGGT ACTT CAGATGAAAGG AGG AGG AGACAAAAGAAC CATTCCTGGG C CT 

CTCCAAGTGAAGTCTGAGCAAGATGAAGTGTTACATATACCCTGTGTGGCCCAGCTTGCC^ 

TTGGAGCCAACAGCTCACACCCAGAGTGCTCCTGAGCCAGGTTCCCAAG^ 

CCTGATCTGACTGCATGATGTGGGCGCTAGAGCTGTTCCCACCACCGCCGC^ 

ATTGTCCTGCTGCAGGCAGACrCroTGAGGTCCATGCACCACACAGCACT^ 

CAGTTGGCTGTGTCAGGCGTTGCTCTGTTCTTTCG^ 

TTGCCTCGCGGCCTGACAACCTGAGTTTGACTCCCA.GGTCCCACTTGGTAGAGAA 

ACGAGTGCACTGTGGCATGTGTACATCCA.CGGACAC1ACACAAAATAA 

ATCTCCATTTGACAGGAGAAGAAGCCAGCACACAAGGAGGCAAAOT 

TACCTGAGG CAGAGAAGCCACTGCGGGGGCACCACTAAGGCTG CT CAAACAAATGGCTCTCAGCTGTCAG CCTGGGGCCTCCCTAA 

CAGGGTCTGGGCTGCAAGCTCTCAGGCCGGTGCACC^GACTGCCTGGTGTCAGCCA^ 

CCACTGGGTCAGCACTGTGCAAGCATGCCCTCCATCTGTAGGGTGAGCAC^GTCCATCT 

TGCTACCCAG CAAG CCAAGGGTGTGAGGTTGTTCAGGAACGACCTCTGCCCT CTGACCTTCAACCAGACTGCAGGTCCTCACATTC 

ACTCGCTTTCTGTATACAGGCATCAGTCATGAAGCAGGGCTCCAA 

CTGTGCACACCTGGCCCTTGCTCCCCATTCT^ 

AAA71AATGTTTTAAG AATTATTTATTTTGGGGCTGGAGAGTT AG C TCAG CGGTTGAGAG CAC TGACT AT TCTTCCAAAGGT C CTG A 

GTTCTATTCCCAGCACCTACATGGTGGCTCACAACTGTCTATAATAAGAT 

ACCAGAACCACATAATCTTTTTTGTTGTTGCTTTGTTTTTTGAGAC^ 

CTGTAGACCAGG CTCGCCTCAAACTCAGAAAT CTACCTGCCTCTGCCTCCCAAGTGCTGGGATTAAAGGCGTGTGCCACTACG CCC 
AG CTCCGAAC CACATAATCTT AAAACAG CAAACAG CCCTG AT AG AGT CTTTAATGT T C CCT CT AATATGTCACAAG C CAGG C CTCA 
TCTGCACTGCTCTCAACATTCITATCTTCCAGGCrCCrrc 
GTTCCAAAGTCCTTCCACAGTCCTTCCTCAAAACACATGGTTCCC^ 
CTCCTGGTATCAATTTGTTTTAGTTAGGGTTTCTAT^ 

ATCTGG CT CACACTT CCTGATC CT AGT C CATCATTGG AGGAAATCAGGACAGG AACC TGGAGGCAGG ACCTG ATG CAG A^^ CATG 

GGAGGGGTGCTGCTTATTGGCTTCCTCCTCATGGCTTGCTCAGCCTGCTTTOT 

TCTGCGCACAACXSGGCGGGGCCCTCCCTTATTGGTCACTGC^ 

GGGGTTGGGGAGTAAGTCCACMCTCCTTCATCAATATGGGTGAGGCG^ 

GCATCGACTTAGCTAAGAACACAGAGAGCAGACIAGAATTCT 

GGACAGAGGCTCCACGGTGAGGGCATGAGAGTTGCAGAAAGCCTGCGGCCCCGTGAAGTACCAATC 

AGCCTGCCTCATGTCACCCTGAAAGGCCTGGGAAGTGTCAGGATCTGAACCTTCAGGGCT 

AGCGACGCTGCAGGTAAGCCAGAGGTGCTCGCTTGCTCTGTGGGCCTGTGTG 

AACACACAGGAATATCCTAAGAATGTGCTTTCATTTTACCCACAGGGGTAGGGGG 

TTTGCCTCCTGCTCTAGCAGAGGTATGGTTTCGCCAGCTGCAGATAGTTTCCACGAT^ 

AATGCTAGGGGCCCTGAGAAGTATGGTCAGTGGTTGGTGTCATT^ 

ATGTAAGAAATTAGATTCAGGGATCTTCTCTTCTTTCTTT^^ 

TTGGGAAAAAGAACC(^GAAAGTAGCAAAGACCAGCTGTGTC 

CTCGTGTACTTATTTACAAGTGTCTTTGTACTCCTTCTACTGT 

CCAACCCTTAAGCCATCTTGCGAAACTAATTTCTATTCACAGCCCAGAAAGGT^ 

GGTGGTGTGGCCAGTTAGGTCTAAGTGCTCAGTGCGGTCTTGGGAGTC^^ 

GCACACCCTGCCTACGGTGCCTAGGGAAACGGAGCGCTGGGCGTTT 

GCATTGTAGCAACTGCTCACAGAGCGCACAACTGTG 

CTCCACATGGCTGACCCCAAGCCAGGTCATGGAGAGCGTCGCAGTCCTGAAAGCAACT 
AAACCCGTAAGACCCAGGTGGCGGGAAAGCTTTCTGTCCATAAAGGAACAGAGCTATGT 

ACTGAAGAG AGCAG CC TAGAGGACAGG CTGTCTG CACAGAGCT TC CATT AGT CCATG CCCTTAG CTG CATCCCATCTCTATACAGA 
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ATGATCGTTAGAGGAACGGACTCAAGAGAT CTTGG ACCACTTTAAGT CAAACAGGGTC CTGTG C TACATGATGACATGC CAGT AAC 
ACCCAGCGCCCACCAGCCTAACACACTAGAAAGCAG CAGTGACAG CCTGAGGTGATACTGAGATAGAGCTTTACTTCAGTTAGTGA 
GCATTTGACGAGGGCGAGTGCTCACAAATGTGTAGTGGAAACAGAGCTGGGGGCC^ 
GGCTGAGAACCGGTTCCGTGATGGAAGGGAAGTGATACAGCCCTCAGAGCCC^ 
5 ACCGTATATTCAGCTAGAGTGGGCAGAATCTGGAATATAGGGGTGAAAGCCAACTCGAGGAAGG 

ACAAC CCGACTGT CG CCATGAGGACC CATGGCCACTTGTATTGGGATCTAACAGTGGGGACAAG CTCAT CTGTG ATTCCGAGTCTC 

ATGCCATTTCTCCAGGGTCACATGTCTCTCTGGGCACTTAATTCCCATGGGGCT 

CTAGTCAAGCCTCTGCCTT/TCTCGGCTAGCCAAC 

1 0 AAC^CCTAGGGGTAGCCCTCACAACACTTCCAAAC^ 

CACATGGGCCTTGGGGCT ACACAG CT CATAA.CTGGCAAT AACTG CAC CAAGACCTATGAG CCCTCAATAGAAT AACTTCTG AACAG 

CAATTTGAAAACCTAAACAGCTCCCTCCTCTCTGTGGCTGGTCCT 

TATTCCTGTCATGGTACTCCTGTCTGTCAGGGAAOiTGACTrCTT 

CTCTGGCGGCACTGAAGCCAAGGCCTGTCGGGGTGAGAAGGGCTCT 
15 CC CATGATGTGCCTTGACAGGTATGGCTCAACTAGCT AGCAGCACCCTGAG CAATGGC CAACACCGTACAACTG TCTGGAG 

CGGTAGTGAGTCGGTTTG CITCTCTGTGCTCAACGTCAGTCCCAAG CAATTCTGAGGTGTCCGTGAAGCTGAGGAGAGAAAGCACA 

TAACATTCCCTCAGGTCAGCAGTCTGGAACACTGGGAACCGACTCCC^ 

TGAGAAAGCCTTGACTCTGGGCCATAAATAAGGCTCTTTATGACCTGCC 

ACACACTGATGCATGCTCGGGTGACTTTGGAAATGCTGTTCCCTGTG CTTAGAGCCCTCTGTCTATAGGG CTTTGAGGCTCTACAG 
2 0 CAAATGGAAGACCCACTCTAGCCTTACGCCTCCCAGGCCCCG 

CCCTCAGAATGCCAGGGGCCAGGCGGCCCTAGTTCCAGAGAGCCrAGTCAGCA^ 

CTACATGCTCCATGGTGCCCTGCTCAGGGCTCACACATAAGGATGCTCAGAAAGAGACAAAG 

TGTAGATCTCTTGGTCACCrGGCTrCACTGGCAGCTGGGACCTGATCCGGCrrGT 

2 5 CTATCTGAAGTCAAGTCAAGGAATGACAGCCTTTGAACAGAGGCTAT 

GAGCTGAGGATGGCACCATGGGGGCTCGACACCTGGCAGGTACCCACCTTGGGCTGATAGCCCrTCT 

MOUSE SEQUENCE - mRNA 

3 0 ATGTCGAAGCGGCTCCGGAGCAGCGACGTGTGCGCC^ACTGCAACGGGCCTGATCCTTCCTGGGCGTCAGTAAA 

CATATGTGACGAATGTTGCAGTGTCCATCGGA^ 

CAACCCTGCTCCAGATGGTTGAGACCTTGTACAACAATGGTGCTAACT 

ATGAGTGGGAGGCGGAAAGCTAACCC^CAGGACAAAGTTCACCCCAATAAAGCAG 

GTTCGTCCACCGATTGCCCTGCCGCGAGGACGACAGCGTGACGGCCAAAGATCTCW3TAAG 

3 5 GGAATCTTGAAACCTGTCTGAGACTGTTGTCTTTAGGAGCCCAA 

GTCGCCTCCAAAGCAGGACAGATCTTACAGGCTGAATTGCTGGCCGAGTATC 

G ACT C CTGTTGAC TATG CAAGGCAAGG AGGGCAC CATG AGTTGG C AG AACGCCTCAT AGAAAT ACAGT ACG AG CTG ACGG AT AG AC 

TGGCCTTCTACCTCTGCGGCCGGAAACCAGATCATAAAAGCGGACAGCACT 

TCTGAGTTGGCAAAAGCTGCGAAGAAGAAACTTCAATCTCTGAGTAATC^ 

4 0 AGTTGACAGGCGAGAGACTGACGCAGTCTGGCTTGCC^ 

TTCCGGTCAACCCTGAGTACTCCTCCACACGAAACCAGGGCA 
GTCATCXlACATCCTCAGTGATGCCAAGAGGAGACAACAGGGCAGTCCrCTGTCCA 

AACAGTCAGTACCCAG CACAGCACTGAGAGC CAAGACAACGACCAGC CAGACTATG ACAGTGTGGCCTCCGATGAAGACACAGACG 
TGGAGACGAGGGCAAGCAAGGC!AAACCGGCAGAAGCTTCAAACTCTCCAGAGTGAGAACT 

4 5 AGTGCCTGTCAGGTGCAGACTGGC TCTGACCACAAGG ACACTGC AAG CCACTCGTCCCTAAAGCGACGG CCATCTGCCCGGGGCAG 

TAGACCCATGTCCATGTACmGACTGGATCAGGCCAGAAGCCATACCTCCCAATGGGAGAAGCC^ 

CGAGACTTCAACCCITCCCCACGCATATTGGGAGGAGTGCACTTGTGACCTCCT 

TCCTGGTCCAGGGACGAAAGCGCCCXaAAGGGCCTCCAGACTG^ 

CTGTGACCCTGAGCCAGACGACACGGGGTCAACCCGGAAGGGACGGCAGCGGAGCATGCTOT 

5 0 ATACAGCAGAACCCCACTCGGTCCCCAGCCCCACCCT 

ATCCAGGAGCTCCTGCGAGCAGCCCAGGAGAACAAGCATGACAGCTA 

GATGGCAGCCCTrGTTCCCCAAAAAACCCAAGTCTGACACGGTGAGGACT 

AGTCGGAGTGCAGAAAGGCTCTCCCAGGAGACTCCAGCTTACCCACAGATGTGC^ 

T ATGACATCGC CAAGG C CG C CAAACAGCTGGT CACAATCAC CAC CAAAG AG AACAG CAG CTG A 

55 

MOUSE SEQUENCE - CODING 

ATGTCGAAGCGGCTCCGGAGCAGCGACGTGTGCGCCGACTGCAACGGGCCTGATCCTTCCTGGGCG 

CATATGTGACGAATGTTGCAGTGTCCATCGGAGCCTGGGGCGCCATATCTCCCAAGT^ 

CAACCCTGCTCCAGATGGTTGAGACCTTGTACAACAATGGTGC^ 

6 0 ATGAGTGGGAGGCGGAAAGCTAACCCA(^GACAAAGTTCACCCCAATAAAGCAGAATTCATCA 

GTTCGTCCACCGATTGCCCTGCCGCGAGGACGACAGCGTGACGGCCAAAGATCT 
GGAATCTTGAAACCTGTCTGAGACTOTTGTCTTTAGGA 

GTCGCCT CCAAAG C^GGACAGATCTTACAGGCTGAATTGCTGG CCGAGT ATGGAGCTGACCCAGGCACACAAGATTCTAGTGGGAA 
GACTCCTGTTGACTATGCAAGGCAAGGAGGGCACC^TGAGTTGGCAGAACGCCT 

6 5 TGGCCTTCTACCTCTGCGGCCGGAAACCAGATCATAAAAGCGGACAGCACTTTCTAATTCCTC^ 

TCTGAGTTGGCAAAAGCTGCGAAGAAGAAACTTCAATCTCTGAGTAATCATTTGTTTGA^ 
AGTTGACAGGCXIAGAGACrGACGCAGTCTGGCTTGCCACGCA 
TTCCGGTCAACCCTGAGTACTCCTCCACACGAAACCAGGGCAGACAGAAATTAG CT 
GTCATCGACATCCTCAGTGATGCCAAGAGGAGACAACAGGGCAGTCCTCT 

7 0 AACAGTGAGTACCCAGCACAGCACTGAGAGCX!AAGA 

TGGAGACGAGGGCAAGCAAGGCAAACCGGCmAAGCTTCAAACTCTCCAGAGTGAGAACT 

AGTGCCTGTGAGGTGCAGACTGGCTCTGACCACAAGGACACTGC^ 

TAGACCCATGTCCATX3TACGAGACTGGATCAGGCCAGAAGCCATACCT 

7 5 TCCTGGTCCAGGGACGAAAGCGCCCGAAGGGCCTCCAGACTGGAGAAGCAGAACAGC^ 
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CTGTGACCCTGAGCCAGACGACACGGGGTCAACCCGGAAGGGACGGCAGCGGAGCATGCTCT 
ATACAGCAGAACCCCACTCGGTCCCCAGCCCCACCCTCCCCAGCAC^^ 
ATCCAGGAGCTCCTGCGAGCAGCCCAGGAGAACAAGCATGACAGCT 
GATGGCAGCCCTGTTCCCCAAAAAACCCAAGTCTGACACGGTGAGGACTTCCCT 
5 AGTCGGAGTGCAGAAAGGCTCTCCCAGGAGACTCCAGCTTACCC^ 

TATGACATCGC CAAGG CCGC CAAACAGCTGGT CACAATCACCAC CAAAGAGAACAG CAG CTGA 

HUMAN SEQUENCE - GENOMIC 

ACAAAACAACAGATAATACCIAAGTGTTGACAAGGATG TGAAGAAACTGGAATCT CATACACTG CTGGTGGAAATGTAAAATAGTG C 
1 0 AGCCACTTCGAAAAATTG CTGGCAGTTC CGCAAAAAGTT AAAACATAGAG TAACTACATGACCCAG CAATTCCT CTCCAAGGT ATA 

TACCCAAGATAAATGAAAACATATACCCATACAAAAACTTATAC^ 
TTGGGAGGCCAAGGTGGGCTGATCACTTGAGGTCAGGAGTTCAAGACCAGCCT 

ATACAAAAATTAG CCAGG CG TGGTGGTG CACACCTGT AATC CCAG CTACT AG GG AGGCTGAGACAGG AGAAT CG CTT AAACCCAGG 
AGGCAGAGGTTGCAGAGAGCTGAGATCATGCCACTGCACTCCAGCCTGGGCAAC^ 
1 5 CAAAAACAT AT ACACAAAT ACAC^CAGCAG CGTTACT CATAATAG CCAAAAAGTGGCGACAACC CAAATGTC CATCAACTGACAAA 
CGGAGAAACATTTTCGTCTAACAAACAAAACATAGTACATC CATG CAATGGAATATATTCAGTGATAAAAAG GAATT AAG CACCTG 
AAACATGCCAGAACGTAGATGAACCTCCAAAAC^TTATGCTAAGTAAAAGAAG 

AAATGT CCAGAATAGGCAAATCTAGAGACAGGAAGTAGGTT AG CAGGTTG C CTGGG CIGGAGGGGTTGGGGAGAAATGAGGAGTGA 
CTACTACAGGTATGGGGTTTCTITTTC^ 
2 0 GGCAGGCCAAGGCAGGAGGACTGCTTGAGCCCAGTCrGGGO 

CCAAGCGTGGTAGTGCTTGCTTGCAGTCCTAGCTATTTGGGAGGCTGAGGTGGGAG 
AGTGAGCTGTGATGGTGCCAGCCTAGGCAAGAGAGTGAGACCCTGTCTCAAAAACMCAAAAAAAC^ 

AAAATTT CTGAAATTTG ACAGT AGTGATAGTTG CAT AACT CAG CAG AT AT ACT AAAAACCATTG AATAACCTTTAAATGTAGATTA 
TATAAGTG AACT AT ATCG CAATAAAAAT CTTTTGAAAAAAT CAATCTGTATATGTTTATACATAATACAT AAAACGAATCT CTCAT 

2 5 GTACTGTTTGACAAGTTGCTTAATTCCTCCTGAGCTT 

TTCT TACAAGGGGAAAGT C CTTAACTTAG CACGGTG CTTG ACAC CACAGT AAGTGCTGAG AAGAG TTACCTGTT ATT ATCATACG C 
CCACACTATGTAGTTTCATATAGCTGTCACTTTCT CAAATTGCATT ACAGT TATTAGTG GGG CTT AG A 
ATCTTCAACTAATAAT ATGAG TTCAGAGAACAAGGAGTC CTG GAATT CACTTTTTCTTTTGAGACAGAGTCTCC C 

TCTITGTTGCCCAGGCTGGAGTGCAATGGTGCIAATCI^A 

3 0 CAATTCTCCTG CCTCAG C CT CCCGGGTAG CTGGG ATTACAGG CATGCACCAC (ZACG CCTGGCTAATTTTG T ATTTTTAGTAGAGAC 

GGGGTTTCTCTGTGTTGGTCAGGTTGGTCTCGAACTCCCGACCTCAG^ 
CAGGCCTGAGCCACCACGCCCGGCTGGAATTCACTITCGAAACTGATCAT^ 

AT AC CCAAATAAAAATACCCAT C CTCCTT T AACT AGGAGTTT C CTT CCT ATTTG CT CATTT CAG CT CATCCACTTTT CTT G CT CAA 
AAGC CTTAAAAAAAATTT ATTTTTTT TTTG AGATGG AGTCTCAT ACTGTTGC CGG G CCT AG AGTG CAATGGTGCGATTTT G G CT CA 
3 5 CTGCAACCTCCGCCTCCCAGGTTCAAGCGATTCTCCCGCCTC^ 

AG CTAATTTTTCATATTTTTAGCAAAGATGGGGTTTCACTATGTTGGCCAG 



CTATATTGGCCAGGCTGGTCTTGAACTCCTGTCCTCAAGCACTGACCCCACCTCAGCCTCCTCAATCAAAAGCC 
TTGCCATCCTCCTCCCGAGCTCTGAGGCC^GGCTGCAGAACCAGACAACTGGACACCCCT^ 
4 0 AATT AT CAGTT C CC CTGTGC CTTGGTAC CTGGCATG CAGG AAGTGCTT CATAAACT CACTGT CACCTTTAGGGGG ACT CCAG CAAG 
AGGCTGCTGGCTAGTCXrrCCTCTCTCTCAAC^CAGACAACC^ 
CCCCACGCTGAGTCAGTGGAAACAGGAATCTTGAGGCCAGTTTCCTAAGGTGA^ 

GGAGAG CAGGCrCTAGGCTCTTCAT CTTGGATGCCTAGCCCATGG CTC^GCG ATTTGAATCTGTGCCCGGGGCCTCACCTGGCCTG 
CCCCAGGCTGGAGTGAGAACCCCAGGAATGTCCTCCTCTTACCTAGGTGCTATACTTTTCACCTGCTGGCTTCCTAAAC^ 

4 5 CTGGCTCTAGAGTCAGGCCTCIATTTGCCACCCTCCACCTC^ 

TTTGCATCTGTCCCCTTAAGTGTTCATGTCTCACCATCAGTAAGAAGGCCAAAGAAGTTTAA 

ACG ACAT AAG GAG AACACAAACAGTTTCAACTTCAG CAAC CAACTTCAAGGCC C CACTG AATAT TAATGAG CTGATGGGTT AAGAG 

GCACTTTATTAAATGCCTCTGCTGATCAAATGAAGGCCTGAAACAATCT 

GAGACAGTGGGATT GGG GCTGGCTCACACACACAGAAACACGTG CTT ATCCTCTT CATC 

5 0 GCTGTTCCrGGAACAGGAAOUUUUVACCrGGAAAAAAATTCTTACAG 

ACGGAATGAGGAAACTGACAGCAGAAACTCACCAGCAAACAAAACATTAAG 
TTTTTGGGTTTTTTTTGTTTTTTGAGTCGGAGCCTCGCTCTGTCTCCCAGCCTG 
CCTCAGCCTCCrTGGATTCAAGCGATTCTCCTGCCTCAACCT 
TTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTGGTCAGGCTGGTCTC 

5 5 CCTCCCAAAGTGCTGAGATTCCAGGCATGAGCCACCGTGCCCGGACTTAAGAATTTTTTGTTTTTAAT^ 

TCCTTGCAGAAAAACTGGAATATAGAGCAAAAAAG CT CAATC CCCAGGTTTACAAATGATG ATTATAATAGCAG CAGCTACTAGTA 
AATATTGTTTCGAATTACCATATGCCTGCACTTTCAGGTCCA 
TTTACTGAAAGAAGTATAGTAGGGGTTAAGAATACAGGATT^ 
AGGCTGAGGCGGGC!AGATCACTTGAGTACAAGAGTTCAAGACCAGC^ 

6 0 GTCATTTTTAAAGTGTCCTAGCTGAGTCTTACAGTTTTTACCTCTGTACCATGTCTAGTAATAGAAAAGGAATT 

TACTAAAATGCCTAATGGTGAATCATTTAGTCTGAGGGCATGTGTCATATCTCCTTAACTCTTAAATAGTTGGTCT 

TAAGAACGATGACTCTCACAAAAGTTTATCGTATCTGGAATTCTTTTTAAATGAAAA 

ATCCCAGCACTTTGGGAG<5TAGAGGTGGGCGGATTGCTTGAGGCCAGGGGTTGAAACCAGC^ 

T CCACTAAAAATACAAAAATTAG CCTGTAATCCCAG C T ACTG AGGAGG CTGAGGCAGGAGAATCGCTGGAACCCGGGAGG CGAAGG 

6 5 TTG CAGTGACCTGAGATCGCGCCACTGCACTCAAGT CTGAATGACAGJ^CGAGACTCCATCTQVAAAAGAAAAAAAAAAAAAAAGA 

AACAGAAAAGAAAAG AGGGT AATTTTGGACTTCG GTGAG GG CT CAACCAC CTTAAATATGG CAGGTTGCAAGCACACAG CTTG CAG 

CIACCCATTCACCAAATGGAGTAATGCTAACCTTCTCA^ 

TTGACAATTATAGCACCAATTTTGAGCCATTCAAA 

CATACTAGATTTCAGGGGTAATCACAAATCCTCATCCTAACTGCTAAGTTTGTTAGCT 

7 0 GAATTGTTGGGAGACITTAAAAGCAGAGTTTCT^ 

CGGGCGGATCACCTGAGGT CGGGAGTTTGAGACCAGCCTGACCAACATGG AGAAAC CCCATCTCTACTAAAAATACAAAATTAGCC 
GGGCGTGGTGGTGCATCCCTGTAATCTCAGTTACTTGGGGGGCTGAGGCAGAAGAATCGCTT 
TGAGCCGAGATCGCACCATTGCACTCCAGCCCGGGCIAACAAGA 
AAAGAAAAGCAGAGTTTCTCAACCAAGACGCTGTCGGCTTTGGGGGCCGGAGAAG 
7 5 TAGGATGTTGJU3CAGCATAACCGGCCTCTACCCACTACCA 
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TGCCAAATGTCCCCC^GGGGTCGTGGGTGCAAAATCACCTCCAACT 
ATTAAAGCATC^AAGCACTTTAGAACAGGAAGGGATTTTAAGGAGCCT 
TCCCTGG<3GAGTTTGTTAAAAATGCGGCACCTGGACCrrCACCC^ 
CAA.CATCCTCATTCTACAGCTAAGTTAACGTCGGCCAGAGAGGGAAAATGC 
5 CTCCAAAACCAACCAAGCCCTTCCTTACAACAA^ 

GACGGTGCATTCCGGAGCCAGCCGGCCCGAGTTAAAATCCC^^ 
GTCTGAGGACCTGTTTCCTTCTCrcT^ 
ATTTAAAAAGCCCGTCGCACGGTAAGCACTCGATACGT^ 
GGATCCCGGGGACAGAGG<jCTGGAAATGGGCTCAGGGAGGC^ 
1 0 CCGCCCCGCCCCTCACCTGGCCCTGCAGCTCCI^CTC 

TOSCCCGC^TCGCAGGCCGAGGACATGGCGAGGAGGCCXSTAACCTGGTCCTCGTCGCCCGCCC 
TCCCXSCGCCCGCCTGCCTGCCCTGAGGCCC^ 

GCGCTGACAGCTCGCGGACAGACTTCTGCX3GACCTAACT 
1 5 GGGATAGGCGACCTTGGGCACCGGGTCCGGGGTCCGCGCCATTGCACCCACCTCCAGCT 

AGAACAGAGCTGAGGCTCATTCTCTTGGCCAAAGATCTCCACGCGGAAAGTTCCGTCTACCTG 

COVGTAGCCCCATGTGGTTCGGTCCACAACTGGGCCTCTTCCATTGGCTCTGAAGTCAGGGGGT 

CCATTCCAGCTGCCGACACCCTTGCAGCCTTTTTGGA^ 

AACACCGGACAATTTTTGAAGGAGGAACGCTAAACCCT^ 
2 0 CTTCATGATTTCTCAACCTCGGCACATATTGAACT 

GCCCTGGTCTTTCrCCCCACTCAATGCCAGTAGCACTCCCCAAAACATGT 

AAATCGTCCTCAGTTGAGAACCGCTTCTATTTGAACTCCGAACCGGGTTATGATTCAA 

TGTATGAATGTCATTTTTAAGCTAGCTTTTTGGATTTGAGGTTATGG^ 

TCTG CG AAT ATGAAGTTT ATTCITTG AGTTTCAGGCACT^ CAATGAAGAACCAAAT CAACTT ACAG CTAGCTA 

2 5 CrrCATGGGCTTGAACTTTCTGCrGCTGCTTCAGCTCAG 

TGAAAACATTCCCTCTGCCCTCTGAGTGTATGCACAT^ 

AT CCATTCT CCACAGATGTGG AG CCCAG AAGATTTGTGAGTTGTT AAAC CAAAACAGG ATGCAGGAG CAC CAGGATGGCAC CCGGA 
GACTC TAACCATGGAATG CGTGGG AATT CTGTGATT ATTTCTCCATCACACATTT AT CACACTGTAG AGATGGACTCAAGACCAGT 
GCTTCTGAGGCCTTTTCCAGCACTTCTCATTCTTTCA^ 

3 0 TATTGAAAATCTCCCTATCTCATTAATOVGAAAACCCCATAATGACAGGAACCACATC 

AAAT AGTACCTGG CT AACAACAGGCATT CAATATTTG CTG AATGGAGAT CATAG CTGTGTTTATATGTAAGACAG AC AAT AAGTAG 
TCATTGGTATTAGGGTCACATCTCCTTTCTCCTTCACACTGATT^ 

ATCCCATGTTAGCTATTGTTATTCTCAAATTCCACGCATATGAAGAAAAACAGTAATTAGAAAATAAT 
GGGAGCTACTATTTACTGAGGGGTTACTATCCAGC^GGTGTATCACATTTTTCCCATAATTTA 

3 5 GAATGTTTAACTTTGCTGGTCACATACAAGTTTTGTCGCATATTCTTCTTCCTC^ 

TTTCATCTCG CAACC CTT T AAAAATGTAAAAAT CATT CTT AAGTAG CAAG CCATACAAAAACAAAT CG TGG ACT AGAAGTAAACAA 
TGCATTATCTTAATTTTGAACATOLAGATTAAAACTGAAGC 

TGTTGAAGTTGTTTTTGAGACAGTCTCGCTCTGTCCCCCAGGCTGGAGTGCAGTG^ 
CCCAGGCTCAAGCGATTCTCATGCCTCAGCCTCCTGAGTATCTAG^ 

4 0 TTTTTTTTTTTTTT TTG AG ACGG AGT CTCACTCTGT CGCCC ATG CTGGAG TG CAATGG CG TGATCT CAG CTCAC TG CAACCTC CAC 

CCCCTGGGTTCAAGCTCCTGCCTCAGCCTCCCAAGTAACTGAGATTAC^ 
TAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTG^ 
TGGGATTACAGGCGTGAGCCACCGCGCCCGGCCAATTTTTG 
TCGAACTCCTGCCTCAAGTGATCCTCCTGCCTCAGCCTTCCAAAGAGTTC 

4 5 TCATATAATATATTCTTATTTTACTTTACCCCAGTAAAATGCAAGCTTCATAAGGAOT 

TCTCCAGTGGCTCGGACAGTATATAAAGCATGGTAAGTGCTTCATAAACA 
ATTGATTGCCTCAGGAAGGTTGTGGCTGCAGGGTTCTCATCACAAGATATTC 

GGAATCTCTGCCAGAGAAGAAGACCCAAATTCAATATTTAAAGGTAGGACAGATGATACTCGGGC^ 
ATGGCGACAAGGCAGAGTGCCCTTTCCGGTCACATGACCTGACACTCGCCGAAGGGC^ 
50 GCTGCAAGGCGCTGAGGGGAGAGGCGCCGCCGTCAGCGCCGCC^ 

CCTTTGCTGTCCCCCGGCOTCTAGrrCCCCGCAGGTGGGAGGTGGGAGCCATGTCGAAACGG 
GACTGCAGCGGGCCGGGTGAGTCGCACCGGCOSGGCCCGCT^ 

TCAGGTCAGG CTGCATCCCG CGGCTAGGGTCGC!AACG7WiGAGGTCCCTGGG CX5GC CI^GGGGTGGGAACCGTCCCGGG CCTGTCAG 
AGCCGGGCGGGC^GGGGGCCGCCGCCCTCCCCGTCCCTCGCGGCCCT 

5 5 CCAACTGGGGAGGTGACX3G<K^CCAGGGCTGCTCCCCCAACAGGAAGGGGCITGG^ 

AAATGTGAGCGTCCTCCCGGCTGCAGCCTCCIAGAGGCTGCAG 

CGGGGAGGATGACACTCCCTGAGACTCGCCAGAGGGTCCCCGTAGATGACTTCCCCGCTC 

GGAGAATTGC CAGGAGATCGGGGTCraCCAZ^ACACAACCAG CACTTAGG AG<5CAGTGCTTGTCTC^GTTTGAGGATCTGAACGG C 
CTGGGTTCAAACTAGATTTTGCITCTGGCTGACCTTGGG 

6 0 TGAGGATAATCGGGAGTATCACGCTTAGTGGTGAGGAAGATTCAATGAATACTCGTAGAATCA 

TG CATT ATGCTGTCTACAGTAGAACTCTTGTAGT CC CTT AAAGCTAT ATTTG A CATACGTTGATT ACAACGGAAAC 
AAACG CAAAATCAC CAGCAATATAG G G G CATTG AAACGCTAAAG C CATTCCTTCATAATG ATGCATAACTGTTCAT ACAT AAT ATT 
CACATTTAGAGATCAAGTGTGTCTTAGCAATTGCAACTGAATTGTCAGGTCTCT 
GGATGTATTGGCTTTGAGTTAATTCGAAGGTAACATGG 

6 5 AACACCACTGGCTTTGCAACTGTTCTGATCTTAAGCTTGTCA 

ACCTCATGTGGTTGTTTTATGGCCCGTACAGTGCAGGTGTTACCAGTTTCCTTTC 
AAGAAGTGTTTTCTAAACCAAGTTAGTGTGAGTTATACAAACTCCATCATTTTCTCTA 
TTCTTGCCTATTTTCAGTTGAACTTACCGTGATGTATTTGATTCACATGTGT CTAGCTA 

70 AGAACA.TTTTCTCTTCCATTGTATCGCTTTAGTTGTTGTCTCATACGCCTAATCT 

GAATATAATTAGGCTACCTTGAAATATTCTGTG AAATTGAGCAG CTGGGAATTTGTGCTG CCTTTTCTGCACAGCCCTACTGAGAG 
AATTACCTGTAGGGTTAG GG AC CCCTTTGAGATTCTGTTG AATG CTAAGGCCTTCTCCC CTAAAATG CACGTG AACGAATTTACCT 
AAGATTTGTCTGGGC^GAGTCACAAACATTTTTGAAGTTCATTTGTGGT^ 
TTACCCAATCTCATTCATACAAACATTTTTATT^ 

7 5 AAAGTGGTTAGTCGAATACTTGTCTTAATTAATATGTGTTAAATACTTGCTAAAAAAAGAA 
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AAACCAAAGCTATGCTTCAAAACAGTCTTTT^ 

TGTTAATGCATCTTTGTAGAGCTCCTTCATCrGGACATTT C CAATCTCTGCTTTTTAGGATGATACAGTAATAGCGACAGAGTTAT 
ACTAAATTGAATTATTTTGATTCIXCT CATGAT CACATCTTCTTAAATATTCTACTTTTTTTTT 

TTTAATGAATCTGGTTCATTTCAAATACATC^ 
5 AGTGGTCACATGAAATG TTCAATGTTTTAAGTATTTCCCATCAAAAAC1TTGTCCAG TGATCATGAATTTCACGT CT AGAAAATAT 

CTGAATACTGAGAAAATAATTTAGAAAAGATATAACAACTAACATTACTACT^ 
ATCCTTACTATCCCX1AGATTTGTCATTGTTCTCATTTGAACCATGGTGA 

AAGTGGCAGAGTTGGGGTTCAAACCCTGAACrrC CTTGTCTCTAGLA^ CCTCTT AAC CACCATACTCACTGGCTCC CAGGATT 
CGTCTTTGATTG AAAG CCCAT AACTTTTCATTATATGGl^ CTTAAAAAATGGTAAAGATGGTGTAC 
1 0 ATGCATTTTAGAGATTTTTGTGG ACATTGAAGAATTT^ CTATTGT CAC C 

C CATTGTGAG T CIGACG AAGTATAGCT CTTTCTTTATTTGAGGT^ C 
CAAGTATGAAATTATTCCTTGTTTCAAAGGACAGCTAATGAAGCT 
AATATTATTCATTTTTATTTTACATCTrTGCCTCTGTGAAGCTC 

GATTGGACTTGGTAGTTTATGT AAAACAATTJATTGG CTG GG CACAGTGGCT CAAG C CTGTAATC CCAGCACTTTAGGAGGTCGAG 
1 5 GTGAGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGTCTGGCCAACATGGTGAAACCT 

GCCGAGCTGGTGGCACACACATGTAATTCCAGCTACTCGGGAGG CTG AGGCAGGAGJ^TCGCTTGAACCCTGGAGGCAGAAGTTG C 
AGTGAGCCGAGATTGCGCCACTGCACTCCATCCTAGGGGAC^ 

ATTCATGG AGATCAT AG AACATTT C CTAATGGAAGAGGC CAGGT TTC C CTGTC CTCAG AACCATAG CAGCTG AGTG ATTTT ATG AT 
T CTGT AACTCG CAG CTTTCTG CTT^AG C^TATTG ACATGTTGATTCATAGCAGTGAATTAGGACAGCAGTCTACAACATTT ATO 
2 0 GAAAATG CTAT CTG CAT AATTCACTGTCAAAATT AT ATT ATCTTGG CC GGGCACAGTGG CT CACGC CTGTAATC TCAGCACTT TGG 

GAGGCCGAGGTGGGCG^aATCACCTGAGGTCAGGAGTTTGAGA 

AGAGGTTGCAGTGAGCCAAGATTGTGCCTCTGTACTCCAGCCTGGGTGACAGAGTG 

AAAAAAGT ATTATCTTGATTTTT ATGG AG ATGG ATC CAAAATCkT^ CAATTTGCTTTCT TAG CTTG AATG AA 

2 5 AAAG C^TGCAT CTG AAATG GTAGATCCACATTAGGGCTACAA^ CCAAGG C 

CATTGATGAGCATACTGAAACAAAAATGTCTCCTTTTTCTGAC^ 

CCTGCTGGTATCCCTGCCACTGAGATTTCACTCTCTTTCCAGATCCTTCCTGGGCATC^ 
GAGTGCTGCAGTGTCCATCGGAGTCTAGGGCGCCATATCTCCCAAGTGAGGCATCTGAAA 

3 0 TCAGGTAAAGGATAAGAATTCTCCTGACAGTTGGTAATTTACATT^ 

ACTTCTT C CAT AAGACAAAACTGTT TCAT AATTGAATAGAGATGG CATGC C CAT C AATGCTATTTACAAATATTAT CTGT ATTGAA 
TT CAGTAAAGACTTTT CTTCCT ACTTGAC ATAAAT CTTGTT CAACAAG CT CAATTAT AGGTAGATAG ATG AATAGT CATTAAAAT A 
TCTTGGTACTGTTGCTCIGTTGACTTTCTAGACTGTCAT^ 
CCAGGCTGGAGTGCAGTGGCX3TGATCTGAGCTCACTGTA 
35 GTAGCTGGGATTACAGGCACCCGCCATCATGCCCGGCTAATTTTTGTA^ 
TGGTCTCGAACTCCTGAACTCIAAGTGATCCACACCTCAACCrCTCT 
CAGTAGTCCATCTTTTCTATGCATGATAATGAATGTTTCT 
GTACCTGCTAATATTTTTAGAACATGTTTTTCTGC^ 
GTTTTCTTAAAGCTCTTGTTACTAGTCAGAGAGACCAA^ 

GTTAAGTGTGTGTrTTTGCCTTCATTTG 

TGTGAACC^GGAGCCAAAACAAGTGGGTTTGTATCTCrrGATGTGAAAATAACTACACT 

AGGAGTCATGCCTTGAAATGAAATTTGGAGATATGATTTATTACTAACTTTTGCTTAGAAG 

AAGGGGCTGTTTACTGCTCTCAGGTCTGTCTGTGTTTGCAAAGCC^ 

4 5 GTTTAGAATG CCAGGGTTG ACCCTAAAG CAACAATTC CCCAAATGTGG AAC CAC CAGTGGT ATT CAAG ACAG ATGTAGG T AAT ACA 

TGTTTTTTTTAATGGTTACATTATTTTACTGTGTATGGGAA^ 

GG CTTT AAAG ACAAAT ATT AAGTG AATAATAT AGACAGTACATG GGT(^TT AAGAATGTTGAAAATAGTGTTTAATT AG GTT TG GG 
AAACCCTGCTCTAAGGTGTC^GAGTACrTTTTTTTCITTCATA 

CTACTAGTCAAAGCTGAATCCTGCATATAGATTGATCATCTTTCTCATTCTCAAATC^ 

5 0 A(ZAAATGAGTGTTTATAACAGGACTTTGTTTACTCTGTTCAGTCACTGATCCCCATTGAATATTTGTC 

ATGTCATCCTGCTTGGTTTTAGATGGTTGAGACCTTGTATAATAACGGTGCTAACTCTATATGGGAGCATTCT 
CGTCTATTATGAGTGGAAGACGTAAAGCTAATCCACAGGATAAAGTAC^ 
AACAAATGGGGCCCACTGAAAGGAGTGCAGCTTATTTTTGGTCAGTCTCCGC^ 
ATCAGGTTCTTTGCTATTAGAAATGCAGAATGGGGTGGGGCAGGACCATCTGTTTAGC^TGGCCAC 
5 5 CCTCAGTTGGAAACCATGGCTTTGGAGTCTGTAAATGTAATCAGATT^ 

TAGGTCTGGGCAC CCGTCTGCCTGTTGCCTCACrTATAAGATGGG ACTAAATG CTGCTTTCTCCCAAAGTTGCTAAATCAGTTCAG 
ACAATT ACAC AGAG AAGAGGGCT ACT ATAAAAAG CAGGT ACT CTGGAGT CCCAG CCATT AT AATAACTT AATGTATGTTTT CAAACT 
GCAGGTCTTAAACCTTGCTGAAACTTACTTGAGCATTGTTCATCT 

ACAGTACAAAGAATGGGGATTGTTGTATCTGAACGAGTGAACTGTTTTCTTTTTTGTTTTTGAACCAGT 
60 CATCAGAGCCAAGTATCAGATGTTAGCGTTCGTCCATCGCTTGCCCTGCCGGGATGACGATAGTGTGACT 

AGGTGGGTCACIACCTTTTAAAATCCCTAAAAGAGCGGGCTGAGACAGCATATTGTTTTATTTG 

TGTGGGCrTTTTTTTTTTTTTTTTTTTTTTGAGACA 

CAGCCTCTGCCTCCCGGGTTCCAGCGATTCTCCTGCCTCAGCCTCCTAG^ 

TAATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGT^ 
65 TCX3GCCTCCTAGTGTTGGGATTGCAAGCGTGAGCCACTGCAC^ 

GGTCTTGCTCTGTTGCCC^GCTGGAGTGCAGTGGTGCAATCA 

CACCTCAGCCTCCCGAGTAGCTGGAACTACAGGCGCATTACC^ 

CTATGTTGCCCAGGCTGAAAAAAGTrTTTAAAGAAA^ 

CAGTCGGTTTCGTCTTTAATCTGAATTCAAATGGCAGCCIATTCA 
7 0 AAGGCTGTTTGTGTTTTTCCAATATGGCTTTTAAATTATATTTTGATAA 

CAGAGTTGTTTCTAGAAGGGACTAGAAAGATTATAGGACAATTGTTTCTTTTTAATAAGGACTAGAAA 

ACTTCTTCAGGTTTCCAGGTTGAACAGTAAGTATGATC 

TGATACTTGTAAAACGAGTGAGGTGGTTTTCTTTATGTGGGC^ 

CAAAG CGCTG CCTTGG ATAAATG CTTACT CACACAGAG CATGAGAACTT ACT TATTCAGAACTTTTTTTATTACTTTTTGG CCGGG 
7 5 TGCGATXK3CTCACTCCTGTAATCCCAGCACTCT 



1916 



WO 03/008583 



PCTYUS01/51291 



GCAACATGGCXSAAACCCCGTCTCTTVCAAAGAATATAACAAT^ 
AGGCTGAGGTTGGAGGATGACTTGAGCCCAGGAGGTGGAGGTTGCAGTGAG 
AGAGCAAGACCCTTGTCTCAAAAAAAGAAAACATTT^ 
TATTTATTTATTTATTTATTTTTATTTAT^ 
5 CTCACTGCAAGTTCTGCCTCCC^GTTCATGCCATTCT 
CCCGGCTAATTTTTTTTGTATTTTTAGTAGAGAC^ 

TCCCGCCTCX3GCCTCCCAAAGTACTGGGATTACAGGCAGGAGCCACCGCGC 
ACATTTATTGATTTGTGTATGTTGAGCCAGCCTTGAATCTCA 
TATTTTTATTTTTTATTTTTTATTTTTTTGAGA 
1 0 CTACAACCTCCG CCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCC CAAGTAGCTGGGATTACAGGCACGTGCCACCAGGCCC 

AGCTAATCTTTTGGTTTTTTTTTGTTTGTTTC 
AGTGGCGCAATCTTGGCTCACrTCAAGCTCCGCCTCCCGGGTTCA 
AGGCGTCCGCCACCACACCCGGCTAATTTTTT^ 

CCTGACCTCGTGATCCGCCCACCTCGGCCTCCCAAAGTGCTGGGATTACA 
1 5 TTAGTAGAGACGGGGTTTCACCACGTTGGCCAGGCTGGTCrPCGAACTCCTAACCTC^ 

CCGGGACZTACAGGCCTGAGCCACTGCGCCCAGCCATGGTGGATTAACTTTTGATGTGCT 

TAAGAGTTTTTTAAAAAATCTGTGTTCATCATGGATTTTGGGCTGAAGTTTTCCTTTTTTOT 

CAGGCTGATGTTGGCTTCACAGAATGAGTTAAGGAGGAGTOTCTCCT 

TTGTATGTCTGGTAGAATTCAGCTGTGAATCTCTTGAACTGGGGAGACGAAG^ 
2 0 AGCCTAAACGACAGAGTCTCACTCTGTTGCC 

CTCAAATGATTCTCCCACCTCAGCCTCrCAAGTG^ 

GTTTTGTTTTTGTTTTTTTTAGATAGAGTCTCTGTTACC 

TCCTGGGlTTAAGCGATTCACCCGCCTCAGCCTCCTGAGTAGCrTGGATTACAGGTGCCC^ 
GTATTTTTAGTAGAGACAGGGTTTCACCATGTTGGCO^GGCTGGTCTTGA 

2 5 CCAAAGTACTGGGATTAC^GGCATGAGCCACCGCGCCTGGCCAGTTTTTG 

TACTGGTCTGAAATTCTTGGGCTTAAGCAGTCT^ 

GGCCTTTTTTCCCCCCCCrrri'TTAATCCC CAG CAGAATCTGGCTG AAGAATT CAG AATTTGAAATTTTG CATAGAAATGCATATA 
CTAGCTACTTCCATTGATTAATAAGACTTGGTATGTGTTGGGCT 

TT AGG CACAGTGGCACAT CCCTGT AATACT AG CTACTTGGG AGG CTGAGGTG GAAG GGTTG CTTG AGCCAAGGAGTT CAGTCCTGC 

3 0 CTTGGGCAACATAACGAGACCCTGTGTTTAAAAAACAAAAAAACTATTTAAAAAATTAAA 

TCCTGTGATAAGGGTTACTGCATCTTGGACAATTCTAGGTTAGATTTGAAGGA^ 

TCTG CAGT AGTACAG AATT CGGTAT ATCACGGGCCTTTT CCTAC CCTG AAAG ACTT T ATT AAAGCATATT ATACAGG CCAAGAATA 
ATGTGGTAG AGAGGG TATT GTATAAGGCAGGATCCTGT AGCTT ACAC^GTGTAGAATT^ CTAAT TACAGTAA 

CAACTAATAATAATAACTrGTTATTATAGCATTTGTATT CTTTGAG CAC TTT ATTAT ATCACTAAAT C CTTAT AATAACCCTGTGAA 

3 5 TAGGGCCAGGCACGGTAGCTCATGCCTGTAATCCCAACACTTTC 

ATCAGCCTGGGCAACATAGTGAGACCTCATCTGTATTGAAAAATAAAAAGTTAGCAGGGCATG 

CTGTTTGGGAGGCTGAGGTGGAAGGATTGCTTGAGCCCAGGATACTGATGGCTG CAGTGAG CCATGATCACTCCACTGCACTCCAG 
CCTGGATGACAGAGC^GCGAGACCCTTTCTAAAAAGAAAAATGCCACCTATTTAA 

CAGGGTCTCGCTCTGTCACCAAGGCTGGAGTGCAGTGGCATGAT CACAGCTCACTGCAG CC TCCACTGC CCAGGCTCAGGTGATCT 

4 0 TCC CACCTCAG CCT CCTTAGTAGCTGGGACT ACAGGTGCX5 CATG C CR.CCACAC CCAACTAATTTTTGTATTT TTTG TAG AG C CAGA 

GTCTTGCTATGTCGCCCAGGCTGGTCTCAAACCCCTG 
GCATGAGCCACTX3CATCTGGCCATTGTTGACATTT1TGAGATGAGGA 
ACACIAGCTTGAAGTGGGTATAGTTTGAAATTCAGGTCTGTCGTATGTCA^ 
CTCACTCAGGTTCTGGTTGTGCCACirCTTAGATAAGTCAT^ 

4 5 AAAGTGGCCTCACTCCACCTTACAGCCCAGGGACAGTTGTGACTGGCAAAGGAGAGACTGGTCCA 

AAGATTATGCGTTACrTGATGTAACATATATTAAAATGGTCTATCAGCTGCCCTAGGACATCATTCTTTACAC 

CCTTTTTTGTTTCTGCGTTTTTCAGCA 

GAGCACAAGCCAACTTCTITCATCCTGTAAGAAA^ 

TATTTAATGTGCTCTTTTGCCTACrTTTTATTAATCATTCTTAGGA 

5 0 CAGATTTTACAGGCTGAATTATTGGCAGTATATGGAGCAGAC^ 

AAGGTAAGAGACCTTAGATTCTCGCrACTTCTCTCTGGGATTTTTTTGTTG 
TTGATATTGAACCCCTTTAAAGATGTACGTTATTTTATTTTACAGATGAAGACATTTC^ 
TTGAGGAAGGCTAGAATCATGAACC^GTGTGCTAAGCTTCCAGTCTTTTACTTG 
GTCAGATTCCAAATCAAGTATCAAGAATTACAGAATGTACT 

5 5 AATATATACATAGATATGTGATTAAACAAGTATAGTAAA 

GGCCAGGCATGGTGGCTC!ACACCTGTAATCCCAGCACT 

AGCCTGGCCAGCATGCTGAAACCCTGTCTCTACTAAAAATACAAAAAAAAAAATTG^ 

CGCTACCCGGGLAGGCTGAGGCAGGAGAATCACTTAAACCTG 

ATCTGGGTAAGM3GGCAAGACTCTGTCTCA 

6 0 ATAAAGTCCTTG AAAAATAAATCT ATAGGTTAGTCTTGAAT ATCAT TTAAATGGCCAAAGAT CAGT CAT CTTT CCTT CTTTTTCCA 

AAGTACTTTTATTAACTGTGAAGCACAATACTCTGTTCCACAGA^ 

TACT (rTCTTGGAAAAGATG ATTCATAAATG CCTG AAAGAGTT CAAGGAGG CAAG ACCAT ATGGGAT T CATTGC CAAAAT AGATAAT 
TGGTCTTAAAAAGCATATATAGGGTTTCTTCATTGTGGTCCAAATCAT^ 

GATTTGAATGTGTGAAG GGAGAG CTGAAGCATTCCAG GTAAGAGG AAAGGCAGGAGTCAAG ATCAGG AG G CAG GG AGGG ACAAAGT 
65 GTGTTCAGCAC^GCAGGAGACCTGACTGGAAAGAACAGAGTGGGTCTG 

TTATTTCACAGAGCACT AAAAAAG AAAAGAGTTCAG CCAATT AAATTATG GTGT AT CATCAATTG T AGGACATT TCCTGATTT CAC 

AGATACTAAAAAATGAAAATAAGATGTGCCTCTCACAAATCA 

TGGGGAGAGAAGCTGGCTGAAACTATCAGGTTGGTGCAAAAGTAATTGC^ 

ATATACTTGATCTTGGACGAGAGCCCTGGGTAAATGATACTTATTATCACTTACCTTGGAATTTTCGCA 

7 0 ATAATCTCSATATAATTTGTATCCTTGTTGCTGAAAGGATT'C 

TAATATTTTCTGTTGTGCAAATTGAAGATAAGGAGGCAGAAGTAGCTGCA 

TGATTTGAAGCCTTTGAAGTGGTTACTAGACTCTTATCCAGAGT^ 

CAACATATTGGCCCCTGGCCCCATCCTGTCCCCA^ 

GCGC TAGT T AATTTTTG AG AACTT CTACCGTAAAC CATTG ATT AATTT CATTTT TTT ACCT CATTAAATTAAGTCATT CTGTATC C 
7 5 TTCCCTCTCATAAGTCTTTGAATTTCTTATTAAATATTCTATTTTG^ 
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AGTTTATAAAAGTGGGATGCAAAGTAGACTTC^GAAATCTA 

ACTGCT CAGGTGTCTCAGGAGACGTCTTTGTACrCCATTTTAAAGTCAGTG AATAG C 
TTGGTAGG CAAGGAGGGCACCATG AGCTGGCAGAGCG CCT CGTGGAAAT ACAGTATGAGCTAAC^ 
TGTGGCAGGAAACCAGGTGAGTGGATGGAGAATGACCGTTTCACATCT 
5 ATTTGTTATTACCTCCTCTGCrGTAACATATCACTGAGAAAGCrrATA 
CAGTTAC^TGTAAAAGATATAAAGTTGGACAAA^ 

T CAGACTG AATAATCAAAGCAAAATTGAATGGGT ACCACAGT TCTTG AACATGGTGGGAC TAGGAATGTAGGAGCAAAG CAATAAC 
CTTTGTAGTTATTTGAAAAATTCTTCTTAAGGAATGGAAAAA 
ATAGGGAACAGGGGTCAGCAAACCATGGTTTGT^ 
10 ACTG CACTTCAG C CTAGGTGACAG AGTCAGACTCTGTCTCAAAATAAAT AAATA 

CGGGTGCAACGGGTCATGTCTGTAATCCTAGCACT^ 

CTGGGCAAG AGGGGTG AG ACT CCGTCTCCACAAAAAAG'i.'T ITTl'AAAAT^ATCAG CCAGGCATGGTGGTGTGCG CCTGTAG T CCCAC 
GTACTTGGGAGACTGGGACTTGAGCCCAGGAATTCAGGGTGGCAGTGAGCTA 
GACAAAGCAAGACCCCCAACTCrCTAAAAAATAAATGAATAAAGTTTTAT^ 
1 5 T ATGGCTG CTTTCACATAACAACAGCAG AGTTGAAT A C CAGATGG C CACAAAGCCTAAAAT ATTTA 

CTATTTGGCCCTTTAAAGAAAATATTGGCCAACCCCTGATC 
TATGATAGAGTTAAGAGCCTCTGGTATAGTTTCAGTTCT 
ACTAACAGACCTGGACCAAATGCTTGGAGTAGCTAGCT 
GTGTCATCAGCTGGTGGTGAGTGTCCTGTTTGTCnTACTTCTGATA 

2 0 GTGCAAAAAAGCCTTTTTCTGCGGGGTGCGGTGGCTCACGCCT^ 

GTCAGGAGATCCAGATCATCCTG<3CCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAA 
GCGCCTATAGTCCCAGCTACTCGGGAGGCTGAGGCAGAATTGCTTGAAC^ 
ACTGCACTCCIAGCCTGGGTGACAGAATGAGACTC 
ATGGGAC3TCTACAGGACAGAGAGGAAGGGGATTCCCCT 

ATCCTGATTGCTTGTCITTTTCTCrCTATTCTCTCACCATCTG 

AAAATATGAACATTTTTGCAT AAAGTTTAAAGAAGG C CCAGAGTCTCAT AACAAAACATTCACAGTG TCT AG GAT AT CAT CCAT AA 
TGACTTGACATCTGAGTAATTCTCATGTACTTCCCAGGG^ 

T ATCAGACAAAG ATTTTAAAGCAG CTAGT AAGAC CATGCT AT AAGAAGT AAAAGCAAACATTC TT G AAACAAACAGG AAT AT AG AA 

3 0 AGTC C CATCAGAGAAAT AG AAGATACAGAAGAG AAC C1AAATAGGAAT ATTAGAACTTG AAAAAT AC AAACACTT AAAAA^ T AAAACT 

T ACTGGATGGG CTT AATAG AAG AAT AG AG ATGACAAAG AAAG T CAATGAATTGAAAACAGAT CACTAG AAATTAT CTGGTTT AAGC 
AACAGAG AGGAAAAAAGAGTAGG AAGAAAAG ACTG AACAGAG T CT CAGGGAC CTGTGG<5 AAAAT ACTA7y\ATGTCTGACATTT ATG 
TCCTTGGATTCCCAATAAGAGATX3AGAAAGAGTGGTGTGTGCAGAAAAC 
TTTGGGACAGGCTCAATGGCT<2ACACCTGTAATCCC^GCIACT 

3 5 GACCAGCCTGGCCAACATGGTGAAACCCCATCrCTACTAAAGATAC^ 

GCTACTCAGGTAGCTGAGGCACAAGAATCACTTGAACCTGGGAGGTAGAGGTTGC^ 

CCTGGGCAACAGAGCGAAACTCCGTTTCAAAAAAAAAAAAAAAAAAAA 

TCAAGAAACTTAGAAAATTCCAAAGAAGATAAACTCAAATCCAC^ 

AAGAAAACATCTTG AAAGCAG CCAG GAAAAAGTGACACATAACAT AT AT AG AAAAACAGTGATT TGAATGATTGTGGATT TCTTAT 

4 0 CAGAAACCATAGAGATGAGAAGGAAATGAAATAATACTTTGTTTGTTTGAGATGGAGTATCTCTCT 

GTGGTGCAATCTTGGCTCACTGAAACCTCCGCCTCCCAGGCT 

CGTG CCTGCCACCATG CCCGGCTAATTTTTGCATATTTAGTAGAGATGGGGTTTCAC CATGTTGG CCAGGTTGGTCTCGAACTCCC 
GACCTCGTGATCCGCCCACCTCGGCTTCCCAAAGGGCrGGGACTGCAGGCATGA 

T ATT ATTATTATTTTG AGACAGGGT CT CACTCCATCG CCT AG GCTGG AGTG CAGTGG CGTG ATCTCGACT CACTGAAACCTCAACC 
45 TCTTGAGCTCAAGTGATCCTCCCACCTCAGCATCCACAGTAGCT 
TTTTTTGTAGAGATGGGGTTTCACCATGTTGCCCAGGCTC 
ATTACAGGCATGAGCCACTGCTCCTTTTTTATTATTATTATTTTT^ 
GCTAAAAGAAAAGAGCTGTGAACCCAX3AACTCTATATCCAATGAAAA 

G AAAAAAAAGAATT ATT CTTTACTTTAACAGAAATG CT AGJ^ G G AAT CTTG 

50 GAACTTTAGGAATGAAAGAAAAACAACAGAAATGTTAAAGATGTGGCAGAGTAAGACCCT 
AATTGTCATACATATCTTAAAGCTGTAACATTGTGAGCC^ 

G CAGG CGGATCAC CTGACAT CAGG AGTTTG AG AC C AG CCTG ACCAACATGGTGAAACCCTGTCT CTACT AAAACTACAAAAATTAA 
CCGGGCATGGTGGTACATGCCTGTAAACCCAGCTACTCAGGAGGCTGAG^ 

AGTGAGCCGAGATCGTGCCACTGCACTCCTGCCTGGGCGACAGGGAGACTCCATCTCAAAAAAAAAAA 

5 5 AGGCGCG<3TGGCTCACX3CCCX3TAATCCCAGCACT 

GCTAACACGATGAAACCCCGTCTCTACTACAAATACAAAAAATTAGCCGGGCGTGGTGGTGGG 

GAGGCTGAGGCAGGAGGATGGCX3TGAACCCGGGAGGCGGAGCTTGTAGTGAGCTGAGATTGCACCACTGCACTCCAACCTGGG 
CAGAGTGAGACTCCGTCTOtf^AAAAAAAAAAAAAAAAATTATCTAGTAGGGTTTTCA^ 

TAT AATATAAAAGGGG ATAG GGTAAAAAATAT AATGGTAAG ATTTCT ACCTTCTACTGGAAATGTAAAA TTCT AAGTAGA 
60 CTGTGAAAAGTTGTGTGTGTGTGTGTTTGTGTGTGTGTGTGTGTATAGTAATTTCTACCATAACCAC 

AGGCACGGTGTAGTGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGG^ 

CAACATGGTGAAACCCTGTCTCTTCTAAAAATACMA^ 

GGTGAGGCAGGAGAATTGCTTGAACC(X>GGAGGCAGAGGTTGCAGTGAGC^ 

AGTGAGACTCCATCTAAAAAAAAAAAAAAAAAAAGACTACAAAGATATC 
65 T AAAAATT ACT CAAATCAT CAAAAG AT ATTTTGATT CAAAATAT T CAAAAGAGG AGG GG GAGAACAAAAAACTAAGGACCAAATAG 

AAAATAAATAATAACATTATAGACTTAAGTCCGTACATATCAATAGTTACATTAAATGTT 

CAGAGATTCTCAGAATGGATTAAAAAACAAAAGGGGCTGGGAGCTATGGCTCACAC 

GCAATTC TCCTAATCGCTTAAG CAT AGGAG TTTGAGACCAG CCCAAG CAACAG AG CAAGATCCT T TCACT ACAAAAAATTTTAT AG 
AAAG CAG CCAGGCATGGTGGTG CATGCCC^TAGTCT(ZAGCTACT(^GGAGGGGCTGAGGTGGGAGGATCACTTGAGCTCAGGAGG^ 

7 0 CAAGTCTGCAGTGAGTGCIX3ATCATGCCACTGCTCTCCAGCCTGGGCAACZAGAG 

CAT AAGG CCTAATATAT ACTGT CT ACAAGAAAC CCATTTT AAATATAACAAT AT AGGTAATAAG TTAAAAAC CAAAAGTAT AAAAA 
GAT AT AC CATGTAAAG G CTCAG G AAAAGAAAGCTAG GGTGG CTATATTAATATCAG ACAAAGTAG ACTTCAG AGG AGGG AATATT A 
G CACGG ATAGAGAGGGACATAAT ATTATCATAAATGAGT CAATT CATCAAGAAG GCAT AATAGTTCT AAATGTGTACACATCCAG C 
AACAGACCTTCAAAATACATG AAG CAAACTTTGATAGAACT AAAAGGAG AG AAG GT CAAAT CTACAATT ATTGTTGG AG ACT CTAC 

7 5 AC TCAT C TTTT AGTAATAGATAG AAC T AGTAGAT AGAAAATCAGT AAGGATATAGAAC TAAACCAGACCAAAACCCAACTAGAT CT 
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AGTTGATGTTTATAGACTCTTCTACCTG 

ATATCCTGGGTCATAAGACAAACCTTAGCTAATCTAAAAGAACTAAAACT 
CTACAGATAGGTAATA(2AAATAAAACTGG^AATACCCAAGTGCTT^ 
CTCX5CGCTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATCTTO 
5 CTCAGCCTCCCT AGTACTGGGATTACAGGCACGTGCTACCACACCCAGCT CACCA 
TGTTGACCAGGCTGGTCT (XAACTCCTGGCCTCAGGTGATCCACCCAC CTTAGCCTCCCACAGTG<nXlAGATTACAGGTGTGAGCC 
ACCGTGC CCGGCCCTGAACAACAGACTTCTAACTAATCCATCGAT CT AAGAGGAAGTCTT AAG GGAAAAAATATT TTCCACTG AAT 
GAAAATGAAAATG CAG CATATCAGAATTTCTGGGATGCAG CCAAAG CAATG CTTAGAGGAAACTTTATAGTATTAAATTCTrTACCT 
TAGAAAAGAAGAAAGGTCTCAAATCATTCTCATCACTATTTT^ 
10 TGTGATCrTGGCTCACTGCAACCTCCCCCTCCCAGGTTCAAGTGATTCTC 
TGCAGCACCATGCCTGGCACATTTTTGTATTTT^ 

TCAAATGATCTG CCTGCCTCAG CCTCCCAAAGTGCTGGGATTACAGC^GTGAGACACCGCGCCTAGCCT 
CTCCTCCCTTAAGAAACTTTAAAAAAAAAAAGAGCAA^ 
AATGAAACTGAAAGAAGGAAACACTAGCGAAAAATCAATGAAACTC^ 
15 TGGTTTT TCAGAAGGACT AGTAAAACTCATAAATGTCTGGCAAT ACTG^ CACT ATAA 

ACCC CACAGACATGGAAGGAT AATTAAAAAAAAAT ACAAAGCAGACC C CATGTAAT CC CAG CACTTTGG 

GAG G CTGAGGCAG ATAGATCATTTGAGCTGGGGAGTTAG AG ACCAGC CTGG CCAACATGGTGAAACTCCGTCTCTACTAAAAATAC 
AAAAATTAGCTGGGCATGATGGTGGGCGCCTGTTATCCCAGCTACTT 

AGAG GTTG CAGTGAGCTTGAGATTGTGCAACTGCACT CCAG C CTG CAGACTC CATCTCTAAT AAT AACAAT AATAA 

2 0 TAATAGTAATACAGACCAGGCATAGTGGCTC^CAGTGGTAATCACAACACTTTGAAAGGCCG 

AGAGTTTG CGACCAGCCTGGG CAACAT AG CAAGACC C CAT CT ACAAAACTGCAACAACAAAAAAATCAGC CTAG C CTGATGG CACG 

TGCCTGTAGTCCCAGTTACTCTGGAGGCTGAGGTGAGAGGATTGCTTGAGCCCAGGAGGTCAAGGCTACAGTC 

CCACTGT ACT CTAGTCTGGGTGACAG AGTGAGACCCTGTCT CAAAAAAAAATAG AG AAAAATGCAACATAAC CAAGTGGG ATTT CC 

AGATAGGCAAGACTGATTTAATATTTGAAAATCAGGCCAGGTGCAGT 

2 5 TGGGTGGATCACCTGAGCTCGGGAGTTTGAGACCAGCCT 

GGGCATGGTGGTGCATGCATGTAGCTACTTGGGAGGCTGAGGCAGGAGAATCAC^ 
TGAGATTGCGCCATTGCTCACCTGCCTGGGCGACAGA 

AAGTTAACAGTAAAAACGAAAAACTATGTAATCATATCAGTTGATGCAGAAGAAGCATTCAACAAACT 
TTAAAAAAACTCTTAGCAAACTAGGAATAGATGAGACTTCCTTGATCTGAT 

3 0 ACGTGCCTGTAGTCCCAGTTACTCTGGAGGCTCAGCTGAGAGGATTGCTTGAGCC 

GCACCACTGTACTCTAGTCTGGGTGACAGAGTGAGACCCTGTCTCAAAAAAAAA 
TCCAGATAGGCAAGACTGACTTAATATCTGAAAATCAGGCCAGGTGCAG 

AAGGGGGTGGGTCACCTGAGGTCGACTTCGATACCAGCCTG<3TTAACATGGGGAAGCCCTGTCTCTACT 
CTGGGCATGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGTTAGGAGAATreCTTGAACC 

3 5 AGTGAGCCTGAGATCTTGCCACTGCACTCCAGCCTGAGCGACAGAGTGGGACTCTG 

AT AC TTAG GT AT AAATC TAATGAAATACACAAT ACATGT AG T ATC TATATG C C AAAAACTC CTGATGAAAG AAATCAATGAAGAC C 
TGAATAAGTGG AG AG ATGT ACTGTG CTT ATGGAT ATG AAG ACT CAAT AACATGT CAATT CTC CACAAGTTAATAAATG T AT AAATT 
TACTTCAATTGAGGATTTTTTATTAGACATAGACATGCTGATTTT 
ATCCCAGCACTTTGGGAGGCCACGGCGGGAGGACTACTTGAGCCCA 

GCCCAGGAGGTTGAGGCTGCAGTGAGCCATGATCGTGCCACTGCACTCTAGCCTTGGCAAC^ 
AATAATAATAATTTAAAAATTCAAATGGAAGGGTGAAGGAACTAGAGTAGCC^ 
GTACAGCACCCAATTTTTCAGAATTGCTATGAAGCTGTACAGTTCTCA^ 
TTGAAACAGTATAGAGAATCCAGAGATAGATCCACACAAATATGTTCAATTGATT 

4 5 TGGTGGAGGAAGGATAGTTTTCTGTTGTTCTTGTTGTTTTCTGTCTTTTTAAGACGGAGTCT^ 

CAGTGCTG^GATCTTGGCTCACTGCAACCTCGGCCTCCCGGGTT CAAGTGATTCTTCTGCCTCAGCCTCCCGAGCAG CTGGGACTA 
CAGGCGCACTCTACCAGACCCAGGTAATTTCTTGTATTTT^ 

CCTGACCTCGTGAACTGCCCGCATCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCC^ 
TTAATGCATTGGGA(MCCGGACATCCATGTGCAGGAAAGTGAACTTCAACCT 

5 0 TCATAGATCTAAATCTAAAACCTGACACIGTAAAACCTCTAGAAGA 

GCCAGCACTCTGGGAGGTCAAGGCGGGTGGATCACCTGAGGTC^^ 

CTACT AAAAAT ACAAAAATTAG CCAGGCATGGTGGTGAATGC CTGTAGTC C CAGCTACT CAGGAGGCTAAGGCAGG AGAATCG CTT 
GAATCCAGX3AAGCGGAGGTTATAGCGAACCGAGATCACACCACTGCACTCCAGCCT 

AAAAAAGAAAACATAAGAGAAAATCTTTCTGATCTTGGATGAAAGCTTCTAGCTATCAGCTGAGTATGGTGGCTC^ 
55 CCTAGTACTTTGGGAGGCCGAAGTAGGAGGATCACTTGAGCCCAGGAAGTCGAGGCTGCAGTGAGCT 
CCAGCCTGGGTGACAGAGCAAGACACTGCCTCTGCAAGGGGGAAAAAGAAACC 
AGAAAATATTGATAAATTATACTTCATCAAAATTGAAAATTTTTGGCCAGGTGCAGTGGCT 
GAGGCCAAGGCAGGTGGATCACCAGAGGTCTGAAGTTGAAGACCAACCTG^ 
AAAGTTAGCCAGGCATGGTGGTGCATGCCTGTAATCCGAGCTACTTG^ 

6 0 GAGGTTGCAATGAGCCAATATTGCACCATTGCACTCTAGCCT 

AAAAGG AAAAGTTTTGTT CTGAGAAAACTAGCTAAGAAAGTGAAAAG AC CAGCAAC AGACT 

ATCACAT AT CCAG CAAATTACT CATATTTAGAAT ATTC AAAGAATTTT AAAAACTCAGG CC GAG CACGGTGG CT CACGC CTGT AAT 
CTCAGCACGTTGGGAGGCTTAGGCAGGTGGATTACCTC^GCCCAGGAGTTTGAGACCAGCCT 

CTACT AAAAATATAAAAAAATT AGCCGGGTGTAGCAGTG CAC G CCTG T AG TC CTGGCT ACT CAG GTGG CTGAGG CATGAGAAT CG C 

6 5 TTGAACCC!AGGAGGTGGAGGTTGCCAGTGAG CCGAGATCCACGCCACTGCACTCCAGC CATGGGGGGCAG AGTGAGACCCTGACTC 

AAAATAAATAAATAAGCTGGGCACAGTGGCTCACACCTGTAATCCCAGCACTCT 
AGGAGTTCAAGACCAGCCTG^CCAACGTGGAGAAACCCCATCTTTACTGAAAAAACA 
GTAATCCCAGCTACTCAGGAAGCTGAGGCAGAAGAATCGCCTGAACCCAGG^ 
GCACTCCAGCCTGrGGCAACAAGAGTGAAACTCTGTCTCAAATAAATAAATAAGTTAAT 

7 0 AAACAACCCAAT AAAAACTGGG CAAAGACAAGATATGG GATGGCAAAAAAAAAAAT CACATGAAGACAAG CGTGGCATG ATT ACT C 

ATTAGC^CAATGAGATGCCACAGCATACCTACTAGAATGATTAAACTTTTAAAAACAAAAA 

CAGGAAG CAACTGT AACTCT CACATTG CTGTTTGGAAAACAAAGTG AT AGCG CTACTCAAG GG AAAAGGT CTGG CAGTTTCTTAC C 
AAGTTAAACATACATTT ACCATAT AACCCAGCATTTC TACT C CTGGGT CTCT ACC CAGAAG AAAGATAGACTT ATGT TAAAG AAAA 
ACCTACATAGGAGTGCTGATAGTGGCGTTACTCATTGGCACATG^A 



1919 



WO 03/008583 PCT/USO 1/5 1291 



GCAAGACCTAGAGCTCATCCACTGTCCCACACCGTGGG 

CATGGCCAAAGATCGTCAGTTACACTGAGGGCAAACTAATCGTCGTGATTGGGG 

CCTTTGTACCAGAAAAAAACAAGG CTATGAG CCTTACTACTAG CTGGGTCnT CAGG AATTTTT AGCTAGTCTAG CAACATGACTCG 
TCATTATG CACAAATACTAATCG CAAAACTTAGTTTGAGGAAAT AATGGTG^ 
5 ACAAAACTTTGAC TTGGTT TCTGTTTCRGATCACAAAAATG^ 

ATACATTTTTCTCTTTTATGGCTATACCATTTGGAA^ 

ACAAGTAG CAATTTTACTAATAGGAAAATTGAAACCAGTGATATTGT CAC TGTTTTAG CCAGTTGTGACAGTATTTTGT ATTT ATT 
TATT TACA GAA TACT TCTGTATTATAGGAGATTACATTATCTATGA 
10 . AATATTTTAGACTTTGTGGGTAATATGGCCGCTTT CGCAA^ CATCTGTGGACAC T CTA 



TTGTTTTTTCTTTTTG CCCCCAAACATTTTAAAACGTAAAAACCATTCCTAGCTCAACTGAACA CCATAATTTTTCT 
GACCX!CTGGTCTATAAGAGCAGATGAGTTCATTGCT 
CTOT CA&GAGTTTATGAA ATGACTAACTG^ 
1 5 TTTG i"rrTTl"i~I"l'GCTTTGGTAG AGACAGGG T CTTGCTCTGTTACCCAGGCTGGTTG CAAACT CCGGGACTCAAGCAATCC 

ACC TGCCCTGG C CT CCCTATGAGGAACT ATAT CTGGCCTTGTATTTT^ 
NNNNNNNmJNNNNNNNNN^ 
NNNNNNNNNNNNNNKNl^^ 

NNNNCTTCGCCAAGATGGTGAAACCCTGTCTCTACTA 
2 0 ACTCTGGAGGCTGAGGCAGGAGAATTGCTTAAACCCTGGTGGCAGAG^ 
GGGTGACAGAGTGAGACTCCATCTCAAAAAAAAAAAAAAAAAAAC^^ 
GCTGGGATTATAGGCGGGAGCCATCGAQCCCGACATATTTTCTTATATC^ 
CACATAAGGGCCTGGAAGGTAACCTGGCAATAGCITTTGTCGTTTAGT^ 

ATTG TGG TATTT CAGAAGGAGTTACTTTCACCAG ATTGT^ CTGTTT AGAG CT CCTCACTGACTCTACTGGG CTGTAG AATGCTTGG 

2 5 TTCTAACATG<3TCCATGAACTGTAATTAATAT 

ATTTGCTTATTAGCTTTG TATG TTTTCTTAGCAGCCTG 
ACTATTTCTCTGTACrGGTTTTCTCTAGTATGAGAA 

TGATGACTTTATTT CATACT TTT ATTTG TTAAACT AGAAG CAG ACTG TTGAGTAT ACAGCAG TG ATGACCACAAGG ACTATT AAGT 
TCAGTATATTCAGTTCATTGGTTCTGCTTCAGATAATCTCATTTT^ 

3 0 GGCAGTTGTT CT CCT CAGGAGAGTT CC CTAGTTCCTCT CAGTGTCATGAGT AAG TAGGGCACG CGTTCTTG GGGTG ACT TAG CCAT 

GTAGGGTTCTAGAGCrGGTAAATGCTGATGCTGAAATGGTCGCGAGTCCCATAGATTCACT 
TGTAAACCTCTTTCCCATCCCATGCCATTCATCAGTGCn'TTTTAA^ 
GCTCAGAGTTTCCCGGCTCTGTGCGACAACCTCCCTTTAGCCTGTGTCTTTCT^ 
TAAGGGCCCAAGGCCTCAAGGCCCTGGGAAATCTACTTCT 

3 5 CGGGGCACTTCTCTCCACTTTAACCACCAAAGTATACTCCT 

TCTTTTTGTAACCTGTAAGGGCATCATTCAGCTCTCACCT 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTTTAGATTAACACCCTCTGAGGCCATGA 

TTCCAGGAATCCTCCITTGCCTACAGCGTTTGCCT 

GTAGCGGCTTTCCCAAAGCCTGACTCTGACATTGCAGCTTCCCT 

4 0 TTTACTAAAACITCATATAATACTCAGTATTAAATATTACA 

AATGTGTTTTATGCAATTTGTTTGGTGTCTCTGTGTTCTGTATCT^ 
TGTAAACTAAGATCATCAAGATGATAAGCT^ 

TAAGT AAATTTT ATTAT AAAT AATT CAAACCTAG ATAAAAAC CCACGTTTT CTT AGACTAT C CT ACATTTG CTTCACTG G AAG CT G 
CATGAAAG CTGAG ATTATGCTTGT AAT ATCTCTAG CACG TGGCAGGCATGAAGTAAAG AC CTG GAACAG AGTCTAGT ATAG CT CT C 

4 5 CAGAG CTG CAGCTCTGAGTCAGACTGGTCAGTACATATCTCAGCCCTGCCAAG CATCCTCTC 

ATAGAAGAGGGAGATAAGGATCCTGCAGGGGCTGCCGGCCAGA^ 
TGTGGTCTCCITCTCATAGGTG^CCATCTTATCAGC^CCAGGT 

CCTCCCTTCATCCTTTCTTTCCTTTCTCTTTTCCTTCCCTC CCTCCTGATGAAGAACTGAGAGAAATATAACAAAGG CTAATGTAT 
TCCTTGAATGAGTAAATGTTAGCATTTTCIAAAGATGTAAACTTGGATCTCAGGT 

5 0 CrATCTTCTCTGCCTC^GTTTTCTTGTCTTAAAAATGGGACAATAGTA^ 

TACTGTTAGCACAGTGCCTGGG ATAT AAG CCCAT AATCAGCG TTGGCCATTGT AG TTGTTG CTAGT AATAATAACATTAG CTTG TA 
GTCCCATGGAGGAAGAAACTAGATATTTATGAAGCAAAGTCAAGCTCCCAAT^ 

GTTTGTTTTTAATTATAGTGCGACCACATTTGGGA ' 
GAGTCAGGCTCAATATTTTGGTACATTTCTTTCAGTTCTTGTTTTATATGCATATCT^ 

5 5 GT AGACGT TT AT AACCTG CTTTTTT CACTCAGTAAT AT ATGGTACCCATTTTAACATGTTC C CAG CACT 

TGTTTTTTCTTTTTTTTTCGAGACAAGGTCTCACTCCGTCACCC^ 
GAACTCCCAGGCTCAAGTGAGTGATCTTCCTGCCTCTGCCTCCCAAAGTGTTGG 
TAGCCAGCATTTTAAATTGCTGCATTAAATCCCTTCTGGCCAGCCAGTGCCT 
GTGTGGATGTAAGGAACCCAATAGGGAATATAAGAGATTCAGCATC 

6 0 ACTACCACGTGTCAAGGTGTTGCATGAGGCAGAAAGTGCAGAGAGAGGCGGAGCCACCCCAGAAGA^ 



ACGTTTTTGTGCAAGGGCACATTTTATCCTCTCAAAACAGCIATGTCTTI^ 
CTGTCACCCMCCTGGAGTGC^GTGGTGTGATCTTGGCTCACTGCAGCCTTGATA 
65 CTCCTGAGTAGTTGGCACTACAAGTGTGCACCA 

CCCAGGCTGGTCTCGAACTCCTGGGCTTAAGCAATCGACCTGCCTC^ 

GCCTGGCTGTTTTTGCAAGGTTATAGGTTCCCTTA - 

GATGATATCACAAGAAOAGTAGAGAACTCGAACCTACT^ 

CTCTGTCTGTGTCATAAAGTAGTGAAATTGCTGTGGTC 
7 0 TTTAACAGTGGGTATGTACCAAGGACrrGGGATTGGATAAGACGTAGGCAATAAAA^ 

CCTGGCATTAATCATTGTCCCTTGTGTTCCTGATATTTT 

TGTACGATGAAGTTGACAGGCGAGAGACGGATGCAGGTGACTTGACXaTGG^ 

AAAAGCTGCCAGGCTCAGTGTTTTACTAGTCCAATTTCTGCT 

CTTCAGAGTGTGTTTCCCCTCGTAGTGCGTGAGCTCTGC 
7 5 CTTTCCTCT^CTAGTCTGGCTTGCCACGaU^CCACAGCGC 
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CCT^ 

TTTTCCC^GAAAACTATTCTAAATGTTTTGATTGATTAAAATTTA^ 
ATAAGAAAAACAAT7U5ACTATAAATAAATG^ 

AAGAAATGTTTAACCTTGTCAGTAATCTAAGAAAGCAACTTAAAAGCATTAG^ 

AATAAAGGATACTCCAAACTGGTAAGGATGTGGTAAGCTGAGCTTTCTC™ 

GAAAAGCAATTTGTTTACCAGTTTCAGAAGCCTCAAAATTAA^ 

AATAAAATAAAAATAGATTTATGCACGTGATTTTCATCAAAGCACCATTGTATAC^ 

TCAAGAGGAGAAATGTTGAATGAA^ 

TATGATTTTGTTTAACTTAAATTTTTTTT^ 

CAGCTCACTGCAACCTCCACCTCCCGGACCCAAGCX^TTCTCTT 

CCATGCCTGGCTAATTTTTGTACAAAGTCCTGACCTCAAGTGATCCTCCC^ 

AGCCACTGCACCCGGCCTTAACTTAAAATTTAAAAATTCCCCTCAAA^ 

ATTAATAGAAG AAAAGCATT CAGT AGACC CATTTTATTTTTG CTTACCAAAATAGA.C CTGCATTGTGAG AACAATGCATATG TG AG 
AAATGCCATAACATTTGTTATGAAATTCT CTT ATAT AAATGGTTTTAAACT CATT AACTTGG TTTG AT 
TATGG CTGGGATTAATTGT ACTTTGAT CATG CCAGAAGAAAGTATTTG CAGAAGT CTGTCAAT AGTCTG ACTAGT TTTATGTGACT 
ATTTATGTATATATTTTTACACTGGGAAATA 

CATGAGTTTGCCACGCTGGTCATTGACATTCTCAGTGACGCCAAGAGGAGAC^ 

CTGCTCAAACCTCCAAACACTTGCATTTTTTGGCCCT^ 

AACATGACAGTTCTATAGGTTTTTGGTTTTTTGTTTGTTTTTT^ 

TACGTAGGTATACGTGTGCCATGGTGGTTTGCTGCACCTATTGACCTGTCCT 

GTTTTATTGGTAGTATATTGTGATAAATAGTTTACCTGTCrGAGCCCTGGTGAAr^ 

CATATTTAT CTTTATAT ATGATTTCrGTTTTGTTTTGGGACACTTAAC CTACC CTGTAAAATTGCATC OTGAAATTAAC CCAAAACT 
CTGGCTTAGATCTTGTTAAAGGACATAAGAAAATAAACTCTTCTTGG 
GGAGGCCGAGGCAGGTGGATCACCTAACGTCAGGAGTTCGAGACCAGCCTGGCCAAC^ 
CAAAAATTAGCAAGGCATGGTGGCGGGCGCCTGTAATCCCAGCTACCCAGGAGGCTGAGGC^ 

CGGAG GTTG CAG TGAG CCGAG ATTG CACCACTG CACTCCAGC CTG AGTG ACAG AG CAAG ACTC TGTTTCAAAAAAAAAAAAAAACG 
GAAAAGAAAAACTCTTCCTGATTGAAACTGCATAATATCTGCTGTTGCTTACCTACAAAACT 

TGG CATAAACTT T AAATGT CT TCAGTGAGAATT ACTGTGATATACAATTATAAGG AGAG CTCTGAG CATCTGTTTCACCAGTGCC C 

TG TTGAGTCTAGTTCAGTAGCCC^GGTCCTGTCTGCATGCTTAGGAATGTCCTTCCT 

TTTTTTCTAATTATAGAAATTAGACGTTTTTCATTGTAAG 

CAGTATCACATTCCCCAAAGA(^GTAATCATTAACTGACAACAACT 

GTTCAACCAGGAATAAGGAAGTTCTGAAAAGGTTTATTAAATTTTGGTC 

TCTTAAAATGGCCTGGGCGTGGTGCCTTACGCCTC^ 

GTTTGAGATCAGCCTGGCCAACATGGTGAAACCCATCTCTACT 

AATCCCAGCTACTCAGGAGACTGAGGCAGGAGAATTGCGTGAACCTGGGAGACAGAGGTTGTAGTGAGCCGAGATCACACCACTGC 
ACTCTAACCTGAGGAACAGAGTGAGACTCTGTCTCAAAAAAAA^ 

ACTTTGGGAGGCTGAGGTGGGCAGATCACGAGGTCAGGAGATTGAGACCATCTTGGCTAACATGGTGAAACCCCGTCTCT 
ACTACAAAAAATTAGCCAGGCATGGTGGTGGGTACCTATAGTC 

GGAGGCAGAGOTTGCAGTGAGCCAAGATTGTGCCACTGCACTCCAGCCTGAGTGACAGAGC^ 
ATTTTTAACTTAAAAAACAAAGTCTTAAAAAATACCTGCT 

ATCAG AG ATGTTAT ATG T AGAACAACACAAACATTAGAAG CAACT AAG T AAT AAAGAGGAACAATTAGTCACATTTT AAAAT AGTA 
CATGCTGCACTITGGGAGGCCAAGGCAGGCGGATCACGAGTTCAGGAGATC 

CTACTAAAAATATAAAAAATTAGCTGGGCATGGTGGCAGACGCCIGTGGTCCCAGCTACTCGGGAGG 

TGAACCCGGGAGGTGGAGCTTGCAGTGAGCCaAGATCGCACCACTGCACTCT 

AAAAAAAAAAATACATACTTATCTTTTTTTAAAAGAATATGTAATTGTCTTAGTCCCTTT^ 

ACTGGGTAGTTCGTAAAGAAAAAAGGTTTATTTGGCTAATGATTCTGCTGACT 

GCTTCCACTCATGGAAGGTGAAGGGGTGCTACA^ 

CAGGCT CTTTTCAATAAC CAGGT C CTAG CAGG AACTAATAGAGGGAGAACTCACTACAAAGG GAGGG CATTGATCAGT TCATAAAG 

GATCCACCCCCATGACCCAAACACCTCCCTTTAGTCCTCCCCTCCAACACTGAGGATCACATTTC 

AAACATTTATACTATAGTAGTAATTATATGAGACAAAGTAATAAAGTTTGATTTTGTTTTGTTTT^ 

ACTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCGATCTTGGCTCACTGCAACCTCTGCCACCCAGGCT 

AGCCTCCTGAGTAGCTGGGATTACAGGTGTGCACCACCA 

GTTGGCCAGGCTATTCTCGAATTCCCyUSCCTTAAGCGATCTGCCCACCTCAGCCTCCCAAAGT^ 
CCATACCTGGCCAAATAATAAAGTTTTTAAAAGGTTTTGTATAACCCCAGTTTTGTAAGGAAAA 

TACAAG TGAGT AT AC CACATGTAAAT AT AAAT ACACAATTGATCCT CATGTG CAGATC C CATATTTACAAATTCATCT ACTT GCT A 
AAATTTG TG AC CT CAAAATTAATACAGCATTTT CAAGGT C ATTGTAAGAAATGTACAGAG CAGAGAAAAAAATTG AGTCACCAAAA 
GCACATGTTTCCAGCTGAGATTGAAC^GGAGGAGACTCCTTGACTTCTTGTTTCTGCACTCATG 
CTATTTAGTGCCATGTTAGTCATATTTGTGTGCTTTTCATTG 

TTGAGTGCCCAGGAGATTGAGGCTG(ZAGTGAGCCATGATCACGTCTGCACCCrrAGC^ 
TGGCCAGACGCAGTGGCTCACX3CCTGTAATCCTAGCACTTTGGGAGGCTGAGGTGGGCT 

GCCTGACCAACATGGTGAAACT CTrGTCTCTACTAAAAATACAAGAAAAATTAGCCAGGCATGGTGACGCGTGCCTGTAATC CCAGT 

TACTCAGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAGGTGGAGGATG 

TGGATGACAGAGTGAGACTCCATCTCAAAAAAAAAAAAAGTGCACACCTGAACAACATAGCAAGG 

TAAAAATTTTCTAGGTGTGTTGGCATGTGTCTGTAGTCCTACATACTCAGGAGGCTGAGGCAGGAG 

TAGAAGCTATAGTGAGCTATGATTATGCCAGTGCACTCCAGCC^^ 

AGTGGCCCCCAAGCATAGTCCTGAAGTGCTGTCTAGTGTTTCTAA^ 

TAAAT AAG CAT CATTC AGG CATGAT TT ATAG TG C CGT TGGC CATGAGTT CAATGTTAATGAATCAAT ACT ATG T ATT AAAT AAGGT 
AT CTTT AAACAGAAACAGACGTAG AACAAGGTTAT ATATTGAT^ CCCAGAAG CACTGAT CT AATATTGG CCAAGT C 

ACAGCAACTTTTTAGAACATAACTACCAAGAGGCrrGGGTGCGGTGGCTCACACCTG 
GCAGATCACGAGGTCAGAAGATCGAGACCTTCCTGGCTAACACGGT^ 

GGGTGTGGTGG CGGGCGCTTGTAGTCTTAGCTACTCAGGAGGCTGAGG CAGGAGAATGGCGTGAAACCGGGAGGCGGAGTTTGCAG 
TGAGCCAAGATCG CGCCACTGCACTCCAGCCTGGG CAACAGAGTGAGACTCTGTCTCAAAAAAAGAAAAGAACATAACTACCAAG C 
ATAACAAG AATTGACT GTGTGTGG CTAT AAGTG CT CGGAGAAAAACCTAG AAAT AG ATCAACAAAT AACAATG^ 
ATCTTACTTGTTCATTAOTTTCTAAATTGAATATATTGACTATNNNNNNNN^ 
NNNNNNNNNNNNNmJNNNNN^^ 
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NNNNNCCTTTATTATTOT 

CTCACTGCAACCTC^CCTTCTAGGTTCAAGCGATTCTCCTGC CTCAGCCTCCCAAGTAGCTGGGATTACAGGATGTGCACCAC 
CACG CC CAGCTAATTTTTGTATTTTTAGTAGGGATGGGGTTTTGC CATGTTGG C CAGGCT GGT CTCG AACT CG TGACCTCAAGTGA 
TCCATCTGCCTCAGCCTCCCAAAGTGCTGGGATTACGGATGT^ 
CATTCATCACCC^ATATCATCCATCCATTGTTTCCTCAACATCTCTC 

AG AAATCAGAGAGACTTGGGTAGTTTCTGAAAT CTTT CTC CCT ATGGGA 

TAGACACTCCC TGTAACATG CACATGTGTATTTGTTTGATTCATTCT CCTTTATCT CTTTCTCTCGCTGAGATG CAGGTAATTG AC 

TAAATGCCAATGTTTCTATTCTTTTTTTTTTTTTO 

GGCTCACTGCAAGCGGGTTCATGCCATTCT 

AACTTTTTGTATTTTTAGTAGAGGCGGGG 

GGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCCAGTGCTTCTATTGTTTG 



ACCTAGAGCCTTATCTTCATGACCTGGAGCTAGGCTCTATCTCTAGAAACTTGGGAACCT 
CTAAGGCATCTGTTTAATTTGCCCATAITTGGCAGGTAGATGAGAGGAATG 

AACTTTTTCTGTTATCTCTGTATTGGTGTGAATATAACGTTGGCATTTCTGTCATTTTTTTTCAGTCC^ 
TCTTTGTTTTTGCTCTTAGAGACAGGGCCTCACTCTGATGCCCAG 
ATCTCCTGGGCTCARGCAATCCTCCTGTCTTAGCCTCCTGAGTAGCTAGTAGCTG^ 
AATTTTTAAATTATTGTGTAGJGGATGGAGTCT 

TGGCCTCCTAGAATGCTGGGATTACAGGCAAGAGCACAGCATTC^GCCCTTTAAGT 
CCGTAGTTTTTTGGAAGAAAAGCAATGTTTTAAGCCTGAAGTAAAGTTGTTO 

AAAAC CATCAATAACCAGCACAGCGTTGAG AGT CAAGACAACGATCAGCC CGACTATGACAGCGTGG CATCAG ACG AAGACACAGA 

TTTGGAAACC^CTGCAAGC^AAACAAACCGGCAGAAGGTGAGAGGCCATGCGTCC 

CTAGCCCTGCCAAAGCTGCTGTGCAGTGAGGTTGGGTTTGTTACTAGCTCAGGCCA^ 

AAAATAGGGATCCTTGTCCCTGCATCACAGCCCTGCTGCAGTTGAGACT 

CATTCAACAAGTGTTTACTGAGTGACTTCTCTGGATATTACTCTCCTGAATGAGGCCTCGTCC 

CTAGGAGAGAGAGAACTGCGGAGAAGGCACAAAGTGATGTGTGTCATAC^ 

ATCTATGGCTGGGAGCATCAGGTGGGGCTGTGTCATAGGA 

GG AGGAGT CAG AAAGAAAGTGAAAC CAGGG CATCTAAGGGGAAG CTGG T CATG CAT AGGGACAGCCT AAAAAGGTTGGG ACT CGAA 
GCTGCAGCAGG AGG GG TATG ACACCCG ACGGAAGAGAGTGGGCC TCACAGG CAGTG CAGAG C CCAGG ATG CCATGT CTGGG GAG GA 
CTCTTCTGGTGGGTTAGAGAGTGGAGGGAGAGATCCATACTGGAGCTGATGCAGGCCACCT 
GTGGGAGAGACCTCGGCACCAGAGTGTGAGCTGGGGGAGAGGGCAGCAGGACCCCAGGC 
ACACCCACAGAGGGGCTTGGCAATCATTTGAGAATGGATGAATAGGAAGGTAGAATAGAA 

GAAGTGGGGAGACAGATTTATAG AAGATTTGTTTGGAGTT CTTCGGCCCACATGACTGTGAGAAAG ATGG CGT CATTAGC AGATGC 

AGGGGACACCCCTGCGCAGACTTCCCGGAA.GCCA.CCTGGTTCACTCAG 

CCCTCTGTCAACCGCCTGGCCTCCTGCCCAGTCC^TCCCACACACCTGCTCCCTCTA 

TTCCTCCAGATTCATGTGACTCATCCCCTCACCrCCTGGGGCTCTGCrCACCTCT 

CTTAACATGCAGCCCCCTACCCTGCTCTGTCTTATCTTTTC^ 

ATGCTGTTGGAATATAAACACCATGGAAGCAGGAATCTTTGTCTCTTTTCTAA 

TCAATGAAAATGTTGGAATGAATGAATTAATGAAAGAATGAATAAAAACAAAACAAGAGTGAAOSAGA^ 

GACAGACCTCCTTAGACAGTCATTGTCAGCATAGATGTGTAGCCTCACTGCCTGTTCTATTAGAAAATACT 

TAAATAGTATAGTGACCCTAOTTTATGTTTTAGTTTTGCATTACT 

GCTGGCTTGGTCAGAATTTGAAATTCTTGAGCTGTTAGTCTTTAATGTTCTCGTC^TCT^ 
TTTTAGAGCCTAGATTCAGATTTATCAGATGGACCAGTCACTGTACAGGAAT^ 

GGCCAAGAT ACAGCAG CTAATGAAGG TG AATAACAACTTGAGTG ACGAG CTG AGAATT ATG CAG AAAAAGGTAACATG TTAATAAA 
TGCCAGGACGGTTGACGCTTTAATCCCATAGTACAAGGAAGAAT^ 

TGTAACAG C CTT AG AAATCTTTGG G CCAAGTTGTCTTTTTG AAG CAGGATGTATTTGG AT ATTT AAAATGAAATATATCCAGAGT A 

AGTTCCCCAGGTAGGAGAAACCTAACATCTATTTAAGCATTTGTGCCCATTAGAATAACCCTTGGACTAT^ 

ACCTGGATGGGGCTCCATCGATGTGATTCTTCCCATTGCTAGGGAAGGATGATCACACCATGGA 

GGCAGGGGCATGTTTATCAGCATTCAAACCCTGCTCCCAC^ 

TACTACTAACTTGTATTATCCCCAGAGCCTACAGGAGGGCCTGCC^ 

GTACTAGGAAGAGCTGCAC^CTGGTAAAAAACAATCTCATTTGCTCAGCATT^ 

GTGTGAATATATTAATCCCTAAAAGGAGTGTGGTTGATGGTAGCAAGGAATACCCITAGTAGGAGGGGGATTG 
TTATGAGATGGCTAAATAAGGGAATTCGAAAAATTGCAAATTTTGCCCATGACITTGTC 
TGAGATTTGGACACTTTACTTAACCAGAAACAAAATGAAATATATAATCTATGTCTT^ 
AAATCATTTTCAAGACCTGAGGACGTGGCACACTCCATGTGTGCCTGGACrrCTTTTCA^ 

ACTAAAAAGGGGCCCATG CCCC TCTGTTTACTAGGGAAGGAAATTGCTGTG CACTTTTGCCTGGGGTTTGGTCTAGGATTCTTTTG 
ATGTTG ACTTCT AG AGCATTTTTGAGACTTT TTGAAATAA CTT AAG AT 

CAGCTAGCCTTAAACGAACAGACACTTGTTCTGTGTCTCCCAGCCCTGTCTAGGGCTGCCGTGC^GCTGTAAACGAGACAGACCGA 

GAGCCCCTTTGTCATAACATACAGGACCTGTATTTTCTTTGCCTTCT 

TTAATTGGACHAATAATTTTGACCAGGTGACTATTTCT 

TTTAAAATCTAACAATGCTTCTTTTAGTATTTTTGGAAATCACCAGAGAGAGCC 
CTTTGAGAAATGCAAAGTGAAATTATTTGCTTTTGCTTATTGATTATCT 



AAAATTTATGTGAAGACATCTCTCTAAATAC AG AG T ACTAACATTTTT AAAAATTG AAAC CAAG AACTTGTG CTCATG AGAACACA 
TTCAGACAAAATCCAGG CAT GAAAAG TGTTAGGAT AAAGTTT CAG ATTT CAAGGCCT TAACTAAATAACTGCCACAGAGGCTTTTT 
ATACAGACAAACACGTGATCACATGCACCAGTGCCCCATTGAGGGACCTGTGTCAGTTACAC^ 
TTTCACTCTAGGCCCTGGGTTCTCTAATGAATAGTTTCAACAGGCACTTCATGCCTGTOT 
GGCTGCTTTGTCAGCTGCTTTGCAGCTGTTCTTTCCCTGGAAAGG 
TAAAAGCATTCTATCATTCTGTAATGGCTTGCTCTTCAGAGAAAC^ 

TTTCACAGCAGTTTGGCATGATTAAT ATAAAGTATATGGAATTO CAAAG ATGTAT TGATTATATTT CAAAGTTGGG 
TAAAGTTTTTAATATAI^TAAAGTTGACAAAGTAGCATGATTTC 
AATTAATGAAGAGGAGCAACTACTATTC^TGTATTTTTCACAGATTGG 
GAATGGGTTTGCTACTGAAAAAGTAATGATCTTCTGGTGCAAACAGTTC 

GTCAG ATTTTACAAT CTCTGT CAAAGAGTAGACAG CTGAAC TCACACCACACCCAGCTTATAGAATGT CCATG GAAG ATGAAGGCG 
CACCAGAAGGGAAG GACCCTGCG CAG AATGGACGTG GTG AATG GTGTTT AAAATG C CAGATGC CAAAG AGTAACACGATT CC CTGC 
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TGACCCCTTAACTCTAATC CATC CAGCACCATG GAG CAGCCTGCATGTGAGGAA 
TCTCTAAAATGGGGTGGTGCCAGGCAGATTAGCATGTTCA^ 

GAGAG AAAGGCCAAATATT CCAG CCCTGGCCGAAAG ATGAT CACT CACCAGACGGAAG CAAG CGTGCTGCGTGGAGCATCCATG CG 
AAGATGTCAATTCCATAGATCAATAGGTTT^ 
5 TTTTTCTTTTTTTTTTT ACAAAGTG CTGAATTGTTTGGAATATCAAGAG TATTATGTAAT AAAAACTTGTTG ATCAT AATTTTT CT 
TGTTGG CTT TTTGGAAAGCAAT AGG CXlkTC TGATTTTT CACAAGCrGAACTT CTGT C'J .T T LT1 GCTTTGCAAAGCAACCCTAGTCA 
TCrTTGTGCTGCITCCATG43TGTCTGCTGT 

GATGTTTTAATGATCTCGAACT CTTG CTTGAAACAGCCTITAGTAATAATC CAGTGTACAG CITGCCTGCAAACATTTCCCAGCCCC 
ACCATGACAAATTTTAAAGTGGTTAGACATTTTGTT^^ 
1 0 AATTTGCTTACGTGATGAAGGGAGTTGATTGTGTTTCA 
GCCCGGCTAGCTGTTACTGTCCTTATTGTTT^ 
AAAACACAGTCCCACTGCCAAACATTTCTTTCAGCT^ 
ATGTATATCAGGTGCAAACTGGTTCTGAGTACACAGAC^^ 

AGGCCCATGTCCATGTACGAGACCGGATCAGGTCIAGAAACCATATCTCCCAATGGGA 
1 5 GAGACTCCAGCCCTTCCCCX3CGCACGTAAGTAACACCAC!TGGC^ 

CTTCTGAACCTACAGGTCCCCACAGCTCTGCTTTGCCAAATT ' 
CAGATGCTTGC^TATTCCCACAA 

CTGCIXS CTTTTTT CT CCCC CACATC CATGATTATTTCT CAACCAG CCAGTGTTGGAACri w lTU"l w rGGTAAAG CTTCCTTC CITCT C 
TACTTCCAAAAACAAAAATAAATCATAAAGTTCACTCTCAGA^ 
2 0 AAGTACTGGAGCCCTTAGAATCCAAGACGTCTTTTCAGCT 

AAGGCTGGAAGGCTCCAG^CTTCCGCAGCCTCTCCATGTGTGCTC^ 
CCGTGGTGGGGGAAGTGTGGCTTTGTTGTCTGAGAAAAGGAAATGTAATACT 

TTGTGTTCTGCAG CAGGGTT C TT ACAAT CrTTATTTTGCATTGGG AT AC!ATACTGTGTT ATAAGGTTTTCAAATATATT T CTGG 
ATTACTTT ATT AT AAAAAT ATGTG TTTTGCCAAGATGTTACATTTA^ CTTT TT CAG AG AG T AGTATTTGT 

2 5 CTTTTTGAGTGTCAAAGTTTATTATTTCT^ 

ACATGGTTTCGTGAATTAAACCCTAGTTTAATTTTGTCAATTTC 
TCTGTTATCTCAGATTCTTTTTAAGATATTTGGG 
CTGAACTAAGTTAAGAATCTTTTTTTTTTGAGATGGAGTCTCGCT 
CTGCAACCTTTACCTCCTGGGTTCAAGCAGTTCTCCTGCCTC^GCCTCCTG 

3 0 GGCTAATTTTTTTGTTTTAGTAGAGACGGGGTTTCATCA 

CCTC^GCCTCCCAAAGTGCTAGGATTACAGGTCTGAGCTACCATGCCCAC 

CAAAATTT AGGAAAAG CAACTTTGTTATYr CAGTGTACTTTCTTATTTT AG AAAAAATTAGAATATAAAG AT AT ATCAGGAAGG 
CACTGTAATTGTACTAAAAATTAGATTGATAGTGTATAGCAACTTAGAATCATTTTTATAGTTTCTC^ 
TGAAAAACAAATATGCATTT CAT AG T ATCCATTAAAAAATTATTGAGA&G TTTG AC 
35 • AGCAGCCTTATTAAATTCAGAGTAATTTCCAGGAGGTGTTTCACCAAGGGT^ 

TGAGATTGCAGCACTCACGTCACATAATGTGTGGTGTTCCAGGATTCTTTTACT 
TTTGTATTTTCCCGAGTGTGTTTCTCTTTCCCATTCCAGATCGGGA 

CCCCTCCACACTTTCCTGGTCGAGGGACGAAAGCGCCCGAAGGGTTAAGTACA.TTCTGAATGGT 
TGCTCCTGGGCGCTTTGTGOTTGGGCTCTGGCCCrCCTGTCCCTGCTAACTGCCCTTCCGTCCTGC^^ 
40 GGTTTCTCACAGTGCATCAGGCTCACACTCGTCCACATGCACGCTC^ 

TCATG TTCACG TCATATTT AACAAAGTTTGCAAAT CACC CT ACAGCATTT ATTCT GAG CAAACTGTTT TTTT AAACAACGAG AAAA 
TTAGCTGGCTGCTGGTTAGATATTATAATAGGCCAAATTTGTTCAATTTGTTCAGGGCT 
GTGGACTCGCTGAAGTACTAAAATGTCATGCCTTTTTTTTTTTTTTTAA 
CAAACCAAGGTTTACAAGTCCCTGCATCTTGAATCAGAATTGTAACA 

AA ??^?^ ,I, CACTTGTACGGGTT G GTTAAACCT CAC CT ACTGGTG T AT AATTTATG GT CTCAAT AG CAGTG CATC3AT T CAG GTAA.GKT 
CTCTTTTAGAGTAGTCAGGCAGTAAGTATTAA 
TAGCTGGCACAGCCCCTGCTCCCAAAGGGACTGTCAATTA 

AAAGCCGTGGAAAGGGCATAGGCTTTGAAATCCACGTCCCAGTTCTGGCTCCAT^ 

GAGGAGGAAGGTGGCTGTGGGTAAGGGACCACCTTTACAGGCAAGGTGGGGTTTGCACTGGGCCTTTGCCATTCCCTCCACACCT 

5 0 GGAATTATGGAGAGCCTTTCTTTCCTAGTTCCAGTGGCATATGGATTACCAAGGCCCTTTCAGGATT 

TTTTAGAGACAGAGTCTCATGTTG C CCAGTCTGTAGTACAATAGCTATTTACAGGCATGGTCATAG CGTACTGCAG CCTCAAACTC 
CTGGGCTCAAGCAGTCCTCCCACTTCAGGCTTCTGAGT^ 

CTTC^CTTCACTTTCAGTCTATTAAGAATTCTCAGCCGAGTGCAGTGACTCACACACTTTGG^^ 
GAGCCCAGGAGTTCAAGACCAGCCTCAGCAACATGGCGAAATCCC 
55 GTGCACCTGTAGTCCCAGGTGCrTTGGGAGGCTGAGATGGGAGG^ 

TGATTGTGCCATTGCACTCCAGCCTGGACAACACAGTGAGGCCCTGTCTCAAAAAAA 
AAAAAGAATTCTTGATACATTGTACTTAAAGTCTCGGCTGGGTGC^^ 

GTGGGTGGATCACTTGAGCCCAGGAATTTGAGACCAGCCTGGCCAACATGGCAAAACCTGT^ 
GGCGGGTGGTGGCGCATGCCTGTAATCCCAGATACTTGGAAGGCTGAGGCACGAGAATCAAGAATCCCT 

6 0 GGTTGCAGTAAGCCGAGATTGCACCACTGCACTCCAGCCTGGGCGACAGAGTGAGATTGTCT 

GTTCTGAAATTGAACTGTAATGTTTCATTAAAATAGCCCAGATAC 
AATTTTGTAAGTCAAACCTCTTTTGTCTACTGTG^ 

GCTTGCTGAGACAGAGTTTTGTACAAAAATCATTTAAACTGATCACCCTGATTGTTCCTC 
TATGGCGTGTCTTTATTCAGAGAAAACTGCAAACCTATTAAATACATTG^ 

6 5 GCTTTGAGAGAGCAGCACTCTTAACCAATAGCTTTTGGCTCTC 

GAAATTCTGTACATATGCTTTTGTTGACAACGATGGAAGGACTTAACTAACGTGTAAG 

TATGTTGTT TT C CATACTTG AAAAAACATTT ACCAAATG CATTT ATTAGGT AT ATGAGT CACT ACT CT TCTAAAATGTTTAAAT GA 
AACTTG GT ACTTT TCATCATTGAGG GT AATGCATTTGATTTGCGATTTATC^ GG AAGG CAAAAAGATTCATT AT 
AAAGGCCAGACATGGTGGCT^CGCCTCTAATCCTAGCACTTTGGGAAGCCGAGGCAGGTAGATTGCCTGAGCT 

7 0 ACCATCCTGAGCAACACAGTGAAACCCTGTCrrcrATTAAAATACAAA 

CTACTTGGGAGGCTTAGGCAGGAGAATCACTTGAACCTGGGAGGCCGAGGCAGAAG 

T C CAG CCTGGGCG ACAGAGCAAGACT CTG TCTCCAGATAAATAAATAAAAAAGATT CATT AT AAAAATGTAGAATTTT CTGGGTTT 
AATTCTGTTTCTAGTTTGTTTTCCACTGGAGCAGCTCTGGGCAAATTTTTCAGTATAATTACGTAAATGCAAGTT^ 
AT AT ATGTGTATATGT ATACACACACACACCTCTGTGTGGGTGT ATATATAAAAAC CAAAATAAAACTTAAAAGTT AGTTGGGG CC 
7 5 AGGTGTGGTGGCTAACACCTATAATCCCAGCACTTTGGGAGGCCAAAGTTGGAGGATCACTTGAGCTCAGGA^ 
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15 



30 



GGAAGCTGAGGTGGGAGAATCGCTTGAG 
CAGAGTGAGACC CTG TCT CAAAAAAAAAAAAGAAAAAAAAT TGG CAGGGTG CGGTGG CTCACACCTATAATCT CAGCACTTTTGGA 
GG CTGAGGCGGGAAAAT CACCTGAGGTCAGGAGTTTGAGACCAG CCTAACCAACATAGTGAAACC C CATC T CT ACTAAAAAT ACAA 
5 AATTAGCTC^GTATGGTGGTGCATGCCTGTAATCCCAGCrACT 
GGTTGCAGTGAGCCTAGCTTATGCCIATTGCACTCCACA 
GGCATTCCCTGTGAAAGATAAGAAGGGTTTTGGGTTTTT 
CTTGCCCATGATCTTCCTGCTATAAACTTCTAGCAGCCAAACTCACT 
AAGCAGAACTACTTATTGATAGGAGATATAAACAAAGGGATCATCAAAGTTGTC 
1 0 ATATGGCTCCAAGCTAACGTTTAATCATTTGCCAGATGGATT^ 
CCACAGCAAGTGTCTTGTAT/ITGAGCAGCTGGGAACT 

AGAACATGCGCCCTGGGAGCAGAAGAGTGTATGGGGATAGCAAACAAAGTGT 
GGCTCCC TTGC C TGTACCCAC CAGG^ 
TTTTTGGTTTTGTTTTGTTTTCTTTTGTTTTGTTT^ 

CTCATCTCACTGCA ACCTC CGCCTCCTAGGCTCAAGAGATCATCTCACCrCAGC 
CCACACCCAGCTAATTTTTGTACTTTTAGTAGCGATGACGTTT 
ATCTGCCCACCTCTGC CTCCCZ AACATGCTGGGATTATGGGTGTGAGCCACCGCACCT 
CT CAAAAGTAGCCACTCTTTTGGATGAAGCTACAGGG 

ACATTTTATTGCAGCAT CAATAAAACTCCTTAGT CCCATGTTTTGATT AACTTG GATGAAATTCAAGGAGTAAAG C TAGAAAT AGC 
2 0 TCACACTTATATTTTATAAGGTAAGTGGTTTAGGTTTT 
GGGTGAAATAAATAGGGAAAATAATtSTTCMGAAA^ 

CACCCTAGAAGATCGTGTGTTTAGCAGTGCTGCCZATTTGTTCJ^AAACATT^ 

GATCACCACAGTTGCACCAAATCTTTACATTTTAAGGGGTAGATTTGAGCCCA^ 

GACCTGGTGCTTGGTCCTGTTTTTCTGCAGACTTTGTGGCCTTTC 

2 5 GGGCTTCAATGAGATAATCTGAAGGTGGCTTCCAGTGCTGAAGGTCTGTGGTCATA 

TTCTGAGACAGGGTCTTATCTTACTCTGTTTCCCAGGCTGGACTACA^ 

G CT CAAG CAGTCCT CCCT CAGC CTC CTGAGGAG CTGGGACT ACAGGTTCATG CT AC CATGCCTGG CTGATTTATTTAT TTATTTAT 
TTATTTATTTATTTATTTATTTATTTATTTTATTTATTTTTTAAATGTTGCCCAGGCT 

GTCCHXIKICT CAG CCTCCCAAAATACAGGG ATTACAGGTGTG A C CTGTATTTTAACTG CATCTCATTATGTTA 

T AAATTTG AG AACTAAAT GAG GATATGTTG C CTTGCATGTCTGTT AGTT TG T ACT AAGGAT CTG ATTAGG AAGTGG CCACTG AG CT 
AAGAAGAAAATGGTGCAAAGAAGGAAAGAGGCCATTTCTGGATCCA^^ 

CTGTCCCAAATGAGATTTGGGATGTGAAGGCCAGTAGCCTAACAGGTAA.CAATTCCTGGGGAA 

T ACAAACTCATGAGGGT C C CTCTTCTTT CTTGT AGG CATC CAGG CTG GAG AAG CAG AACAG CAC ACCTG AGAG TGAC TACG ACAAC 
ACTCCCAACGACATGGAGCCAGATGGCATGGGGTACATGCACAGGACGAGCCT 

3 5 GGCAGGCCreGGCCAGCAGAAGCAGTTCACCACACACACCCCTTT^ 

TACGCTCGTAGTTTACATGTCTTGTCATCTCACAITGTTGCCGAAAGGCCTTTTTCTTTGTAC 
GTOTGTAGTGTGTTTAAAGTCATTACCCCACTTCATCATGAAGACAACA 

CCACCTGAGGATGGCTTTATAAACAAACACCCTCCCCGCTCCAC^TATTTCCCTCCCrrTTTCCCCT 
GTCAACA.CTTCCAGCCCTATGGTAGAGTGAAATAGCCGATTTTAGAGCCTTACCATGAGTCTCCT 
GACGAGACCAGCCTGCTCATGCATTTAGCCAGTGCATACAAAGTC 
TTTAACTGCATCTTTAATGTTCACAGGTCIAAGCCGAAAGGGACGGCAAAGAAGTATG 
CACAGCAGAACCCCATGTGGCCCCAAGCCCCACTCTCCCTAGCACCGAAGATGTCATCAGGAAGACT 
TACAGGAGCTCTTAAGAGCAGCCCAAGAAAATAAACATGACAGGTAAGCAGGATTGAAGACCCTGCT 
GCGCCCAGAAAGGGATGATGCCCATGTTCCrAATGGCAGCTATTCCCAGCATGCTGA 

4 5 AACAGGCTTAAGAACAATGTCAGTTTTGGTTTGCTTTGTCTTGTTTGATTTACATGGTA 

GCTCCGAAAATAAATGCCTCTAGAAACAGGATTGCGGCATAGTTAGGGTGGTGACAATGCT 
CCTTACACCTCGCTT 

HUMAN SEQUENCE - tnRNA 
50 CGCCGCCX3TCAGCGCCX3CCGCAGCrGGGACCCGTTAGAGa3GAAGCGCCGCCGCCACCGCC^ 

TTCCCCG^GGTGGGAGGTGGGAGCCATGTCGAAACGGCTCCGGAGCAGCGAGGTGTGCGCTGACTG 

TGGGCATCA.GTAAATAGGGGAACGTTTTTATGTGATGAGTGCTGCAGTGTCCATCGGAG 

GCATCTGAAACACACACCGTGGCCTCCAACACTGCTTCAGATGGTTGAGACCTTGTATAATAACG 

ATTCTTTG CTGGACCCTGCGTCTATTATG AGTGGAAGACGTAAAG CTAATCCACAGGATAAAGTACATCC CAATAAAGCGG AATTC 

5 5 AT CAG AGC CAAGTATCAGATGTTAGCGTT CGTC CATCGCTTG C CCTG C CGGG ATG ACGATAGTGTG ACTG C CAAAG ATC TTAG CAA 

GCJU^CITCCATTCGAGCGTGAGAACAGGGAATCTTGAAACCTGTTTGAGACTGTTATCTT 
' CTGAAAAAGGAAACACCCCACTCCATGTTGCCTCCAAAGCAGGGCA CTGAATTATTGG CAG TATATGGAGCAGAC 

CCAGGCACACAGGATTCTAGTGGGAAAACTCCCGTTGATTATGCA^ 

AATACAGT ATGAG CTAACGGACAGACTAG CCTTCT ATCTC TGTGG CAGGAAAC CAGAT CACAAAAATGGACAG CACTTTATAATAC 

6 0 CT CAAATGG CAG ACAGCAG CCTGGATTTG T CTGAATTGGCAAAAG CTGCT AAGAAG AAAC TTCAAT CTCTAAGTAAT CATTT GTTT 

GAAGAACTTGCCATGGATGTGTACGATGAAGTTGACAGGCGAGAGACGGATGCAGTCTG^ 
GGTAACCGAGACAACGGTCGTCCCCTTTCTTCCGGTCAATCCTGAGTACTCATCAACAC^AAA 
GGTTCAACGCCCATGAGTTTGCCACGCTGGTCATTGACAI^CTCAGTGACGCCAAG 
TCAAAAGACAATGTGGAGCTCATACTGAAAACCATCAATAACCAGCACAGC 

6 5 CAG CGTGG CATCAGACGAAGACACAG ATTTGGAAACCACTGCAAG CAAAACAAAC CGG CAG AAGAG C CT AGATT CAG ATTT AT CAG 

ATGGACCAGTCACTGTACAGGAATTTATGGAGGTCAAAA 

AATAACAACTTGAGTGACGAG CTGAGAATTATGCAGAAAAAGTTGCTTGGAAAAGATG CT AAT T AATG AAGAGGAG CAACTACTAT 

TGGTGTATTTTTCACAGATTGGTGCTTTCTAAATAAAAATTGAAAGTAACTCOT 

TGATCTTCTGGTGCAAACAGTTGCTTGTGGACTTAAACCTTGGCACTGGTGGGGAATT^ 

7 0 AGTAGACMCTGAACTCACACCACACCCAGCTTATAGA 

AATGGACGTGGTGAATGGTGTTTAAAATGCCAGATGCCAAAGAGTAACACGATTCCCTGCT 
CACCATGGAGCAGCCTGCATGTGAGGAATGGAAGGAGCATTCAGGGCCTCCAAGTGACAGT 

G ATT AG CAT GTTCAAAGCTGACAC CACTG AGGTCGTCTTTrrTGGGTGACAAAGC CAAAGGAGAGAAAGGCCAAAT ATTC CAGCCC 
TGGCCGAAAGATGATCACTCACCAGACGGAAGCAAGCGTG CTG CG TGGAG CAT CCATGCG AAGATGTCAAT T C CATAG ATCAAT AG 
7 5 GTTTCCAGTTTTCTTCGTGATATGTTAATATAGCAAACTTACCATGATCCGGTTTTCTCTTTm 
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TGAATTGTTTGGAATATCAAGAGTATTATGTAATAAAAACTTGTTGATCAT 
HUMAN SEQUENCE - CODING 

ATGTCGAAACGGCTCCGGAGCAGCGAGGTGTGOSCTGACTGCAGC^ 
5 TTT ATGTGATGAGTG CTGCAGTGTCCATCGG AGTCTAGGG CG CCATAT C TC CCAAG TG AGGCATCTGAAACACACACCGTGGC CTC 
CAACACTGCTTCAGATGGTTGAG ACCTTGT AT AAT71ACG 
ATGAGTGGAAGACGTAAAGCTAATCCACAGGATAA^TACATCC^ 
GTTCGTCCAT CG CTTGC CCTG CCX5GGATGACGATAGTGTG ACTG CCAA^ 
GGAATCTTGAAACCTGTITGAGACTGTTATCrrTAGGAGCA 
1 0 GTTGCCTCCAAAGCAGGGCAGATTTTACAGGCTGAATTATTGGCA^ 
AACrrCCOSTTGATTATGCAAGGCAAGGAGGGCACCATGAGCTG^C^ 

T AGCCTT CTATCTCTGTGG CAGGAAACCAGAT CACAAAAATGGACAGCACTTT AT AAT AC CT CAAATGGCAGACAG CAGC CTGG AT 
TTGTCTGAATTGGCAAAAGCTGCTAAGAAGAAACT^ 
TGAAGTTGACA.GGCGAGAGACGGATGC^GTCTGGCTTGCCA.CGCAAAA 
1 5 TTCTTCCGGTCAATCCTGAGTACTCATCAACACGAAATCAGG 

CTGGTCATTGACATTCrrCAGTGACGCCAAGAGGAGACR.GC^GGGCA 

GAAAACCATCAATAACCAGCAC^GCGTTGAGAGTCAAGACAACGATCAGCCCGACTAT 

ATTTGGAAACCACTGCAAGCAAAACAAACCGGCAGAAGAGCCTA^ 

ATGGAGGT CAAAAACG CTCTAGTGG C TTCTG AGG CCAAG AT ACAGCAG CT AAT G AAGGTG AATAACAACTTGAGTG ACGAG CTGAG 
20 AATTATGCAGAAAAAGTTGCTTGGAAAAGATGCTAATTAATGA 
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Table 93 

MOUSE NOMENCLATURE 
ICSGNM Anp32 
Celera mCG10172 

5 

HUMAN NOMENCLATURE 
HGNC N/A 
Celera hCG23810 

10 MOUSE SEQUENCE - GENOMIC 

GTTAAT AAAAGGTGAATT CGTGTTACAAGATGGTTCACAC CT ATAGT CCC!AACACAAGAGAGCCTGAGGCAG CAGTAGT CAAGG CC 
AGCCTGGGAGCTACAGTGAGACTGGCTCAAAACACA 

GTTTCCAGGGACACTTGTGCACACGCAGGCACCACATGCCTGTGTGTGCAC^ 
1 5 CTGAAAGGCAGTCTTTCACTVGCTGTAATCTCAGCATTTGAGATGCTGAAGCA^ 
TACAGACTGAACTCTOT'GTAAAATTAAATAAAGAAATTGTC 
CAGATCTCTGGGTTTGAGGCCAGCCTGGTCTAGAGAGTG^ 

AGATGGAAGGAAGGAAGGAAGGAAGGAAGGGAGGGAGGGAGGGAGGGAGGNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ - 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NCCTGATGCTATATAACTAGAACTGGTGAGTAAAGATATTTTTTCCATCCAG 

GTACAGGGGATATGGCCATATATGATCGAAGTTACTTCTACAGTGGTTT^ 

GGGTGGAGCTGGGTTTTGGGTTTTTTTGAGATAGGTTTTCTCTG 

GGCCACACGTGTGCCACTACTGCCCGGCTGAGTGTGTGTGTGTGTGTGTC 
3 0 AACCACCTGCCACAGTCTGGGATGACAGGTAAGTGCCATCTTGCTAGGGTATTAGGTACTCACT 

ATCTTG AT ATTTTATATGGGTG TTGAAGATTTGAACTCTGGT CCTGACTG CCCAGCAAG CCAAACCATCCCCTATCTCTTTGCCTG 

TCATTTCTTGAGCTTCCnTATAACTCTTTATTCAGAGTTCATTGGGATGTAT 

AAGATATATTGTAAGTATAGCATTATTCCAATTCATCGTTGATCTACAAAATCAGAM 

AAAGCCTGTTAGTGAATTAGTGAGAACAAGCTACAGTAATGAATCAAA 

3 5 ACAGGCCTGTAATCATAGCTCTGTGGGAGGCTGAGGAGGGAAGATCAC^ 

AGATAGCCTGGGCTACTTAGAGAGACTGTTTAAAACTGACrAGTACA^ 

TTGCCTAGCATATATGAAGCCCTGGGCTCAATCCCAATACACAA^ 

TTTCTCCCTTCACCCTCAGTTTTGTCTTAAATTTATTT^ 

GTCATACGACAATTCTTAG TTCTC CCACCACATGG CT CCCAG AATCACCTAGCTT AACAG CAAGCACATTTAC C CAG CAAGTTAAC 

4 0 TGGCTGACTCCCCAGTTTCTCTCTTGATAGACAGATGAGTAAGTTGCTGG 

CTACGTCATAATACATGGTTGCCCCGGCCGTTCTGTGGCATCCT^ 
CACTGCCGTGTAGACTCTCCTGGTTAGAACTCAATTCCGTGATCCCATAAGCT 
ACTATTCACGAAGAAGAGAAAATGGATGATTTTGGGTGAGTAATTAGCACTCT 
GGATTCCTAACACACGGAGTTCTTGGTTGTAGAGAACATTTTGTATAATC^ 

GTGITTGAACACATGGTCTCCAGCTAGTGGCGCTGTTAGCCTTAGAGC^TC 
TTTGTGAATGGGTAACTTCAGATTACTTCTGTTTCTGGCTGTCTGCT 
CITTGGCCCACCTGCAATCAGGCCTCCCCTGCrrGTAATGAACTCTGCCT 
CGGTTTGGTCATGGTGTTTTATCACAGCAACCAAAACGAACTAA^ 

5 0 AACACCTTTTTAACCTTAAAAAAAAAATGAAATTTATTAAGGTT 

GCGTGTTTTCTAGCTGAAACACAATATAACTGATCTGAAATGGTGTATGAA 
CAATTGTGGGGGTCAGGCAAGTACTCTATCACTGATCTATATCCCTAGATTCCTTGAACT 

ATTTTTTATCCTCTTTTCTCAGC CTCCCAAGAAGCTGAGGGTATTG ACCTGGAG CACCCATCTAT CCAT TTCTACAAGCGCG AGCT 
TGGTTACCTTTTTCCCCAC^GGAATACCAGTTTTAATGATGTGTCTGTGGGT 
55 TTTCCCACAGTGTTCTTTAAGGTGTCTACATCGTGCCAGGAGCCCA 
TCCTAGCCTTCAGCTTTCGATCTGCACCTAAGTCTGAACC 
AGTACAGCAAACAGAGAAACAGGGATACTTGAGTTATATGTATCCTAACCnTC 
TGAACTTCAAGGTCATGTCAGCTACTCAATGAGTGAACAAGGCCAGCCT 
GGGCATTCCTTTTTTGTTTGGTTTCCTTTAGCCATTATGAAAGTGCCTGGGC!AGCAGATCCC 

6 0 ACACGTTTCCCAGTCATGCATGCTCTTCTGCTACAGCTAAC^ 

CGAGTGAAATGTATTTGTCACCCCTGCTAAATTTTCCTTAAATAAGTCCAAGTC 
GTAAGTGGCGTTCGGATO^TC^GCCAGTCTAGCAGAGAATAGGTGTCCTTCTG 
CATCGGGTCATATGCAAATGATTTCAGCACTTTTCTATTGGGGCTGGTGA^ 

G ACCJTGAGTTCCATCCCCAGGAGCCACATGGTAGAGGGAGAGAG CTCACTCTTTTGGGCTG CCCT CTGATATCCACACAAAAGTAT 

6 5 ACATTTTAAAAGACTAAACCC CC C CCAAAAAC CTTTAT ATT CAAGT TACAAT CAT CT AGAGTGTC CAG AG AAGGCTTAAGG CCTTG 

CAATATTC CTCGGGG ATGG CC CAACATTTCAAATG AAAATAT AGTACAAATTTAT TAATTTATCTGAGTG CATT CAGCAG ACTTCT 
AACTCCAGGAGACTAGTGCAGGGTACAACTTTGAATATTTGTCAGTTCGAATGAGAGTTGCT 

GGCAGACATG CCTCGG AAG CTAATGAAGAACAAAT AAAAATGTGCTAGJU^GG CT CTT GGAATAGTTT ATTTG CAT C C CCAT ATTAA 
TATATTCCATTTT AATATTG AG ACAAGGTCCGG AACTCACT ATGTATAC CAGG C CAGCCTCAAACT CAGAAATCTG CCT CTT CAGT 

7 0 GTTGGAATTAAAGGTGTGCACCACACCTGTCTCTTAATGCATTTTAACAATTTTCCTATAGAGGTGAGCTTCAACT 

TAAAAAATAAATTAAATACTCAAAATTAGTTGATATTACTCTATTCTTTTTGGTTT 

GCTGGCCTGGGACTCATTTTGTAGACCAGGCTGGCCTCAAACA.CTGAGTTTCT 

TCT ATATT CT TTAGTTAATTG CCTTTAGTTAATT ATTG TTAATTGACTTCAT ATTCCATTGTCTT^ 

TATGAATGGGTACTCTGTGTCTAGGAATTAGATGGTTGTGGGCCACCTGTGAGTGCTAGGGATGGAACCCAG 
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AATAAATGCTCTTAACCACTTGGCCAT^ 

C^CTAGCCACAAACTCACTAGGCAGCAAAGGATGGCCCTGAACTTC 
GTATGCATGA.CTTTGTCCACTTTCTTTCTAAGATTTAT^ 

GGTACCCTCAGAAATGAGGACAGGGTATCAGATCCCCTGTAACTGGAGTTACAGAT 

CTTAATTCAGGTTCTATGCAAGAACAGCAGTTGCTCTAAGCTGTTGAG 

TTATTTTTAAAAATTTCAGAGCTGAGCAGTGGTGGCGCA 

AC^CCATCCATAATGAGATCOSACACCCTCTTCTGGAGTATCTGAAG^ 

ATTTTTTAAAAATTCATTTGTATGGGTGTTTTACATGGATGTTTG 

ACCAGATCCCCTGGAACTGGAGTTCTAGACAGTTGTAACT^ 

CCAGAGrrCTCCATTGCTGAGCCATTTCTCCAGCCCAACTTAACTTTTAATA^ 

CCATGCTGGACCAACAGGAATAGGCTATCATGCCTGACTTACTGTTTTGCCC^ 

TAGGTAGGAGTCACCATGGTATATATTTTTTAAGTGACCTG 

TAACAAAGTAACTATTCAATCATTAAAGCATTTTAA 

TTTGATGACCTGATTTCAATCTCTGGGCTCCACAAGGTAGAAGAGCTG 

GTAGCACATGTCCCCAGCAAC^CACAAAAATGTAATATACACTTAAA 

GTGACTTTTTAGAArrTGGACTTCAACATTCCTTTGCTATTCAA^ 

AAAGAATAAT CTACATTT CTAAAAGCTATTT ATATGTAAGGACAG AAGAG CACACAGGCAAG CTGGAGC CTTTC CAGGTCATTGT C 

TCTGGGATCCAGGCAGCCAAGCTCCAGGGTAAGCGGAAAAGGGTTATGGCTAGGTTCCAGTCT 

CAACTGTGCACACGTTAGCTTGACTAATAAAAACCAAAGGGCCTGCATCTTCAGAGACTGA^ 

GCAGCAACGATGTGGGTATTTCTAAAGCAGGACTTCTCACCATGTTGGAGATGGT 

GCACTCACAGAGGACATGAGTACCTGTCCCAGCACCTATGTGTCCATTCACAAGTTCC^ 

CCGGGCTAGCTTCCTTGGGCAGCTACACTTATGTGAGCAGGCGCX3TG 

CACATAGAGGCTAAGGACACCCACTGCTCTTATCTTTGAGAAC^ 

GTGCCCTTCTTACTTGAAGCCCACTCCTTTCTGGGGAGGCT 

CTGCTACTGACTGCCTAGTCCTTCCCCCCTCCCTTGTTTTTTTTCAGGCAGGGTTTCT 
GAACTCACTCTGTAGAGCAAGCTGGCCrCAAACTCAGAAATCCACCTGC 

GCCTGTTGTAGTTTCAGTCTGTAATGAGGAACCGCACCTGTTCCAGGTCCATATAAAAATTCTCCCAAGAG 
TACTACCTTCCTGAAATACAGATCAACTGGGAGCAGTACCTACCACT 

TAATAATTACACACATTCATATATATATATATGTGTATATATATATACATATATATATATATGG1TCAGC 

ACCGAGTAAAAGCCATCTTTCAGTTTTGTGACACTAAATGCAAAAGCTGTTCTTCAAAATTTGTAA 

AAGTGATGTCTAAC&CTTTCAGTTTTCTATTC^ 

AATTCrrGTTTTCTTCTGTCCCTGTCACTTGAGCTCrrTGGCTTCTTAGCAGA 
TTTGAGAGTTATTCCTTGGAAAGGACTCTCTCACAACACrGCTGATAGCAGGTCCTCAAC 
TTTACCAT CCAAAAG AACAAG AACTTCAAG ACTTC TGCTAT AAT AGATTTTGTACACT 
AGCTATTATTGCTTTAAAAACCTTATCCAGAAACATGAGCTGATACATTGGGAAAGA 
AGGAAACAATACAATTGAAACATTCCAGATATACAGCCTTTGTAAAAATTAACAAAAATCT 

ACGTAAAGAAATTACAT ATTT AC CCAGATGAAATG GTTTT ATTTTGTACTT ACAG CTACTTAG AATAAACATTTATCAGCTTTCAA 
ATCCATTTGTTTGATATAAAAGAACTAACTAAATAAAATACTTAGTTCTAATGTCT 

AGAAGCGTGTAGATGAAGACACATAGGAAAAGACACTATGATAAAGTTGTGTGAATAAAATTAAACATGTATC 

ACAGCCGGACATCCACATAGTACCATGAGGTCCGAGGGCTGCCCATAAGAAGCTGGTAACCCAATC 

TTCTTAAAAAACGTAACGTGGTCGAACAAAAACAAAACCCAAAGCAACTGTAAGACCT^ 

TGGCAATGCAGATCCAACGAGACGATTAGAATGAAACGTGCAACGCTCTCTC 

GGCTCTGAGAATCGGCAGTTCCCAGCACCTTCCTTTTCCACTTTTTAA^ 

TTAATTGCCCTATACAAGATCCCCCCGATGACTCTCGGGGAATTGCTTCrTTTT^ 

TCGAGATCGCGCCCTCGCTCCTTTGGATGTGGGAGCGTCATGCACTTAAGACTTGATTATTTAATTGCGGAGATGGGAGCAGACCG 

GTCGGAGGGGTTGTGCCAAGTGGGCACCCCGGGGTGTGTGTGTGTGTGGTGGGATTTGTCeiTC^ 

CTCCTTGGGCCTGCCTTCTGCCCGCGCGCrc 

CGraCAACCACCAAGGTGCAGCCGCGCTCCTCTTCGGCGGCCC 
TGAGCCTCCCTGGCCCGCTGCGCAGGATGCCCATTGGACCCT^ 
GCCATTTGATTGGCTCTTGATGCCG^CGCTCCAC^^ 

GGTGCTGGGGGCTCGAGAACCGAGCGGAGCTGGTTGAGCCTTCAAAGTCCTAAAACGCGCGGCCGTGG 

G AATT CCG CTG G CG CAGGATCCT CTG CAGAAAGAGAGAG CG CG AGAGATGGAG ATGGACAAAC GGATTTATTTAGAG CTGCG G AAC 

AGGACGCCCTCTGATGTGAGCATTTGCCAAAAATATCTCTGTCGGTCCATTCrcCTACTrTGTGTTATGTGTGTO 

GGGGGAAGGCACTTTTTTATGGAGGGGGGCTCCGGGGAAGGTGTGGGGTTTGGGGTCTGACTGCAA^ 

GCGGGGGTGGGGGGGTTGCAATGCCACCGGAGCATGTTTCGTCCGCCGTCGCTGCTGCCCTTGGTGTGTGACTGACTAC 

TGTGCTCCCGCGTGTGCGGGTATGGGTGCGAGGGTGTGCGTGCGAGTGTGCACCCAGCGGGGGGCCGGCGGGGCG 

GTCGGAGGGAAGGCGGCAATCTCACCCTCGTCCGAGCTCTCAGCTCCCACGACrrTG 

GAGAAGATGGGCTGCCTCCTGGATTAAACAACAAAAAGGGGTTGTTATATTTTAGAAGCGTAACCCCCTTTCCGCCCCCTCCC 
CCTGCCCCCTCCCTTTGCACTOTCTTTCGCTCGCTCGCACACAATAAGTTTTG^ 

GTAGCCCGAGGACCCCGGGGAGAGCGCGTGGGGACCCGACTGCTGCGGAGTTGGGAGATGGCTCGAGTTTAAAGCGCGTCTTCGTT 

TCTCTGCGGGGATGAGGCGAGAGGTGGGGAGCCATATTTGTAACTTTGTCGTGTTCGTGAACX5CGC 

TTTCTCTGTCTGCCTTCCTTCTCCTT^ 

CGCGGAAAGGGAGGGGGGGTACCTTAGGTTAAGGGGGAGGGATTGTGAAGTGTGCAAATGCTCAGATAATCCTCGCGCCTCC 
CCACGAGCGTCGCTAATGTAAGTTGCCCGGAGGCTGCGTCCCGTTCTGGC^^ 
TGACGCCTCATCGGCAGCAGACACCCCACCTTCCCGGTCCTTGCT^ 
TAATAAATGGGCACGAGAAACTTTATCAGAACCCGAGCTGCCCAGAGCTAGTGCC 

CGGGCACGGCAGGCACGCAGCTCTGGGATGGGCAGTTCGTCTGGCCGTAGCCGTTATGGATTGGGCAGCGGCGGGAGTATC 

AGGAAACTTACACCGGTGGGAACAGGGAGGCTGGGCGCGAGCCCACTGCAAGGGGAAGTGAGGGTTGGTGGGTGACTCGAAAGCT 

CTGTTTATTTTTGTCTCCCTCTCGCTTTCACGC^CTCGTGTTGTCTGCGAGTTGACCCATT 

AAACCCTAGC CAACTT CCCCTCTAGGCTCGGTAGTAGGGAG CAGGAGCTCGCGGGCGGGAAAGATCTCGGGTCTAGCCTGAGGGTC 

TGGGGCTCGGGCGAGTTGGGGGCTCGCGTAGCACGTTGTGCTGACCrrGTCGCCCCTCCAGCCACTTGGCCCCTCGAATCCTGACAA 

AGGCAGGCCTCATTTCCTCTTTTCTCCTAATTCCTCCTTCTAAACCTGAGTAAGACTTTTGA 

CCCATCCTCACGCAGCTGTGCACTGGAGACGTGCTACAGCCTTATGGCGCGC^ 

CTTGCCGCGGGGGCTGCCTGGTrCGCCCGGTGGCGTCCCGA^ 
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GGTGCTGGGGGGAGGGGGCCGAACGCGACTTCCCGCGTACTTTCCGG^ 
GCGAGTAGGTGCGCAGCAGCGCCGGACCCGACCGAGCG^ 

GG CTCTTT AAACTTCCCCG AGTGTGTG CG CTGGGGGAGGGG CGGGCG CTCTGGG CTGGG CGCG ATTT CT CACGCTTCGCC CCCTCG 
CCXrrGAGTCTGCTGCCCACTCGCCGGCCTCCGGGCrTATGGCGGGG 
5 ATGGGGGAGGGGA 

GGTTTTTGCTTTTTTCAAATAAACGTTTTTGGAAA 
CTTCTGC^CCTCTCTOrCAAGTCGCATCTCGGGGACTGGTGG 

GCGCCGGAGGCCCACCCCTCCCCCA03GCGGCCGCGGACCGGGCTCCGCGGGGACa31^ 

ACACAGGCACAGTNNNNNNNNNNNNNK^ 
10 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNIWNNNNNNNNNNNNNNNNNNNNNNN^ 

NOTOJNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNNNNNl®^ 

NNNNNNNNNNNNNNNNNNNN^ 
15 GGGTGAAACTAGAGGGCGGGAGGGGCTCAGGGCTCTTTAATCTGGTC 

AGGGACGAGTTGTTCGATGCGCTGGGGAAACGCCCTCCGGGGCTC 

AGC^CTCTGCAGGCTGAGGGAGCTCTCTCGCTCTTTATCCAGTTCCT 

GCGGGGAAGGTAGGTAACTACATAGTAAACGCTACCTTAAACCGAAAGAGA 

TGTCCITTTGTATTGCCITCAGACTCGGGCAGTTTGTTCCTA^ 
2 0 CGCAGGCTGTAGGCTTGGTGTCTGCATGTAGAGACTAACT^ 

TTTGACC TCTGGAATG CGTTG CATTTTTTTTTTCTGT CCAGG TTGACTTACTG C CTAATTG GG TTCG GG AGAGTGG CCTGGCGGTT 

TGCTCCAATTTATGCAAGCCAAGCGGGGATTGGGGGCAGAGAACTGAATGTGGCCTTGGTATGGGG 

CTGGCAGCTGCTGAGGCTTGTTTTTGATCTGAAGGTTCCATGGTGAGGCT 

CCAGCTACAAGGAGGTAGGATCCTGCCAACCTGTGGTCTGGGACAGA 

2 5 G T AAGACTGGACAGTTCTGTTTTTATCAATCAATAAGTCACT CC CTTCGCAGGTG CTTGATATAGTCGCAG 

AT ACTGGGGAGTG<3GGAGG AATAATTTAGGGCCC CTTG GAG GTT TAAAAT AACAACT CCAG CAATAATAG CAGATG CTCACAT AAC 
TCCTTACTGTCTCTGGCATGTTGCTAAGTG 

T ATT CTTATACT AAATGCAAGGAAATGGAGT CAGAACTCCAG ATCACCGCCAGTCGGTG CAGAACTGGGATTCTAACTCCGTTAGA 
CTCAGCCTCTGTACTGCCCTGAAGCACCACGGGAAGGACnTTTTCTCCrrCCAGTTTTGGAGTGA 

3 0 GGGTATGGGTTTGTTGGTGTTGAGGGCACAGAAACAAAGCATAGCrTCCTGGTGGAGCGATAGG 

TAGGCATTTTAAGGGGGAGCCTGACCCTGGTTTGGAATATTCAGTATACACT^ 
AGC CATTCTTAGTGGTTAT TTAGG TGTCTTATATAAGT AGT TTTTTCATT AGC CTTT C 
GAATAGACAGGACCTTAGTGTTGGGATGTAGCCATGCTAGTAATACTGCCTAGAGCCACT 
AAGATAGCCAACCTCTCTGTTCTGTGGATGGATCACTTAGGACTCAGAGTCAGCCT 
35 CAGGTTCTGTAGTTGACCAAGGCCCATACATGACCrATATT^ 
ACCCCCTGGGTGTGGGCTTAGTACCAGGTCACCGT^ 

AAG AGATTTGGGAAT CTGGGAG CGAACC TTTG AATGATC CT CCT C CCT ACAAAT CT CG CT CGTCGAGTTAACAAAGTTGTATTAAC 
. G AGAAAAAAGAT TGT ACTT ATTTTG AACTTGAT CAATAAAAGAATTAAG AGGG AG ATGTGTATCC TTTTG TTAT AAAAG CAC CTGT 
GTTATGTTCTCAATTTTGCTTCTTGAAGTTTACTACCTGG 

4 0 CTGTGCTTTGTTCTTTTAGGAATGTGGCAAGTCCTACATTT13 

AACAACTTTTGAT C C TCAT AGAAATGGG CT C CCGTTCACAGG CCTTCATG AG AAG TTGTTCT CATTC CTGGCTC CCTATGGAG CAG 

TCCTCTCTGTCTCAGAGGTAGAGAGGAGGAAATTTAAGTCCAGO 

TAGTGAGGGTCAGGACATATCACCACAGTCCCCAGGTCA^ 

GGGAGACAGCACACAGCAAAGCAGAAATGGCTTTGGGTTCAAAACCTTTGTCTAATTGCCT 
4 5 AGTGTGTGGAACGTGAAGTTCCCGGCAGCAGTGTCAGACCCTGTCCCCTCCCTGGGCTGGG^ 

ACTTACTTGGGACTGGTGCAGTTAGGCATCTTCrrGTGCAGCACTGTGGGTGACCTGACTCAGTACTAAC 
CAGGCAGGAGCTGTCTGCCAGCTGTCTGCTCTCCTCT 

GGTGGGGGTCTGATGGAATGAACTTGGCCTAGCATTCTGGCCTGAATTCTGACCT 
AAGGGTGTITTCCTTGGAGCATCCCCTTTTAGGCTGTGTCTCCTCGGACCT 
50 ACAGCTAAGAAGAAAAACATGAACCAGACTTGAGTCTTTTTTTTTTAAA 
TTTATTTTATGTATATGAGTACACTGTCACTCT^^ 

CACCATGTGGTTGCTGGGAATTGAACTCAGGACCTCTGGAAGAGCAGTCCGTGCTCTTAACCTCTGAGCCATCT 
ACTTGAG TCI^AT ATT ACATTGGCAT AT AATT ATACAACAAAT AC TGAGTT AAAG AG AGGG CT CCTGACTGGAC TG GAGGTAGTTA 
TTCAGAACATTTCAGTGATAGAATTGGTTGAAGCCATGTGAGCAAAACACCCATGGCCTCAACACGTCCAATTCC^ 
55 CTGCCTCCTAGTATAGAACCACGTGTCCCGGTGACACCGGTATCTTT 

CGGGAGGTCCCTTCAAACTGATCTTGTTTTTACrCTGTTCTGTGCAACTCCCCT 
TACCAAACTGGCTCCAGGCAGCAGGTCTAAAAGATGAGCCTACT 

CCAGAGGTATTTG CAT TTTGAGAGT CTGGTTTAGTG AGGAAATT ATTG AGAAG AAC CATT AAAG ACTTGG ACGCACTT CTCAG CTT 
GGTCATTAACrTGAGTTGGACAAGTTGTTCAGCrTCT 
6 0 GTGACnXSTTGTACTAATTGTAGTAACACAATTACTAGAGTTCTTAT 

GTAAGGCAGATGTAATGCC^GCACrTACTGCAATC CI^AGCACCT C^GGGGTGGAGGAGGGAGAAGTTGAAGGTCATTCTCTGCTCC 
ATATCAAGTTCAAGGTTAGCCTGTGCTACATGTGACCCTGTCCAAATAAATAACCTGAGGGT 

CAG ATTT TTTGAACCCACT AGTAACAGGATT CACAC CGTTGCT CAG CACATG CAT ATT ATTT TTTTAAACGTGAACACTTTTGATT 
CACTGAAAAATATGATAGCTCAGACTCATAAGTTGATTTCACTCT 

6 5 CTTCCACAGTTTTGCCTTTTGGTGTGGGGCTGGAGCTTGCCCTCCCGAGCCTTGTGTAGTG 

CCCAATTTCCCAATTTCATGGCAGCTCACTGCTGG<3TCCTCT 
AAAGTAGCCATTAGCTTAAGTGTGTATTCTCCATGGTGCCTGCAGTTATT^ 
CCCCTCCTGCTGTGCACTATGAAGAGCAGCCACACTTTCTTTTACT 
CTGTCTCT CACT CCCAGTCCTGGCGAJGTTGCCTCACAGTGTGGACTC 

7 0 GCACCAGATGCX3TCACTGCCGATGTGCTAGAGTGTGTGAGCTGGCAGGCGCTTAAAAAGAG 

GTGGTAAAAACACTTGCCACGCGGCCTTGACCTGAGTTCAGATCCTCAAAACCCACATAA 
GGGCAGCTGGAATCCGGGGCTCCAGACAAGAGACTAGGCAAACTACCC^ 
GTCTCAAAAAAGGAGGAATTGAAGGACCAGCACCTGTTGTTGCACGTGTGTGTATACACACA 
TACACACACAAAGGCTGACATTAAAGTGOTACTTGTTTCTGTAG 
7 5 TCTGTTCGGGATCAGCTGCAGCAGTCCAGAGCAGCCCAGCCCTTGGCTCTGCACTGCCTTCCC^ 
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TGOTATCCACTCTATCCACTGCCCTCTCTCTGG 

TCCTGGTCCTTCACCCTCTCCCAGCCTGC^ 

GTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTC 

AAAGGCAGTTTGGGGGAGTTAGTTCTCTCCTTTCACCATGTGAGTT^^ 

ACCTGTTGAGGCATCTCTCCCCAACTCCAC^ 

GCTTTAGAGAACIX^CCTGGACGGGGGAAGCA^ 

AACCTTGCCTCAAACAAAACAAAAAGACCAGACXCCCT^ 

ATTGTAACTGGGTTTACCCTCCCTGTCT^ 

GTCCTGAGGTATTTGGAGTCCTAGACTTrGGTGAAAAAAAAATCCATCT 

GCTTCTAATATTTTAAACT CAGTTGGATGTGGGACCCAGGTGCCCTT ATTCTATCCTT CTTCCCT AAGACAGC^ C T CCTT AGTGG 
ATGTGTACGAATACCACATGCTOTCTGGGCTGGGGTGTTTA^ 

CAGATAACTTCCCCATACCACAGTTCTGGTC^ 

AGCACAGATGACGGTTGACAGTGTCTTTTTTTTTTTTTTTm 

CCTTAGTACTCTAAGTCCGCCCCTGCTGGCCTTCTAACTGGTCTTTCCTCT 

CCAAACCACTCCCATCCCACTTCGGAGTTC 

CTTTCTTAGGAGTTTTCCTTGTAATTGCACATTTCT 

AGGGACCTGGGTTTCCTTCACCCCTGGGTATCCCCAGGG 

ACCTGTACGCACGTGGTCCGGTCTCTTrrCTGTTATTGTTCACTACAGGG 

TTTGTAGATGGGGCTGGCCTCAAACTCACAGAGATGCCTGCCTCTGCCT 

CTGGGGACHXMTCTCTTCTTTATACCCANl^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNN ^^ 

NNNNNNNNi> ^^ 

NNNNNNNNNNNN1 ^^ 

TTGTAGTGCCAATTGGCCTAGAACTCCCTATGTAGACCTTGAACTCACAGAGATCTGCCT^ 

ACAGGTCTATGCTATCACX3CCTGGCTTGCTTGOT 

ATGCCAAACGCTGTCTACGGGCTGAACGTCCCACCCCT^ 

AGCCTGCCTCAGGTACCCAGCTCATCCAAGGCCACTGCTGCATGGCAGCT^ 

AAAGCCCACTCTGATGCTCTTGAGGCTC 

GTGAGAGGCCCCATACCAGAGGCTCAGGACGAATGATACCTAGGTAGITTTTTTT^ 
T™^^GGAAGGTTC^ 

TAAAGTCATGAGGACCATGG GGGTAA GGGGACTGGCAAGGTGGAGGCCAGGAC 

GGGGAAGGGATTCAGTCCTCCTTTTTGCAACTGGTCCACCATCTTCCCTTGACTTTACAGGAAACC^ 

GGAAATCAGAGGACCCCCGCAGTCATATTCCCTGTCCGTCAGTTGGGTTCAACGTACTGGCCTTC 

CTAAGCCATGTGCACAGCTGACTTACTTAATTGCATT^ 

GAGGGCGGTGATGCCCATGTCTTGCGGTGGAGCCTGTGTGATAGACAGGAGGAAAGAATGT^ 
ATCTGGCATCAGAACTACCAGAGGAGCATCTGAGGGACTTCCCAGGCTCTTCC 

AATG CATGTGTTTTTAAAGC CCT AGGGG GGATGATGC T AAGGCACACTGAAG CAGG ATTCAG ATCTTCACAGT AG AAG CAAG CT AA 

AGGGAAGCAAAAGAGAGTGGAACTAAAGAATGGTCCAGGCTTTTAATGTGCTGGGCTGC^ 

TCCCCTGCACCTCCCCTCCCCCGCCACC^ 

CTTGTATCTATGTACCTGTGTTCCATGAGGCTOCTGCTACCAGAGTATGACCAAGAATGGGCCTGGTTGGG^ 
AGTGGTTAAAAGCACTGACTGCTCTTCCAAAGGTCTTGAGTTCAAATCCCAGCAACCG 
ATCTG ACAC GCTTTTCTGGTGTG TCTG AAG ATAGCT ACAGTGCACTTATTTAT AATAATAA^ 
AGAATAGGCCTGGTGAGATGGCTTAGTTGCTGAAGTGCCTGCCCTCTGAGCATCAGGACCCAGATGCCCA 

TCTATAATACCi^TACTAGGGGCTTGGCAGAAACAAGAGGATTCCTGAGTGCACCGG CTCAG CCCAGTCACTGAGCTCCAGTTTCA 

GTGAGAGACCCTGCTTTAGAAAATTTGATGAAGAGCTATTGAGGAGGACATCTGATGTTGGTCCACACCATGCT 

CTCCTGGACTCAGTGGAGGGAGGAGACCCTCTGCGTCTATCCAGGTGGCATACAGGATGTGAAGGCGT 

TTCCCATCGTGTCTCACGTGCTGGTTGCTCTC^ 

TTATGCTGAGTCCAGTGTGGCTCCCCCACAGCTG^^ 

CTTGTGAGAAGCAGCCCCATCCATCCACATGCAGGAGTG 

CGTCGAGCTGTGAGTGTGCTAGCCAGCCTCCCCTCGCCAGCT 

CGCAAGCAGCTACTTCTCTGAACACCGGCTAGTTTGCGGCTCCATTACAGA 

TGTAACTGTTCTTCCAAGGAGGAGTCTGGGGTGGTTTCTGGAGCAACACT 

ATTGGATTGCAGGCTGTCTGTCCTAAGCCACAAGGGTATTGAATTCAGACCATGTTTGTTCTCTGG 
TCCACTCTAGTGGCCTGTGTGGTCATCGATAGGAGTGGCACAGGACATGTTTACATGTTGACCAGTGGCT^ 
GGGGCGGGGTGCCCATTGCACCAGAACACAGACTTGTATCTTCT 
TGGTAAGCAACGCTTGGCTGCTCGATATTTGACATTA^ 

GCAGGTCTG^SAATCCCTTCCCACTGTGACCCCCGGCCTTGGTGGTTTCTGTTC 

GTCrecreGGTTTTTTGCAGGCTGTCTGTATATGACAGTTATC 

CATGGCAAAGGCCCCACTAAGTGCCATGAAAACAAAGCTGTGGCTCAGGG 

CATAAGAACACTCAAGAGAT CCCAGGAG AATAGGATGTGGATTGACATG C C TG TGACACCAG AGG CAAGGCCAAGAGG ACATGG CA 
AGACTGAGACCX3GGATCGACTAGAGGCAGTTCCAGGCTAACCTGGGCAACTTAGTGATACTCTGTGTT 
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AGAGCCGGGCTGTAGCTCAGCGGCGGAGTGCTGGCCTAGCCGCAACAG^ 
TGATGTGTTCCATCACTAGGGAGGTTGAGTCAGGAAAATTCAAGGA 

ACTGGTACAAGAGTGGGAAATGCIAAAATTATGTTGGGG CTGG CCAGT ACT CTGTG CACTGCAGCCTTAG ACAGGACCTTGCCATGG 
TAGAATGT CTTGTTACTCrrCTTAAGG CTCAGACCTTAAAT AATCTGA^ l"l ATTTATTTA 

5 TATATATATATTGTGTffrGCGTGTGCCTGTGTGTAGGGGGTGGAGGTTCAGGTGCT 
GCTGCTGTTAGTCAACGTGTTGATGCGNNNNN^ 
NNNNNNNNNNNNNNNNNNNbD^ 
NNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^^ 
1 0 NNNNNNNNNNNNNNNNNNNNN^^ 

TATGAGCTTGGGTTCTCCTGGTGCTCTCAGGATGTGCCAGGTTCT^ 
AGT CAGGTITGTGCTCACAGATGAGAAACATAC CCAGAGGTT TTGACCTA 

TAAAAGTAGG CTAAGGTGTAATT AGGGGC CTAGGGCCTGCCTCCG CAAACAGGAGCAGGG ATCX5GATAGGCAG CCTCTTT A CAGG 
AGCAGAGTAGCACCCCTGTGACrCGGCGTGGCGACCGTGTATGCAGAGCACCTCCCT 
1 5 TAAGAGGTGGCTTCCT TGACCTTT ACTAAGTG C CTAGGGTG AGTTT GCGGT AGTGCTTGTTGTTAGAT ACTTGGTAAACATT AC CT 

TTTGGATCTATTAGCACATCCCGAGGTATAGGCTGGGCCIACTTGTTTTTGAGATAAGGAA 
TAACTGAACCTCGATCCTGTGTTGTGATGCCCCTTCCAGA 

TC^GGAAAGGTGATGAGGCCCTTAGATTTGTAGGACTTTGAGGACTGTGACCTT 
2 0 TCC CTAAGCCAGGGG ATATAAAG TGACATGTGC CAGCTT ATTGTATTCCATGGTTC CTTT ATTACH C CAC C C 

AAAACACCCCCAGCCCCAAAGTCAGTGTTATAGGTTAGTTTG 

CCTGGAGATAGGTGGAATAGTTACTGGGCCAACAGGGAGCATATATAGTGGGGAAGT CAGG CCTTAGGATGTCTTCCCAGGTTAGT 
TTTGATGCCTTT CTGATCT ACAGAATTTC CT AGGTG CTAAG CACAAGGAGG AGT AGGGGACTGGGG CGGACAGT ACGTCATCT CTG 
TTCTCTGGGTACAGCATCTTGCTTGGTAAGGAACATGCTTGTGAAATGTATTGCAGTAATG 

2 5 TGCCTTCCTACCCAGCATCTAGCCTCGTTCCCTGGATTCCTCTC 

GTTGGCGAAGTCTTGACCAAAGCCACAGTTCTGGGCTGGGGCTGTATGTAACTCAGT 

TTGGGTT CATCT CCAG CT CAGCTGAAA.C CGGATACG CC CCTGACTT C C TGG T ACTTGG ATAGAAGAG ACAAG AGGATCGG AAGTTT 
CTTCTTCAATTG CAACAGAGACTTTATGACCACCAG GC CT ACATGAGACCCTGTCTCAAAACGAC^ 
CTTAGTAGAAAAGTCTAGCXSCCAAACCAGACTGACT 

3 0 GTCCCCATCGTGTGGCAGAGGCCACITCCAC(!AGAAAATGTGGCTATTTGCAGC 

GGTCAGAGTTCTGTTTGTCTGAGTCACTTCTCTCCTTCCCTGTGTGGGGTCCAGGGATTGAACTCAGTTC^ 
CCCACGGAGCCATCTCTCCAGCCCTCCCACAGACTACCCAGCTGCAGCGTGTGGCCAAGCCTGGTTCCC^ 
TAGCTGCTCCTTCCATTGTAAGAGCCCTCCCTGGTCAGATACT 
GCTGGCCACCCTCCAGTGCTGTTCTCTGTCACTGATGATTTTCTC 

3 5 • TTAGGATCTGACTCCCCCTGCACGTOMAGGGCATTG^ 

CATACTGGGCCTGTGGTCTAAGCAGGTAACTTTCCTCCAGAGGAAA 

CACACAC CT AG CAG CGAG C G CTT CATAAG ATATT ACTCATCCAT AT AGCGGGTGC T CAG CACAGTTTG T ATACAAAGC TCTT CTTA 
CAGATAGTTGCrTAGGGTTTTGATTAGCGTGGGTCTCCCACCTTTCTGCCTAAGCATCCCTAGTTCCT 

4 0 TGGGGTTTGGGACGCTCCC^TCAAGTGCCTTTG<K3TTTTTCCC 

GCTCAGACATCGGTGTCATAGCATCATGTGGAGGTCAGAGGACAACrrTGCAGGAG CTGGCTGGTTC TTTCCACATGTATGCTTCAA 
AAGTCIAAACTCAGATCATTAGGCTTGGTGACAAGCAGCTTCAGCCACCGA^ 

ATATAATAAGGCTTGTTAGTACCTGCTAGGCAAG CATTCTATCAGTGAGCCACACCCCCCACCATG CCTGCTCTTTTTTCTCTCAA 
CCCIAGCCTGAAGAGCTTAGCTTTGTCTGTGTGGG CTGCAGTGACTGAGGTAGGTGGGGTGTGCTCT CTCGGAGACTGGTCTCCCCA 

4 5 GCATTGTCACCTATTT CCAG CCAACTAGCCC CTCCCTGACT CTCAC^GTCCTAAAAGTTGTAACATTTTGTTCTTTAGCATTTTAA 

GATTAAAGCAAGCCaMTAGTGGTGQTTCAGGCCTTTAATCCCAGC^ 

CAGCCTGGTCTACAAAGTAAGTTCCAGGACAGCCAGGGCTAC^CAGAGAAATTCTGTCATGAAAAACA 

TTAAAGCAGTTGCTGGGCAGTTGGTGGCGCACGCCTTTAATCCC^ 

TTCAAGGCCAGCCTGCTCTATAGAGCAAGTTCCAGG 

5 0 ATTAAAGCAGCTATAGACCTGTGAACAACAGCTAA 

AGCAAGACCAGGCGGAGAAAAAACGGCACTCTGGAGCTAAACCAAGCCAGGCGT^ 

AGTTT TAG CAT AAGTATT TGATTTG TCC ACTTAGGCAAGTTG AATGCAT C CTGTT AGGCT AGGG GGAAAGGGCTG ATTAACTCACT 
TCTTATGCATTACCGTGGCTAATAGGCGAGGCTCTTCTGCGTGATTGCTGCCCAAATCATCrGCrCCCC^ 

5 5 AGACTTCAGCACCAGGAATTGCTCAGCTCGTGTGATTGAGGAGGCTGCTTGTCA.GGTCTG 

GGCAAAATTTCTTCTCT'GAGGCTATTAGACTTCTTGTCTATTTCC^ 
AGCTCTAGTCTGGGCACTACCCTCTCCGTTCCTTTTATTTGTTCCAGTTTTCTCCTC 
CCCTAGGCTGAGAGGACGGGGAACAGCCCAAGGTTTCCATC^ 
TCATTATGTGGCTGAGGATAGCITTAACCTCrTGATTCTCCTGCAACTACCT 

6 0 GTGGCTTATCTGGTGCTTGGGATCAAACCCAGGGTCTCATGTGTGCTAAATAAACATTCTACCA^ 

TAAGTGTGTTTGTTTGTTITGGTTTTTGCAAA^ 

GTGAGTCTGAATGCTTTTTGCCTTCAGGGTTTAGGGTGGTCACAGTCCTTGAGTTGCC^GTCTGTGGGCCT^ 

GGGG AAGTGAT CT CA.CTTG TATCTTTCAGTCAACAT CTCAAGGT AG AAACCC CAGGG AGTGTGCC CAGCTTCCCACAACAG G CAAC 

CATGAGCCCACTAGTGGTCACAGCATCACTTAGGTCTAAA^ 

6 5 CACAT CTG CT CTGC CCAAC CGTG GTTCG AGT AGAAAGGG AGT CGG CCATCTAGAAATTTCT CTAGAACGATTG C CCAC C T CCAG CC 

TGAGTGGCTTTGTTCCCATGTGACCAGGAGCACAGACTGATGCCAGCCCCTCTGTC 
TAGACCrGTCAGAGGCTCrCAGAGGGCCTTGGTGGACAGGAACATGCTCCT 
ACCTGCMTGTCATGTGGGGTGAGGCATAGACCGCTAGCTGCCITGGGAAGACGTGTGAGGT^ 
GATTTGCACTTGACCTCCATGACCATTAAAAGTTTTGGGGAGGAATAATGCT 

7 0 CCT TCTGAAT CCAG CTTTT CTATTGCACAGGGACAG TGCTG CAG CTCTTT AGTG CGAACATAT CAACCACCGTAGGTTTTT ATCAG 

GAATTACACTGTGGGTGAGTGAGGTGAGGTGTGGGGTCTTGTTAGTGGAGGTGGTTTCTGCCra 
TTGGGTGCCATTGGGTGCTTGCTCTCTGAGATGACTGATGACACTCCCACATTTGGAACCTC^ 

TTTTATAATCGTGGTGTGCAGAAAGAGCCTAGCTGATCCGTTTCTCCGTC - 
TTTGCACGCTGAAACCTGAGCTCTTCCACXSTCTAAGGTCCCTGTGCCT^ 
7 5 TGGAATTTTAGGACCTCATCTCTTTATTCCAGCCCCACrrGCCCAGTGTGTAGGCTCGAGGGAGGATG 
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GGAAGCAACTTCTACCAGGGCACACAGC^ 

T^CTAACCGGCCCTTTTCGGCTATCTGGACTAGTGCTCTGCGGCTG^ 
GTTAAGTGAATGGAGCCTGCTACGTTTGACCATCACCrrc^ 

AGTTGAGCGGGACCMGGC^ 

ACTTGGGGCAGGAATTCGCCTCAGCAGTTAACAGCGCTTGCTTGCTACTOT 
TGTGGCTCACTGGCGTTTGTAACACCAGTTTCAAAATATCTG^ 
CAAAGCACTCATAACACATTTTTTAAATCTTTTTTAAAAATTAAAGATAA^ 
CTTTTGAGCAAGCCIX^CAGCACCGAGCCTTTC 

CGAAATC^CCCGACCGCGCGGGGAAGAGTGGGGAGGAGCCAGGGACAAGAACGA^ 

CCTGCCCTGGCAGCTGGACCTATGAGCCAGCCTGATCTTCCTC^ 

TGCCCGCGCCACGTGCTCCCCCGCCGCCTGCCn^GAGCTGCTTTGTA 

ATTCTCGGGGAGGGTTTTGTTTGTTTTTAAAGCGAAGAAGGCT 

CCCTGGCAGCCACAAGGCTGCAGGAACTTGGGA^ 

AGATTCTGACCTAGGACAGGAGACGTGTTGTGGCAGAAAAGAAAAATTAATCACCCGG^ 
CCCCCCCCACCCCCCCCC^CCCCGGTCCTCTGCAGAACATC^TCTCTCTGGCCCAOT 

AAATTGAAGTAATAAGAGGGCTGGAGAGATGG CTCAG CAGTT AAGAGCACTTGG GG CACTTG CAGAGGACT TGGATT CTGTT CTCT 
GGACTCAGATCCGGCTCCTCACAGCCCCCTGTA^ 

CACAGAGACACATAAAT AAAACTTTTAAGTTTT^ CCTATGTGTAG TG TTT ATTTTCACCG A 

GTGTGCCT ACAG GCAAACAGTATGTGTGTGCTCTGAGTAAAGT AGGTTTTGTTTGTTTG CTTG CTTTTTGTTTTATT TG TAT T AAT 
GTGGCTGTGTAATTAATGCGTATGCATGTGCCTGGGGAGACTAGAGAGAGTGC^GAATCAATCCAC^ 

ACAGGCAGTTGTGAGCCrrCrGACCCTGGTGCTGGGTACCAAACT CAGACAAGAGCTGCAAGTGG CTTGCCTGCTGAGCCATCTCTC 

CAGCTCCTAAAATGTACCCNNNNNNNNNNNNNNNNNN^ 

CTTCTAGCTAGGAATGGGATCTGCCAGCAGGGGGTGGCCCAm 

GGGTCAATCTGATTTGAGTCTTCTGTTCGAAGCCTGGAACCTGGAACCTA^ 

CATATATTTATCCAAT/TCTTGTTCAGTAAAGGAAGCACAAATTATGTCTGGAGGTTG 

AGAGGTTGAACCCAGAGGGTTTGTTTTGAATGCTCTC(^CAGCTCC^ 

GTGGCAGGAAGGCATGGAGGACTGGGCCTGGTTAGGAGCCTTCATTCCCTGCTAC(^TCTTC^ 
ACTGGTAAAGGGGATGGTGGGTAGCTTTTTTTTCTTCAAGCT 

GGT CGTTGG CAC T AGAATTCTGAGAGGAAACTGG AAG CC CAGAG AGGTGAGAAGGGTG AGAGACCAGTGGTTCC CACTCTTCTTAA 
CTGATTGCAACCATACAATTTTTTGTTGCTACTTCATAACTGTAACTTTGCTATAG 

GTCTTAGG GAG AC C CCTGTGAAAGAGT CG TTTAACC C CAGAG AGGTGGAGACCTACAGGTTAG GAACAGCTGGGCTAG AG AG AGGA 

ACTTGGACTAGAGCCIAGGTCTGTCTGGCrAGAGTCCACCTTCTGCCCATCCr^ 

GGCTCTTAAAGTGTCCTTTCCCCCTGGGTClATTATTTTTTTTAGTAGGCCrGTCGTAGAC 

AGCGCCAC!AGCCATGTGACCTACACACATAGTTGGCTGTCACTGTCrrTAGAATTCTATG 

AGATGAGTTTCACATAGCCCAGGCTGGTCTCAAACTCAGTGTGTA^ 

TTTATGT AGTATTGGAAAAAT CAAACAAG AGACT AG TACAGGCTAGG AAAG CAT ATGCTAT C T AC CAGGTAAATTT C CAG CAAGG A 

GCCCACCATCTCATTTTGCACTGGGCTATAGTTACATAGCC!ATTCCTGACCAAGAC^ 

CTCTTATCTCCCCTCGTGGCCm 

CGACTCCTCCTCTACCCCACATCTTGGATCACCATGGATTCTGTGACCCTTCAGCC^^ 
AGAACTGACTGGAGTGTGCTGAGTGCITGCGTCAGTTTGGTAGGGTAGCGACTGATCCCATCCGCT 

G CATAGGCACACACCCAAAGCTGTTGCGGGAAAGCATGAGCAT^ CTATAGTGCACTGTTTATGCTTATGAGCTGT 

TTGGGCCCCTCCCCCCTCTCTCTGTATTCCCAGTACTTATTATATTGTGCCIX^CC 

GTCCCaTGCGTCCCTCTAGGCTGATTATGGGGAAGGACAGTCTCTGACCTCTAGGAGGGTCA^ 

TTCATGCTGCTGCCTCTGAGATCGCTGCCC^CCTTGGGTTCTTAGAGCCAGCTACAGGGAC 

TTAGATTTGGGTGGAATGGGTAGGTGACTTGGGGGAGGGGG 

AGCTGGCTAGAGTTTGCAGTTTATAATAGGCTAAGCCGATGCCACCGTCCTTTTCCTAA^ 

CATGATATCATTCAGCCCAGATCAGGGGGTCTTGCATCCAGGGTGAGGTTGGTC^ 

CTTCCCTGAGGTCACGGCCTTCAAGACAAACACGCCTGAAGTAACTAGGGCTGTGACGAAGGCTGC^ 

TCACATCTGAGAACTTCGTAAGGATCAGACTTGGTGGTTTTGAAACAAACGAGTTGTTGTG 

ATGAAGTTCAGCTTTCACTGGCAAGGAACGAAGGGAGACCCAAGTCTG 

CAGACCAGGCTGGCTTCTTATGTGAATGTGTGGTGCCTCCCTACAGCCAGCAGAATG 



- _ _ _ . _ .„ _ w „, * ^ * * GTTGTTGTTTTA 

AGGGCTAGTGGCAGTTGCTGTTGTGTGTTAAGGGCTGCTGGAACCGGGAGGGCATCTAAACCCAA^ 

TTTAACTTGCAGTTTGCTCACACTCAGGACTTGGCAGGGG 

TCTTGGATACAAAAGGGATGCTGTC!AC^TGGAAGGGCrAGAGAAGAGGCTG 

GTTGCAGGGTGAGTCCCCITGGTCGC^GGCCTTGGGAGAGAGGAGGGGGTG^ 

ACGATGGCTAGGGTTTTCCGCTTTGTCCGGTTTGAGCCACT 

CGGGTCTATGCCCTGGCCACCCCCTAAACAGTGCCC^GCACCCTTACTGTCTGT(^GGAAGGAGGA 
ACTGGGAAAGGCTACCTGAGACTTTGCCAGA^ 

CTGCACCTTGGTGACTGAGGAGGTGGGTGTGATGCCCACC7VAGCTGGTGGAACAGGAAAGGGCT 
GGGGACCTATGGAGTTGGGTGATCTCTGGGCAGGCACAGTTCCrrATACCrrTGGAAT^ 

TTGAGCAGCTAG CCTG CATTGACCTX3TTTGGGTAGGCAGCCACCTGCTGGGCCTG CATTTCAGG CTAAGTATTTCTTTTTCTTANN 



NNNNNNNNNN ^^ 

NNNNNNNN1 ^^ 
NNNNNNNNNNNN ^^ 

ACACACACACACATAGAGGCAACTTCCCTAGACATTCCCTTGAC^ 

TAATCGGATTAAGGGGAGAGGCCGTCGAACCACTCTAGGAAAAC 

TTGGAAGCAGGGATTCCCTGGAAAGCCAGTTAAGGAGAAAGCCTGTGCA.C^ 

CAGCCGCTCTTCCTCCATTTGCTCGATGCCCTCAGCCAGGCCTTTCTCTGCAACTGTCTTGCCCTC 

TTCATTCTTACACTCTAACTGGGAGT^GCTTGACCTGGACCCGTGGCCT 

GGCAGAGGCTTGGAGCTCATGATTCGGGGAGGAAGGTGGTGTGGTGTTCCTGGCTTT 
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GCTAGGGATGGGGCreTCCTGCATGGAGCACTCGCTGTCTCTGCAGGGACCCAGGTGCCCCATAGT 
TGGAGTGAACCCTGTTGTACCTCTGCAGAGTGGCT^^ 

T AGAGGAGAAGTG CAG CCATGGGTGGGGACTGG CT CGTGAGCTGTTT ATG CA.GGGTGGTGGTGGGGAG GACAGTGGTGACCCTCTG 

AGACGTGTGGGCCTGTGGAGAACCCCATGGATGGAGGGTGCAGCACOTCCAGTCAGTATTTTCC^ 
5 TAGAAGCTGATTCTGAGXSCGGAGTCCTGGGGTGC^ 

GAGTC CTTTCAG CCTCTU^CCAACTCTAAGATCTCTAATCCCAACTATTTC^GAM CGAGT CAGGATT ACAAATTGAAGGC CTTTT 

AAAG CTATGTGCAGGGACAATT CAGATCCAGTTTAGG CGACTCAJGAGAAAG ACTGTGTCAAAGTATLAAAGG^ 

AGTGAACTAAAGGGAGCAGGTGCTTGCTTGCCTTGCATATGTGAGCGCCT 

CTGGTATCACCAAGAGACTGGGTTCCCCAAGCACCTGGCTTGCCC^ 
1 0 CGCCTTTTCTGAGACCCACTGGCTITTCCCCrGAC 

TGTGGTTTCGATCCTCACGAGTAACrTCTACCGTCT 

TGTCCTTGGAGAGGGAGTGGGTAGGGACCCTCACATTTAAG 

AGGGACCAAGGAAGACATCCAAGAGCCATGTGGTCCGGTA 

TGTTCAGGACCACACGAGGCTTGTGGCTCCATTCTTGGACAGCACCAAAC^ 
1 5 TGTCTCGGCATGtfVGGATGTGGACGAGGGTTCTCTGCT 

GGCCAGGCCTCAGTGAACAGGCTGAGGCCAGAGGGAGGi^ 

AAGX3GGGTATGGGGGAGGTGGTGGTGGTGGGACAGGGATGGGGACCACAAGG 

TATCTGGCCTGGGCATTCGTAGGCAGGGATACCAGGTGTCATGGCTACCTCCATTGTGGCTGA 

CATTTTTATTGTATACTGGGATGTGTAGCAGGGGCTTATTCACACATGT 
2 0 CCTTGGTGACAG CAAAGCCTACTGGAT ATGTTCT CTGCAGAGGTT C CCCAGTT CTAAATATTT ATTTATTG ATTTTATGT ATTTG A 

GTACACTGTAGCTGTACAGATGGTTATGTGCTTTCATGTAGTTGTCGGGAATTGAATTTTTAGG^ 

CCCACT ACCTCAGTCCCTG CT CACTCTGGC CCAAAG ATTT AT C TATTATTATACATAAGTTCAC^njrAGCrGACTTCAGATG CACC 
AGAAGAGGGTGTCGGATCTCATTATGGGTGGTTGTGAGCCACCATGTGGTTGCTGA^ 

TCAGTG CTCTTACATG CTGAGCCAGCCCATGGCTCCCCAGTTCTAGAGGTGTG CTTATCCAACATTCTCCATCrGGGTCTGTCTTT 

2 5 CITAATGGTAGAAAGATGGGTAGCCCGGTCACCCTTTGCTCCAAGGC^ 

TTGGACCGCCTCTCTGACCCCTTCACTCACACCAACCTTCCTTT^ 
AAGGCAAAATCGAAGGCCTCACGGATGAGTTTGAAGA 
TTACCAAAGTTAAACAAACTCAAGAAGGTAAACAGGCTGGGAGCTGGGAG^ 
ATTAAGGGCTAGGCCAGTTGCTTGCTTC^CTTGCT 

GTATTGG CAG AGAAATGT C CG AAC C TTAAG CAT CT AAATTTAAGTGG CAACAAAAT AAAAGAT CTCAGCACTU^T AGAGCCGCT 
GAGTCCCTTGGTCCTCTCCTTTCCTTGGGGGCGGTGGGGAG 

TCTCGTCCTGTGCTGTGAGCTAACATGGGTACTATTTCrGCTTAGTTCACGGAGAATT 
TGGGCACCCAGTGTGGGGCAGGGAATGGAGCCTGGTGTGGTGTGTGCCCTTCAGCCCGG 

3 5 GGCTGGCTTGCCACCCTTTGAGGAGAGGAGCAAACCTAGATGTAAGCTGACTCrrGGA 

TTAGGGTTTAGTGCTCACACAGACGTGGCATT^ 

TGGAGAGAAGAGTGTCTGTCTTGGTGACCAGGGCCTCAGAGACAGGATGGATGAC^^ 

TTAGGGAAATTTTAGTCAGAATGGCAGTCTCTTGTTAC 

AGATGTGACAAAAAGAATCCATTGGAAATGGAGAAAAGTTGCAAAGCT 

4 0 GTTCCGGGTTCACAAAGCACTCATCAGTCTCTG^ 

TTTCAAGCTCAGACTGTCTGTCTGTCTTGCTAATAAAGCAC^ 
AAGGCAGAGCTGCTTTCATCTGCTGGGAAGCCCTGAT^ 

GTGTTGGGAGTGTGAGTACAGCTGAGTCTATACAGTCTGTCTAACACAGACCAGAGCTCAG 

CTAAGAATTACCTGAGCCTATATACAAAATAGTAGAGAAT CAAGACTGACT CTGAGGGGTCTACTCTTCCTAGTTTGACTTGGTCC 

4 5 CCATGGCAAGAGGGACTCGGTGAGATCCCCTGAATGAGGTGAGA 

C^CTTCCTGGAGCCTCGATAGGGCGTTTCCCCCTTCCCGCCCCTTTCCCACGGTCACCOT 

CAGAAGAAGTTAGAGAATCTCAAGAGCCrAGACCTGTTTAACTG 

GCTCCTGCCCCAGGTCATGTACCTCGATGGCTATGAC^ 

ATGACGACG AGGAAGATGAGGATGGTAGGTGTCCTGGG CTG CC 

5 0 ' ACCGGTTTTCAAAGGGAAGAGGGACTGCTGGCCTCAAGTCCCCCCAGAGAAGAA(^OT 

CGTGGGAGAAGAGACCAGTTGTAGGGGTGGACTCCCCAGGGCCT^ 
TGGCGCCCCCTGGTGGTTGCAGCATGCAGGACGACT^ 

AATCTCAGATCCCTTGGGAAGGAAGGAAAGGCTAAGGGTCTCrCCrCCCTAC CAGTT 
GGCACCCTGCAGCTCTTCTGCAGGGCTTTATAGCT 

GGCCAGTTGCTCCACTACACAGGCC^CITG 

CATTTTTTGTGTGAACACCACCTGTGTGCATTATTAAAGCAGTCGGGC 

6 0 TGTCTTCTTGCTGGCCACCCCCACAGAGGAGGAGTATGATGA^ 

AAGAAGGGGAG^AAGAGGATGTGAGTGGAGAGGAGGAGGTAAGGAGACGGGGCT 
AGGTGCCTCATTATCAGTCGAGGCAACCC^GGAAGGAGCCITGGTTGTCTACCTCCrGCC 
GTTTTCCAAAACTTCTATCGCAC^GACTTTGCrGCACAGACCT 
GGGGATACCGGAAGCAGAGGCTCCGCGTGCTTTGGGCGGGGCCTGTCGCT 

6 5 GTCCTTTCACTGTGTGATTTCTGGGGATTGGACTCAGGTTGCCAGGCTTGGCAG 

TGGGTTTCCTTGTGTAGCTCTGGCTATCTTAGAACTTACTCTGTAGACCT 

ACTTACCGAGTGCTGGGGTTAAAGGTGTTTGCCACCACTCCCCZAGCTTCTGC^ 

GAGCTGTGCTCGTTTGGACAGATTTCTTGTTTTTTT 

7 0 AGAGTAAAGTCACATCGACAGAAGGAGTCTGGTGAGGGGTA^ 

TGTGTTGGGGTTCTATGAGTTTGGAGGCTCAGGGTCTCTCAAAGCAGG 

CTTG C CACTAGT AAATGGTAACCAAGAAAGATTTG CAGTG CACACT CATGAAC CAG ATG C CTT CGGG CACACATG CG CAGTACACA 
GGCCATAGTCTCTCCTTCTCCTGGTTTTCTGCCTCCT^ 

7 5 GGCTGACAGCCCCACCTTTAAGGGATACTTGCAGGAGGGAACCTCTTTCOT 



1532 



WO 03/008583 PCT/US01/51291 



CTGTCCCTTGTACCATTCCrGCCTGCTGTCA^ 
GCCCTTTCTCAGTTACCAATGAAAGAGCTCTTAA^ 
ACACAGAGTCTTCGCAAACAGACCAGGGAGAGAACTATCCAATGATCT 
CTGTGAAACCATAGAAACCAGGCCTTCAGGTACAGAGAAGCTACCAGCATGGAACT 
5 ATGGGAGG ATGAAGTCACAGGGCTGGCTGAATC CCAAG CAGGTTGAAGTGAGGTT CTCAAGGACCCTTG AGTCTATCCG C CACACC 
CAAACCCTTTAGCTGCCTAGGCTTGGCTGTGGGGAGGGCCCGGGAAAGGAGAG 
AACTGGGGTCTTTCCTTGAAGCTAAAACCATATTGAGGACAGATGCAAG 
GCCTAGGGAGCCTGGTCTCCTAAAAGCCTGCATGCTAGCAAGCTC 
GGTAGTCCAAAACTACCCCACAGCCCAGGTAACCCTCAGTATC^ 
1 0 GTCATCTGACTCAGGTGGTTCCTTGATA 

G CTCTGAGATGGGATGAAAGGC CTTTCTAGG CAGGG ACATCTCT AACTCATGTAGGTGTG CGGGTATAGTGT ATGTG CTTGCGTGT 

TTTGAGCCAGGGTCTCTTGTTGAACCrcGAGCTCACTGGTl^ 
GTCTCCAAGGTGCTGGAATAGAAATGTGTGCTTGCTAGGCCCT 
15 GGAG CTGCCTC CCCAG ATCCAGCAGGGGCACTGCTAATACC CTG CTGTC 1TTCTT TGCC CCXAGGAGG ATGAG GAAGGTT ACAATG 

ACGGGGAGGTGGATGACGAGGAAGACGAAGAAGAAGCTGGTGGTATGTCAGCCTC^ 
CTCTAGCGTTGCCCTGCTTGGGCITTAAGGGCCAGGCGATG^^ 

GGGTGGGGCTGCCGGGAAGATGGCTGCCTGATAACTCTCCCTTTCCCTCCATTTTTCC^ 

CGAGAACCX5GACGATGAGGGCGAAGAGGATGACT AAGG AATG AAC CTGTTTG GGG AAATT C CTATTGTGATTTGACTGTTTTTAC C 

2 0 CATATCCCCTCCCCCTCCTATTCCTGCCCCCCGAAACTTATTTTTTTCTGATTGTAG 

GTACTGGGGGTTGATGGGGGGTGGGGGTGGGGGGGAGGGGAGGAATAAAATACT^ 

CTTTTTCCCTTGTGTCTGGTTTTGTCCCCATAAATGCAATA^ 

GATGGCTTGGAGGTCTAACATTTTCTGGTATGTCCTACATC 

CACAGCCCCTGTCATCTTCACTAACTCCTGTTCTGGTTCCGGTTCGTGTGGTTOCT 
25 ATAGCAACCAAAAGGCGTATTTTGGCACTGGTTTGGGGACAGTCCCCAT 

GG CTTTGCTTTTTGCAAAG AG AACT CT TAT T ACT CTGGGGCTATG ATC CCG AGAGCCAAAAAG AAAAAATCAAAATAACAACAACA 
AG AAAAAAACAAAAAACGG CAGGGTT AGG AG AT ACTTGT AAAATG GGAAAG AAGTGAAC CTTCATGT TT CAGcrTCTIAAAAACAA 
AAACAAACAAAAAAAACTGTTCTATTTGGTTCTGTAAAATATTAGAGCAACTGGTTTTGTAG 

GCGCTCGAGCTTAGAACCCTTTTACAAATCT 

3 0 TCTCACATACATTGGAGCACAGGAAAGGAATGCAAACTCGTATrrXCTTACCCGGTGGGACT 

AGGGAATGCTGTGTGCCTTCTGCATCCTTCTTCCAGATCGCTTTA 
TACAGAACAACrTGCCACTCATTTTTCTCCCGCCCTTGAGAAGAGCCAAAC^ 

AATAAATCTTTTTTATACAATTTTTTGGGAGATGGCTTCCTTTGAGTCTTTACAAGCCATTGATCGTCCT^ 
CCTGGAGGGAAGGCTTGTAATGTCACCTGCTAAACAGCTTrCTTACAG 

3 5 GTTCGCAAAGCAGCAAAAACATATCTGGGTGGGATATGTTCTACTTAAAGGCTACA^ 

GGAAGGGTACAAAACCTTATAAATAAATAAATAAATAAATAAATAAATAAATAAATAAGGCAAGCCGCCT 
ATAAATAGCTCCTGTTCTTGTTTTTAAAATATCTTTG^ 

TGTATAAAACATTTAACCAACCCTAGACTTAACAGCTGGTATTTATTGTATTTTTTACT 
GGTTTTCCGAAGCGCTC^GGTGGTTCCCTGCTAATCCCATGGCTGGGTTCTG 

4 0 GAATTATTTCCAGTTTCTCCTGCAGCATCCATCCT 

AAGTAATCATTTTAAAATCTATTTACATTGTGTGTGTGTGTTTCTTGTATGTCAGG 
TCCCGTAGGTCTCAGAGATTGAACCTGAGGCATCAGGCTTCACA 
CCAAGTACTTAACATCTTTCATTTTTCTTTAAAAAAAAAAATATTAG 
AGGTCAGACTGGCATCAATCTCCTAGAGATCCGCCTGCCTCTGCCT 

4 5 ACTCTT AGACAAGATGCTCATTGGG CCAAAC CT CTCC TAGCA.TTTTTGT AACG GTCATC C CTAAAACTTGT CAG AC 

CAGGGATGGCACACTGGCTGCGTGACCTTCTTCTCAGCCATACA^ 

GGAGGGTTCCAGTGACTCTTGTACATGTCTGAGAAGACCTr^ 

TGGCTTCAGAAGCAGAGTCAGGTGCTCTCCTGCIAGGCT 

AAGGTATCAAGAACATGGGCXSTAATGTGTCAGGACAGATGCrGTGGACCCCAGCrCCT 
50 gTAATGAAGAATACGTTTTGCAGGCAGCCTCCAGGGGA 

TTGATGAGAATTCTAGGGAGATGCTCAGTATAAACATGAGGACCTGAGGACCTTCAGCATCC^ 

GTGCTGCTTGTAACCTAGCACTGAGAGGCAGGCAGGCAGATCCCT 

CACA.CTGTCCAGAGAGACTOTGTGTCAATCAAGATGAACGGC 

ACATACGGGTACATTCTTTGTTCCCCITGAACTCTTCCCCCGTAAATGAGGAAGGCT 

5 5 AAACCCGGCCTACCACACGGTAAGTTTTTAGTTTTG 

CCAGCCCAGAACTTCAAACACAGCCAGTCCAGCGGGTTTTCTCAGAGTCTCAGAGTGATTCAGTGA 
AGCTGAGCTCCTGTCTAGTCCCAGAGCCTTACACTAAATCCCCCTACTGACTGTATCCTTGACTTGTGCAAGACTGCT 
CATCATGAACTGGTTCCTCAGCTCACACCGTCIACGACT 
CCAGTTGCCCAGTCCTAGATGTCCATAGTGGTACCTACCTGTAATCCCAG^ 

6 0 GTCTGTATAACAAGCTTTTGGACAGCCTGGTTACTTGAGAAATTGTCTAAAAACTAAAAC^^ 

ACTTGGTGAAGTTCAGGGAGGAO^TGGTGGGCCCAAGGTC^CATCAC 

ATAGCTTTTCGGGGCTTTGGTACACTCCrCTGCATTGAGGOT 

GTTTCGCATGGCAATAGGACCTGTCATraGTTCT^ 

CTGGAGAAAGGATGGTGGTOTCATCrGTCGTGTGGCCTCCAGGCTGCGTGCTT^ 

6 5 CACAAAGATGGAAAC^GAGCTTTAGCGGO^AGGTGACT 

TTCTTACTTATCCTGTGTTAGAAGTGCTAGCAGGCAGATGCAGGCCAGAGG 
GACTATTTATATCCTGGTCCCAAAACTCATTGTTCAGATTTGG 

AT AG G CTGG GCAT AG GCAAG AATAGGACAGAGAGAGG CTGGAGATGGAGACAGATCTG AAGC CAG CCAGGAAGGTGTT AG CAGGGT 
AG AGACTAG AGAAAG C CAAAC CCAGG GGGGACATTGTCACTGGG ACTGCAGG AG CAGACACAGGAACAT AACAAACAATT CC TCTT 
70 AGTGGGGCACCGTTGGACTGTGGACGGCCCAGTGGATGGGGACATTCTCCACTCCAGATGTTC^ 

CT CACAAGGACAG CAGGTG TGAAGGAG C C C CTG C CTGGTGTC CAG AG ACATGATTGG ATAGTGGT AG AGT AGAGACAGTTTGGTTA 
TTGT1TGAAAGACTTAATGTTTTAAATTATGCATGTGTGTGTGTCTGTGTGGGGCTATATT 
GGAATAGGAAGTCAGAAACCCTGAAGTTCATAGGCAGTTGTGAGCCACCCAGAGCAGA 
AGTATGTACTCTTAACCACAGGCGCCTCTCCTGGTCCACAGAGA 

7 5 ACACCTTTAATCTCAGCACTCAGAAGGAAGGAGGATCTCTGGGAGTTCAAGGCCA 
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GAGACCCAQTCTCAACJUACAAACAAGCAAAAAACCAACAACCAAAAG CAAAACACAGATCTTGCTGCACGGAACTGGAAGGAGGA 
GCCTGGTTGTGGAGGCACGTCTTCCAGGGAAAGGCAGCCTGT CACCAG AG C CAGGAAG CCATGGAACTTAACATCCCGTGGCACCA 
TTCAGCCTTGGTCTGACCAGCGCAGAGTGGGTAACCCC 

GTTTATAATC^TCCTCACT ACAAAGACTGCACACACGGAAG CT CT CAGTGAAG ATT AGCTGCTTAG CACT CG GCACAGTAACCGAT 
5 GCTCAACCC CAGTGATTC CAAAGATTTCCTGACTGGAGT CC CCAGGGTTTGT CC CATGGG AATTCT TG 

TTCIACTCT AGATCGGGACCTGGAAAACCCAGGACTCAAGGTTCT C CTTG CCATG AGGGATGTTTGGAGAATGGTGT 

ATTAAACTGAAT ACT CC CAGTGACCTCCAGAGAAAAGTT TGAT CGTAC CACATT CATT CAGTTAAGGGCAGTGAGACTCAGAGACA 
TTGAATGAT CT AATTAATTTGTCACACAGCCCACAAGTAG CAGAGTTGAGTTT CAAACCTGGGT CAG CAT AATTTTCATAGCT C CT 
GT AG AGAGTAGGTGAAAG CAACTGCCTGTATGCCTCAAGATT CGGAGATACT AG AAAAAAGCAAGGGG AAATAGAGAGTGAAAACT 
1 0 ATTTTTTTTAAGTGATGTGATAGC^AGATGATTCAGAGGGTA 

GAACCAACTGTGAGG GCT AAAGTTAAGTTTAAATTACCAAG CCT AAG AAATCTTGGG CTGGAGAGATGGCTCATCAATT AAGAG CA 
TTGACTGCTCTTCCTGAAGGTCCTGAGTTCAAATCCCAGCAGCCACAC^ 

GGCGCACGCCTTTAATCCCAGCACirrcGGAGGCAGA^ 
15 CAGGACAGC CAGGG CT ACACAGAGAAACCCTGT CTCG AAAAAAAAACAAAAAACAAACAAG 

CCAGCACAGACAGAGTCACTTCAGGAGCTGATGATTT^ 

GTCACCTAGCCAGAAAGGTACATGTGTCCTACGTTCTCAGGGCTAGCGTGG 

ATTTCACACTTTCCCTCAAGGAOTCAAGCTCTC^ 

AGTGAC^CAGTATTCCCATTATCCCTGGGGACAGATCTCTCATAAC^^ 
2 0 CCTCTTGCTGATTGCAGAGATGCTGCTGTCT 

AGCGCAAGATGCTAAACrCTTGGGCTGTCATGGGCTTGGGCT 

ACCTCTTACTGCCCATCTCTAGAAGGCTCAGGAGGTC^ 

TTGG GTAATAGGGAAG CATGG^K5TCTTG G ATTTAAATACAAGTTTGACTTGCGATTG^TATTCTT AGTTGT CAACATGACTACAT A 
TGGAATTAACTAAAAT AAAAAATG AGGG CACAC CAATG AGGAATTTTTGCTTACTT TGAAGTAGG AAGATC CACTTCAAATC CAG A 

2 5 TCTTGAAGGGGACGATAACGATACACATCTTTAACCTGGGCCATACTTTCTGTTGG 

CTTTTGCTCTTTGCCTCCTTGCTCT 

CTACACTGAAGACCAGCCAAGGCATCCAGCCCCATGGACTGGGCAACTTCTAGATTCTTG 

TTGG ATTAG CTGG ACTGG AG C CTGTAAGTCATTG CAAT AAATC CC CTTTGT AT ATAT AGAG AG ATTCATTCT AT AAGTT TTGTT AC 
TTTAGAGACCCTTGACTAATACAGTCCTCCATAGAGACAG^ 

3 0 ATCACCTAAGTGGGTTGATTCCCTGGTGGGGAATTGGAAGGTOGATACTTATTGGTC 

ATAAGACACAGGTACAGG^AAGGGTAGGATGGGGGTAGGGGCTTGAGGTGGGGACAGA 

GGAAAGCAAGCACAAGGCAGCCAACGACTGAGTGTGGCTTTAACGTGGTAGATTA^ 

TTCTTATCTAACATGCACAGTCTTTGGGTTCCTTCCCTGATATTGTA^ 

TTGGG AGATGGAGACAG AAGG AT CAG AAGTT CAAGGTCATCTTCAGCTG CAT ACT ATGT T CAATG CTGGG CTGGGATACATG AG AC 

3 5 C CTGT CTCAAAACAAACAAACAAACAAACAAACAAAC AATAGC C C 

AATTCTAAGCTGGTTAAGGAGGTGATTCGAAATTGTCAGGAATGAAGAAGGAGAGGGCCA 

GGAT ATCCAGGG TG GAGAAGGG AG CCT CAATGGAGGTG AGGGGTGTTTT CCTAG CACTT CAG AGGT AG AGACAAAG ACATCAAG AA 

TTCAAGGTTAGCCTTAGCTTCAGCCAGTTCCTAGCCRACCTGGGCT 

TCTCTCCTTCCATGTTTCCCCCACATCGACCTGTCTCT 

4 0 TCAGCACAAGCCCAGAAATGGACTCAGTTGACCACCGATCAAAAGT^ 

GTAACTGkTCTCCCCCAGATATTTGTCACAGTGAC^ 

GAAAAATGAGAAGGATCCCOVAGTCCAGGTATAGGAGCCC^TGAAGTA^ 

(ZAGGGAACAGGGTCACAAGGAAAGACCTTGTCAGACAAAGCTGACAA^ 

GCTACAAGGAGTGTACTAGTTACTTTCCTCATTGTTCTAACAG C CAATGGGAACGGCTAAGGAAGGAAGGCTTGTTTGACTCTCAG 
45 TTTGAGGGTATGGGTTGGTGGGTAGTCATGGCAACAGGAGAGTAATGTTGCTGGTAGCAATGGGTCCAC^ 
GACAACTCTGGTGTTTAGCCTACTTTCTTCTTTTTATTC1AGTTCAG 
GTCTTTTGACTTAATGTGGGGAGAAGATGAAGTCAGGTCJAGTTGCCA 

TAAATTCTATCAAAATGATAATCAAAATTAACCATCATATTTGGTTCCTAAAAGTTGAAGTGTAAGGGT 
TTCTGGGCAGAGCAGGTGAGGTGTGTCCTACTAACGCTGAAGACCCA 

5 0 AGCCAATTGTTTTCTAACCTCCACATGTGCACCTCGGCX^ 

AATACAT ACAT ACAT ACAT ACAT ACAT ACATACAT ACAT AAATGT CAT AAAAC AGGAA CAATTT C 

AACACGGTTCAACTATGG^CAGGAAACGTACTACACCCTCCCACTCCC^ 
TCTGTTTGTCTTTTCTGAATCTCTGGAGCCrcTTGGCCCCTCATTTGGAGCCCT 
TCTAACTGGGTACTGGGATGGTCTTTCTCCrGTCACCCTGGATCCATCCACT 

5 5 GGAACCCCAGGTACAGGCTCTCCCAAACACGGGCACGTGTCAGGTGTCTCCTCTCCTGGCT^ 

CAT CACCAGAAGT C GG AAGTCAAGGAGAAAAT CAG G AGG CACC CACAGCAG CCCAGCTGTG AACAGGT ATTTGTGAGAC TT AAAAC 

GAATGGCACCTTTTGTCACTCAAACACATGGCTGCGCCTTGGATATTCAGTAAGTTCCACTGT^ 

TAGGCTGTTGGAGCCAATT71CCACITGCATCAGGGAA 

GG ACAAGAT CG CT CCAAGGT AAG ATTG AG AAG AGGTTT ATTGT AGCCATG AGGGATAGAACTGCCAGC CG CAT CTGG AGGAG AATT 

6 0 CG AAGCTG AGAGAAGGGAAGAAGAAGCCCAGAGAG AGG TTT AGGGT AGGGGG CGGGGTGAGAAG TG CGGGG AGGAG C CACAGGTGT 

GGAGTGAGACTTGTCCTAGGTTTCTTTGGGCCATGACAGATGGTTAGCTCCTGGTGCTGGGACAGATAT 

AAGGCACATTCACTCTGACTCATGGTTTCACAGGCAGTCGGGCCAGCTGGATCCC 

GTGGAAAGGCAG 

65 MOUSE SEQUENCE - mRNA 

GGGTTCGGGGTTTATTGATTX^TTCCGCTGGCGCAGGATCCTCTGCAG 
TTTATTTAGAGCTGCGGAACAGGACGCCCTCTGATGTGAAAGAGCT 
GGCCTCACGGATGAGTTTGAAGAACTGGAATTCCTAAGTACAAT 
CAAACTCAAGAAGCTTGAATTAAGCGAAAACAGAATCTCAGGGG^ 

7 0 TAAATTT AAG TGGCAACAAAAT AAAAGATCT CAG CACAAT AG AG C CGCTGAAG AAGTTAGAGAAT CTCAAG AG CCT AGAC CTGTTT 

AACTCTGAGGTGACCAACCTGAATGCCTACCC^AAAACGTGTTCAAGCTCCTGCCCCAGCTCATGTACCTCG 
AGACAACAAGGAGGCCCCTGACTCTGATGTTGAGGGCTACGTGGAGGATGACGACX3AGGAAGATGAGGATGAGGA 
AATACGCCCAGCTAGTGGAAGATGAAGAGGAAGAGGATGAGGAGGAAGAAGGGGAGGAAGAGGA 
GATGAGGAAGGTTACAATGACGGGGAAGTGGATGACGAGGAAGACGAAGAAGAAGCTGGTGAAGAAG^ 
75 ACGAGAACCGGACGATGAGGGCGAAGAGGATGACTAAGGAAGAACCT 
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cca: 

MOUSE SEQUENCE - CODING 

ATGGAGATGGACAAACGGATTTATTTAGAGCTG CGGAACAGG ACG CCCTCTGATGTG AAAGAG CTGGTC CTGGATAACTGTAAGTC 
5 AATTGAAGGCAAAAT CGAAGGCCTCACGGATGAGTTTGAAGAACTGGAATT C CTAAG T ACAATCAACGTAGG CCTCACCTC CATTT 

CCAACTTACCAAAGTTAAAX^AACTCAAGAAGCTTC 

TGTCCGAACCTTAAG CAT CTAAATTTAAGTGG CAACAAAAT AAAAGATCTCAGCACAAT AGAGCCGCreAAG AAGTTAGAGAATCT 
CAAGAGCCTAGACCTGTTTAACTGTGAGGTGACCAACCTGAATGCCTACCGAG 

ACCTCGATGGCTATGACAGAGACAACAAGGAGGCCCCT CTGATGTTGAGGG CTACGTGGAGGATGACGACGAGGAAGATGAG 
1 0 GATCAGGAGGAGTATGATGAATACGCCCAGCTAGTGGAAGATGAAGAGGAAGAGGATGA 

GAGTGGAGAGGAGGAGGAGG ATGAGGAAGGTTACAATGACGGGGAAG TGGATGACGAGGA^ 
AAGGGAGTCAGAAGCGAAAACXIAGAACCGGACX^ATG AGGG CGAAGAG G ATG ACT AA 

HUMAN SEQUENCB - GENOMIC 
15 ATGGAGATGGACAAACGGATTTATTTAGAGCTGCGGAACAGGACGCCCTCTGATC 
AATTGAAGGCAAAATCGAAGGCCTCA(X3GATGAGTTTGAAGAACTGG^ 

CCAACTT AC C!AAAGTTAAACAAACTCAAGAAGCTTGAATTAAGCG CTGGAAGTATTGG CAG AG AAA 

TGTCCGAAC CTTAAG (ZATCT AAATTTAAGTGGCAACAAAAT AAAAG AT CTCAGCACAATAG AG C CGCTG AAG AAGTTAGAG AATCT 
CAAGAGCCTAGACCTGTTTAACTGTGAGGTGACCAACCTGAATGCCT^ 

2 0 ACCTCGATGGCTATGACAGAGACAACAAGGAGGCCCCT 

GATGAGGAGGAGTATGATGAATACGCCCAGCTAGTGGAAGATGAAGAGGAAGAGGATGAGGAGGAAGAAGGGGAGGAAGAGGATC 

GAGTGGAGAGGAGGAGGAGGATGAGGAAGGTTACAATGACGGGGAAGTGGATGACGAGGAAGACGAAGAAGAAGCTG 

AAGGGAGTCAGAAGCGAAAACGAGAACCGGACGATGAGGGCGAAGAGGATGACTAA 

25 HUMAN SEQUENCE - mRNA 

GATTCCGGGTGCGCCGCGGGTGCTGGGGGCTCGAGAACGAGCGGAGCTGGTTGAGCCT^ 

TTCGGGGTTTATTG ATTGAAT T C CGCCGG CG CGGGAG CC TC TG CAGAG AG AGAGCG CG AGAGATG GAGATGGG CAG ACGGATTCAT 

TTAGAGCTGCXSGAACAGGACGCCCTCrcATGTGAAAGAACTTGTCCTGGACAAC^ 

GACAGATGAATTTGAAGAACTGGAATTCTTAAGTACAATC^ 

3 0 TTAAGAAGCTTGAACTAAGCGATAACAGAGTCTCAGGGGGCCTG 

TTAAGTG G CAACAAAATTAAAG AC CTCAG CACAAT AGAGCCACTGAAAAAGTTAGAAAACCTCAAGAG CTTAGACCTTTTC AATTG 
CGAGGTAACCAACCTGAACGACTACCGAGAAAATCTGTTCA 
ACAAGGAGGCCCCTGACTCGGATGCTGAGGGCTACGTGGAGGG 
GAAGATGCTCAGGTAGTGGAAGACGAGGAGGACGAGGATGAGGAGGAGGAAGGT 

3 5 GG AT G AAGAAGGTT AT AACGATGG AG AGG TAGATGACG AGG AAGATG AAG AAG AG CTT GGTGAAGAAG AAAGGGGTCAG AAGCG AA 

AACX3 AGAAC CTGAAGATGAGGG AG AAGATGATGACT AAGTGGAAT AAC CNATTTTGAAAAATTC C 

HUMAN SEQUENCE - CODING 

ATGGAGATGGGCAG ACGG AT T CATTTAGAGCTGCGG AACAGGACG C CCT CIGATGTGAAAG AACTTGT C CTGGACAAC AG TCGGTC 

4 0 GAATG AAGG CAAACTCGAAGG CCT CACAG ATG AATTTG AAG AACTGGAATT CTTAAGTACAATCAACGTAGG CCT CAC CT CAATCG 

CAAACTTAC CAAAGTT AAACAAACTTAAG AAG CTT G AACTAAG CGAT AACAGAGT CTCAGG GGGCCTGG AAG TATTGG CAG AAAAG 
TGTCCXIAACCTCACGCATCTAAATTTAAGTGGCAACAAAATTAAAGACCTC^ 
CAAGAGCTTAGACCTTTTCAATTGCGAGGTAACCAACCTGAACGA^ 
ATCTCGACGGCTATGACCGGGACGACAAGGAGGCCCCTGACT 
4 5 GATGAGGATGAGGAGGAGTATGATGAAGATGCTCAGGTAGTGGAAGACG 

GGACGTGAGTGGAGAGGAGGAGGAGGATGAAGAAGGTTATAACGATGGAGAGGTAGATGACGAGGAAGATGAAGAAGAGCTTGGTG 
AAGAAGAAAGGGGTCAGAAGCGAAAACGAGAACCTGAAGATGAGGGAGAAGATGATGA 
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Table 94 

MOUSE NOMENCLATURE 
ICSGNM Map2kS 
Celera mCG9283 

5 

HUMAN NOMENCLATURE 
HGNC MAP2K5 
Celera hCG1787lll 

10 MOUSE SEQUENCE - GENOMIC 

TAATT AAGTAGATAAAAAAT CTCAC CTGGGAC CATT CTTTT AGTAGCAT TAAAAAAAAG ATCTATTTTGTTGGAGAGT ATTTAAAA 
CAAGAC C CACGTG AAGTATGAGAAG CAGTAGCAGACGCAG CATC CTTTATT AGGTCCTCGTGGCTGGGTAAGGAATACCAGTGCAC 
ACCAGCTCAGGGCTGACATAGGATTGCTCATAGCTATTTTAGTTGTATTCATG 

TT CCATTACAC CATTAATTT ATCTT CAGAGATATTTAT AGTAT CAATTTAAAAGCTGT AAAACTT CATGGAAAGTAG CTT AT ACTA 

1 5 TAGGGCTTCAACAAACAAGCTTGCCATTTTG 

TT CTGGTTGATAGAATT ATG AACCTTATAAACT CAG TACTTTTT AGTT AACCT AAATACATAAATCT ATTTGGATT CATCTTATGA 
AAAG AG ATATGT CATT AGATAT ATGATCATT ATATG CTTCAGTGATTCATGGG CCITGTTACATACATG CTTTGT AATGTATGAAT 
TATTTAGTAAGAAAAATCACAATAATTTTGTTGTTACTCAAAAATTTAT 
GTGTTAAGGTAACATCATTAACTCATGTGGTATGAGTTTTTA^ 

2 0 AGTAATTTTGT ATAACAAT CTG CCTGTGAACAG AATG CTGAA CAT TTTACTTTAT TT C CAT ATTA 

GT ACAT AATTT AAATC C TAC CT AAAAAGTGT CTACCCAAAC C CTAAAG AAT AAAAACATG CAAAATAATGAATG CT T CC CAAGT AC 
ACAAAAGTG CAAAATCACATTT AAT AAGAGCAAC CATTTCT ATT TTT AAAATGTT AGATG CTT AAG AATTGG ATGAAAATAATTT A 
TG AGAGTAGATT ATCC TTAAAG C CTT TTTT ACACATTGT ATT ACAG CCACTACTAAGAGAT AAAGAACA GAAG 
GCAGCAATGCCACAAATGCAATACCAAAGGTC^ 

2 5 GAAT AT AAACTTAGG CT AG CTCAACTTTAGACT ATTTGC CTTGAATGTGTAAGGTTCCACA.T CACAGTA 

CACACACTAATT AGTAT CACTAATTAAAACATTT ATTTTT ATTTTCTATGTAG TGTTTTG CATATATATGT CTTTGT AT CACATGT 
ATACCTGGTGCCCACAGAAACCAAAAAAGGATATCAAATCCCCTAGAACTTGAGTTACAGA(ZAA 

CAGAAATTTAACCAACATCTGAAAGAACAG CCTGTGCTCTTTAATCGCTGAG CCATCTCTCTAGTCCCAGAATGTC ACTAACTTTA 
AATAATATTTAAAATTCGACAATCATTCATGCACC^ 

3 0 TTTGCCAACAGACTCTAG T AGTCCTG AG T CT ACACAGAGAAGTCAGACTG CAGAAG ACTCT CTG CACACGACAGACTG AG AGTCAC 

GTGTTGGCAGTTTTTGTTGGCAACATTTATCATAGAATGGATGCr 

ATGTTTG CCTT TAATAG CAT CAAG ACAAGG ACT CTT TAATGGAAGGCATTTG G ATCTGT CTTG CAAG AAGAG CAATAAAG AGGACT 
AGAGAGATGG CT CAGT AGTTGTGAGT ACTG ACT T CT C TTCCAGAGGT C CTGAGTTCAATTC CCAGAAAC CACAT ATTGG CT CACAA 
CCATCTGTAACCATATTAAAGTATACTTAATACATTAAATAAAGAAGAGCAATAAAGTTTGACATGTCT 

3 5 GTATGCTTTCTGAAAAAAAAATAAGAGTTTGGAATCCTTTTGCCATATCOT 

AC ACATG GGAACTG CT ATTGT AAAGTCAAATTTAGGTAATG CACAGAAGTGG AACAG A^ CATTCTT 
TTTATTTTACATTTGT ATT ATCATCAT CAC CACCAC CAC CAT CACT AACAC CACCATCATCATTGAAG CATGGGATG CAGG CATTG 
AGCAAGAATGAT AC CACTGAGCT AT ATCCCATT CAG C CTT ACT CTTTT AAAACAAAACT AAT ATTTTGAT TGCGT TG TCAAATG AG 
CAATCATCTCTCAGAAGGAAGCCAGGCTTCTGTTGGTTTTCAGAGGCTACATGA 
40 AGTAGGACAGCAGATTATGATGGTGATTGGGAAGTCTCCATGCCTCTGTCT 

ATACAT AT AATATAAA7\ATCCTATTTT AGTGGACATGGCTTAAAT TACTTCC CAG CCATT ACCAGTTGGATAACAG AGTAGTTT CA 
GCTTTCTGAGCTTTAGTATCTTTGCTATACAATAAGGAGACTATCTAACAGAAAGCATTTAAT 

T AAG TCACTTAAT CAAAC CACT CTATG ATGTAAG CATTGTGCACTGT ATTT C TTCTTGTTATTTAGTTAG CATCAGCT CATCTTT C 
ACCTGTGGAAGTTTTACC CAAGG CCAGACAAGGTTCCTG AGTCTACATTCTTGGTCACGTCATTCCACTTOT 

4 5 GATAATTCTGTCAGGTG ATAACT GAG AAGTCACATAAATGAGACTTCATTACTTC CTCTG CAT ATAGG GG AAATT CAG AAGTG CT T 

AGCAGTGAGTCAGAGTCATGCTAGAGTTTGTTTTCCAGAGACTCTAGCCTATTCTAC^ 

AATCTGGTGCATTTTGAATGAAAGAAGCCTTCGTCTTAGATTATCTGAACT CTCCAGTTCTGATCCCGTG CCTGGTG CATGCTGCT 

GCCCCATGAGCACCTGTCATCGCCGACCCCCCTCGCCGGTATGATTGTTACCACTGCT 

AACACACTTGCCTATCATAGTTTTGGAGACTGCTCATTTTTTTCATATCTTCATATTCTTCTT 

5 0 AGTAT AAAGAAAATG CTTGCTTGCGGTAGGGAGATGGC TCAATGGTT AAGTG CTT ACAGTACAAGCTTAAAGAC CAG AGTT CTGAT 

T CCAGT ACTCAC CTAAATGCTGGGTGGGAGTGG CAG C CCATCT GT AATTCT AGTCTCAGAG GCAG AG ATAAGGG ATT C C CAAAG CA 

AGC!TGATTAGCAAGACTAACCTTATCAGCCAGCTTTGGGTTCCATTGAGAGACTC^ 

ATTCCATAGCAACCTCAAGTCTTCTCGGGCCTTCAGATATCCCA 

GTAGATAACGTAGACATCAAGGGTAAGAGGATTCCAAGCAGAGAGGCAGTAGAGAGAACCCTGGAGTGTGATGT 

5 5 AGAAGCATGTGGGACAATGACTTGTGTGTGTGTG AGAAGGAAG AG GAG ACATG GG GTT CAAGAGTT CAGAAGTGTGTAG GCTCTGG 

GAGATGGCCCAGTCTCACTAAGTTCTGATCCCAGTACTCATTAAAAGCCTTGTGCCTGGAATTCAAGCT 

GAAGGAACCCTGGGGCTCCCTGTCCAGGAAGCCTAGCTGAACAGGTGAGTTCCAG 

GGTGTCAAGAAATTTAGGAAGACACTTGATGCTAAATTCTGGCTTTCACATGTGTGTCTG 

TGTGCACACAAACACAAAT ACAT C CAT AGGC CCT CAGAAG AG GATG CAG AGG ATAATAAAG CT CAGTGGT AG GC T CCATG CT CAGT 

6 0 ATGCAGGATGCCTGCAAACTATGCCAAGAACCAAATAAATGGATC 

GGTTTTGCOTGCTCGAGGTGAGAAGCTATCAGTAAAGCTGGACAGAAAATAACAAGTTACTATTG 
TCAGGAGTGGTAAGGATGAAANNNNNNNNNNNNNNNNNN^ 

CTGTGAGATTGAGGCCAGCCTGGTCTACAGAGCAAATTCCAGGACAGTCAGGGCTATATAGAGATACCCT^ 
TGGGGGCGGGGGGAGGAAAAGGGAGAGGGAGGGGGAGAGGTAGAGGGAGAGGGAGAGGGAGCAGGGAG AGG GAGA 

6 5 GCATGG ATT CTGGTT AAGGATT TGAGATTGAGGT TAAGAT AATAAG AG AGAAAAG AAAAT CTCT CTG AACTTATAAAAGCAAAATA 

AAATTCCTCATTAAATGACTTTAAAGCAAGAGTCAAAACTATAAAAACCTTAGCACCAAACTTG^ 
AAGTGCGTGAGGGTGCGCACACACACACACACACACACACAC^ 
CTTAGCCACAGGCAGCTACAAGATAAAAGTACAGAAAGGGAAAAAACTTTAGTGACAT 
GGTATGTTTCTGCCGTGCTGTGGCTATGGTTTGTATCTCCGTCACTGCAAAGGTTG^ 

7 0 TACCTCATGAGAGCTGGTGGACTCTTTAAATGGTGAGCACTTACAGAAGCTCTGAGAGTCATTAGGCCCGTG 

CAGCTGATGTCAGCACCATCTCCTTGAACCTCCAGTGCTATGTGCTAAATAACTATCATCTTTATAAAGT^^ 
CTGAAAGGGGACAAG CATATGAACTTCCATCT CTAACTAAGAATTCCTCTCCAATAGACACCCACTTGCAAAGG A 
TCCAATGGAGTCTCAGTG1ACTTTATAAATCACA 
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ATGTTATTTTTATAGACTTTTAATCTCATATTTCTTTGTTTGGGCTTATG 
AACAGTCCTGACTGTCCTGGAACTTGGAGATCTACCTGCCTCAGCCT 
GCATTATCATCTTTCTGGTCTTATTTATGTGTTATGGTTTCTGATTTTG 
TCTTGTGTTTACTGTTATTTTATTTTTGTTTTTTTTA 
5 GAGAAAAAGAAGGAGTACAATGAGTGGTGGGGATAAT CTGGGGGGAGTTGGGAG CAGAGAAACCATGATCACAATATAT AAT AAAA 

AAGAGTAAT AAAAATACATAAAGTTAG CATGTCTTAGGAATTTCATTACAGT AACAAAAAA^ 
AAAATTAGTCTGGAACATTAGTGTTGTTGATGATGATAGTTGTC 

TTTGGGTT CAGGAATCACCACACAATCCATACAAACACAGAT CTTAGTT AAG CAAAAAT AGTTTATTGAATG CACACCC TAATACT 
GGTCAGACCAGGGACATAGTTCAGAATTATCTGAACTATGAGAACAGATAA 
1 0 CAAAAGTTC^TATGCAGGTGCAGGCGTGACTCAGGATATCTTAGACTGAA 
TAAAGCTTTATGGAATTTCTTAAGATATCAAACCTCATACAGAAGAT^ 
GTCCTTGAGCCAAGTC^CTTTTCTGTGTCAATGAC^ 

AATGGCCACAGACAGCTATTCGGGATGGGGTGGGGTGGGGTGGTCAGACCATA 

ATAAG CATACTGTGTTCCACTGAGGTACCCAGTC CTGGAATAGAAGTTCTTACATGCACAGTCATAGCAGC^GCATTATTCACAAC 
1 5 AGTCAAAAGATGG AAGTAAT CTGAGATTCCATTG G CAAATG AATG AATGAAG ATT ACAT ATC CAAAGGACTTAT CTTTAAC CCAGC 
AAATGAAGGATCTTT AGTACAAAATTAC CACAGAG ATG AACTTTGAAGGCAT TACT CT AAGTGAAATAAACTAGGCACAAAAG ATA 
AAT ATTGT CTAACT C CAATTTTATAAGGTTTT AAAAGTATTTCAAGTTCAAAAT CAGAAAACTAATGCTAGTG AGAGGAAGGGGGA 
GGGAAGTCATTATTTAATGGGGAGGGGATTTCACTTTTGTAAGTGA 

T AAACAGTGCCGAAGTATAT ACCT AAAAAGGGAAAAT CTG AT CTT ATGTG TGTTTT ATCACAATT AACG TACATT AAAAAC CAGTC 
2 0 GATATCTGT CTCAX3GGTTT CT ATG CT ATG AAGAGACAGCACG AC CAT AACAACT CTTATTTAAAAAAATATTTAATTGGGCT 

TACAGGTTCAGAGGACTAGTCCATTGTC^TCATAGGC^ 
TTCTGC^TCCAGATGTGCAGGCAGCAGAAAGGGAGAGCAACA^ 
TAACACACTTCCTCCAACAAGGCCACACCTCCG\AGAGTGTCAC^ 
ACACCTAATTTTCAGAAACACCTTCTTTTCAGTTAAGGTTA 

2 5 GTGTTTCCTTCCCATATTCCCTCTGGGCCCATTTCTCCTTGCTTTTACCACCATCT 

CATCIAGAATTTAGAGAGAAACTTTCTATCATAGTCATTTAGAAGCA 
TTTTGTTTTTTGGGTTTTTGTTTTGTTTTG 

TCTATAGACTATAGACCAGGCTAACCTCGAACTCACAGAGATTTGCCTGAGTC^ 
ACCACAACTGGTTCATGTGTCAGTTTTCAATGATAAG^ 

3 0 CAG ATTTT AGGCCATGATCT CTAGGC CTGTGGTTAGGCAGACTT C CACACAT CAG CG AGTGTGTGGGAAAAGG CAGGAAGTAAAGG 

TGTTTTCTTCATGGCTGACAAGAAACAGAAGAAGGAAGAAGGGCACT 

ATTTCCAATCCIAAACTTTAACCCCCTTTAAGTAGGAAAGAAACA^ 
AAACTACCTCAGTCAGAACTTGTGTGGCTACITATGTCTACT 
35 ATAT ATCC CT TT AAT TTGGACGGTGGTGTTT ATAT T ACTT AACAAAG AGG TACTTTTG AATGT T AATAGTT CTGTGAAACTC TG CT 
CAAAT AAT AAAGTCTTAAGATTT CATGC T ATGTAAAGG CTACAAAAAGTTG AAATG TG ATT AT ATTTGCTCTTT AG ATA GGGA 
AGCTGAGAGACTGCCATTCATATATTTAAAAAAGTGCTTCAGAAT^ 
TACTATGAATTTTTAAAAAGACGTTTTAACAAATCATAGGTTAATAGAAAATCT 

AAATAAAGTTT AG ATTAAG AAGT ACATGG C AAAAT AGTG AATTTT TCTTTAG CACTG AAG AG CAG TT ACAAAG GGGTAGTT CCAAC 

4 0 AGTACCTGGAGAATCGGTCTACG TG AATTCCT AC CT AAGCTATTAAG GAATAATAACT AAAAG ATGGTCATTT ATCT CCTACAATG 

GCCTTTCAAATGCAACAGGAAAAAAGCCCAAGTAGAGGAAATTATGCCACAAATTTTGGTATTACA 
ATTCTTCCTCCTCTTACATTAATCACATATCTGTTTATTCCGGGGAAAGT 

G CAT AAT CAGCCC C AAAAGG T AGAG AT AATAG CAT AATTTAATT ACAAATAAAGG G CCTAC CTAC ACGTTG CCATTTGT AAG C AAT 

TCTTTTGTATAGACACATAGATCAACAGGATTATGCACTTTAAAATGTGAG^ 
45 CTACTGCCCTGCTATCACTGACTGCATAAACACTGCAATTT^ 

ATAAGATATTATGATGCCAGAAATACAGCTGTCCCCAGGCAGTTAAGCGCTCTCCAT^ 

GAGATTGCTTGGTACTGTAGGTCACTGGTTAGGGTAAAGGACCTCACT 

ATAGTGGTATTTTGGAACTGTTAATTGGAAGAAGGGGCATAAAAACATGCTTTTTTTTT^ 

TTGCCTGGCTCTTGCCGATTTAATGTAACAAACCCGCGGATAGTAAAAG^ 
50 TTCACCGCCTTTTCTGCATGATCAAACCTTAAATTGTTTTACGGACAGCCCCC^ 

GCTCCTTTACCACrrACCTGAAGAGATGGAAGTCAGCCAGTCCTAGCA 

GGCGCCCCAGGAGGCGGGTCCCACATGTCGGCTCCCCXSGGTTCTOGCGCC^ 

GCCGCCITCCrCCJTCCTCTCGCCGCCACCTCCGCTGCCGGGACGCGGGCCTCAGTCGGACGAT 

TTCCGGCTGCGGTCCGGGACCCXSCCCCTTGACCCCGCGGCCCTGGGCCCCCCTGCCrGGTTGGAGGGCAGGCXj 
55 TCCGAAACAGCCCCGACCCAGCCCGCGGCGCCCGGCCACCGTCCCCTTGCCCACC 

TGTACCTGCCTTAAACCAGGCCGTTGCTCCGCTGTCTCCTCTTACTCACAA 

CTTTTGTAACTCCAGGACCTTCCGCTCTCTACTAATTCCTCTTGG 

ACGACCTGGGCCTGGTTTTGGGCCTGCAAGCTCCCTCCTGATCGCTCCTCTTCACTTC^ 

CCTGCCTTCTGTCTATCGTTCCCOTTGTC^CTTCITGGGGCTCCTCCTAAGTAGGGGC^^ 
60 AGACCCTTTAACCTGTAATGCTGTGGCTGGCCCTTGGCCCCTTCITGTGCC^ 

CCAAATAGTGGCGCGGTGGACTGGACCGTGCACTCCGGGCCG 

CCAAATCCCAGCGTTGGGGATTATGGATCTGGATAGGCAGCATCTTGACQATTGGTGGCCTGAG 

TACCTGTTTTAACCACAGTCTCGGTTAATTGCACTCACTGAAAACTGGTAAATGTGCAGCGCCCTGGGAAATTGATGTATCCGACT 

GAACCCCTGCCCTCAGGAAGCTCACAGTCTGGACTGTGACTTACIACACCAGTGCAGACi^ 
65 TTTCTTATATTTTTCTCAAGTATTTGGAAAGGATTCCTGCCCr^ 

GCTGGGAAAGGGACACCAGGG CAATCTTAGTTTACAATGGTTAAGCAAACCTTAGATG CCACTGAATGGGAGACCTAGACTTAACT 

ATCAATATGG AAAGGATG CTG CCTGTT AG AACTACAGCACAT CATT CCG TGACAAACACAG CT CAATT TCAG AGATGTTGAGTGTT 

CAGGGAAAAT TT ACAT CTT AG AATCAACAGTGCATTGCCTTGGGATG CTTAGAAT ATATTT CACACAAAATTATGAAGGTT AATGT 

AGGCTGGC CCTTGAGTCCATATATAATATATACTATAAAACATTATGTATGTGTGTGTTTGTC CTGAATAGAGAGTAAAAGGACCT 
70 TCCGAGGACTTGCTCCTGAGGTTGAGAGTCAGTGCCAAAACTATAGCTTCCTTACTTAGTATATGAGTATATC 

GTGTGCGTGCATGTGGAGATCAGAGATCAGCCTCAGGTGTGCTCAGGTGCCATCCACCTTGTTGTTTTGGGATAG 

CAGCAAGCCCTGGGCATTCGCCTGTTTCAGCATTCCCAATGTTAAGATTACAA 

AGCACCCACTTCCTAGGTTTTTTGTTTATTTGTTTGTTTTTGTTTATTTTTTATTT^ 

CTCAGGCCCTCATGTTTGCATAGGAAACACTTTGCCAGCTC 
7 5 CC CCATATACTTCAGTCCACGTG CAT CT TTC TAG TCAGTTGACATTAACAGGGTGAC^TATATGT ATGT CCACAGT AACTCAG CAG 
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GAGAATTTGGTGTACGGGGAAGAATGTTTTAACAGGCAGATGTGCC^ 
CATTTACTGCCTTGGGTTGATAAAAAATGAAATGGTGGTATCra 
TGCTTGGGTCTAAGCCAAGCATGTGG<^CCTTASGTGAGCTAG 
CTGGTGCTCTTCACAATATGCACAGTGCTAGTCTCCrTTCCT 
5 ATTATCCTGGCACCTATGTCCTCTCTTGCCACATCTTCTTGAG 
GGTACCCAGGCTGTGCCTCCACTTTGCCTTCITTGTCTGGCT 
-GTTTGCCAGGTGGCTGCCATGGTAGTCCTATGTCCATC^ 
TGTAAACATCTGGAGGAGAGAAAGAGCTCTGTTTCTGTTGCCT 
TGTGAGCCTGAACTTGGGTACAGTCCGTGGAACCTATGGTAGAACCAACT 
1 0 ATGGCATGTATG CATCCGTGTTTACTGTTT ATACACCTG CA^ 

CCAGCTGTGTGCAATTTTAAGCTGTGCAGATCACTGGCTTCT 
TCAAGTTAGCTATTTTAAGAGGGAACCCTGGGACTGCTCTCCTCCTG^ 
TCCGGCAGTTGGAACTTCACGGCC!AGGTGTGCC^ 

AGGGACACCTGCACCACGTCGTGTGAAGATTAAACACATAGAAAGTGCT 
1 5 GGCCTTCTGCTTCCTGAGTACCCACAGGAGATGTCCAGCGT 

TGGTTGCTGTTTCCrrGGCAAGTCTGACAGGGGGCCACCCAAGGTCCT^ 
AGAGTAAACAAAACCCTAAAGAAACCAGCATCTCTGACTT^ 
TGACCCAAGCCCCCTCITCACACnTTCCTGTTTTCAGA 
2 0 GAGCTAGTTGGCTTACCTAACACAAAAGCCAGAGGGGGTTTCAGCCACTGAGTCCA 
AGTTGAAGATCCAGTTGCTCCnTCTCC^CATGGCTGGGAGACCTTGTGT 

TCTGT ATCTCCAGG ACCTAAATG CAGTATGAAGGAAG CCAAAACCAAGTG CAG AG ACACTGACTGGG CTCTCIGTATTCCTT CAGC 

CTCGTTTCTCTTCATCCTTGGAGACGGTCTAGGACAGTTCTAGGACTTGTGAA 

TAATTAGTGACATTTCTATCCAAGTGGAACCAGTGACTCTTTCriT 

2 5 CTTAGGGAAGAAAGGTTTGATTTTCACTCACAGTTTGAAGG 

ATAGACTACGAAGGCTGCTCAGCTCCTGTCCATGAAATGGTC 

TAGAAGCTCCCAT ACAGACATG CCCAGAGGT TTGTG C C ATAAGTGATTGTCAG TC CT AT CCAGTTG ACAGTAAC CGTTAG CAATCA 
CAGTGAACACTGCTTGCCACTGAGATCTTGTATTTGG 

AGATTACAGGC CGGCTGAAGT CT AG AGCCAAGTC CTG ACCTGTGCAGACTTATAAG CT C CTGGCCTGGTACAGAGGAGGTTTGACT 

3 0 TCTCCACTGTTATGTAGTTTTCTGTCTTACTGAGAAGAAGAA^ 

ACCATTTATCATTAAATATACCCTGCTTTAAAATGGCCAGTAGAGCGGCT^ 

AGACTTTCCTCTGGATTG CCTGTGT ACTTAATT CTGT ATTCAGTTT AT ACAT CTAG CT C C CTGTTAAGAAACG CGTG CT ACCTGAG 

TTAAAGGCCAGCCTGATCTACAGAGTGAGTTCCAGGACAGCCAGGCCTACGCAGAGAAACCCTGTCTCGA^ 

AAAAAAAAAAAAAGAAATGTGTGGTACCTGTGAATGTGTAGAAAGACAT^ 

3 5 AAGTAGGT ATGT CATCTG CTAAG ACT AG AAT CAG AG AAAACGTTTTTTCTGGCTGATGTGTAAG GGTTGTTG IT LT CCCC CATCAA 

ATTTCTTTTGCACGGATTTATTTTTTAAAAGAAAGAAAAAAAAAAGACGCTC 
ACACTCCTAAGAACTGCCTTTAGAAGGAGGTTGTAGCCTCCA^ 

AGGCTCTGTCAG CTGGCTGCCTGGTGCCAGTTCC CG CACAGC CTGGTCTG CG CCATCG CTCCTGGAGAGGAAAG AGGTCTGCAAGT 
CTTTTACTGTGGGACTGGGCATGATGGAGGACTCTTCTCCCAGAACTTAGGTGTT 

4 0 CTAATTTTTTTTTTTTTTAGTATTTTCAACATTAGAG 

CAAATCAACTTATATCCCAGCACTATTAAATACGTTTCCATTGC^^ 
TGCTTATCATCGTTGTAGCTTGTTTGCTAAAGCCACTGCT 

CCAGG TCTT AGGCI'lTTG TTT AT CACACCATTTGACTT C CACATTATCCTTT CAGTGTCACTG CATC CATGTTTTG AAAATAATCA 
GAATGAGGCTCAGGGTGGTCAAGTAAATATTTTTCCCAGGTTGTACATCTAATAGTAAAGA 
45 CATGCCACCTCCCAGCCTCCGGTACATGAAACTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGCGCGCGCGCGC 
GATGTGCACTGTTGAGTTGAGAATAAAAGACTGTCTITCAATACAG^ 
TTAAACTTTCTGGG CTAAGTGTGATTGCAAGCTCATGTTTCTGTGGTTTATGCCAGCA 
TTCCTGTTGGACGTGCCACATGTTGGCAATAAAGAAGATGGCTAA 
CAGGTTCTGCCTGAAGCGACGACGACAGCGTTTGAATGTGAGTCCGGCTC^ 

5 0 TCACATGuri rATGGGTTGTATTACTTTGAGAACTGACAAAAC 

AATTAATG CAAG TTTAATG ATG ATACTG AAC CG AG TAAT ATTTTC T TATTTAAT CATATCTTGGGTGTG CAT ATCTCT ACAATAT A 
GCTTT AATTAC CAACATGACCTTTATCTGC TCTTCT CATTGTT ATAACAG TG CAGGGCAAAATG AGC CGTGT AAACTACCTGTAG A 
CCGATGATCGTTGATTAAGAGGTTTTAATCCTTTAGTGCTTTAGGACATCGCAGTAGAATAAAGCT 
TGAGTCTGTTTCAGTGGGGTTTGGCTGCCCCCATGOTAGATGACAAGGGCACTTCTATCCT 

5 5 GGCCTAGAACAGGGCTAGGACGAGCTAGTGTTTTGACGCAGTCCAGGTCATTGCAGGAGGAGTOT 

AGAAGGCCTCAGGTATTGAGAAGGTAAAGTCAACGTCTGGAATTGTGGTCTTGCCC^ 

ATTGCTTAGTGGAG AAAATGCTTGTTATGC CATTGTG AG AACCTGAGTTT CAGTT CCCAGAG CCCATGTAGAAG CTGAG CATAGG A 
GCACACATTTGT CACCCAGTG CTCCTGTAG CAAGAT ATG AGATGGAG ACAGGAGAGT CCTTGTGGG ATCAG G TAGC CCTAATG TAG 
ACAGCAAACAACTAAAAGACCCTGTCTCAAACAAGAGAG 

6 0 CACACACATGTTGAGGGATGCACACCCACACTCAC CCATAGGAACATG CCTGTGCACGCTCACACACATACATACACAT CATG CAC 

ATGCACATGCACACATGGACATGTGTGCCACTGTCCAACATGGAA^ 
TTATCTTAAAGATGATTACrGGTAAAGTGTACGATAATAATGGCAGT^^ 

AAGCTGTCCAGCAGAAACACTGAACACCCTTGATGAAGTGTAAT 

6 5 TAAGAAACTAAATGTAAGCAAAATGCCTCTGTACGTGGACTGATAAATAGTAATGTGTCCCT 

AAAATGAACAAGTTTGGAGTCTATGTATTGATATGGAAAGGGA 

AT CTCATCGAG TCT AGCCCTCCTTAT CTAC CT CATATGTTTATAGGGGG ACAGG ATCTACAGAGGGCC CATTG CAGTG CTTTCTG C 
CCTGGGAAGTACACATGGGCCCTGGGGGACTG G ATTTGAATAGG TACACTTTT CATTTTATGTATGTGTCCATTGCTGGAATAGTG 
TTCAGTATTCTGAACTTGCTTTTAAAAGCTACAGTGAGTAGGACT 

7 0 GTTGAGG AATAT AT ATGTGTGTGTGTG CATGTGAGTG CATG CTATAAGCTCTATTTAGATTTCTTTCAGTAG CTACAGGGAGTTGG 

GGTTTCTGGAATAGTCTGAGAGTGAGTGTTTGGTATTCATACTGCTGCCGGGTGAG 
TCCCATGAGGTGCTTTGAAGGACTGCTGTTGTTGCT 

TGTAAGTGAGACCTCTGGAGTAGATTCCAGAAAGTGTTTTCTCAACCACTGTGC 
CGTCCTGGCTGGCTTTCCTTAAGCATGGCAAGGAGAAGTTTACTTTTCCACTATTTTAATCT 
75 CTCTCTGTCTCTCTCTGTCTCrrGTCTCTCTCTGTCTCTCTCTCTCTCTGTGGGTGCA 
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GTGCTCCCATCCCCCACTCGGGCATGTGTGGACAGCATTTCCGTGTCAGTTCT 
AGGCCATCAGGCTTGCCCCCACAGGCCTGGCITCGTACTGTCTACC^ 

TACTCATTGTCCTGTCACCGAra^ 

TTTTGTAGTGACTCTCCAAAGTAGGAAAATTA^ 

CTAGGAAGTGGCAGAGCCCAGAATCAAAAACAACACACACAGTCTTT 

ATGTrCAGACTTCGAGGTTTCATTCTT^ 

CAGACTGGCTTTGCTCACCTGTTATGTCAGGOT 

CATCTTTCTGCCAAGTG CTACCTAGGACTGTAGGTAT TTG CC CTCACAT CCAACTATGGACCAG CTCTAGGTAAAATAGCAAGAGC 
AACAACAACAAAATATACACAGAGAGACAAAAATAATGACAGCCATTGGT^ 

TTTCTCATGACTATTTTTGAGT AGT TTGATTGTCTGACCCAAAAAT CTGAACCACT AAGTGGAACCAATATGCAAG AAAT AGTGAA 
GATTTGGTTTGTAGGCCAATTTGCTGGGAAGTCTGAAGCATTTTGGTGGm 

TGAAAGGACCATTTAACACTG CAGCATGT ATCCAACCCTGACT C CTTACATTGGAGAGATGGTGTGAT AATTATAGAG CAG CAAGA 

GGCAGTGCAGAGCTCAAAGCTGCAGCTTTTCATAGATGTAGGAAGAGTTAGG^ 

CAGGTTGCCTGGGCCAGCTCTTACAGGGCTTGTTTCrCAGCA 

TCCCTTACTGGGCCAGTCATGTCCTTTGTTCCTCTGTTAGCACTAGTTCT 

TAAGAATTGTGAAAGGTCATGAATTAGAGCTAAGTGGATGTTC 

TGTA1TTGGTTATGATGGAGATGTTGGGCC CCAATTCAAGTTG C CACACATATGAGAGGGAAGAGTTTACAGTTG CTATTTCTCTC 
TGGTTTTGCCITCTGGCCTGGAGTTTCCAAGGGAGTTA 

GGGGCTGGTGGTCCTAAGCCCCTTTCTTTCTCATTACCCTTGCAGGTTCTTCTCCCA 

AGTTAATTTTAAGTTTCTC^TTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCATG 

GGATAACTTGCAGGAGTCAGTTCTCTCCTTCTATAG 

GCCACCCACTGAGCCACCCAGCCAGCTTGCTAACCCAGAAGTACTTACTAGCACCTACT 

TTTTAAGTAGCTTAGGAACAAAGTAAAAAGTTCCCATTTGCT^ 

GCTGCCTGrcCCACCCTCATTGGCATACATTAGACTTCTTGCAGGCATTA 

T AAATT ATTTAATATACTAGATGGG CATGT AAATTTTT AC CAG CTTTTTTATCT ATATATTTT ATGTATTTG AGT ATTTTGT CTAC 
ATATC CATCTGCACGCCAGATGATGGCAT CAAGTTCCATT ATAGGCAGTTACAAGTCACCATAGAGTG CTGCAAATTGAACT^ 
ACCCATGGGAGACCAGCCAGTGCTCTTAACTGCTGA^ 
TCCTAGAACTTACTTTGTAGACCAGGCTAGCOT 

ATATGCCATCACTGCCCAGTGACCCTCCAGCTTTTTTAAAACCAGAAAAAAATCGGACGTC^ 

CACTGAGGAAGCTGGTGTATAATTTGGATTTCTAG . 
CCATCCACGAGCTTTCCGTCCAGATGGGCCTCATCCACAGCCTGTCAGAAAGCCCT 

TGAGCACATTTCT CT C CTGTT AAT G CGGTGGGTG CAAAGCTACTG C CTTT CCT CG TTATAT ATTTTG G AAGAGTG ATAGTTT CTTT 
TTATCTTTTTATCTGCAT^^ 

TGCAGCGCTTTCCAAAGTAAGAGCATTGGTCCATAGAGCAAGCATTGGTTC^ 
TCTTTTCAGCTTGTTTTTTGCGCTCTCTCTCTCTCTCTCTCTCT 

CAG ACTTACCATATTTC CCTTTATTT CTG ATTTTGT CATG C TTAAAAAG AT CTT CC C CAT CC CAGGGTG CGG AAAAACATTTACTT 
GCATTGCTTTTCTTTCrGTTTTAATAGGAGTGTGCACAT 

G CTTTTTTl'T'l'l'TT TT T TAATAAAAAGCATGAGTTTTCCAAG ATTAGAAGG AGTTGACTTT ACT CTT AC CAT CTTTAG CATGTGTT 
GGAGACAGCCGGATGGGGGTTCAGTTTTATTTTTCTTGTGAATAGATAGTTGATTT 

TCAGATGTGAGACTACATTCTGTAAC CATAAAAAATCCTGTATGCTGCATGTGTAGGC CTAAGGATGCTGTGGGAGCTTGG CACTT 
ACGAAGG C C CAG AGATCC CTGCTGAGTCT CT CAGTT CCATTAG CAG AT T AATGAG CG CTCCAACACAC CAG GT GATAAAC ATCAG C 
AGGGCCcWlAGATGCGTTTCCT 

ATGTTGTGGGACAGGGAAGTCTCATGGG CCACTTGAAGGATGAGCTGGAAGG CCATCTAGACAAATGCTGCCTCGTGCTCAGAACG 

CCATGCATTTCCGCAATTCCATACTACCTCCATATATTTAGGCOT 

CAATTTGACTTAGAAAACATGGTTAGAAGATAAATGTGTCTT^ 

ATTTTGAG CATAGTG CT C7^TGGAG ACCAGGAGAGGACATCAAAT C CCTGGG AG CT TGAGTT ATAGACAGTT ATAATCAG CCCAT CG 
TGGATGCTGGGAACGGAGCTTAGATCCCGTGGAAGGTAGCAAGTGTTCTTAA 
AGGGTTTG TATC CACTT CTT ATTGATT AAGTATTCT CAGTTGTTTTAAG^ 
TCTTTTTTAAAAATTTGCCTACATGTGTATATGTATGTA^ 

GGAACCAGAGTTACAGATGGTTGTGAGCCTCTGTGTGGGTGCTTGGAACTGAATCTGGGTCCTCTGCCT 

AATTGCTGAGCCATCGCTTCAACCTTCTTGCTTAGGAACCTTTATCTTATTTTATTTTT^ 

ATGCTTTAATAAGATTACTACCGGGAAGATCTCTGTGCAGCCTTCATGCCGCATGTCTATAGA^ 

TACTATGAGTATTCTATGAAATACCCATAGTGTATATAAAGGTGTCCAATAATATGTAGGTCCCT 

AAAAAATTGTATATTTAAGTCTTCAGTTGTAAAAAAAAAAAW 

CCTTTCTG CAT AG TAAACTATCAATAATGTAATTTCTTG CT ACTCTGT AATTATTG AT CACIT AGAACATGAGTTCTTTGTTTAAA 
TATTGTTATTGTCTGAATTTCATAAAGTATCTAAATTTAAGTACTCTGCCACAAATG 

TTGATG CCAGG CATGTCAAAATTTACAAT AT TGGGACGGT AGATAAATTT AT CAGC T AAGGTGG GACACATTT GGACAT CT AGG CT 

GGCCTTAGGTACAGAGGAAGACAAAAAAAAAAAACAAAACAA 

TCTGTCTTCCCAGACAGCCTGTCrCCCTGTCTGAATGTCCTC 

TTTTAAGTTGGACATATATGAGCTCTCATAGGGAGACTGCAA 

TGAGT^^GGAACCGTGCCTGTATGAGATGC^ 

ATTATTATATTATTATATAGAAAGG 

CAGCTTCGATAGGTGGGAGTGGTGCGCACTGTGTCTCAGCGGTTCAGC^ 

ATTACCCTCA^CTATAGTAGGACAGCAGCTGCCTAGGTCCCTC 

CATCCTCCGTGCTGACCCCTTCTTGTTGTCACT^ 

Sgtcttcc^ 

TOTGTGTATCATGTGTCATGGGCACTGCTTGGTTTTTG 

AG^CAAGGAGGCAAGCTCATGAATATAAAGGTATGTCCATAGGAACT 
TGAAC^T^CATCTTTTCTGGTAATTTCCTGCTGATC 

ccXIIgccctc^ 
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CCCCTACCCAAGGTGCTTCrTCTCTCTGGGGCCT^ 
AATCATGAACAACATATGGCCTCJTTATGATTCTCTTTCT 

AGATTGCTTCATAATAUT W1TUTSCCAT CTGCCGTACTCTATTTGGTAT CTTG CTAGGATTTTTG CCTC CAT1TT AAGTG ACATTA 
ACAAAATCCAGCC^GTGCATAGACATTCCTGTTCTCCCCTTGCCrCCCrrAATC 

CT AGAGGAGTCTGGGACTGTCATTGTGAAC CAGTGTGTCAGATC TCTG AGCAC C CC CTCCC C CCGGTTATATCCAGTAATGTTATT 

TTGGTCAGATGGCAAAGAACCTCXSTGTATCAGGTTAT 

CCAGCCCTTTATGCCTTTA6AAGTCTTGGTTCCCTTTTAGTAAGCT 

AGTGAGTGTAGT TTTTAAACCATCAGTACAGTTTGAAAGGCAA^ CTAACTT 



1 0 CATTTAACTGGTTTTGGAAAGAGGTGTGTCCTCrCTTTAAAC^ 
GGACAGGTTCAATTTCACTCTTTGGCCAGCATCACCTGA^ 
CTTCXCTTTTGACAAGAACAGGGAGTCTTCGTTGAAGC^ 

TTGGAATGCTCCTTTTAGTTGGTCTG'AGATCTCCCT 
1 5 TTCTTGTCGGCTITAAGGAGCATGGCTTGGTCCrc^ 

AG GAAAAATCTCAAG CTT AC CT AG AAAC CAG AATCC CAG AACC CTTG TTTGAAGTAGTCTTAGG CACCT AAGGCTGGGTGTGAGAA 

CTCAGAjGATTCTCTACCCCATTACATCTCCT 

ATTATATGAATGGTGTTTATGGAATTTATACTCrCTTATCaGAM 

ACTCTTTTGTTCTATTCAAAGACTGCCTTCCTGAGGTTAC^ 
2 0 CT CATTATClTGCACAGTTAGTTCrGGTGCAGAAATGAG^ 

TGATTATATGTATATTCATGCATGTGTCTGCA 

TTGTTTTTCTCTATATTTTATTCITTAAGACTCCTTCT 

GAATTCCCCCTCCCCCGCTCCCACGTCTTCACCTCCCCAGCAAAGTGATTCTATTTG 



2 5 TTTATACTACCTTTTTTTTTCITGTCTACrrGAAlTTCAA^ 

GAAAGACCTTTCTTTTGTTTCTGATAGCATGGTGTTTCTGTCTAGTTTGTCTTTAGAAAGTCT 
AAGCAGCTCITTTTGTGCAGAACTCTCCrre 

CTTGGCTGTGGTTTTCAGTAGGGTTCTTGACCAGTAGGGGCTCTGCTC^ 
GTGGGCCTGGGGAATTCTGCTGCATGAGGCTTAGGGAAGAAGGCACTTTGAGAGCT 

3 0 TTTCCCTGCTT CrGTCAGGTTGCTGCTGGCTACTGAGAACAGAA CTTAGTAATAGACTAACTGGGGAAA 

AGCCGACTCATCCCATGGCCCCC^TGGCTTTGTTACCAAACAGGAGGA 
CTGTTGGTATGCAGCAGAAGATTATGAGCATTGAAACAAATCTCT 

GATGTCCTACTTGCCTGGTTTGGAATAGGTACTTCTAGTCAAGGCCGTAACTATATGCTGCGAA 
ATTTGGATTCTGAGCATCCriTCCCAGGGTCTACCCACCCCTTCTAGATTG 

3 5 TAGG CATTTTT CTTTG AACTATGTAATT AG AGCTT C CAT ATCAGCTG CCAAATGTTCCTG CCTCTGTTAAAATTTTGTTATTATAT 

TTTATTCATTCATTCATTTATTCATTCATT 

AAGCAAGTACTTTTATCCACTGAGCCATCCTCGTTACCAATGTGTCCTACTCT 
CAAGACAGTTAGGAATAACCAGAAGTCATTCTGAGTCTAGCCTAGGAAAAGATGTG^ 

4 0 GGTT AAAATTG AG ATCTGCTCT CAAGAGGT CT CT CTTTAGGTTCTTAAAAG CAG CTTTTAAAACT AAGAG AATTACTG CAATGAGC 

ACATTACTTCTAGATGTGGCCrrCCTTGCTGCTTACTCAGAT 

GTTACATTATT C CAGAAT AACATG CAT CAC CATCTGAGGCTT ACAGG TCGGAGCAGTGGGGCAAAC G AAAGTCTTCGGAGGGAGCT 

TTGCAAGTTAGGGCAAAAGATGGGAGCCCACCATGTCTCCTC 

TGGTTCTTAAGGTGGTATTGCACATGAAGCACAAGTCAGAGCACCAT 

4 5 TGCTTCAGTGATGCGTTAGGTTGCCTGGGAGAGAGGAGGAATAGAGAACAGGGAATCCGGTGC 

GGTAGAGCAGGGCCTCTCTTCTCTCAAAAGCTGACGTGTCAGAATCCAGTTAAATGCACTTGACTTA 
AGGGAAATATTTTGATTTTTCACCTTTCTTATAAAAGAAAATTTTAGTTCT^ 
AGGTGAGCCCCGCAATCATGTGCACACACCCCTCAATACACAGTGGAC^GTA^ 
CAATAATAGTGTGCACACACCCCTCACATACACAGTGGACAGTAACCGTGTGCACACACCCCT 

5 0 GTGTCCATACACCCCTCACATAACACIAGTGGACAGTA 

CAGACCCCTCACATACACAGTGGACGGTAACCGGGTGCACACA^ 
CCTCACATACACAGTGGACGGTAACCGTGTGCACACACCCCTCACATGC^ 
TGCACAGTGGACGGTAACCATGTGCACACACCCCTCACATACACAGTGGACAGTAAA^ 
TTCGAGTCTTACTGCTT/TCATCATATTC^ 

5 5 GATGAAGGCAATGCTGTCCTACGTAAGTGTGCTGCATATGAGGGCTGTTTCAAAAAATO 

GTGrcTGCGCACCTGCITGCrCGCATAGTTCAGATGACATCTC 

GCAGCTCAGGCGTCAGGCCTGGTGGTACGT<3CTCCTAGCTGGTGAGCCATCrCATCAGCT 

TGTT AATGTAATTGAGG ATG CTGTTTCTTGGG CACAGTGGAGAT AGAAATGGAACCACAGTTG C CACAGCATTT TTATGTGT AACA 
GATAAGGTTTTCAAAGATTTCCTTATGTGAAAGTTAGGGGCATTATC 

6 0 AT AATGCAT CTT AATGACTTTTG CATT AGGTATGGTG CAGACTAAATGGACT ATCAATTTCCTGT ATTTTTTATG CAGT ACTT AAA 

GTAACGGCTCATTAAAAATGAACCCCTGGGCTCTAAAAAATTTTTCAAAATCTACTTTT^ 

ACTTCAGCTATAACATAGCTGCAGGCTTGTGGATGCAGGATTTAAAGATGTACTCOT 

AATGAACTCACCCACAGTGATCTGAGrGTGTGTGTTGGGAGCTTGCTTT 

TTGTC CCGAGACTAAAATAGGTGACAGAAACGGTAC^TTTCAAATGCTT CAGAACCAATTG AGAAACCirCATAAAOVACCAAAGA 

6 5 CATATAGCTTCCCTGAGCCCCAGTTTGAGCTTTCCTCCCAAGTTTTAT^ 

TCTGGTGGCAAGGACCACTCTAGAGGGGAGCCACAGCTCTGC^ 

CAGTTGCCTCATCTGTCAGTGTGTGGTGGGTAATAGCACATGTCCTTCCTATTGCTCGGGGTTGTGTAATG 

GATGTGAAACGCTGTCACATGCAAAGTATTGTTACAGTGGACACGATGGAC^^ 

TCTAGCCAACAGTTAGGGTGTATGTTTTGGAGAGCCCT^ 

7 0 CTGTGTGGGTTTCCGCCAATAGTTTTAATTTAGGGTAGGAGGAAAGAGAACAAGAGGGATGGCATGA 

CTACCAG CCCAG AGTAGGAAAAGTCAATTTCCCATTGACTTCC CATTAACTTATGTG GGAGTCTTTGT GTACTGAG AATTT AGAGC 
TGAGATCATCGGTACGTGGAAGGCAGAGOSGGATTGTCrrTTGCCAGGGC^^ 
GCGTGACTCCTTTGTTTTTAGAGAACGCATGCACACAGACACAGACGGGTTCT 
ACCCCCATATAAGCAGAGTGGACTTCCCAGAAGACCTTACAAACTGTGCCT 
7 5 G CTCAC CTTTCCATTT CTG AGTGACAGATGTGT AAACTCAGAG TTTGAAAGAC'TTT TTT AAATTG ACTTTT ACTT ATTTGTGTGT C 
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TGTGTAT AGGGGTGGCAAGTGTGTGTGTACATATTATGTG AGTGAGTG CAGGTGTGAGTG CA.CATGGAAGTCAGAGGATGAC CTTG 

TOTGTCAGTCTTTACTGTCTGCCTGTTTG^ 

TTCTGAGCAGTCTGCTGTCTCTGACCTCCATCCTCCATGCTC 

CATGGATTCTGGAGAAGTCCTCATCCTTTCATGG^ 

TATGGTAGATACAAGCCCGAGATTAAACCTGGATGCTCATATTTCCTAA^ 

ATCTCTGTTTGGTAAATGGTTGAGAGACATGCCCAAGGCCTTAAAGCT 

GGCTCAGTAGGGAAGCTCCTGCTC^ 

ATAGTCCAAG^GATTTGACCCCTCCTTCTGGTCTGTGGCT 

TAAATTl^AAAAAGAAAGAAAATATGTTTGGCCTTTAATCTC^ 

GAAAGAGCATCATAGAATTCXSAATGATTTGTT^ 

TGTTTGTTTGTTTTGTTTTTTTGAGACAGTGTAGCC 

CTG CCTCTG CCTCTTGACTCATGGG ATGAAAG CTGTGAGCCAC CACTGACC CCAG CTTTATGCATGTT TTTTAATTGTAGTACTTA 

GTCAATTTTAGTTGTAGAAACATGTTAGTCATGACCTTGTTGTCATA 

TGAGATTTTTTTGAAAGATACCT 

T/TTTGTTTTGTTGTTTITTCGAGA 



TGACrTGGAAGTTCTTCTGGGCCCAGCTCTGTCATAGAGAACCGTC 

AGCACCCATCGAATACTAGGATTTTGCATTGGTATTTATTTAT^ 

TTTTTTTTITTAATACAAACCAATTGCAGT^ 

CTGTTCCTCC^TTTCCCTTTCGGAAAAGAACAAGCCTCCCAGGG 

ACAAACCTTCACATCAAGGCTGGGTGAGTGAGGCAATCCAGTAGGAGGAAAAGCA^ 

C CC CG C CC CCACTGTTAGG AGTG CCAAAG AAAACACCAGG CT AACAAC CAT AACATCTCTG CAGAAGAC CT AGGT CAGACTCATGC 

AGTCTCTOTGAGGCCCTAGGAGCTCTGCTTGGTTGATTCTGTGGTCT^ 

TTCTTTTCCCTCTTCCTCTGGGTTTCC^ 

AAATTGGAGCAATCTTATACTAGAAAACATACCTGA 

TGGCTCAGGGAGTGGCAGTATTTGGAGGTGTGGTCTTGTTG^ 

TTTGAAG CTCTTAT AAGAG TTG CTTTGGGAAGG AGTT ACAGAGACAAAGTTTGGAG CTGAGATGAAAG GATGG ACCATGTAG AGAC 
TGCCATATCGAGGGATCCACCCCATAATCAGCATCCAA^ 



G CTC CCAATGGAGGAG CTAGAGAAAGAACC CAAGG AG CTAAAGGGATCTG C7VACCCTAT AGGTGGAACAACATT ATG AACT AAC CA 

GrACCCCGGAGCTCTTGACTCTAGCTGCATATGTATCAAAAGATGGCCTAGTCGGCCATCACT 

GGAAACTTTATATGCCCCAGTACAGGGGAACGCCAGGGCCAAAAAGGGGGACTGGGTGGGTAGGGGA 

GGGGACTTTTGGTATAGCATTGGAAATGTAAATGAGCT 

TATCTCT/TCGCAGCAGTAAAACCCTAACTAAGACACACAC^ 

GGATCAATAAAATAAAAACAAAGAGAAAATTACAAAGAATAAGTGAA^ 

AAACCGTT ATT CAAACTAAAAG GG AG AGAGAAT AG CT AAGTT AACAAAAT CAG ACACAAAAAAGAGGACATGATAGATAG ATATCC 

AGGAAGTCC^AAGAATCAGGATGCCATACTTTCAAACOT^ 

GTATTTTTAAAATTCAAATTGAAGTGCTGGGTGTGGTAGCTCATG 

GATGCTAGCCTGGTC7ATGTAGTGAGTTCTAGTCCAGCT 

AAATAAATTCAACTTGGCTCTGAAAAGGATTTGTTGCAGAGTGTGGCGGC^ 

TGGTAATAACAAC3GTACAGCAGCTTCGGCTTTCCTAGGGAACT 

TGGAGATGCTTCACCTCTGTCCTTTAATGATTTTOT 

GCTTGCTGAATCATCCCCCT^^ 

mAAATAGATTTTTGTATTAGACCTGAAAATGTTGGCTTTTTAATTGAA^ 
(TTCTTAATTTTATGAGTGCTGTAATAATGGCA 
GTGTGTCGCTAACTAACCAGCAAC^^ 
TTTTCTCC^TTCGATATATTTTTAAG^ 

TGGTCTCCAAGGAGATATTTAATGTATTGAGAGACTTCAGCAGCCA^ 
CTMGAGTTTGTGTC^ 

ATGGCTTCTCCTCrGACACTATTGTATCTTTTTAAl^CCTC^ 

AGCATACAACCTACATATTCTGAAAACTAAATACCTGATC^ 

^TAAATTTTGTTGTTTTTCTTATTTTCCCATCTTAm 

GTATTGCAGTGTGGCAGTGCCATGGAGTCTTTGGTG^ CTAACCTCTTCCTTTGGGTGG CCTTTCACTGAGAAT 

SagStgtaccgctgtgaag^ 

C^C^CCTCCTAGGC™ 

CACATTGCCAGAAGCGTAC^^ 

rrVlTGGTATACTTGACCACTGTGTACCT^ 

SSStCTIGGCCMT^^ 
TTCTCATCCTATTCAAGATACACACACACACACACAC^ 

TGCTG^^GC^C^CTTTTTCCCTACCTTGTGGCATAGCTGGGGCATGCCCGGCA 

aaagggaaaS^gtaggctcagcctccaaaactctggatctccgctacagcttgagtccccag 

tcttctoatotcgaggtggacggcctaagtagcagtttagacccacctcttcagagaggaaagc^ 
^ga^tgtggaSc^ 

TCCTAATCGAAnraACTA^ 



aaarrTrrrf^TTTTCTTTCTCCTTATCATCTCAGACCATGACCACCT 
TMCGCAGAGCTCTGAGTTGtS 
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AAGAAGGCCATCTAGCTTTTGTCCCTGAGCLAAGT^ 

AAATCCAAGGGTTTCATTCTTTTACCAGAGGTTCACACCAG<3CTACA 
TAAAGCCCACAGAGAGCCCAAGTGGCTAAACCTGCTGTGA 
5 GAAGAACACACCACAGTCTTGTGTGCACCTCrrTACTCAGGGTGTTAT^ 
CACCTCCX-X5GGAGACAGAGCTAACCCGG 

GTCAGAAACATTGTTTTTCTGGGTT CAGTTG CTTAATGGT AAGAG GTTCAT AGTGCTGTG CCrri'l'TI'T TTCC CTAAGTACTTTAT 
AG&AGGTTTCCTACGTRC CTTTTTTTTTrTTT AATTTTGATCTTCCT 
ATTACAATTGACTATTTGGACACAAGCCTGTGTGCCTTTCCTTAGGT 
1 0 TCTGTGATT CACTCACTCATT CAC CTACTTCAG CTAG ATT CCAAGAGCCAGGATATTCT CAT 

GCAGGAGXSTCTGCACGGTGATGTGCAATCCAGGTCCTCCCCTCCTCTCAAGAC^ 

GCGGAGAGAAACCCAGATTAGGACTGT AAAG CCAAC CCTTACCAGG CAGAATGACAAGTGAT AAATG CTGGGG CATAGGGTGCCTG 

TGAATACGGAAAAGCACTGAATCCATCTTTGAGGAAG 

GCCCGTGGGACTGGGAAGGGTGTTCTGGGCAGAGGAGCGGTGCATTTAAAG 
1 5 GAGATTCAGCCTGTAGGGATAAC^GGAGAAGGAGGGTGTGTGGGAGAAGAGTTCTAA 

CCAAAGACAGAGGTGGCTGCAGGTGTAAAAGGG CAGGAGAT CT7GT AGAGG TC^ 

GGAGGTGGCCAGGGCCAGTCCAAGCCATCTGTTAGAGCACAGCATGG 

^GGGAGGGTAGAGAGGCAGGAAATGCAGTTTTTGATTGTAG 

TGAGAGAGACATCTAGGCrroGAAATAAAGCTTTGTGACTCGGO^ 
2 0 GGGGAAGTGGGGCAGGGG ATACACAGTGAAAGGG AGGGTGAATAAGG AGCCAT^ C CATTAAGGG CT CATTCTCA 

GGAACTAGAATGAGGCTCGCTGAGGATGGGGGTGCTGNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

GTGCCCTTTT CTCAGCCAGGGAAGATGGCTGAGCTCACACACTGTGGGAGCAATGAATG CAGAAGCCAGGGCTTTCTGCTAG CTCA 
GCAGTCTGCTTGGTGATTCTGTCTGTTTAGTTAAAGGTTTGGCGGATATGCTACCCAC 

2 5 CGTGTTT CATGTITTARGCA.TTGAACTAGAATAAAAC'1'T 1' TTTTT T T ATTGTTCACTAATAAC CT ATGCTT ATTGGT AAGC CATTT 

GTCTCACCTATTAAGCAGTTGATTAGGGTAGATGTATGGGAAATTAGAAAAA 
CAGGC^TTTGTGGGGCTGAGGGTTGTCCCCCOT 
GTTGATTTTCTTTTTTAATATCATAGATTTCCATTAAAATT 
TAACAAATAGGGACACTGCTTGATAAACTTGCAAACAAAGGAAGCAATTGTACT 

3 0 GAGATTAATTATATTTTTGGATAACTACACTAAACTCTGCCA 

AGACTCTGAAACCGATCCAAGCCTCGTGCTAGAGATTCCATTAGCTATCGTAAAA 

ATTATTG CTGCAATTACTG CTTGT AATGGATTATGGGAG GGACAG CTGAAG CAAGCG CACT AACTG CTG CCTGTGGGATT CG TTTA 

GGCACCTGTTCTATAGTTTCTGTATGTAATCTGCGCTTTACATGCGAGAGTGCAC 

TGATTGCTGAGCCTTGGGAGGTGATGGGCTGTGTGGATTTGAGCGTATGAAGTC 

3 5 CTGCTGTTTAGCCTGGGATTTCACAGTAACCTGGGTGATAAGGCAAAC^ 

ATTGGGTGACTTT CTGAGGG CTGTGCACAGTAGTGATTGGT AGAAT CAGGGTCCAGC CC CATTTTAGT CAG T ACACACTTCACAAA 

GTTAGCTGTCTTGGTGTCTGAGGTCTAGGCTTCT^^ 

CTGCCATCACTGATGATTTTATTACTATTTAGAACAAAATGGGAGAGAATAAATC 

4 0 TTCATTAAAACCAGTGGTTTAGTTGTGGTCTTTGCTATTTCTCTGTGAGGCTTACAG 

CCCAGCAGAGTGAGATACAGCAAGGCAAGGCAGCATTACCCCACTGGTGC 
GATGGGCCCTCCAGGCCTCGAGCGCCTTTATCATTCACCTGGGCAGCAA^^ 
AGTGCTCTGTGCCAATGTCCTGCACATGCCCOSCGTTTTTCCCT^ 
TCACAGTGGGTTCACACCATGGGCTTGGTTGCTACCCAGTTCTGTGTC 

4 5 GCACTTGAGTTTGAGAG AG CTGTAAGAAAGT AAAAGT AAAGCCG AATT AACTGTTTGAGAG AAAAGTCTTAAT CAG CACAGTGTGT 

ATCACTGCAGAGGGACTCTGTTAATCATAGTACGGAGAGCTGGGTATACACr^ 

GCAGTTTTCCAGTTG ATTTGG CACAGTG CT AAGATGAAAGTG AAAG CCTITTT TATTTATTTTGGGGGGT AAATAAAT AAAG CTGG 
AAAACAGGTCAATTTCATG CATGCAAGGTTTTATAGTAGGGGAAAGGCACTG CACCTGTCCTGACCCTCACACTCGAGGTTATTTG 
TT CTCAGACTTGAC CCCGAG CACCCGTTT CAGACAATTCZT ATAATTAG TTTG TT AGAT AAAGCAT AAGGATATTTTTTT CCTGATT 

5 0 TCTAGGACCATAGCAAAGTAGAAACCTACTGTACATAGCAGTTAATTAAAGAAAAG 

CTTGT AT CAGT AT AAAAAGGCAG GG AT TT CTTTGACAGCCACTG ATTGGAAAGTTG CTTGCAG C CTTCCAGAAG CAAGGGTACATP 
CTGTGGATCAATAACCGGCAAATTTCCCATTAAAAAGAATACCTGAG 
GAAACCTCAAGTACTTGTGTGCCTCC^ 
CGAAAGGTTGACAGCCGACATGATTGATATTTCACrTACCT 
55 TTTCGGGAGCAGCCTCTGTGGCCCTAGTGTATTCCAACATGATTTGCA 

AGAAAGAGAGAATTGCCTTrCCTTTACAGTTTGAAAGCCATTGCAGGAGGGCT 
TGCCCCTGCCACCACGGTCACCTTCACCTTCCTTGTTTGC^ 

GTG ATG CAG AATATT CAG ACCTTTG CATATTTTGGAGG AAAAAGATTG ATTCTAAAATAACCTT AATC 

AG ATT C CAGTTTTT AAAATTT ATTTTT ATTTTATGTTCATTG CT ATTTTGTC^GGT CC CCTCGAAGTAG AATT ACAG ATGGTTGTT 

6 0 AGCTGCCATGCGGGTGCTGGGAATTGAACAGAGTCATCTTGAAGAGCATCCTTTGCA 

CAGTTCTCCAGCCCCAAGATAGAAGGGTTTTTTAGAAT^ 

GTGGTATGTGCATATTTAAATATAAAAACAGCAAAAATGAGAAATGGG<3AGAG 

AAGTCAGGGCGGGAGAGATGGCTCAGTGGTTAAGAACACTGACTGCCTTTCCAGGGA^ 

GTGG CT CACAACCATCTGT AATTC CAGATCTTTTT CTGGT CT CTGCAGGT ACCAG GCATG CTAGTGGTGGATATATG CAG G CAT AC 

6 5 ACATAAATAC CCAT ACACATAAACAAATAAAAAAACAATTTT AAGC CTAGT CAGTGGTTTTCATTGTAGGTATTTTTCTTCATGAA 

CATTATTTTCTTGTTTGTTCAGCTTAAA 

GTGTTCCAGG-AAACAGTGTGCTTACTATTTTGGGCTTATTTTATGAGGG 

ATG TGC CTAG AATTTTACAGTAAG CTTTGAT C CAT CTTAAAGT GTG CCTAAAG C CAAATT ACGAAT CAGAAGG AGACTTGTTAG G A 
GACTCCTTTTACATGGGAGTTCTGCTGTCCTTGGTTGAGTACTATC 

7 0 TTAGCTCTGTGCTTT AT ACACAAATGTGTGTGTTTGATT GACTGACATGTG AATT CAAG CCAAATATTGAG C CTATT CATTTTG CT 

TT ACAAATTTCAG AGGAAAAG CACAAGAAGCAATTTGAGT AATTT CT ATAAAATGCTTTTTTATGATTT AAAATATAT AT CATTTT 
AAAGTATGTTAAAT AC T AACATGGGGT CTTAGTTAATACTGTT AG T CTT AAAG AT AAATGGTGGT ATGGACGATAACCC CCT ACTA 
GACTTCCAAATCTTGAATACATTCACTGTAGCAATTTTGATOT 

AG CCATTTACAAGAGTACCATGAATT AAACAGCCCGTAAGTGT CTGT CC CAG TG TCCATACATTACTGAGT AATCTCTTTCTATTA 
7 5 TCT CTTTGCAGT ATTATTCCACAGTAATC4GAACAGCAAGT AAATC G CCAG CT AATAGAG C CG CTGCAG ATATTTCCAAGAGGTAAT 
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GTTGAACAAATTCTAATCAA^ 
ATCATGTTTAGCTC^T^ 

AAAATAT CT AATCT ACTAGAGTAAATT AAACTAACTAGATTGTTl'rrrrTTTT T T GAAGTGCAT ACTTACATTCG CTT AAAG AACT 

CTTTGGGAGTCCCCATGATTCTAAAGCAAGGCAGATGGCTG^ 

GGCrGCTGTTGAGATGAGAGCTGTTAGCTTGTGTGGAGGCTGTGAGAAATGGGG 

TTTGTCTTGGGATGGGGGAGGTTTTTAAAAACT 

GAATCCCTCCTTTAGGGTGCK5CAGTACCAGCCCTGTACGGGCT 

AGTGAAGAGTGACTTAGTTATACCAAAGTAAGTTm 

CAGACCAGGAAATCCTCTAAGTAACTCTTCTAGAAATAGCTTAATTGAAAT^ 
GTACAGATAATCATGGTATGAGTTTTCCAATTAAACATTCTAGACm 
CCACTGGGCTCTATTGTAGGGATGGTCTCAAAGGCAGTTCGCTTG 
CITTGGGCATTTGCGTGGGCAGTAGAACCAGGGGGAT^ 

ACTGTGCTACTTAAGATCAGTGGTCATTCAAAAGAAAGGTATGATTAATCGCGTTCTGTGG 
AGGCTGCCGTGCTTTTGTAGCTGTCTCCTGACAGTGACCCAGATTAm 
TAAAAATTA1TTCCTTCACAAAGAATAGTGTTTTACCAGCTC 

GAGAAT CT AGT CACCAGGCTGGAGGG CTTTT CTTTATACTAAAC ~™ 



CAGTTCCC 




i . | UJ , , ^ . ^ v . TAGGTTTTTCTC 

CTAAGCATTTAGTATACTGCGAGTGAGAAGGTGACITAAATTACA^ 

GGTGTGTAAAGGGAGTTTGAGGGGTTTTGTGCGGTGGTGCTCAACCTGTAGCT 

AO^AGGGCTGTCTTAGTTATGTTATGATTCACAACAGTGOT 

GGATCACTACAACATGAAGGACTATATTAGAGGGTCGTAGC^TTAGGAAGGCrrGAGGACCACT 

CACAGCTATGACATG CTGTGAT CACTTCATCCCTGGCATAACT AACCCA.TGTGGCTT AATATGCATT GAATGTAGT CTCT C CAGG 

CACATTGAGAATTTCAAAATGGTAGTCCATCCCCCTTCTCGGTGCTGTGACTCA 

GGTACCTGACTACCTCACTACAGATGTAGATGCCCCCATGTGATGAAATACATATTTGTTCA 

ACGGACTAGAGACTCTTGGGAAGAAGGAGTCTCAGTTGAGGAATCACCTGTCTCCATT 

TGTCTATAGTGCCAGCCCTGGGTATACTATCCTGGACTGTATTAAGAAAG 

AGCCCTCTATGG CCTCTACTTTTGTTCCTG CTTCCAGGTTTCTGCCTTGG CTTCCCTAGATGATGGACTGTAACCTGTAAGCTAGA 

AAAAACCCITTCCTCCCCAAGGCGTTTTTTTGGTCGTGGTGTTTATCA 

TGTG^CATGGTAAAAGGTTCCCTGGGTCTTCCTGATTGGTGAGA 

TTAAAAGAATCTTTAGGCTGAGCAAGTTGTTGGGTCTGGAGACAAAGATm 

ACTGTTGCCTTGAATAGCATTCCAGTCTTAGTTCATGGCCAAGTCTTCCCTGTGGCTCAGAGC 

AAAATTGGTTAGCATTATAGTAAOTGCCTTCTTAACTTCAGGATCTATGAGGTT 

ATTTCTAGTCACAAAGGTTTGATGGGCAGTTGAGAAGCAA 

GGCTCACCCCCATTTGCCTGCACAGGTTACCGCAGAGGCAGTCGTCTGAGAGAATACT 
AGGCAGCCGTTGTTTTGTTAATCTGTATTTTAAGGAAAGAAGATTTGGAATCTTAATATATTAAGTATCCT 
AAAAACATTTATTCATGTTTATGTATTTTTGGAATATTAATGTAATAAATTCATTTAAAATATATA 
TGTGTATGCCTGTGTACCATGAGTGTGTGGGAACTGGAGAAGGTCAGAAGAGATGTTTGATCCCCTAGAACTAGGGTTA 

TTGTCAGTCACCATGTGGGTATTGGGAACAGAACCTGGGTCCTCTGGAA^ 

AG CCCTAAT AAT ATAATAAATT CAAATTTACTT GAAG CT CCCAAGTG AAAGTG AAACC CAT ACCTTGT AATTAT AATG CATGTCTT 
CTTAAGCAGCTTGATTTCTGTTGTCAAGAAACAAAAAGAAAACAAAATCAAATTC^ 

AACAGC CTGCTGTAGGACACTCTAGTATTTGGG C^GAATTGGTGTAGAACAGGG ATAGGG AGCTCGTGTTTGAAACGTGATTACGA 
AGCACCTGCTGGACTAGAAATATCTCACTGTAGCCCA 

CAGAGCTAAGTATTCATGG ACATGAAGATTTT CTGGAAACCTCATTAAATTATATATCTGTTTAAATAGTCCAGTTG CTTATGGCT 

TTACAAG ACTTGAAT ATACGCAGACTT G CAT ATT CCATCTTGGAG TTC CCAGCAGTCTTTTTG G CATTCAAG AACCCTTTCT CCAC 

CCCTCCATG CCTTCAAAGCAACAAAAGCAGAAGCCTGCTGTTTAGCACGC CCTTTGACTGCCTTGTGCAGTTCTGTAGCGAGGGAA 

GACTGCAGACATCTGAGCCACGAGAGGAATCTCATCATrCTAGCT 

ATATGAAGTGGTGTCAGAAAGCTCTGTGCATTTATA 

TCTGGGGGCCACmTCAATCATTTCTCrCTGCTA^ 

GTAAATGGCCCTGATGTGTTCTACAATTTAGTCAATTACT 

AGGTACGAATTTCAATAATTGTTTCAGTAAAGTTAGATGGATTTGTAATGCTG^ 

ACTTTCTCAAGCTCTGACAAGCAATCm 

TTTGCTTACAATGTGTATOTACGTGGATCCTGGCCTTTGGTGCTGTATTTTT^ 
TCCAGGATGATCCTGGGCTCTGTTGTCCATTAA 

AGCATATTGCACACAGGCTGCCCTTCCTCTTCTCTCCACATATTTATG 
TGCATGTTTTAAATTCTGAAOT^ 

AGATAAGAGCTTACTGTGCTTGCATCGGCAGCCTGCCT^ 

cJ^TCTCAACACAC^^ 

Sa^a^CTTGAGTAGCTAGATAGCTC^^^ 

£^^TCGTTCAGTT^TAA*GATGA^ 

CCTGAGTWGATCCCCAGCACCCATCTA^^ 

GMCCCAGGAGCTCACTGGTCAGTCACCTAGCTGAATCCATGAGCTACAAGTGCAGTAAGAGG^ 

TGACTCATTAAGGAAGATCTCATACATGAACC^ 

TACCACACACACCCATGCAAAATA^ 

M^StcCCGAGTCTCCAAATTATTTATO^ 

TT^GCTCCCCATCTTTTGACAGTGTTATT^^ 

^CTAGACCAGCAGTGCTGCTCATTAATGGCA^ 

CACTTCATGCrAGCGTCACCCCACTCTGGGACAGATCTTAAGCCTTTCATAGTCTTCTTTTATCCC^ 
XGGCAATGCGCAGGTGTTCAGTTAGCCTGCCTGTAGACATTCCTCACCTTGTGATCCTGGGAGGTGCTTGCT 

Z^^ 2CT GfiAAAGCTG^ 
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TGC^CAGCCTTTAACAGGGTGAGGTGCAGCTTAGG 
TTTACCTGTGGGTTTCTGGTTTGACT 

ACCTTGTCAGTCACTCX3CCTCATGGTGGGGACATGTG CAAGAGTGAGTG CACCT AATACCGG AG C CACAGACTCTGGTCCTATGCT 

CCTGCTCT L 'I TCXTTA TTGTCTTTAATTGAAACATTTTATATG 

CCACTGCCCCAAGAAGCCAGAAGTGGTCGGAGCTCCTGGAAOT 

ACCAAAACAGGTTCTCTGTAAGAGGAG CAAGTACTCIT AACCACT AAGCCAGGTCTTG CTTGTATTTTTTTTCTTTAATTTTTGAG 

ACTCTTGCTATCTAGCTCAGGTGGAAATCAAACATAGCAGTCCCCCTG^ 

ACCACACTTTCTCTACAATTCCAGGTGACCTAAGCT 

GGTATCAGTATCTACCCAGGAGCCTTCCTGCTGAGAACACAGGCTCTAAAAGGC^ 
CMTTAGCTAGCTAGTCACCAGCATGTCCTTTGCCTGGTCTTGTGGG 

AGATT CTGGTTTG CATGCACTGTCTCCATACTG CCTGGACATGTGAG CCG CGG TCTTGCAGACTAGAGGCGTAT AAATAG AGGACT 

6CATGCTCACAGAGTTCCTGGCRGGCAGAACS3GTO 

AGCCXZTGGGAAGTCAAGCrTTCCCCrGGTATTTTAGATT 

TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTCTCTCTCTCT 

TTGTTTTTTTTGTTTTTGTTTTTTm 

AACTCACTCTGTAGACCAGGCTGGCCTCAAACTCAGAGATCCGCCTGCCTCTGCCTCC 

CCACCACCTGGCTGAAAAGATTTCAAATCATAGAAGCAGTGTTACrrTTCCCTACAAGTC 

TGTCTCTTCATGGAGTACAGTAAGATTCTTGGATCAGATGGAACTTTACCACCT^ 

TACGGGAT ATGG CAGCCTC CTAG CTTGTAAGGATAGTGGTT CCTGTAATGG AAGAATGTTAATTGAAAAAGT CCT ATGGGTGTTTT 
AATGTT ACCCTGCATTGG CCATGTT CTGTGCCCCAGGTTGAAAGCTAATCAG CAG ATATTTGTTGAGTAT CT CTT ACTCATT AGTT 
GCTGTCCTAGGAGTATGACGTGTCAATAGTGTCGTGAGGGAAGCAGATACATGCAGAG^ 

T TGAACTGAATACTCAC CG CTGG CTCCTTGGGG AACGTG ATTG CTTCCATGTAGACCTGATGCACAAGGAGGACCAAGTT ACATCA 
CAATACAGACATCTCAGCTAAAGGTGAATGTTTGTGTACATGTTTGT 

GGTAGGACAGGGGTGTAGGAGTTGGGGLAGAGCAACTCAAGTGCTGGAAGGGAAGGATGCAGATG 
GAAGGCCCATCAGTGCCTACCGTTCTTGTCCTGGCCTGGAGACCAGAGA 

GGTAGAAACTGTCTGTGGGGAGCAGAAAGAGGGCAAACACAACCTACCCTGCCCTTCTTTTATC 
TTAAGAGGTTCTTCCTTTCTGAGCTGGGATTTGTC^CTCrTCCACACCCACTTG 



CACTTGCAGCATCCCTTTGGCTGCACAGGTGCTCCAGAGAATCTCCACAGTCTCCTCT 

CTT ACTAATTGT ATT AGT CT CCCTT CACTATAATGG AAT ACCTG AAAT AGTGACAT AG CC CATTTTGTTAAGAT AAAAAG TTT AGT 

TAACTTTTGGGGNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNinilJNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNl^^ 

GGAACTTCTGAGTGTTCTTATGCCCTCAGGATGAAGTGAGACCCCCGCCT^ 

ACCGCACACTCCAGCCTTGCKSGTGAGCACGTC 

TTCCTCACCTTCTCTCTCAATCTGGCTAACCTTAACTATC^ 

AT AGTAGGTACCC CGTGT GG ACTC CTGTATCTTGCCCTGC'IT CTTTTGTC TGTAGCAGGG AG CTCATTG GAGCACAATT ATCT CGG 
TGTCAGGCAGTGACTGCAGTGTACCCACTGCAAGCCCGGGCA^ 
GTGTATACTTATATCCTACATTC^GATTTCTCTCAGCCTTCTTTC^ 
TCGTATTTCITTTTAGTTTCCGAGACAGCATCTTGC^ 

GAAACCAGAGGAGTTTTTCTGCCTCAATCTTCAG AGTGTTAGGATTGT^ CATGT C CAACTCTACCTTAACAGAA 



AGGTGGTCATGAGCCTCTTTGTGGGTGTTGGGAATG^AACCTAGATGCT 

TTCTCTAGCCCATGGCCCTCTCTTTTTATTCTITTATGC^ 

TGAAACTAAGACCCCACCCGTGTTAAACATGCGCTCTA 

GATGCTGGCATGTC^GTGGGATCAGTTTCCTCCrrGTTACTT^ 

AGACTGAGCTTATTAACCC^CATGGTGTTCCTAGGTCA^ 

CTTTCT CCTTCTGTAAAATGAGGATGATTAAGCTCT.CT TAG CAT AAGATTG ATG AAG CT CTGTTAAGGAATCA 

TATTTTG TTT CTTGGATGAAAGACATTTTGT AAAG G CCATG AATCAAAGTGTT ATTATTTAAG CTTATGAC AT AATGGACTATCTG 
TGGAGAGACCACAGTGGAAGCTCTGGTTGGTTGCTTGTC 

ACAAAGACTTAACTATTTCTCAGACTGTGTGTGTGTGTGTGTGTGTGTGTATTAATC^ 

CTGTAAGTT AGG AATAATAAGCATACATGAAGGAGTGTAGTTC CATTGCTCA 

GCATTCTTCAAATGAGCTGCATGACCAAGTTAGTAATTGGACAGGT^ 

TAGTTCCAGTCTACAAAATTCITGACTGTTCCCCTGG^CTGATCAGC 

G CAT AGTTT AGACAGTTTTAAAAAGTGTGGGAACTG AGG CTCAGT AGTCTGTGTTGG AC CACAGCAG AATTG TAAGG AGT ATTTAG 

AGAGAATGATTATCAGAAAGCTAGCTTGAGTCTAAGATCATATTGAAGATC 

TGAGCTCCGTTCCTACTTGACATATTTTGCTCTTTATGTCTT 

TTTCTTGAACACTTAGCACTTGGTTGTTCGGGA^ 

TCCAGTCTCTGGTGAGAGTGTAGGGGAGGAAAGAGCCACACTGGCAATGAGCAGATG^ 
GTGGACAGTTAAGGATAGAGATGGCCAGCTTCACATTTCAAAGACTAGGAGTTGGC^^ 
TATCTATCTATCTATCTATCTATCTATCTATCTATCTATCT 
CCTATCroTCTATTTTTGGAAGGTGTGTCCrTCCGTCCATC 

ATGTCCGTGCACCGTGTACATGCAGTGCCTACAGAGGCC^^GAAGAGGGACTCAGATTCCCCT 

GAACCACCATGTGGGTTCTGGGAACAAAACCTGGGTCCTTTGAAAGAGTATCTAGTGCTC 

TGTGAGATGGTATTTTATATCCCTTGCTACTCAGTCTGGTGGCTGTGGAGGCTTAACOT 

AG CTCTTT CTTAAG CATTTTCACT AAACAGG ACT AAG CT ATCTGCAGAAATGTTTG CAT AATGTATT ATCTTCTGCTATTGTAACA 
AATTCCTGAGCTAATAAAGAAGAAAGGCTTATTTTGGTTCACAGTATAGAGTT^ 
TGGTGCAAGGACAGC^TTTCACAGTGGCACCATGTGATGGAA^ 
CAAAGCAGGTAGATATTTCACAGTCTCTOTGGAGAGCACGCCCCCCACAAACG 

AGTACTGTGGTCAGGG C CTGTGGGATT CACTCTAACTAGGT AG CACATAGTCAT ATATG TTC CG AT AACATTTTAATAATTACTAT 
GTAATAATGGAAGCTGAGTCTCCTAGGGAAGCAAAAGAAAGAGAGGGCAGCCTT 

TAGTGT ACATG CAGCG CTGTTTGGT AATGTGTCACTTGTCTG^3TTACCTAT AAT ACAGAT ATGCTG C CTATTTTATAGAAAGAGTA 
ATTCACTTTCCAAGAGGAGAGGTAATCGGTCCCAGGTCCCATAGCTC 
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GTGCATAACTCTTATCIXrTAGTAAGAGA 
GAATATTCTTGAAAAAGCATCCAGTGTTTTGTGC^^ 
CCGTAGTCTGCTTCCTAACTCTGGAGACTCC^ 
CAACAAAACCTTCTTAGAAAAGGraTCTT^ 
5 CTGGCCTCCAACTCCTGATOTAGCTAAGGATGACCTTGAACTG^ 

CAC^CTTTACCTGGAACTCAGGATCAACCCCGGGGCCTGGATGTGTGCTC^ 
CTTGGTCTGGAATTTGCAGTAGTATTGAGAGG 

CAG CTGAACT GAG AAAGATACTGGCCAACGGCCAGGTAGGT ATTGTTG CATACT AAACT CCAGAGTAGCACTGTTG CTGT CTGTCC 
AAAGTG T AG CCAT AGTGTTGAATGATGAAGAGTCTG CACATTT CTT CTTGTCATAAATGAATGCCTAAG CTTAATTTGATTGGCAG 
1 0 AACAATCTGCAGCACGTTTTTCAACCTGAGCTAATGGGTTCCACTCCCAGC^ 
TTCAGTTTAAAGAAGCCAATCAGCTTGGAACAAGTCG^ 
TTTCATTATCTCTGGGGGCGAAGGAGAAAACTGT^ 

GGGGACTGAAGCCAGATCGGTCAGACCGTGACACATGTGTCCCTGGGC^ 
TGAAACTGAATGACCCAACCCAGCCAGGGAAGGACAAAGAGGCCTGCGAGCCACTG 
1 5 G ACACTTCTG AAATAG T ATAGTAC"l"riTAAACATTCACTGGTG AATG AAGCT ACCTACAGGGGCTG CAGTTCCTCAGTAG CAACAT 
AAATGCGTTTTCATGTTTGTTACAGTGAAGCCCTTTG^ 

GCTCTATTGACTGAGACATCTGTGTCACrGCAAATAAAAGACTCTTATTGATTTTCTTAACT 

GAAGGCTG AGATCTTAATT ATCTCAGATG CTTATTCAGTG C C TTCG CATG CTGACCTCTCCAAGCCT CG TTGGCATTTCATGAAT C 
AAAGACTTGTAGTGCAGCATGTCAAGAGGACAAAGAGGGGTCCTCACGCGTG 
2 0 GACGTGCGC CCT AAAACAG GACTCG CGGAAGAACTTCTGG AAGCATGG CACAC AAT CTTTTGAAG AGTTACAAAAAATTGAGTTG A 

TCTGGTTCTGACAGTCTGTCAGTCAGTGTGTGTGAGCTCTAGTTCTTAAATGTTCCACGTACAGTGG^ 
TTGATTGTGTGTCTAATTCTC^TAGTCTCAAATTCTGCGAT^ 

AGCCGGTGCTATATCGTCAGCCTCC^CTGTTAGATAGCATTTACCGACACCTGTTACCTTGAAGCAC 
GAGGTCTGGCTGGCTCCCCTGTGCCTGGTTCACACTGGCAGCAGTGTGAACA^ 

2 5 ACATGTTCAG CCTG CTTG CATT CTAGTTACrTAATTACAGTTTATGGACAAGTCACTTGG CT CAGCTGT AAAATAT ATTTCAA 

AATAGACATAAATT AAG AAGTTTT AACTACATT ATTCAAC CT AGGO C CTGT CATACAGTCTTCT TTCCTA 

AATGAAAAACACTT CATGTTCACTGTG CG CAC G CATG CACACAAGCACGCACGCAC C AC AT CAG CATTCTG AAATAGAG CC CCACA 
AATATCAGAGTAGTGTTATGAGGCTCTCTGTAGTGGAGATTGGCACA^ 
CTTGGGGATGTGCTAACCCCATAGTAAACAATGTGACAGATTTTTTTTCTTTTTAA 

3 0 CAAAAATAAATCAGTAAAATAAATAAATAAATAAATAAAG 

CCTGCTTGTCCTCGGCCCTCCCTTGGTGCTGTGGAGTAACTAAGCMGC^TGAGATTG 
GGGCATCAGTAAATCTCACGGATGAACACTGACACCGTTTGCTATGGAAGGT 

TGT AGTTTTATTTT AGTGGTTT AACT CTCAAG CATTAATAATGATTTG CC T TC C CACT AGC C TCCACTTAG ATT T CATAAGGAATT 
CAT ATT AAT CATT AACAGAAAGGGACATGATTC TT CAAGAT AATCAAAT AT TG ACAGGTTAGAAAT AGCTTTCTG CCTG GGTTCAT 

3 5 TGATAT AGGTTATG TGAT AT AAATATTTATTTCTTTT CTGT CACT CAG AGTAAATGC CACCAATTG ATTCAG CATTG ATAGAGTG T 

CAACAGTTCMAAGAAACATTTGACTGGCTTGTGTTTTCCG 

TGTCTGCTCATGGAGTGTGACGCCTT(TTGAAACCATTATTGCTTCGATTT C AAACGT CTCATG CTAGGACTCAGTTCTGT 
CAGGCTTT T ACAT ATAAGATAGG AT TCATTGAGC ATTTTT ATTTT AATGATTTAT AAG C GATG CAT ACTT ATTCT CTTTAACACAG 
AC1AGACZAAT AAACAAAGAAC!AT ATTCTTTGCT AGT CCATAATTA C CATTT GG CTG TCTC CATT AAT TT CATT AAT ATGTTAA 

4 0 ATCAAGAACCACITAAAAGTTTTCCTTTAGTCTTTG 

TCTCCCG CTTAATAAATAAATCAAAAG CTATTTGGG C C CTGT L~I IT CTGAG ACCTCTG ACT CAG ACT CACTG CAGTArr J.TTAG AG 
T ATGTTTTC CC CGCTCAGAGTCCAGTGTGGCGGCG CTGTCTGTCTAATGTTGGG CCTGGGTTTCAAATTGCCTTGGAAAGAAC CCA 
GAGGGTCAATTCAGAGAATTTTATAAAAGAGCCAGTAGTCACTCCACAGACCrrA 
CTCTTCAGGTTAACACTCTGAGAAGTCATACTGGCCAGTCCCTTTCCT 

4 5 AGAAAACTCCTGGGAGTAACCTGGGAGGGTAGGAATCTGCTGTAAAAACTATCTTTAAA 

TCTTCTCCIATTGTGAAGCCTGAGGCAGAAATGAAGCTCCAAGTGAGTC^GAACTGCAG^ 

G G C CT CAGTG CTGGCTG CTGGCCCTGCTGCT1TCTCGG CC CTGCTG CT IT CTCAGC C CTGC TTCTCi~i"i"i AAG CAGAATG GAGG AG 
TCACCAGTCAT^CCTGTTTGTGGCTATGTAGTGACTGTGACTCCTGGGCTCGCrCTTCCTCTTCCCTC^TG 

CTG CTTTGCATTTT ACAGTT AAAACCAAGAAAATCAATTAT AATTAAAAACTAG CAT CTTCTTCGAGTT AATAAAAACCAATCTCA 

5 0 C^TCTGTACAGTTTGAAAACCATGTGAGGTATGTGTGAAGAGAAATGCATGCATGTACATGCATGGCATTG 

GTCAAAGATCGGCTGAACTGTCCCCCTCTATTGCrrGACCACTTTATTTTTT 
TTTTGGTTAGTTTGCTTGCTAGCAAACTCTGAAGATGATCTCCCTTTGCCTCCTGGCCCTGAGG 

C CCACTTTATATGG ATGCTG AAGAT ATGAACACAAGTCCACATGCTTACATGG CAGG CGTTTTG CCAACCAGCCAT T TCT CTAGT A 

CTTGAGCTACACGTTTTGTTTTTAAATTTTCCTTCCTTAGAAATAAATG 
5 5 GCCTGTCTCTGCACAATGACTGTTGGCAGTCAATGTCCCC^CACTGCACCAG 

TGACAAGTTATAAAGTGACJTTGGGTTCAATGGCCTAAAATGCTATAGAACT 

CACATGAATCCCTCTGTAGGCTTAGGAATCCTGCCTCCATTNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 
60 NNNKNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNOTtfNNNNNNNNNNNNCT 

65 ATTATTTATTTGGGTATTACTATATAGCCTTGGATAGCCTCTAAGTCATTATGTAGACAAGGCTGGCCT 

CCCTGCCTTTTCCTCCCTACTCCTAGTACTAAAGGAGTGCCCCGCCATGCCT 

TGCATGTCTGTATGTACACCATTTGCATGCAGTCCCTACAGAGACCAGAA 

GGTGATCATGGGCCTCCCAGTGTGGGTTTTAGGAATAGAACCCATGTCCTCTGTAAGAGCAGC 

TCTCTCCAGCCATTGGTGTTTTATATTTAATATTATAAAAAGAAAAAAGAAAAAAAAAGAAA 
7 0 AAGACTTGTATACCTTGAAACAGCATACATCGTTTCATATGGATGTCGTCT 

AACCACCTCTGTGTTTATGGTCTTGCATAGTCTTCCTCATTGTCTGGGGGATTCCTTAGTTTCA 

ATATACACATGCAGCATCAGGCTTCGCCTCCCTTGACTTCCC^ 

ACCCCATTTCCCTAATGAGGCTGACATGCATGATACTGGCCACTCCTCTTTTGTGAGTACTGCTGTGAT 
ACCCATACCITGTCCTTTGCCTATGGGGAGGACCTGATCACAACT 
7 5 G CCCTTTT ATCT CAGTGCTG CAAATTCAGAATTTGTG G AC TTGACCAG GTGGAATT C CT CT CTC CTAGAGATTT ACTT CAAG CCAC 
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AGTGAGAATGT ATCTTGTT AGG CAAGTTAATGG CG CAAATTAGTTTGTTGGAGG CATGTTTAGATT ACCT AGAGGAATG CTCT CAG 

GGAATTGGCTTGGTGGCGGGCTGTACCTGCCAGTAG^ 

CCTGCrGGCATAGCATCTCAGCTCACTTCCTGGGCTGCTGTTGCTAATGCT 

GG CTTT CTTTCCTTCATG CCACATCTTTTCTACAGGGTAAAATATCAT^ CACGGAGAACAAA 
5 GTTTTCCTCAAATAGTGAAGCMjTTGGCAATTGTTCTGACCT 

GATTGTAAATTGCATCAATCAGGTAGTACAACAATTAAATCTGGT^ CTTTT 
AGGAACTGG CAAAATGGCTTGG CAAGGAAGCTGTCGCTTT^ TATCACAGT CCTCTGAAG CTTCAAC 

TGCAGAAATGCATAGAAGAAAACTATTTCCCTTGATAAAT^ 
TATTTAAAGCTTTACCATAATAATGGTCCAAAAGC^ 
1 0 CCCAATGCAAGGTAGTGCAATTCCCTAGAAATTGATTTTTTTCAGGGCTTCC 
CTITATTAAACCTACTTGGTGAATAAAGTrGGCAGTTAATTTTTA 
ATGGAGTCAGATTAGCCTTAAAATCATTTGATAAATGATTATTT^ 

GTACATTAAGTTACAGGAGTAGCATTAGTTGATATTACACAGATGCTCAGTGTG^ CTGTGAT CTGTCATTTAC CA 

1 5 G CATAATATGACTTCTATTCAGCTTTTGCTAT^ CT CTTTCT ACTCTAAAAAAATT ATAA 

ACATTATTTAGTTAGTTTGTTCAATGTTAATTATAGGGTC^CCAGAGGAGTTTTT 
NNNNNNNNNNNCACACIACAGACACIACACACACAGACAC^ 

ACACACACACACAC^C^^CACACACACACAC^CACGTTAGTTCTGCGC CATGCTGGAGACCGCTCTC CTATCAGTATC TCTGACC 
TGAC CC TTCTGAG CTATGTTATATCATT C CATTTTGAC^CCTTTT ATCAAACTTT AATGAAATG AC CCAGG CAAACAAATTGT CAT 

2 0 TAAACTTTCCATCACTTTGTCCCrrTGGTATCGTCTTTTATAAAT^ 

AACITGGAATGACCCAGCCTGCTTGTTGTCT 

TTTTTCTT TCTTT CCTGTGGAGATGAATGAACAAGACATAC GGT AT CGAGACACC CTTGGTCATGG CAACGGAGGCACAGTCT ACA 

AGTXlAGTAGTCATTCTGTCTAAACTTGCTGrTTACTTCGCCAAGTGGTGGGGAC 

CCACCGCTTAGGTTTTATTTCCACATTTG^ 

ACGTGTGCTCGCCTTTGCTCTTTAACCATCACTGATTATTAGTTTTGGAAGCT 
ACAGTTATG CC CAGTGGATGCCAAACTTTATAAGTGCCATTGCTGTGCCCTGTGCT 
GTGCTTCCTCCCCTCTCGTGTCCCCCCACCCCC^CCCGAGAGJ^TCCCTACACACACn^ 
CCCAAATTCCAGTTGCGGCGATCTTACCCTAAAGGACTGTTCATTTTCATATTA 

3 0 ACAGGGAAGTCCCTCTC!ACGGCAGTGCAAAGTTTAAACTACAT 

AATATTCCACTAGGAATTAACAAATCCATCCrrCrGTAAGAAAGCCCT 

ACTGCACTGGGCTGTGCTTTGACCGTAGCrTACTGTAATATTAAAAATAATTGGGGTTATAAAAGAATTAATTT^ 

ATCCTCGTAAGCCCAGCAC!ATCCTTGTGTACTGATGCTGCCT^ 

GATCTAGCTATCTCCTCAGCCTCAGTCCCCAGGCGCAGCCCGGCTGCGAG 

3 5 GGCCTCCACATCACGTGtZACACCCTGAAT^ 

CTATATAACAAGTTCCTGGAAAGCCAGAGATGCAGTGAAACCCTGC^ 

ACTCTTTAGTTATGTTGGTCAGTCCTCAGGATGOTGGGTCAAGTTGATTGTCTGAGGT 

CCAGCATATGCAACAGTGTGTGGCGATGACTCAAAAAAAAATAAAATA^ 

T ATGCTCAT CtTt T AG ATTAT TTTTC CTATAGA.CAGTTTAAT AAG ATAAATTGTTGG T CTGAAATT AAGTC CTG CCACAATT AGAA 

4 0 TCAGAGACAAATTTGCTTTGGGGCTAGAATTGAACCTAGGTCCGAGGGTTCCTGGAAGCTGTATCTTCA 

TT AAT CAT CT AAAAAAAATT AAT CAT CTGTAT AATTG TTATAGGAAAAT AAAAAAGAAGGGACTCTACAGGG ATGTCTAAGT AACC 
TGCCCACCT CC CCAAAGGACACHTTCAATGTGAGATTACACTACTGTTGTTACAAAAAT^ 

TATTATCAT AAC C ATTTTCTATAATTTT TTTAAGACAGGGCTTCACTGTGTAGC C C TGGCTGACCTGTATCTTGTT ACTCAG AC CA 
GGCCAGCCTGGAACTCACAGAGACCCACCTGCCTCATCCTCTGAGT^ 
45 TAAGCAATTTAATGTGTGGTATAGTGGCTTTTAGTGCIACr^ 

C CTACT CAAT ATT CTGC CAAC CCT CTTACG CTACAAAACTGG ACTAG CAATCCTCTTATTGGGACAG G CATGT C CAAGGAATATTT 
CAAATACCACAGACCCTTTTAGCTGCATGCCACTCTCAGCTTCTCCTAAGGGTTCACTTAGCCA 
CTGTCACTGAGACTGCCACTGTTACTGTTGGGACAGCATTTGCTTCTGGAAA 
TTAGTAGATTTCTCATAAGAAACTTG^GGATCCT 

5 0 ACCACATTGGCCATGGTT CTAAATTCCCTGGTC CAAG CAAGCCTTTTGC CTTAGCTT CCTGACTGTAGATGTG TAT CACTGCAG TT 

GGTTCATATCTAOTTCTTAAGGAGAGTGTGAAGTGGTCCATGGTTGTGCCTCACTTCTGGGT 

CCTGGCTTTCAGTCCTCC^CAGAGC!AGAGCTACAGTCAT 

AATGTGCTGTGGTTGGGTTTCTCTTACACCAAGTTTACT 

G AAAAAAAAATG ATT ATG TTAGGGGCTATGCT AC!AT AAACT AGATG AAG AAAATATTGGAAAAT C CACTCATTGGTAACAAAACTG 

5 5 CCTAATGGGTCTATTTAATTTTTTTC(^TTGGAAGTATAATAAAG 

GCTTCGCTTGCTGGGACATTGTAAATCTCAAGTGATGTGGTGCCTCACGGA^ 
TCATCTTTTCAGTCTGGATACrTTC 

CCTGTCCTGTCTGAGCCATTCTCTGTGTTTTCCTAAGCAGGTGGTTTTAGTTGGTATTTTTT 
AGAAATTGAGATTTGAAGTATAGCCTAAGTGTAGCAAGATUU^CAAGTTTGGG 

6 0 CCCCAAGGCCTGTCTCTGAGAAACAGTGTTTTTACAGAAAAGACAGACTTTATTTC 

GGCTATAGCAGGC^GTATGTGCraTAGTTAAGTATGCCCCATGTGAATCGACAATCTGTATATAGT^ 

TTTGT AGAGTGGGG AAAATAAT AGT AC CT ATCT T TTCAG ATT GCTGCACAT TTAAATGAG ATG ATC CACAG AATGCCGAG CCCT TG 
GCTTGCACCTTCCATGGTGGCTGCTGGTAATGATAGTACTGACAGTGGACACTACTTAAACCCT 
ATGGGGAAACAGCTATAAAACTAATCTCAGAATCATCTTTGGACTCAGTTAGCT 
65 TTTCTCTAATTAATCCTACAAATAAAACATTATTATATCATAT^ 
TATCACAGTTTGTTTTCAGGTTGTCCCAGAGTGGC 
CAGCAGCGGATCCTTGTCTTTCTTTTTTCCCTTCTGTGCriTTTCC 
GTTCCTCTGTGTCCTAGCATCCTAGGAGCAGCGTTATGAACTGAGTGGCT^ 

AATCAAGTTCTGTAATTGCC CCCTT CCCTGTCTCTCCAGCTTC AGAGGTTGCTTAATACATT CAGAACATTTGCCTAGAAGGGGGT 

7 0 T CCCAGAC CAAGG ATAT CTATGCAATGAGTAGTTCT CT TGTAGGTGGTAAGACTTTGATGACATAG CATCTC CTGACCT AGCCAAC 

TGGTCCCTGTCCCCTTCCTTTGGATGGTGTGTGATGTTGAAACCTGAGTGG 
TCCCAGGGCCACTGAGGGCCTGCTTTGGATGGAG 

GGGCTGCCTGTTAGCTAACAGTTGCCCAAGTCACCAGAGGCTCATCATTTCT 

T TTAGG CTAAGT CTTAG ATTGATCATG AGTTCCAGTCTTTAGG T ATTAGACAT CAGTAGTG AATTTTCTATTC CAG CT CCTTCCTG 
7 5 GGGC CCAG CGTATGGGTAAACTTTTTAG TGAT AC AAT AGGTAG CAT ATATTCCAC CCAGATGGTGTTTAAAGG ATTG AAT ATTT AT 
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5 



0 



5 



0 



CATTGTTTAAGATTTTTACCT^ 
GTGTTCATCAAGCCCTGGGCTGGGAACAGTCOT^ 

T ATGACTATAAAAC CAACT AAAATAAATACGTG CTTAGAAG CTGGG ATTCT AATAGATCATTAT AGTAATGTGGTTTCTTTATTGA 

AAAACAGAATCTCTGATGGGCAGTTACATAAAATACATAAGTTACCTAATTTAAATGCT 

AGAGGATTCCTGAAAACATAGCAGAACTGGCCTTGATGGTGAGAGC 



ATGATCCTTTCAC^AGTCAGTATAATTTCATATATCAGTTATCCTACATATC^ 

AGATAGGAAGTAGCAACAAATATAATTTTGTGGTTGGGTATCACCACAACATGAGGAACTGTC 

CTTGAGAACCAGTGCTGTAGAGTGTAAGTTGGAGGTATCTTTCCCAGTATATGCACAATCT^ 

GCCTGCATGCATCCATTGTTGAGTGAGGCATTGAGGTTCACTCTGG^ 

TGACACCCCAGAAGCTTTGACTTCCACTCTCGCCAGCCAGCCTGCACATCT 

C CC CAAC CTGT ATCTCGT AT ATTAAAAGT AGT TTTTAAATG GT CTTGAT ATTGACC TG AG AGG CAG GGGGCTTGTTTG ATTTATCT 
TGAC CTTAAATTTCCAATGAAATGAAG AGCGTTTTCrr TGCAG C CAAGATCAT ATTTT AAACAAG CAACAAG ACATCCTCCT AATG 
TAAAATTAAATACTTACATATGGCT^ 

GTATTTGTCAAATAATATTAATTATGTATGTGTCCCCTCTTCCCTG^ 

GCACTGTGCTGCTGAGATGCTGAAGATCAGCCCTGTACCTGGGCTTCTCACTCTATC 

TGTACATGCGCCCATATCCATTATGGCAATGATAGATTATTGGCATCTCACCAACATGACAGGG 

ACTT/rCAGACTGTGTTTCCACTCGAATTAACTATTATTGAAATAG 

GTAG CT T AAGGG AAGGAAGAGGAGGCAGTGG TGTTT TTG AAAGACACTAAATTTC^ 

CTTT TT CCTGTTTTAT AGC CCCT AT AGTTTT ATTG CTTGG AAAAGGGTATCITGGCTTTATTT TTATG ACAGTATAATTATACTT C 
AGAAATG CACTTGATTACT ATGGGT AACAAAATTGTTCTACrrATATTGGTTTG G CT AT ACAAACAACTTTT CAGT CA 
ATTGAG CCTGGTAAAAATCAATACATTATGGTAATATTTCAAGTAAGA CTTTCA 
GGGAAAAATAATTCTGAGATAGCAAGACTTTTTGTAATAT^ 

GCGAGGGTGTTTATGCGCAGGTGCAAGCTTTACGTTAGCAGCATCCATTAGCAGACTTGG 

CAGGTGAAATCTGACTGTGAATCCATCCTGTAGCTTACCCTCTTGGCATTCTTGAGGGGATTTm 

GACTTGAGTTCAAAGGACAAATGTGGGCAGGTAATACAGGCCTGGCAGTGTGCCT 

CTT AG CATGCATGGTGT CAAGTG C C CCATGTCCT AAG AGC CATG AG TC CTGT C CACAG GTG ACACAGGATCTCAGGCTAG ACTGTG 
TACTCTGTGTGGGCCTCCAGCTGTGGGGTCATGAGAGACAGACCAGGAACAG£ 

TTTTACrrCAGTGTAGAGAAAGTAAATGGCTCCATAGAAGTGACTGTTGCTGCACCTGGACAGTTTGCT 
TTGTCTCCAGTATCTCTCTGAGTTCTATTCTCTCCCTGATGCTCTCCCAT 
CATCTTATAGCAGTCCTCCTCACAGTGCTTGGCATGGTGCCTTGTCTATACGAATCACCAAG 
GCAGAGCCTTCACACAGTGGTAATACAAGGTGGTGACCATGG 

ATTGCCCGGAACTGCTGAGAAAAAGAACTATCCCTCAGACCTGAATATAAGCTGACTGCATAGATCCCTCCT 

TAATAAAGGGCCTTTAAGTGTAGTACTAGCAACCATAAATATCTTATGCCAGATCCA^ 

GTTATGTATTCATTTATTCAGGGAACTAGCAGTTACTC^ 

AGACAGGACAGTCACAAAGGGTCCTTGAAGAAGTAGGTGCCAAAGTAGGATGATACTCGACCATGCAGTTGT^ 
AAGTGTGCCAGAGAAAGATTTAACTGAGGAAATGTTAGTTG AG CTTGAT 

TTCCAGAAAAAAGGGG C CACTCAAACCAAAG CT CTG CAATGGG CACAG AG ATAT CACACCT AGGC^GTGTGATTGG AACAG AG AA 



TTTACAAACGAGGAAGAGCAAGGAGGGGTAAAGTGGCATGCCCGAGGCTAAGAGCACAACTC^ 

CCTGGGGTGCTGCTTCCCAAGCCAGTTCCACTCTGTGCCATTTATTCTCTGAATTTTA 

TAAAATAGAAACTAATGTACGTCAATGGTGAACTCTGTCCAGTGGTTAATTTTAGAAGCT 

ACAGCATGACTAGCrrGCTGGTCTACCCTTTGGGTCTCTTTATTTATCC^ 

ATTTGGTTGTAGAAATGGAGGTGGCACATTTCTTGACTCCT 

TAGAAAATTATAGTCTCTTTGTTGGCATATCGCACAAGTAAATATTTGTA^ 

TTTTGGAAAATAATTCACTGGAAAGTCCCAAAGGCATTTACTGTGTATAACTTTC 

TTTTAATGTAAGAGACCATGTTCCTGTTACATTTCTGCTTTTCTGGCTCTGTTA 

CT AAAGT AGG CAGGTAG CAGTAGAGTCTGGGTT AT AAATATT AAG C TTAAAGTGATTTTGTTTGTT TGTTTGTTTGTT T TAGTATT 

AATTTGCTTTTTCGGTGCAACTTAGTATAATATCCTCCTGACCATAAAATGTC 

TCACTCGGAGCAGAGTGCATGGTTITGTTTTGTGAA 

TGGGCACAATCAATATAGCACTTTAACCAGCrrTTTGTATTTAGG 

AGCCCGTGGCCACAGAGAGGGTAGGGTGCTCAGTGAGAAAACGGAAAGGCTTGATTAAACCATAATTAAGT^ 

TAAGAACTCTGAATAAGATATGGTTTTAGATGATTTCAACAACAAA 

AGCACAATCTCAGAATGACTACCACAACATGTGCACCGAGCTAAATTTGTATACACC^ 

AAAGATAATTCTTAGTGAACGTTTGCGGAGCAAGGCATCTGGGGTAACTGACTGAAAGGGCCMTCAGAAG 

TGAAGCACAAGCGAAACAGTCCAAGGGCCTCCTACCAGCCTCTGTGCCCCCAGGG 

TCACCAGGATGTTGGAAGTGGCTGAGCACTGTAAGGCTCCATCCTCATC^ 

AACTCACCTCTCAAACTCAGACCTTGTTATTATTTCTTAGTCT 

AG G CTG AAGGGTTTGT GC CAG C CTTTTCACGGGG CAAG^CTGGAGTTAGTTATTGCTATTTTTTTTTT^ TGACTTGGG CTCATG 
TTTCATACTGATGGCCTATGCCAAGTAGACACTGTCTGCCATCCTGCCrrCCT^ 

GAGCCAGTGTCAGACCTGTTGATGTTGACCTCTGACCCCTAGTCATTTCATATGGTTCCTTGCTAATTO 

TGTGGTTGTGACTTGGTGAGTATTTAATCTTTCTCTGGAATGATGTTAGTGTTTTGGGTTTTC^ 

CATCCCCACCACATCACCCTTTCCCTCCTTCCACATATCTGGOTTGAAAATGTGTGTGG 

GGATTAGAGACTATGACCTTCCITTTGACTGATCTCTGAGGACrCTGTAGAGCA 

CACGCAATTTATATGTT"GTTAGAAGTCACTAAATATACAATAAAAAATATGCCATGT^ 

CTGGGGGGTGGGGGAGCACCCTCATCCCTTCTTGCTTGTTTG 

AAAGTAACAAGTTTCTCCTGGTTTATTTTGACCrrGGTCATGATGTGTCCATT 

TGAAGATAAAGTGCXrTCGCTCCAGCCTGACAAAAGCCATTCCTTTGCAGAGTGGTAGCTGCAGTC 

AGCAATCACAGCCTAGTTGATGGAACTGTTCAATTAAAGCACAG 

TTTAGACACAACCAGTGTTCAAATTGATTTCAGCATTGATTTTGGCTGAAGCTGATAAATTTCTC 
GCAGAAGACTTTGTACTGCXSGTATTGAAGTGTTCATTTAGCTGATG^ 

CTGAGATGG GACAATT C CCCAATCAT CTTTT ACTTTTAATTTTTT AG G CTT CAGT AT AC CAATCAATAGGTG ATTGAAACTGATTA 
AAACTT ATCCAACCAGCTG C CT ATGGCAGTTT AATT AG AAG C CGGTTGATTCTTAAAT TAAACG TCTTGTTTT CATCG CATTG C CA 
TCTTCCAGAAGAGTC5AGGGCTAACCXTGCTGAAAACCTAAAACTGGGCTTCA 
ATAGCTCCTCGGGCAATCCAAAACACAGACGGGCTTTCX^ 
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GTGTGGTACAGACAAAGGGCCATCTGTTTGTGACATGTC^ 
TTGAAGGGGCCATCAGAATAGTCAGTGAGTCTCTGG^ 
AAAATAACTX3TGGCAGGGTAGTAAAO\AGGACCTTCACA 
ATTTTAGCCATCCCCCACTTCCCTCGCTTCCCCACACTCATTT 
5 TGATATTTCTGTCAT AAGGGACATATTG CATCTGG CACTACAGGGAAGGGAATAAATGGCT CACTTTTCAGAGGTGCATTTACTCT 
TTGACCCACTAGGGTACTATTTAGTGTTCTAGAAGAGGT^^ 

TGAAAAGTG AG CCATTCGATCGATTTCTCCTATTGCrrrrTAA CAACCATATCCAGGGTG C CCGGGCG ACTGTTGG 

AAAAGTGCT CTAGAAG CAGCCAGTTCACACTCATGTAGAGCTTGG TCTCGCTT CA^ CTACTCTGGTGT 
TTATCAAACAGAAATGTGAAG CTTCCATT CTTGACT CCTTGACCX3 TAGACTGACTGACX?A^ 
10 CAATAGCAAATAGCCTCTCTGTATAAAAATATTTCCTAACCAGGTTTCTCTCTCGTGCT 
GACCTGGCTGCTCTCAGCCTTGGTCAGGGAAGCTTAGTT^ 

CTAAGAATG AGAGACTGG CCGAGTG CTCAG CCCAGACTTGG ACT C CT ATCTCT CAG AACATGGGAGGAGCGACATG AAGTGCTGG C 
TTCTGGGCCTGGCATGGCCATTGTGATCGTCTl^ 

GAATGGGGCAG^GTAGQCGGTCTCCAGGCATACTGAGGGGCrACTGGCAG 
1 5 CCAATAGGCTTTAGTGGTCGGCACTGCACCCAGGCACATAAAGGC^ 

ACATGAAGATTAAAG<3TCGGAAATGGGAGTACATATGGTCAT ACTCATA CT CAAAACAAGGAAAACTTGAA 

AGATTCTTAACAC^CATGTTATAAACCATACAC^ 

AAGGC^TGTCCAGGAGAGAGGTCCCGAGCTGGTTAACCCTCAGCT 

2 0 CTTCCGCTATCTCAAAGGTATTTAGGCCATATATTAAAAGACATTACTCTAGTT 

G CrrTCAACTTACAGAAGAAAT ATGG ACGC CT AACCACATT AACTT CACAGGAGGTATGAG AG GAAGAGGGGGCAT CT CAG C CAACT 
AAACAGAGGAGCCAACATGAAGCTGTTTCCCCTTTCTATGACT^ CCAAGTTG CCCGTGTCTTAGATTGGG 

TTGATTTATGAGGCCTATATTTGGTCTGGTACATAAGCAGCTGCAGG^ 
TGTAGTGGACTTCTAGGCAGAAGGAAOTTCGGCCT^ 

2 5 TTGTAACAGCGAGCACTTGGCTAATGTGTTTATCTAT^ 

TTTTCTCTCCTTGTGAATGGCTGTGGCTATTTCAGTGTGTGAC^ 

CAAAGATAAG ATCCAAG CATT AAGG C^^CATTTTCATTGG CTGTT C CCACG AG CTGTTTTG CAT ATGGTTGCGGGAG ATGTCAT TG 
CAATT CAAAGATGACTT AAG AGAGT ACACTT AATAATTGACTCT G 
AGTITCCTAAATAAACTGGATTACTAAGGAATGCACTGGTTTGTCTTCCT 

3 0 CTCTGAGCTTAACTTCTGGGATCCAGCCCTGAGCT 

GGATACGCATTGCCTGAAGAACA.GAGAATCTGCTTTATTGTTOTGG CCTGTTAACACTGCAG CT 

GGCCTTGTAATCTGTACATATT1TTAAATAAGCTTTTTCTCITTCCAGTCCC 

ATGAGTACCTTCTCCTGCTITCCTAACGCTTCCCCATTTG 

AATTTCCACCAATTGATGGCTATCTCTAAATAATTCACAGGATGATCCTCCTGCTI^ 

3 5 AGCAGAACTCCTGTTGGTTTATGAAAGAAGCAGCCCCX3CCCT 

GTG ACTTGGT CTATG CATCG GATTACTCTTG AGGG CTTT C TCCAAG AC AGG AG GAAAAAAATCG ATGCAGAGGTGTTTTCATG CTC 
ATTTTTGTTGAATATTTGCrGATTGATTGATGTTTATAGTGTATGTGAGTCATGCT 

GTCACCCTGTCTTAAACTTCCTCTG^TCACC^GAGGAGCTAGATGCTCTCCAGAATAAATCATCTTTGCGTATGAGTTAAAGTAAC 
TTTATTGGACTTACGTTCAGAAACTCACATGCAGGAAGTGCTGTAGAA 

4 0 TTGAGAAACGCATGTTCGGCTTGGGGCTATTTATTTTGTAACTAGAAAT(ZAGCTGGGACAACT 

ATGTGAAATOTTAAGCATAAATAATTTCAGTAAACACTGTAAGATGGGTG CGATGCGCACTTGCATGG CAGGCCAGATGCAGGACT 
GTGAACACAGGTTGCTTCTCCTTCTGTCCTGGGGGATGCAGGACTTTTC^ 

AGATTATTGGAGAATTTTAAAAAATATCTTTTATTTGTGTGAATGTTTTG CCTGAATGTGT CTCTGTGTACCACAAGTATGCCTGG 
TGTCTAAAGAGGCCAGAAGAGGGTACCAAATTCCCTGAAACTGGAGTTACAGGCAATTTTGAGTGTC^ 
45 TCTGGAAGAGCAGGCTGTGCTTGCAGCCTCTGGGCCATCTCT 

AC TT AGATG AAGGTC AG C C CACACAT TG CTTCCCTTCTAAGCTG CACTGGACT CTAAAG TCTGAAAGAACAAG CAATAG AATCATT 
CACACTGGTCCCCAAGTATTGCCITTGTGGTCAGTCACATTTCCTGTCT 

CAGTTTAGATAGG AAGCTGAT CAC CTCT CACC CCT AC CT CTG T ACCT CTCAGCT CAAGTCAC CCTTACTC CTTG CTG GGGACAT TT 

5 0 CTCAGCTAACCCTC^GTTTCAAGCTCCTCATCCAGCAGCTACTATGCTTTCTTGAAGT 

ACAATTTGGAOTAAAGGCCGGAGACCTGAGTGTGGCCTAAGTGACATGACAG 
TCTCACTTAACCTGCTGTTC^TTGTTTCACACACATTTCC^ 

GAAGATGTGAGTGAGGGGTGTCJTGGAATTAAGGCTTTTATATTGTTTAGAGGGGGATAGA 
GTCCACTCTGATTTAAAAAAAAAAAAAAAAAAAAAAAAAAAGGTCTACCTGACAAGAAT 

5 5 AAAAACAAATGAAGAAAGTGTGGGAGAAATCCCAAATATATCAGTAGTTACAACAAACT 

AATT AG ATT G AGGTTTTAAAAGT ATAACTGTTTG CT AAT TTAG AAG AAC CACAAAACCACAATG ATAT ACT AAAGTTT AAAGAAAA 
AG CCGGGAAACGACATCGG ACAAAAGCAGTAGTG CCIAACAAAACCTTCTCTCTTAAAAGCTTGTACATTGATGTATAACAAACTTA 
AAAG AAAAATAATT AGT ACG CAT AAGGAGGG ACT CC TATATGATGCTGT AAG CAACAAT AAG CCAAT A 

TGT AATT AATAACAT AATTTT AAAATACATGGAG CAAAATCTT ATACAGTT AC C CAG AACATTTGAT AAAC C CACACAGTGGGAG A 

6 0 ATGT CTCAACAATCAACAATTGATCAAT CAG AT AAAATTAGTCTTTG AAAG ATTTTT AAATTTTTT TTATT CATG CT TATTTT ATT 

CGTGTTTAAGTGTGACTGCTTGTCTATACAAGTACrACATGAATGCAGTTGC 
GCTGGAGTGGCATGAGGTTGCAATCC^CCCAGTGJIGGGTGCTGGGAACCGAACCCAGGACCT 
CACTGAATCCTCCTGCC^GCCTCATATGGAAGATCrTAACATC^ 
AAAGTGGATCTTCCCACTTCTAATGATTTAATTAAGAAAA 

6 5 CTATTATTCCTTGTGCCATCCCAGATGTCTATGCTGTGTGTTCAGTTTTG CAGGGGACTAGTCCC CAAAGGC CAAGTGGAGCACAC 

AGGGAGAAGAGCTAAGAAACGCCTCATACAAATGCTTCCTCCCGTCTCTCTTTG 
ATTTGAAGCGTTAGCTGTTAATTAAAACATTGCAGAGGTGTGTAGTTGTCTTTTGGTGTAA 

AATCACTTTACAGTT AAT GTGCTGAGGAAGGTCAAGACAATTGAGAAAGAAAG GATGCTGTT 
CACAGAGTTGAGGAGTTGGCTTGGCTTGGGCTGCTTCTGCCITT^ 

7 0 GTCCCCAGACTGTAAACCCAGCC^CAAGGCATCTTTATTGCTTGAAATGGGACCCAAACTTT^ 

T AATTAAAT CAT CAACTCT CC CTTTTAATT CACTTGGAAATGG GAG A CAG CTTTCTTGACAAGCT 

AATTAAGTTGCCAAACTGCCCAGTGAGGTATATGTATGTTGGGCTCTCTGCTAATC^TGCT 
GTCCATGTGTCACTT'ATGTTCAAAGTATGACACTGCCCTTCCCACGGCr^ 
TGTTCTAATACTCAGGGTTTCAAACAGTTGAAATAGGAAGTATCTATGGTCTACTGGTTGA 
7 5 CATTGAATTATTT ATATTTG CATGGAAAG GG CATCTAAATTCT TCACAT ATATT TT AAAGAACACAT TCCAACAAC TTCT AAAGCC 
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ACTGGAGGAAATTGGTCTCTACCAGTGTGCTC 

AATGTAGCTTTT AATAAAAGTCATATTTACAT AGATGGTAAATAAT^ CAGGATAATATATGGG CT CCATTT 

GTAAAATGTTGTATCCrrTTATACGAAGCCTGAAAGATTTCATTTA 

TTGATTGTAGATTCATAACTAAACCATTTCTATAAATCA^ 

CAGTCTTGTT AATATTTCITAACCTAGGTTATGAC CTATTT CAG1OT AGCTGGGTGTClTAATTACTl"iUXJT AATG CTGTGGAAAG 

GGCCAAGGCAGCTTTAAAAAGAAAGTGATTTTAGGCAATTAAAGTT^ 

TTACATTCTGATCTACAAGTTGGGAATAGAGCAAGAGAGACACTGGGAAT 

CAAGGCCATGTCTTCTAATCCTTCCCAAATAGTTTCACCAACTAA 

GGAGGGGNNNNNNNNNNNNNNNl^^ 

G C CACATGCT AAAACCTCATTGGTGTATGT CTTACATCCGTCAATG TTTT ATT AT AGTAAAGC CGT AGTCAGAGCTCAT C CTGGAT 
TGGTGATTATTGATGAAATACAGTAATATGATTTCTCTTTCTGTGTA 

ACAACT ATAGACACACAT ACAGTG ATAT AACAGAGGATCTAGGGTCAGA CTGC CG CTG 

TG TAGTTGG GCATCG AG CAAACTG CTTAAATT C CT CATGCTG CAGAG CTGTCTTTAAAGGAGGATAAT AAT AGCAGTGTGAGGTG G 

CTCTCTAGACCAGGCTGGCCTTGAACTTAGAAATCTGCCTGCCTCTGCCTTCCAA^ 
CCCAGCTAGCAGTGT^CTTTTGAGCAGTATTTT 

GCTAGCATGGGCCATTTGCCTTGAAGGTCTATCCCCCTTTGGGCCAG^ 

GGTTATCTAATGACAGCAAGTAACTTTGTGTCTTTTTCTTT^ 

AAAGGTAAGCATTGGGTATATTTTATGCAAGra 

ACATGGAAATCAGCGGAAAGTGCTCGGTGGCCTGTTTTTTCACTCT 

TGGAGCTAGGCTGATCCTCCAGTCT^ 

GCATCAGGTGCTGCTGCCTGCCGGGCCATCTCCTCAGCTCTGTGTTCTAAGATTOT 

ACT ATGT CACTGGTGCACAACCACACTTTG CAGCAGTAGG CT ATTTG CTG AAACGTGG CCAG AC ^ ^^^^^^^^^^^^^^ 

GTTTTTGAATAGCTGCTGTGTG CCAGGTATCACA.CT ACATGTTTTGAAC^ CTTAAAACTATGAGGTAATTCTC 
ACCAATTTTAGATTTGAAGGAAGAGGGACTCCTGGGGGCTGATTTC 
TTACCTGATAAAGCTAATGTCTACC^TGCTTTGTAGCTTGTGTACAGTGACrrGCTGT^ 
CAGTTCATCrCTGAQATGCCTTTGTAACGCCATCTACAGTQAGA 

CTTAT ATTT T AAAG AAATTAAG AATTTGCAT AATG GT TGTT CTTTTTTT ATTGAAGTTTU^CT AGAAC CG AGG CT CAAAT CTT CGG 
AATTCTAGTGCGTTCCAT<3GTTTCTTTCTGATCTCAGA 

AT AACAG AT AG C CTGT ATGAGG AGGTCT AG CAGAGTT CTCTG AGG ATG CAAATT AT AT C CACACAG GTG CAGC TTGGG AGG CAC CA 

TTG^GGATCCTCATAGGCCTCACGTTTTGTACAGGAGGAAGTGGGTCCAGA 

TTCCAGTCTTCCCTCTGTGAACCCACCTT-GATCCATGTTCrGTGTCTCTCTTGAGCTCCCTT^ 

AGCAGTCTTTTCCTCTCTGCCC^TTA^ 

TCAAATCTTCAGTTATGCCAAGAGAGAGAGTGTC 

TTCATAATTCCATACATTTTAAGTTGTGCITAGGATAG^ 

GGACCAGCAGCTCTTTCTGGAAAAGAGTTAAAGTTTACTTGATAAAAATA^ 

TTACAAATAAGCCATTCTCCAGACTGGACCTTTAAACTGGGCTTTCrrAGCAAGAACTGTATTG 

TGGCTGATTTTTGTGTATAGACTTGATCTTGATAACCAAAAC^ 

TACCATTTCT^CAAAACAGCCAGAATGGTAAATTAGTTATTAGAATTTTATTAm 

ATTTTTCAGTTATTGACTTCTATTTAATTCCACTGCAGTCTAAGAACATA^ 

AAATATGGTCTTTGTTGGTGAAATTTTCCTGTGCCCATAAAATAATATAT 

TAGATAAAATTGGCTTTTAGTTTTTATATCTTTCATATCTGATTTTTT 

AATCTCCAACACTTAGACCTTGGTCTGGTTGTTTTTTTTACATTTC^ 

TTTGGTAAGGTTTTTATCATTTGGAAGAATT^CCTCTTTA 

ATCCATAGTGGTTCATTATTAATCTAGCTAC3VGCAGCT 

T ATTGG AAATGAGTTT CTTGTAJGACAGTATATAGTTGGGTCTTAAT TT AAACT CTAGT CTGAT CTTTGCTTTTTAATTTGGCrGTT 

TCTCCCTC CACTGTTT CTAAGG ATTGTCTAGAGTTT AT AGCGC CCATCATT AGT CT A 

GCTAGTCAGTGTTCTATTGCTGTGAGGAGACACCAGACCCAAAGCT^CTCTTATCAAAGGA 

AGT*TTCCAAAGGTTTCCAAAGTTCTGAAGGAACAAGTCTCCACGATCAT 

TCTGAATTTTGAAGATCTCGTTTCATTGCTGGCTTT TTATTTTCCAGCCAGGTGCTGGCTGTCCTT TGTGTTCATTGCTGCTCCAT 

GCGTCTGCCCTGACATCTCAGAGTTCTTTCTTTGTTACW3GTGGTGGTTATAG 

GATGCTTTTTTTTATTATTATTATTCAATATGCTTGAGGTTTGTTTCTTGAAA^ 

ATTTGGAGCACTTTAACCTTTCATCAGATCATGCTTCC^CATG 

TGTCTTAGTTTTGGATAGTACCTCTTCCTTCCATGTCT 

CTGTGGTTTTTCATTTATAATGTTCTTTTTATCCCTAAAAGTGTAGGTAGGCTT 
TTGATGTGCTC^CCTTTTTTCTACTTTCTAGAGTACATATAATAATTA 
GAAAGAAGCTTAAGTCATCCAAGAGGAGGGATCTTCAACTGATAAA^ 
AATTTCTTAATTTATGATTAATGGGGTAC^^ 

AAAGTAGG CTG AG CAAATCATGGGAAG CAAGT CAGTAAG GAG CAGGCCCCTCTATGG C CT CTGCATCAG CT CCTGCCTC CAGGTTC 

CTTCCCTGCTTGAGTTACTGTCCTGTTTTCCTCCAGTGATATGGAGGTGTAAGCCAAAGAAGCCCTTTOT 

GGTCATGGTGTTTTGTGATAGCACAGTAACCCTGACCAGGACAG^ 

TATTTCTGGGTTACTTCCAGTGCATTTTTTTCCTCTTCATTTTGGGTTC 

GTAC^TAGATTTGCATTCCTATATTCTTAAAGTGTTCTGAGA^ 

TGAACTTTGCTTAAAGGGCCAACCTAGCAGTCAGCCTGCAGTGAATTTCCCCATTCTGCT 

TGGAGGGAACACAGGCCGCGGGGTTAGGGTTGCTTCCTCCGGTTCTCTCCTCCTATCCTGGGTAACCT 

CACTCTACTCTAGACTCTGATGTCTGTCACACTCCCT 

GCT CTGT CTT CT CAG CATAAG GAGACTG AGTT CAGCCTG CT TTCT CTCCCTGTACAGGGACCAGAAGACT CTGTTT AGTCAGTGAA 

CTGAGCAG<!CrrAAGATTGGTCrrTTATC^CCAGTCTCTAGATGTAGTGT^ 

GGTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTCTTGGTTTC 

GAGACGGGACTCACCATGTATCCCTGATTTGTGTATAACTTTCTC 

CTCTGCTTTGTTTCCTTTCCTTTTTGTAAGACCGTTGGCCAGCTTCTGAATGGTTCT 

TGATTTAIGCTGCTGTAM^ 
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TTTJVGCAGATCXIAGAACCTGAG^ 

CCTGCCTTTGCTCATCTATACCCCTAGATTAGCTTC 

TGTCCTGGTTGCTTCTGCACTGGTTACTATTCT 

TTG CTCCACAGTTGAGTGTCTAGCCATCATGGCAGCAGGAGTG GAAG AGA.GATGG AAACTG CCATT CTCTTCCTTTTCTCT CT TAT 

ATTTGGTTCAGGACCCCAGCCCTGGAGTGATGATG 

ACAGATACACCAAAGGTTTATGTCCTCGGTGArrCTGGACCT 

CTAAACACACTCCAC1AATGCATTAGCAGTCTGCTG 

GTTTTACTTGAACTTCTCAC CTTGTCIX3AACCT CAACAAGTT AATTCATGTG CACCTCTCT 

AATTAGAATGCCTTCATCTGTGTCTTCAACTGTTAGAATCTTAGCAGC 

OJCATGTGTTGTATGTGCCAGTCACCICTTt^ATTCT 

TTT^GTTTCATGTCCATTTCCTCTAACCT 

GCCCAACGCTAAATGCAGTCTCTTAATACATAATTTACTGATTGAAA^ 

TTGAAGTAACATTCTGAAAATAGGTTTAACAGCATACAAAATTCCCTT 

TTTCTTTGCrrTCCTTTGTTATTTTAAAGGGTGAAATGGAAGGTTAAG 

ACTTITGTGTGACTTGAAGCATGTCACTTGAACCTCTCAGTGAC 

ATCTTAGTCCTTGTCTGAGCAGATTCA^ 

TCTATTGGACCACACTGAGGCTGTCTGGGGGCCGGTTTGGTGAGAGCCCrACT 

GAGGCCTCTTTTTCACAGACAAACAACACATTGCTGCT 

GTTTTTAAAAGGAAGATCATTTTAGGAAAAATAGAAATGTGTCTATACCAT 

TT AG AAAATTATG CT AAC CACATGTT AG ATAC TGT CTG AT AATCACCTG CACACTTGCTAGOT CGGATTTGAC C^CTTTTGTTTG 
TTTGGT ATAGGT AG CAAGACCATTGCAAAT AAGT AG AG CAGAAAACAAACACAAAT ACTGC TAG CCATCTAGAAGCAGAAC CT CTC 
TCTTCCATTACTGCCTCrGTAAAATCTTTCCAGTAGTAAAGCTCAGGGAATTTACTC 

AAGAATC CATCT AG CGGGATTTCTTAG CTGTGT ATGTAGCATGCATTGAGT ATCATT C CTGG AGTGCTTTGGATTT CTGCAG CTTT 
TAAAGCAGCATTTAAATGGACCATTTGATTAATCATTTGTATCC^ 

TGTAAAAGACAATTATGTCTCCACAGATTAAATAAGTGAATTAAAACAGAATAGTGGCTTA^ 

TG CCTTT AG T AGAGCGCATG CCACT C CAG CAAAT AAATGAAGTCAT CCATG ACAG CTCGTG ATGAATTG CCGTATTTATACTGT AT 
ATTTTAATAGTCTCACACTAATTTAAATTGAGCACAGAGTAAATTATAATTCAGTCACTTGACTGCT 

CATCCTTGCGCGTCCTGTCCGGGAGATGTAGCAGAGAGAGAGAGAGCGGTGAGAATGCAGGCAGGGACCTCCATCCTGGCTG 
CTTCCTCCCAGAGCTGCITTCCAGCTCCTrCTCTTCATTCCCT 
CACACTGTGCTGGAGCTTCAAAGGGAAAAATCCACACTCGCTGTAATTTTCT 
TTTTTAGCATTTTTAAAGCTCACTGGCAGGAATGCCCGGGGGAAGGGAGTTTACCT 

AGCTCGTGCACATGCGCTGTGACAGCATGGAGTGGTCTTCCTCTCTGGAATTTGTAGAAATGACCTGATTTGOT 

TGGACTCCCAGTGCTCTGGACTACAGGAAGTCTCGAGAGA 

CTAGTCGAGTTTTGGTTAGTGTGGCACTTTGGTTTC 

TCCATTTCCCTTGGATAACAGCACATGTGGAATACTGCTTGTAAGGAACACTTTAA 
CTGAGGCTTTGTCCTGGGTGCTAAGGAGTCA^ 

GGGGCATTAAAGCCAGTTGTGGTAAAGATGAATAAGGGTCAAGTCAGAGATGAATTCT 

TTCAAGCTGGGTCTAAAAGGTCAAACCAGCTTTCATAATCAGTGACTGTGGAGT^ 

TGGGCCGGAAGGCAGCTGTGTAGGAAGGATGAGAGGAGGGTTAGCAGGAACrCCAGAGTTGCAGCT 

CAAGCAAGGTTCTTGTTCAAGGGACAGATAAGGTCrCGGAAATTGCCATCAACrr 

CCTITACrTCTTACTTACGTGACTGCTACCATTAGAAAGAGGGATATTAGAATAGTCA 

CGTTTTTTGCTTTATTCAGTATAATGATACATGGTCCCCGTCTGAGATCGGA^ 

GCTGTAGCAGTCCTGACGGTTAGGCTTTACrCAGAAGTGTTTACAATCAAACTCTGCTACAGTTCTTAAGGAT^ 
TTTTATGTTCTTACTGCCTTTTACAAATATATTTTTTAAAGGTTTTTTA 
GATGGGGTTCTGACTCTTTGGTAGGGTACTCTTGGTACra 
CTAAGGGAAGCGGAAAGTCTTTGAATTAAACTCGAGCCTG^ 

ATAAAATCCACCTTGGTGGCAGATACTTAAGGAAAACCACAACCCGTCAGGAGGTCCTGGCT 
GGACCrrCTCCAGCTCTCAAGTCTGTCAGCATTTATAAAGAAACATTACCCACATGGGAGGAAGTAAGGTGACAGG^ 
CCCATTTCTTTTATTTCTGAAAATCCCATTTTAAAATGTTCTTTTATTTACCATCCCCTCTG 
GGGOTCTGCTCTTTTAATGATCTTCTTCTGAAAATAGCCGG 

ACTTTTAAAGCCATTAATGCTGACCTCTGAGACCTGGCTCCTTCC^GGGTTCCATCAATCACCTCTGTACCGCCTCCrCCAGC 
GAGGCTTAGCTTCAGAGACAGCTTTTT r i TO ^ 

ATTTCTTTAGTTCTTGTATCTCACGTGCTATGTAACTTAAGGTAGACGAATGTGTTGACGCTGCTGT 

TTAGACATTACACTGGAGCTTCAGAAGCAGATCATGTCTCAGCTGGAAATCCTTTATAAGGTATTTT 

TATGTAAATGCACATGGTGTCAGCATCTAGGTGTTAAACTACTTTTGTCTTATTTTTGTTACA^ 

GATTTTACGGGGCATTTTTCGTAGAAAACAGGATTTCAATTTGTACAGAATTCATGGATG 

AAAAATGATTTTTTGAATCGATTGACGTAAATGTAGTCTTGACACAGATGTCC^ 

ACTGCCTTCTTTACACTGGCAGTTCTCTTCTGTTAGGGTTCCTGCTATTAGATATTTGTAGm 

GTTAATCTCCTTTTACTAAAAATTTGCACTCGGGTCCAGTCGGCGCGAGGTCCCTGCTGCCTCTGTG 

ACAGATGCTCTGTTCTTTTAATGTTTCAGTCACCAAAACAGATTTTCTTCTAAAGAAATCTATT^ 

AGAAAGAAATAGGTTCTTAATTGCTAGACAAGAAATTGTTTTTCAACACAGCTGTTTAGCCGAGCTGCGTT 

GTCAGGCCTGTCCTCCTCTGCTTCTCATTCATGGCCATCAGCAGCACTCCCTTGATCTGAATATT 

G AAATCCT CT CACTGAAATGTCATTGTT ACTL TT T T TTT AATGTATTGT AAATT TG G ACACGTTC C CAC CTTTTACAAGGT CAGTT 
TCAGTATCTAGCCTCCATTAAAATGTAGCATATGTGCCTTTCTACTTTCTTAGGTACATGCACAC^ 

CATACATACAT AC CACACATGTG CAC CACT CACACAAGAAAT AT ACCAC CATACATACATACACACAC T ACACATACAT ACAT ACA 
T ACAT ACAT AC!ATACACAAATGCAT ACACGT TTTTACTTCT CATGCAG C CACAT ATACACACACACCACACA 

TATT CT ACACAAT ACACTCCCCACACACATCACACAT ATACAT AC CAT ACACATG CAT C ATCT AT AT ACACT ACACACATG CACAC 
TCACACAT AGG CIACAC CnTTCAACTTCTTTAGG TAAACACAT AT ATAC CACACACATAAACATACACAGTTACACCAT ACAT ATAC 
ACACACCATAAACACACACAGGTACACACACACAAACA 

GCATGGG TTTGTAATCCCAGTACAG CTCACT CTTGTGAACGTG CATC CTTTGAG AT CTGTATAACITCAGTT ACATT AAAACACTT 
TTT ATT AAG ATAG AATT AAGATACT ATAAAACTCAC CCTTI^GAAG CATACAACTTACCATTGTTGTCT AATTT CAAAACATTT CT 
AAAGACSAAATCCCTAGCCACATTAAGTG^^ 

TCATGCTGCACATCAGTGGAGTCCTrGTAGTCGACAGCCTTTGTGCCT 
ACCGCAGGAGTAGCACCCACCACCCCTCACGTCACTAGCAGTAGTT^ 
TGATGGTCACTTGGGAAATTTCCACTTCrTCCriTTTAC^ 
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ATAGGTGTGCATGTGTGCTACTCCCTGCCTGGAAGGCCAGAGGACAGOT 
• GATCAATTCAGGTGATCCAGTTCGCTGGGAGTGAGGTGTCAGGT^ 
CTGGGAGTGAGGTGTCAGGTCCTGTGGCAACTCTTAATTTTTG^ 

TG CAGTT ACTGTAAGTACTAAAGAAATACTGCAACGC^ACGAGAAA CACTCTTT AAAAAAAAAACAGACAAT AG ATT 
GGTTATGAATTTCTCAAAGAAAGACACAATGGTCAAC 

AAAAACACATACATATT CATACCTGAGACCACATACACACAAARCAAAATCAAC CACAT AAT AAGTGTAGACAAAGGCACAGAGAA 

GTTAGAATCCTCAGACAATCTTGGTGTGAAACTGA 

GAACCAAGTGAGTCC^CTGCTACAGAAACCCAAGACAAC^ 

CACAAGTCACCAAGATGGCTCA^ 

G CCAGTTTCAGTACACTACAT 

CCACCTCCG^^^ 

CCOTGACCCCGCGGCCCTGGGCCCCCCTGCCTGGTTGGAGG^ 

GCGGCGCCCGGCCACCGTCCCCTTGCCCACCGAGACACTCAGGACX5GCAGGACCCTGGCCACT 

TGCTCCGCTGTCTCCTCTTACTCACAAGGACTACAGAATGCTTGTC 

CTCTCTACTAATTCCTCTTGGCCGCT 

TGCAAGCTCCCTCCrTGATCGCTCCTCTTCACTTCTCCCCAT 

TTGTCACTTCTTGGGGCTCCTCCTA^^ 

TGGCTCGCCCTTGGCCCXOT^ 

GACCGTGCACTCCGGGCCGCAGTTACTCTTCAGGGATGTTCTGGATGTGATAG^ 

TTGAATATGAAGATGAAGATGGTGATAGGATTACAGTAAGAAGCGATGAAGAGATGAAGGCAATGCTGTCCTACT 

GTAATGGAACAGCAAGTAAATGGCCAGOTAATAGAGCCGCTGCAGATATTT 

ACATGGCCTGAAGGTGAATACACGGGCTGGGCCATCTCA 

AGAAGTCCTCAGCTGAACTGAGAAAGATACTC^CCAACGGCCAGATGAATGAACA^ 
GGCAACGGAGGCACAGTCTACAAAGC^CATCATGTCCCAAGTGGGAAAATTTTAGCTG 

GGAG CTT CAGAAG CAG AT CATGTCTGAG CTGGAAAT CCTTTAT AAGTG TGACTCATCGT ATATCATAG GATTTT ACGGG GCATTTT 
TCGTAGAAAACAGGATTTCAATTTGTACAGAATTCATGGATGG 

GGAAGAATTGCAGTTGCAGTTGTTAAAGGCCITACCTATCTGTGGAGTTTAAAGATTTTA 
GCTTGTAAAC^CAGGOSGACAGGTCAAGCTGTGTGACI^ 

GAACAAATGCTTACATGGCACCAGAAAGAATTTCAGGAGAGCAGTATGGGATCCATTCTG^ 

ATGGAGCTTGCTCTTGGGAGGTTTCCATATCCTCAGATTCAGAAAAACCAGGGATCTTTAATGCCT 

TGTTGATCAGGATTCGCCGGTCCTTCCGCTTGGAGAGTTCTCGGAGCCGTTTGTAC^ 

CCAAGGAGAGACCGGCGCCTGAGGAGCTGATG^ 

TGGGTGTGCCGAGCTCTAGAGGAGAGGCGGAGCCAGCAGGGACCCC 

ATCATGACACAGCAAGCCGTC^CCCCTCTGTCTGCTGTACTGGCCCAGCCTCCTGCTCGCACCT 
CCAGAGGGCCAGCCTCTGOTGAGCGGGGGAAGTGGGCACTGACTACAGCA^^ 
AAACTATGGAACCTCCTCT^CCCCACCCCCTTATCTTTCCCTACTGTTCAT^^ 
GG AAT AAAAGT ATTATTG CTTTG 

ATGCTGTGGCTG^ 

GGACTGGACCGTGCACTCCGGGCCGCAGTTACTCTTCAGGGATGTTCT 

CAGCCnTTGAATATGAAGATGAAGATGGTGATAGGATTACAGTAAGAAGCGATC 

TCCACAGTAATGGAACAGCAAGTAAATGGCCAGCTAAT^ 

GAACATACATGGCCTGAAGGTGAATACACGGGCTGGGCCATCTCAACACACCAGCCCTGTGGT 

GCTTGAAGAAGTCCTCAGCTGAACTGAGAAAGATACTGGCCAACGGCCAGATGAATGAACAAG^ 

GGTCATGGCAACGGAGGCACAGTCTACAAAGCACATCATGTC 

TACACTGGAGCTTrCAGAAGCAGATCATGTCTGAGCTGGAAATCCTTTATAAGTGTGACTCAT 
CATTTTTCGTAGAAAACAGGATTTCAATTTGTACAGAATTCATGG 

GTCCXTGGAAGAATTGCAGTTGCAGTTGTTAAAGGCOTTACCTATCTGTGGAGTTTAAAGATTTTACACAG 
CAACATGCTTGTAAACACAGGCGGACAGGTCAAGCTGTGT 

ATGTTG G AACAAATG CTTACATGG CACCAGAAAGAATTTCAGGAGAGC AG TATGGGATCCATT CTG ACGTGTG GAG CTTAGGAAT C 
TCTTTCATGGAGCTTGCTCTTGGGAGGTTTCCATATCCTCAGATTCAGAAAAACCAGGG 
GTGCATTCTrTGATGAGGATTCGCCGGTCCTTCCGCTTGGAGAGTTCTCGGAG 
AGCAGCCCAAGGAGAGACCGGCGCCTGAGGAGCTGATGGG^ 

TCCATGTGGGTGTGCCGAGCTCTAGAGGAGAGGCGGAGCCAGCAGGGACCCCCGTGA 
rv^ATAT^ 

ATTTTGATAAACAAAAGCTTTCAGTTTTAAAAGCAAATCATGGA 

TG AT AGAAT T ATGAACCTT AT AAACT CAGT ATTTTTTAGTT AAC CT AAATACAT AAATCT ATTTGAATTCATCTTATG AAAAGAG A 
TATCTCACTAGATATATGATCATTATATGCTT^ 

CAAGAAAAATCACAATAATTTTGTTGTTACTCAAAAATTTATGGGTTAGCAGA 

AGGT AACATCATTAACACT ACGT AGT ATAATTTT T TT AGT ACTTATTTGACAT AAC CTTT ACTGCTT CTG CAAGAATAT AT CAGT A 

ATTTTATAACAATCTGTCTC^^ 

AATTTAAAATCAGACTTAAAAA^ 

AATAGATATAAATGTAAAACTCCACTTAAAGTG 

AAATTTACAAAATGAAGGAAGGAAGAATTTGAAAGTTTGAATA 
C^TCA^^ 

OTAACTTTCCCTGCCAACAAAATCTAGCAGTTCTAAATCrrA 

n rTT mTGAG AAC CATGTGTT ATGAAGTTT T CTTTCCAGCACTTGT CAT ATAAT AAAGGCTCAATGT TT AAT AAAT ATGTGATGAT 
nn^AACAGCAATAAGTTTTGAAATATTATCTCAGATGTAATGAA 
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GAATTTCCTTTTGAGGTATTTCAAATGG 
TGTAGAAGTAAATGAAGTC^U^CATGAAACAGAA^ 
GTTAGTAACTAGAAGACAAAAAGAGTGTTTTAATCCTCAGAA^ 
GAGCTGTTCAGAAGCTACATGTC^TTCCTTAGTC^ 

TGATTTTTTAAGTCC C CTTTAACTCAGAAATTTTATAAr CATTGCTG CATGTCT ATGTCTGGTT CCCAACCATG CTATTT AGTTTG 

GTACTTACCGCTTTTCCAGTATATGTGGAATACAAAGAACCCTGCTTTGA 

TGCTGGTAGAGTAACTTTGAGAAAACTTAGAGCTTTCTAAGTOT 

CACAGCATTTACTTGTGCCAG^ 

GGTACGGTTTTTATGTTTATTTTTCAGTTGTGGAAACCAAAGGGTC^ 

CTACAGAGTCCACATTCTCAGCCTCTCCTCTACACTTGTCX3TAAGATGCACAA^ 

TTTGTTATTATGATTAGGACTACATATA^ 

GATTTGTAGGAACACCAGTCTACTCTACTGAAAGAAAAACGTGCGGCCX3G^ 

GAGGCCGAGGTGGGCGGATTACGAGGTT AGG AGAT CGAGAC CATCCTGG CTAACACGG TGAAACC C CGTCT CTACT AAAAAATACA 
AAAAAATTAGCTGGGCATGGTGGTGACGGGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCG 
AGGCAGAGCTTG CAGTGAGCCTAGAT CC CGCCACTG CACG C CAG CCTGGGCGACAGAGCAAGACT C CATCTCAAAAAAAAAAAAAA 
G AAGAAG AAGAAAGAAAAATGTGTT ATGT AGAAATTATGT CTAATAAAATTTT AAATT AAT AAATT AATT CATTTTAG AGT ATTT A 
AAATCCAT AGGCTTAG CAT ATAGTGGTGGTCCATGAATGACCTGTGG AACTGTAATTTCTGC CTCTGTGTGAGTGTACTTT CATTT 
CCATTCCTGCCGAAACAGAACAGGGCAACTCTCCTGTTACACITTTGGGAGTTTG 
TTAAAAAAAATTAAAAGGGCTGGGGGCAGTGGCTGACT^ 

GTCAGG AGTT CCAG AC CAGC CTGG CCAAT ATGGT AAAAACCCAT CTCT ACTAAAAATACAAAAATTAG CCAG ACGTG ATGGTGT AT 

GCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCACAAGAAT^ 

CACTGCATTCCATCTTGGATGACAGAGTGAGACTCCATCrCA^ 

TTGTT ATTG CT ATT AT ACATT ATTTTTTC CAAT CAAAATTT CAAAGACATTG CTGGTG TGT AG AAGGGCAACTTI" 
GAGACAGGGTGT CACTTCrGTCACC CAGGCTGGAGTGCAGTAGTG CAATCAT AGCTCAC CGTAACCTTGACCT C C CAGG CT CAGG CA 
GTCCTCCCACCTCACCCTCCTGAGTAGCTGAGACTACAGGCATGTGCCACCATG 

GGGTTTCACCATGTTGCCCAGGCTGGTCTCGAACCTCTAGGCTTAAG CGATTGGTCTGCCTCGGCCTCCCAAAGTG CTGGGATTAC 
AGGTGTGAGT CACTG CACCTGGCCT AT ATTCTTTTAAAT CAAG ATT AAT CGACAT ATATTGGAGTGT CAT CTG CAAGGGC C CATAC 
TGC^TGCTGAAGGAGACAGGTG^^ 
CAGTG<3CATGATTTCG^CTTACTGCAATGTCCGCCTCCCG 

CAGTTGCATG CCACCACG C CTGG CAAATTTTTTGT ATTTTTAGT AG AGATGG GGTTTT AC CATG TTAGC CAGGAT GGTTT CGATCT 

CCTGACCTTGTGATCCACCTGCCTTGCCCTCCCAAAGTGCrGGGATTACAG 

AAACATAGGAGGCAATCAATATTTTAAATAAAATGTACTGTTTAATGGGAGTCTATAA 

GGTGGTGGAAAGGCAAGGGGGCAAGTGATTGATGTTC^GGCA^ 

CAACCTACTCTGCGTTCAAAACTCCCTGTTCAGTCAGC^ 

AGGGATATGGAAGTGAATAAAACACACAATTTCATCCCTTCATGAGTAAAAC^ 

TGTAGTGGTTCACGCCTGTAATCCCAGCACmGAGAGACATGCAGATCACT^ 

TGGTGAAACCCCGTCTCTACTAAGAATACAAAAAAAACAAAAATAGCTGGGCATGGTGGTGCACGCCTGTGGT 

GGAGTTGAGGTAGGAGGATCGCTTGAGCCCAGGTGGTGGAGGTTGCATTGAGCTGAGATCACACCACTGC^ 

CAG AG CCAG AC C C TGT QTC AAAAACAG AAAG AAAAAAAAAAT AGCAATAAGTG CCAGGAG AAAAACACT 

ATTGTGCATGCCTTCATGCTTGTATCTATGGGTTTGAGGATGGCCTGCAATCTCAATAATGTGCCA^ 

GGTGACCTTTGAGTCAAGACCTACAGAATGTGAGGTAATGACCCATGCAGATATCT 

GCAAATGCAAAGGCCCTGAC^TGGGATGCGGGACATC^ 

GAAATGAAGTTTGAGAGTTTAGATGGGATGAGGGGCAGAATGTGTAGAGTTTTGTAG^ 

GCACAAGATAAGAAGCTGTTTGGAGAGTTTTAGTCAAACAAGTGACTTGATC 

AGTAGGGGCAAGGGCGAAAACAGAAGATGAAAGAATAGTTTGGACTGGGGTAGTA 

TG GATTAAGGATTCGAT ACTGGG ATCAGGGT AATAAGAGATG AAAGAAAAC CTCT CAAAATGT AATTTTTC 

TCCTTAAAAAAATAAATCACAACAACAAAATAACCCTATTAAAATGGGCAA^ 

CCTGTAATCCCAGCACTTTGGGAGGK^CTGAGGTGGGCAGATCGCCTGAGGTCAG^ 

CCCCATCTTTACTAAAAATACAAAAATTAOTGGGTGTGGTAGTGCACGTCrGT^ 

AATCCCTTGAACCTGGGAGGCGGACGTTGCAGTGAGCTGAGATTGTGCCTCTGTACTCCAGTTT 

CTCAAAAATAAATAAATAAATACACAAAATGGGCAAAAAACTTCA^ 

AATAGGT AAAG ATGTTC^CATCACri AATCATT AGAG AAAT ACAAATT AAAACT CACAT T AT ACC CAACAGG AT 

AGCTACTATAAACAAATATAGAAAATAACCAGTGGTt^TGAAAAC^TGGAGAAA 

AAAATGGTACAGCCACTCTGGAAAACAGTATGGTGATTCCTCAAAAAATTAAACAT^ 

agtatacatcccaaaaagto;aaaggagagacaggaacagatatttctac 
aaottggaaacaacttaaatgttcattgatagatgaatg^ 

tgaaaaggaaggaaatggcc^gggtggtggct^tgcctgtaaccccagcactttgggaggcca^ 

TCAGGAGTTCGAGACCAATCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAA^ 

CCTGTAATCCCAGCTACTTGGGAGGCTGAAGTGGGAGAATTGCTTGAACCCAGGAGGTGGAGCCTAGATCATC 

GTGTTGATGACACAGCAAGAGTACATCTTAAAAAAAAAAAAAAAGTAAGGAAATGCTGACACAT 

AGG CATTAT AC CAAGT GAAAT AAG C CAGTCACAAAAAGG CAG AT ACTGG ATG ATT CT ATTTATATG AGG T ATCTAGAGT AGTCAAA 
TTCATAGAAATAGAAAGTTGATTGGTGGTTGC(^GG 

GTTTTGCAAGATGAAAATCTGGAGATTGGCTGTACAACAGGATGAATGAATGTATGTAACACTACTGA^ 
AGTACAATGGTAAATTTTATGTGTATTTTATCACAAGGAAAAARATCACTAGACATCTAACTT^ 

TTAAATTTG C C AGTTAACTG AAG C CAATAT AGGCCAGTTTTCAT ATTC CTT AGTGTT AT CACACTGGTGCACTTACTGTTTTAC CA 
TTTTCCCTTCTCA.TTTCATTTTTCTGTTAGCATTTACTACTATCTAACATATAT 
ATAACTTCATGAGGGGAGGGATTTTCTATTACACTTAGTGAAAAGTAAATCCCT 
TTTTTTACAGTAAGTTTGCTTAATGGCTAGTAAACTATCTCAGCCAGTACCTGAGTGACTAT^ 

AAAAAGGCCTGG CGCGCTGTCTCACGCCTGTATTCCTAG CACTTTGGGAGGCC6AGACG(MTGGATCACTTGAGGTCAGQAGTTC C 

AGACCAGCCTGGCCAACATGGTGAAACC^^ 
AGTTA(^TTGGGAGGCTGAGG(ZAGGAGAATCGCTTGAACTGGGAG 

TCTGGGTGACAGAGTAAGACTCCGTCTCAAAATAAATAAATAAAATAAAATAAAATGAAAAAAAG 

GATCACATGCAAOTCGATGTTAAATACTAAATATTCCTTAAATTTAGACAGTATTC^ 

TGTCIATTTATGATATCAAGTGGAATGTATTATCA^ 

G CT C AAATCAT CAAAGTTAAGATTTCTT ACT ACATAAAGGCT ATG AAG AG CTGACGTGT AATTAT ATTGGCTCTGCAAAT ATTGGA 
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AGAGAAATATGAGTTTGCCATTAGTGT 
TGAACTTTTAATAAAAGTACATC^ 

TAGAATATCTG AAGGAT CTGACT AAATGAATTT CTAC CTAAGCT ATTG ATTT ACAGTAACTT AAJATGTGAT AATTT ATCT CCT AC 

AATGGCCTTTCAAATTAAATATAAAAGTAACTCAGAGTAAAGTA 

TAATTTTTTTCTCTCACATTAATCAAGCACOT 

ACCCCAGAAAGTAAAGCCAATT03TGCACCTTCATAAGAAAGAAATAAAGGACGAACCTA 

TGTAGA«5CACCATCGTTCTACAACCTTATGCACTTTAAA^ 

CGGTGGCTCACACCTGTAATCCCCCCM^ 

CAACATAGTGAAAC CCAGT CT CT ACTGAAAATACAAAAATGAG CGGGCACGGATGGCGCACGCCTGTAGT CC CAGTT ACT CG G CAG 

GGGTTGGGGGCTGGTGGGGGTGGTAGTGCTGAGGCAGGAGAATCGCTTGAATCCG 

GCCGTTGCACTCCAGCCT^ 

CCTGATATCACCATATCACCAACTGCGTAAGCAGTGCT 

AATAAGAGACG ATGATG CGACT ACTGCAGCTGTCCCCAGGCAATTAAACT CTT CGG CGAATTTCTTT ACTATTGCTTTTG AACAGG 

GAACTGTTTGCTATTTTAGGGCACTCGTGCGGTAAAGAACTTCAOT 

ATGATCCTCTGAACCTGTGACTTGGAAGGGGGGATAAAAAAGAACTAT^ 

(^CAACCCCACGAACATrCAGATTCATGGCTTGACAAAGTGGGGAT^ 

ACAAACGGTCTGCAATGTTAAGCCTTAAATTGGCTrT^ 

CTGTCATTAAGGAGGGTCAGGAAATGGAAGGGCCGT^ 

CAAGAC CAGTCT CTNNNNNNI^NNNNNNNNNNNNNNNNW 



NNNNNNNNNNNNNN ^ 

CACGGCGGCAGAGACCITCACCATAGCGTTCGCTCAACTC 

CCCGGTGCACCCTCCCCGGGAGACACCTCAGACCCCCGA^ 

CCCCTOTCCCTCCCCCTCATOT^ 

CCCTCCTAACCAGCGGCCAGTGGGTTTCCCATACCCCAGGATGTGAGCCTCTTTAACCT^ 
TTTCCTGCCATGGAGAACCAGGTGCTGGTAATTCGCATCAAGATCCCAAATAGTGGC 

GCAGTTACTCTTCAGGG ATGTG CTGGTGAGTGGTAATCTCAGTGTCCGGATGCCAGCAAGGGGGACT CAGGGACTTGAGTAGTCAG 

CACCTTGACCACTGGTGACCTGAGCCAGTGGCAATGGCTACTGCTGGCTTCCT 

TGATAAATTGCAACCACTAAAACCTGGCAAATGTCTAGGACCCT 

CTCATGGCCCGGGCTGGGACACACATATAGACACAACC^ 

GTGTTTCGGGGAATGTCCTTGGACAGATTCCAGCCTTGTTTTTCTAC^ 

AAGGACAACTTTAACTTACCATGTTTGAGCACACTGTTTCATTA^ 

GTGCTGTGGCACGATCTCGGCTCACTGCAGCCTCTGCCTCCTGGGCTCAAG CGATTCTCCTACCTCAGCCTCC CAAGTAGCTGG CA 

CATGCCATCACACCTGGATAATTTTTGTATCTTGTAGAGACGAGGTATTGCCATGTTGCCCA 

TGCTTGCCTCGGCCTTCCCAAAGTGCTGAGATTACAGGTGTGAGTTACTGTGCC^ 

T AAGACATTTTACTT AG AAAAAAATG CTAAC CATG ACAGTATG ACACAT CAT C CGTT ATT AAAATG AG TC CAG TTTCAGAG ACGGT 

AAAAGT AAGGG AAAATGTGCATC TTAGAATTG GTG AT AT TTT ACATTCAGATTT TTCAG AATAT ATTT CAAATAAAATGATGGAAT 

TTTAGAGCJUjAGTGTAACACTGTAGGTGGCCTff 

TGGCTTTTCCAAAGCTGTGAGCTCCTGAGTGG^ 

CCTACATGTTGCAGCCGCTTCTAATCTTTCTGGTATTCAT^ 

TGAGGCATGGAAGACAGAAGCGGACATTCCGAAAAGCACATCACTACCTGCAAGTTAAGACT 

AGAGGAAGATTTGTGATGTGATTTTTCATTGTCTTGGGTAACTTAATCAAAAT 

GTTACTGACTGCCCAGGCCATGCAGATTTTTGGGTGCCTTGACCAGGAGCTTTC 

GTATCCGGAGTTCTCCACACAGTCATGCACATACAGATTTCCCCTCATGATAAGGCCC^ 

ATCTGGATTGGCCCCTGTCTGCCTGGCACCTGTGTCATCrCTCT^ 

TTTTCATTTCTCCCAGGTGTCCAGGCTGTCrGCACTCTGCTCTT^ 

TTTTCTCTGTGTCATTCTTGGTTATGTGCCCCACCTCTGTGTCXaTCACAGCTTTC 

CATGATTCTTTGTCTCCTCCACTAGATGGTAAACTT^ 

ACTTGTTTAAGAAGGGAGCTTAAAAGAGGATGCATGCAATTATGGTTCT 

GTTTAATTGCAAGCTGTCCCACTGACTAGGCAAGTCAGTCTACCTCrCTGAGCCT 

GTACCTTCCTCAAAGACTTC^AAAGATTATGTGAGATTGTGACTGTGA 

GGAAGGTTGATTATTATGTGATTATATTATTTGGTTATGTGATTA 

CCAGGTTGTTTAACTTGCAAGTTGTACTACTGGGGCAAGTATCTTCTGTATCTC^Gm 

GGGGTGAAAGTAATGCCTACCTCCTAGAGTTCTGAGGACTCCTTGAGTTGATACATGTAAAGCACT 

TAAGCGCTCCATTAGTGTTGTTATTATTGATGGGATTGTTATTATAAGGGATTGGACTTGGGT 

AGTTGTACCTGGCTCAAGGCTGACTGGCTTAACAGGAAAC^CGAGTGGCCCCACC^ 

CTTTTGGCTAGGTAAGCCC^GGTGGAGCATGTTTTGTCAGAAAGGCCTGACra 

CCTAGTGCTCCATTTTAGCAAATGre^ 

CAAGAAAAGGCAAGAACCAAGAGACACGTCAGCATCTCTGGCTT 
CCTCATCTAATTCAGACTCATTTCGAATGAAGAGGCATGAACC^ 
TGGCAGAACCTGACACAGATGCCAGAAGCAGGTTGTAGTCACCGAGTCAGCCAT 



ATCAGGGACT^CTGAACTATGGAGGCTGACCTGACCCTCAAAGG 

CTCrTTTTCCCTGGAGACTGAGTTCCTAGTTrTGGGAAAACTATACA^ 

CCTCTCCCCTCTTCTTTTTCTATTCTGTGGTCTCCAAG 

TAGGCCTGTAATGCCAACACTTTGGGAGGCCGAGGTGGGTAGATCGCT 

AAAACCCCATCTCTACAAAAAATGCAAAAA^ 

GGC^GAATCTCTTGAACCTAGAGGTCGAGGCTATAGGGAGCTGTCATTGCAC 
CTATCTCAAAT^AAGAAGAAAACACACACACACACA 
TGATTTTT'TCCTTTCTCCTGAGTTACCAAATACTATTGGCTCTTTATG 
TACTAGTCTTTCAGGTTTGTCTTTATCTCTGTAACCAGTTTATAATCTTCCTA^ 
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T ACAG CTTTGTG ATCTTTATGCTCGTGGCATGTTGT AGATGCT CAATGATGTTTTGGATCTGT CAGAT ATTT GTC TGATTT ATTGA 
AAGGAAGAAGAAAAGTATCAAAAAATGGTCATCATCAAAATATGTAAACTTCGGTTTTCTTTTT^ 
GACAGAGTCTCGCTTTCTTGCCCAGGCTGGAATGCA 
TCTCCTGCITTAGCCTCCTGAGC^CTGGGATTACAGATACACACCACA 
5 GGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAAGTGATCCAC 

GGTGTGAG CCACTGCGCCCATCCCGGTTTT CITTTGAAACCTT CTTT AGGTTAGTTAGGT ACATGTTATATTGTCAA 
AAGGCCGCTTGGAAATAAGTCACTTTAGCTGGAAAATGTATATAAAATCATCAAATGTACCCT 
TACTGATGTTTAAACTrCTTACCTCrrTTOTCATTATG^ 
CATGATTCTCTCACTCAAGTTTGCTTTGTTCTTTCT 
1 0 GATGGGCTAAGGCATGCCATTGCCACTGAGCAGGTTTTCAGTTGGTTGTTAGAACT 
GTAGGAAGGAGCCTAAATTAATTTGCTAACACTATAGAGGTACAGTTCACAT 
TAACCTAGACTTAGAGAGTTGGGGATGAAGCAGTTGTTTTCCCCTCGTTTT^ 
AGTACTGCTGAAGGATGAAGT'CAGTGAGTGAATCACCATCCTCCACTGATTGTAGGG 

AAGCTTTGTAATACTCCAG TTATTGGTGTTTTGTTTTATTTATTTGACACT AATATGG CACTTGCCATGTG CTAGACACTC TTCTG 
15 AGT ACTTT ACAAAT ACTAACTCATTTG TT AGGT ACAG CT AAGAT CTGAT AT TACACATGAGGAAAGTGAGG CACAG AG AGATTTTG 
TGCACTCAGCTGTGCCAGAGAGCTAGCATCCAGCTGTAGCTGTTTGGCT 
GTGTGCTTTATTCTGTACrCCCTTTACAGTTTC 

AAAATCTTTTATAGATGCTTCCTCAAAAAGTTGATGTTTCATTTGTGAGGACT 

TG AG CAAGGACTG C CT AAGATAGT C CTCTTTG AT CAGTTTTAT ATG CA.CAGATTTATTTTTG AAAAAAAAAAAAAGCACTAT AGAG 
2 0 GCAAG ACT ATTATGCAAATTGC CACTG CAG CAAAAGCTTCTAAAGAATTG CCTTT AG AAGGAAGTTGCAG CCACTCTG CTAAGTGC 
CCTGCTTGGAAAGGCTGTGGGCTCCCTGCTGAGAAATACTGCGAGCGGC^^ 
CCTGGGAGGGACACTGTGGCAGAGCTCCTGGAGAG 
TTTCCCCAGAGCGTAGGTGTTCTATGCTTATTCAATTAAGCTGTCT 

ATTTAAG CATTGATGTCAGATTAT AT AC CCTTTTTAGATTTTTGTTGT AGG CTAG ATTAACTTTATTT T ATGT 

2 5 TTTCTGTTGCTTAAGAGTAAAGGTTGATAAGAACTGTTTGTGTTAAAT 

GTCGAACTCCTACCATGATGAGTGCGAGGCACAGTGTTGCTTGTTGTATATCAGGAACrACATTTGTTGAAT 
AGGTCTAAGTTTATGTTTATTAGATTAATTTTTAAGCTACATCTATA^^ 
GCTCAGAGAGGTTAGGTGATTTTTTTCCCAGAGATTCAGACCTACGTCA 
TATATTCCACAGCCTCCCATAGAAGAAACCAAGTTTTTGTAAATACATGACX3GTTTA 

3 0 AAATATGTTATTAATAAAATGTGGAAAAGTTTCCTTAAATATTTTTCTTCCTGG 

GGGCTAAGTGTGATTATGAGCTAATGTTTTTATAGTTTATC 

ATTTGCCACATGTTAGCAATAAAGAAGATGGCTAATTGTGTTTCTTTATTCTTTCTTTGTTTAGGA 

CTGAAGCAACAACTACAGCATTTGAATGTAAGTCTGGCTTGTATACTTTCTTGACT 

TTATGGGTG<3CATTACTTTAGAAACTGACAAAAACAGATTATTTATGACCATCATATGGTACATTTAA^ 

3 5 AAGTTT AATG ATGATACTG AACTGAGT AAT ATTTT CTT AT TT AAT CATATCTTGG GTG AG CATAT T TCT ACAATAT AG CTTTAATT 

ACCAGC^TGACCTTTATCTGCTCTTGCTCCrGATT'GTGATAATATTG 
GATTATTGATTAAGAGATTTTAATCCATTGCTGCTTTAAAACATAGCAACARA^ 
ATTCTGTTTTAGTGGGGCTTAGCAGGGATGCTAGATGGTGACAG<3AACACTTCAGG 
GCCTGGGGACAGGGCCAGGGTGACCTGGGGACTAGAAAACTGTTTTGTCATGAACCAG^ 
40 TCATGCTACGATGAGATAGTCAGTAGCATAAGGGCCCGGATGCIGAGAGGGATGAAGTAAGAACTGTAGT^ 

GAGGG CTACTTTTTTCTTTT CT TTTTCTTTTTTCTTTl^TriTriG C CG CAG ACT CT CACTGTGTTGC C CAGGCTGGAGGGCAGTG 

GTGCACCACCATGCCCAAGTAATCTTTGTATTTTTAGTAGAGAC^ 
CTCAAGTGATCCACCCACCTCAGCCTCCCAAAGTGCTG 

4 5 TGTAGAAAAATTTCAGTTTTTCCACAATAGGATACTTTTTTGTTTCT 

GTACTTCCTTTTTGGAGGATAATTAGGCATTTCTTATCAACAT^^ 

AAGCAG TT C AACAGAAAT AGCT ACACAT TTGTTGAATGCCCAT CAAACAAAGTGTT CA.T TGTGAGATCATGATGATAATT AT TGTT 

ATTTTTAGAGAC^^ATCTTGCCTTGGTGCCTAGA 

CACAATCCTCCCACCTCAGCCTCCCAAGTAACTGGGACT 

5 0 GATGGGGTCTTGCTATGTTGTCCAGGCTAGTCTTGAACT 

TTATAGATGTCAGCCACTGTGCCTGGCCCACrrGTGGGATTATTTATTTATTTTC 
AGTGCAGTAGTGAGATCACAGCTTGCTGTAGCCTTGACCT 

GTAAAC TAAATG CCT ATT AT AATGGATT AATAAATT ACT ATTTGT CTATAATTCAAAG C ATTCAACAGTG ACT AAAAAGAATG AGT 
TAGQATTTTTTAC^TTTGTACGGAAAGATATCTGTC 
55 AAATATCCAACCACGCTTCCCCAACTTTATGTGTGTATTGAAAGATAGG^ 

TACCCTGTAAAGTGGACTTGGGGCCTGAGGCTTGGGTTGGTGGGGGCGGGAGGGGGTGGATGAATGATCA 
TGTTTCTCTATTGTTTGAACATTTTrCAACAATCATTGCCTGTTTTTA 

T ATTAGT CACA^TGAACTCTG TACT ATAGAAAAAT AGT ACATCTG T CTGAAATACAT ACT AGTGTTG CAAG CTTT ATT CAGACTCT 

TTGCAGTAACTCCTTGCAGGTTCCACGGGTTTCATGGACAGTCTGATAGTTAACATTCGGA 
60 CTAAGAGCATTCCATGGGTTATCTTATTTCTTTTAACCATCACAAGATCCCCACGAAGTA 

TGCTCTGTTGCCCAGGCTGGAGTG CAGTGACATGATGATAGCTCACTG CAG CCTTAAACTCCTGGG CTCMGTGATCTT CCIGCCT 

CAGCCTCCCGAGTAGCTGGGACTACZAGGCACCCACCTCCACACCTAGCTAATTAAA 

GTGTTGCTCAGGCTGGTCTTGAACTCCTGGCCTCAAGTGATAGTCCCATCTTGGCCTTTCAAAGTGCTG^ 

CACTGTACCCACGAAGTACTTAAAAGGACATTATTAGCTCTGTTTTTTGGTTGAGGAAAGGAA^ 
65 TCTCTCAGCTGTTAAGTGGTAAAGTGGAATTTCTGGAGTAGTTTCCAGAGTGTATATTCTC^ 

GAGTTGTCATAATTATTGAAATCATATATTCTGGTTGTTC^ 

TCGGATCCCGAAAGTGGCTTGATGTGCCTGTGCCTGCCTCATTGGC^^ 

TAACTATCATTCATTGATAGCTTGCTCTGTGCCAGGCAGTGTACTAGATGACTTGCCTAAG 

TGTGT^AATAGGAAAATTATACCTGTGAGGAAACGAAGCTTAGAGGCCTGAAATGATTG 
7 0 CCCAGGTTCAAACACIACATCTTTTTGACTATAAAACTTX3 

GATAAACCTCTACrTACTCC^GGTTTACGTTCTGTGATATC^TGCTCTTTCTTTTGCCCCATCAA 

TTCATG TGGGCCAGTTTT AAATG AAAAACAATAAATACAAAACATAC C CACAG ACAAAATAATG ACAGTT CTTAGT TGTTTTGTGT 
TTGGATTATCATGTACTTTCCACTATAGACTTGAAATCCTAGGCTGGCTTTTTTCATGATTA 
ACCAGAAATCCGAACCAATAAATGGAATCTATGTTTAAGAAATAGTTGAGGCCGGGCGTGGTGGCTCACGCCTGT 
75 TTTGGGAGGCCGAGGCAGGTGGATCATGAGGTCAGC'AGATCGAGACCATCCTGGCTAACAAGGTGA 
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TACAAAAAATTAGCCGGGCGCGGTGGCGGGCGCCTGTAGTCCCAGCT^ 
AAGCGGAGCTTGCAGTGAGCCGAGATTGCGCCACTGCAGTCC^ 
AAAAAAAAAAAAAAAAAGAAATAGTTGAGAAGATATAGCTTATAAGGAAT^^ 
ACAAC C CTTGAATAAATACAACTTTAATAAAAGAAATACTTAT^ 
CCCAGGCTGGAGAGTGCAGTGGTG03ATCTCAGCTCACTG 

TGAGTAGCTGGGACTACAGGTGCATGCCACTACACTGGCTAACTTTTGTATATTTT 
CCTGGTCTITGAACTCCTGGGCTCAAGCAATCCATGCT 

GCCTTATAGTAATACATTTAATACTGAAAATATACCTAACACTAATTGATATCT 

ATGAATTAAGCAACAAGAGATAGG GTAG CTGTCAAATGTAC CTGTTTTGT ATAAATGTAGAGAAAAAGAGTTGAGTGAGTGGAA^ 
GAGGGGAAAGGTCAGGTAATGTTACAGATTCTCCAGTTCTGAGAAATGGCGTCTCTTCAATGAAAC 
ATTAATCTCTCTGCACCTTTACTAGAGCAGGGAACCCCTTCAACCreCC 
ATTTTGAGTCACTTAGTGATAGGTACGATGT1TTATTC 

ACTTGTTGAATGAGTGGAACGTTATGTGTGTCCATATTCAAATGAAAATTATAAAATGTATATGGTCT 
TGCGTTTTTGGATTATTTGTGTTATAGCCTGAGCATGGGA^ 



TTAGGGCTGGATGAGGAGAGGGACATCTGTOTGCAGAACTCACCTTTTGGGCTGA 

ACCT CTTTTAGGT CCTTATTCTCCATTACAT CTAG AGCTCACAGTTCTACAGGCTGTACAAG CATGGCACT AGCATCTGCTT G CAT 
TCTGCTGAGGAAGCTTTTACTTATGGCAGAAAGTGAAGGGGCAGCAGGCGTGTCACATGG 
CTCCTTC!AACCAACCAGTTTCTCACATGAACCAAC!AGAGTGAGAACCCACTACCATAGA 
CCACCCCCACGACAAAAACACCTCCTTCCAGGCCCCGCCTGCAACATTGGGGATCACTrrGGA^ 

CATTGTGG AAATAAATTTGTAAG AT AAATT CTTAGAGTTGG AACTACTG AGACAAAG AAT AC C CATATTT AAATTT CAAT AT CTAT 

TACTGTGTTTGCTCACCAAAAGGTAGAACCAATCTACACTCACA^ 

ACTCTGGATATTAGCAAACCTTTTAGTCAATGTCAGTTTGATAGGTGAA 

AATTGTAAATGAGCITGAATATCTTCTCIATTTGTATTTGGCrrATTTGTATTT 

TCTGTCGCCCAGGCrGGAGTGCAGTGGTGTGATCTCGGCTCACTGCAACCT 

CCTCCCAAGTAGCTAGGACTACAGGCATGCGCCACCACACCCG 

TTGACCAGACTGGTCTCAAACrrCCnX5ACCTCAGGTGATCrTGCCTGTCT 

CGTACCTGGCCAGCTATTTGTATTTCATTTCTATAAACT^ 

AATTG ATTT AAGG GTTT TCCAAT ATT AAGGTTGG CC CTTTG TCTTGTGTGTTACAAATGTri'U"rGTTGA.GCTTATTGCTCTTTTCG 

GTAACTTCTGGGTTTATGTCATGCTTAGAAAGGTCTTCTACAGTCCAGGAT^ 

CTACTTTAATGAGGTTAGACACACACACACAGTTATTTATATAAATT^ 

TTTTTATTACAAAGTTCIAGAATTTGTTTATTTTTCTTTTTCTTTTCT 

CCTAGGCTGGAGTGCAATGGCATGATCTCAGCTCACTGCAACCTCCGCCCCCTGGAT 

AGTAGCTGGGATTACAGGTGCCCACCACCATGCCCAGCTAATTTTTCT 

GCTGGTCTCGAACTCCTGACCTCAGGTAATCCACCTGCCTCGGCCTCCCAAAGTGCTGGCATTAC^ 

GCCCTTGTTTTTCTTAATTCTTGGACCATCTTCAGTTTATTTTGGTGAAATAAATGA 

GATAGTTGATTCCACACCAGTTATGGAATAGTTTTTCTTTTCCCAGATC 

ATAAAAAATCCTGTGTGTTTGCATTTCTGGGTCCTACATATGCTGTA 

TTTTCACAGTTTCATTAGCAGATTAATGAGTACIACCAACACATGAGGTGATAAA 

CATATCTG TTACTTGTGTTCCTCTGCCAGCATTTT CT CTAGAGAAG TAACTG CTAT AAAATGACT GTTGG AT TGGTG TT CG TGG AC 

AGTAAAATCTCACTGGTGACTTGAAAC!ATGCATTTTAAGTCCATGTAGACAAATGCTGTCT 

CIACTTTTCTATTACACrCATGTATTCTAATCTGTAGCTCTCAGAAGTATCTT 

AGAAAACATGGTTATAGAATAAATATTTATTGAATTTGTGCCT 

ATTTTCAGCTGTTTTATGAACTGCTGGAAGATTATTTTGTATTTAGGGT^^ 

ATTTGG CATAGTATG AC CATTT AAAG CT AAATTGCAATTCTGGAAGTTGTAAATT CT CAGAGTATAGG AAAC AAC TAT AG CTGAT A 
ACTGGAGTGACACCTGTAGGAGATACTCTTGTGTTCAGCTAGCCATGACA 

GCTT CAG AG AAG AGTCTAAAT CIATAAAAT ATTTGG AAGAAATGAAGTTT CATTTC CTTAAACTATTGTATATG TAAAAATAT AAAG 

GGTCATTTTTAAGTAAAACTAGGTAAGTTAGTGAAAAATGGTCA.CAATAAACCTA 

AGTTAATTATGCATCCTAGAATTTGGGGCAAAATAAGAGAGCTTTAAT^ 

AGTCCAAATCCCTTTTAGTCTTATGAGAGTAGTCATTTCTTATTAATAGTCrTGCTAATTTGTATGAAAATTAAGAAGTTTGAGTC 
ATC^GCAAGTCCTGTTGGCTCTATCTTCAAATTTATCCAG 

TCTCTCACCGGGATTAGGACAGTGGCCTCGAAACTGGTCTCCCTGTTCTACTTTTGCCT 
CCAGAGTGATTCTTTTAGAAGTCAGGTGAC^CCACCACGCTCACAACCCT^ 
AGTCCTTAAAATGCCCTCCAAGGCTCTGCATTTTCGCATCTGGCCTC^ 
TCCCCAAACCTCTCATTCTGTTCTAGCCACTCTGGCTTCCT^ 

TGTATTAGCTGCTGCCTCTGCCTGGAACAGTCTTTCCCCAGATATCAGCAAGGCCCACTCCCTC^ 
TAGAGTGTATGTTCCATGAAGGCAGGATTTTCTTTGTTTTT^ 

ACATAGTGGGT ACTT T T ATTTGCTGGAAGG AGGAAG AAAG CAAAG AAGT AAGGAAG GAAT CAGGAAG AGGG AAGCAAG AAGCT AAA 
TTAAAACTCATGGATATGAAGATGTGGCCATGGAAAGTACGGCCTCTGGTCCTGCCCACGTCATTAACT 

TTTAAACATGACATCTTTTCCGGTAGTTTCCTGTTGATGTCTTT CTAACTTCTATAATTATGTTTT CATTT CTTTTCT CTTACCTG 
GGAAAAGTGCXTTTTGCAGTAGTTCTAATTGCGACCTGGC^^ 

TCCCTGGAGTCTGCTGTOATCTGCTGTTTACTCAGTAGGCCATTGTGACCGCCCCTTACCT 
GCCTTTATGTTTCTCTTCTTGGTTT'lTCTTrTGGATTGCATGTTGCCTCAAATAAAATTGCT 

CTCTTTTC C CAAACT CTG CCATT TTGTATATTTG TG ACACACTG CAGATTTTGTGAAGATTG CTT CATAATATTTGTTT ACCATCT 
GCTG CACT C TATTTGGTATCTTG CTAGG ATTTTTT C CTC CATTT TAAGT GACATT AACAAAATCT AGC CAGTACATAGGCTTTCTG 
GTTCTACTTTTGCTTCCCTAATGTGCTGGAAAATCATATTACTATGCTCCAGGCAACTAGAGAAGCCTGGAA 
CAATGTATCAGATCTGTCTGCTCCCCTACCCCCACCCCTTTTCT^ 

CCT C CATGTAT AAG AT AG AAAAAG CTCAAATTATTTT CTTGCTTCTTAG TACCACT ACTTTCTAGTGTTGTTC T CTAACATATTTG 
TACCTCTGAAACTTTGTGGCAGTTTAGTAAGTTTTTATGATTTATGGTTCCAGCTAGTGT 
AGGCAGCTAGCCTGAATCirGGCTAGATGGGCTCTATATATCCAAGTTTTTGTTC^ 
TTTGCTTAGTATGTATTTCTGCTTTCTTACCCTTCACTAACTTTTCGTAGCT 
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TG CTTT ACAGGTTAAGCTGAGTGGTTTG CT CTTGGG CTGC CATCAAGGG GTCAGAG ACAG CT CAGCTTTTGCCTTTTGGCTGGAG C 
CACCAGTCTTCTCATGGATATGGTTACTTTT 

AT AG ATATTAAAGAG CAAATCAGACAGTCTCC CAAAG AGCAGTTTGTAAAGAAAACCAATAC CTATAT A CT AGGTAT CCT ATT 
AACCCACAACAACTATTCATCAGTATGCCIATA^ 
5 AATCTTCTTTTGGAGTCAGACCCCCTAAGGGGAGAGTTCCAACATTT^ 
GCTTCAGTTTTCTGTCCACTTAAGCAAATGCTCTTGTAATGTTTCATA 
GGTAGCTGACTTCTTTCACAGATCTTTTTCTTCT 
GTTTAAGTAGCCTTAAAGGTTCAAGCrCTAGCTTTGAGA 

TATGTAATCGGTGCCCC CCCClTri' 1 W 1'T AAAT AAATTATAAGAAGGGTTTTCACTCGTTTT AC CAAAAT ATAAG CACTTG CC CATC 
1 0 CT ATGTGAACAATACCITATTCAGGT CTTGGGTATATTGAAATTCAAAT ATAAGCAGTCT CTTTTTAAC CATAATCTCTATAGCTA 
GCTTAGTTGCAGAAATGAGATTTATTTTAAAAAAACTACTTTATTCA^ 
AAGAACAAATCAGTTTAGCCTACATTTAAAGAGAATTTC 
TGTAACATTACATCATTTGTACTTTGTTTTTTCCTGTCT 

T CT AG AACTTATATATAG ACAG ACTC CTTT CTTCTCT CT CTGGAATGGCATTTCTC CCCCAATTTGG CTCTAGAGATT CT 

15 ATATCT AAAG CGAATGAAAG AATCITTATGTG TGT AACATCTCTCAGGACTC CAGTGT ATG 
CCTTTT ACTCTGAAAACAGG CAACTCTTGATAGCAGAATACTGGTAG TGACTC TC CT AG ATTTAGACAGTGTGAT CTTGATGAT AC 
CGATTCACACTGAGTTGGGGAACTCTGCAGCAGTGAACCAGGATCCAGAAA^ 
GTGAGAGCTATGGGTCACTGAACAGGACTGGTCCAAGTGCTGGTTTCGATCATGCT 

AAGG G AGAAAGT ATTTTG CT TGAAATTAGG CT AATTGTG AAAGAACACAAG CTTC CT C CT AGGGTCCCCATAG CCTAGTCACCAAG 
2 0 CAGGGGGATATC CTTGGAACT ACAGCCAGACAGG AGGGAAG ACCCAC TGACTTGTGGT AG GGGAAG AT C CCTGTGTTGAACTGAAC 

TCCCTCTGTACTTTCAGTAAGCATCCCCTGACCATGTTGGGCAAGCTTATTCT 
GGCTTGGGATCAGAACAGTCTTTCTATATGATCAA^ 
ACACCAGAACTGAGGCATCCTGCTGAGAGCCTGCCCCTTCC 

GGCATTTCTTTCAACTGTG CATTT ACAGT CTACTACAT AGGCTGTC CTC ACTG TTGACAAATCTTT ATAGAAG GATGAAGAGTT AA 

2 5 CT ATAT ATTTGG AAAT AG CCATAAGT CATTCTT AGACATG CTTG GCAGATGATCTGG GT AAGTG AT CAG G CCAACAATTTCAT TTT 

GGGTTAAAAATG AG ATTTAATAT TCT CACATAGTCT ATCTT CAG GTTTTTAAAGCAGTTTTG AAG AGATATTTCAAAAACTACTTT 
GAATAAGAGAAGTATCATGAAATAAATACATTATTTCT 

G T AGT TTGTGAAGT CTTGTC CT CCAG C CTTAAG ACTTAAATTACTTCAAAATAGCATG AAT AG CATTTG AAGCTTTCTATGG CTGG 
AGAG TATGCAAAACAAGTGAAACTGTTCGGAAAAAG AG CTG CT AAAT AGAATGGG G G AG AC CTGT ATTAGAGCTG CTTTACAAGTT 

3 0 AACCTTGCACTAGGCCTAGGTTCTCCTGTGGAAAGAGAATAATAATAATGATGTTCATGTC 

GAGGTATTGCATATGAAGCATAGATCACAGTGCCT 

AGATGATAGATTGCCTGGGAGAGAGGAAGAATGGAGAAATCGGGGTATGGTGCTGTTTGGGAGATTTAAAAAG^ 
ACAAGACTGGCCTCTCGTTGGTTTTTAGTTCCGCTAACCAAACCTGTTTTGTCATACTCCAG 

ATGTTG TGAATGACATG AG CAATATTT TAATTTTCTGCTAT C CCT AAAAG AAAGTTTTAT ATTGAGATT CTGATCAT ATTC CAAAT 

3 5 GGAAAACAAAACAACAAACTAAT AATGTCAACATAC CC CCAAAATGTGGT CACATTGTAAT AAAC CG TT 

T AT ATT ATGTTC CCITTGTG CATT AAACAAATG CACACCT T ATCATTTG CATT ATGTGCTTTT AAACAGATGAAGATGAAGATGGT 
GATCGAATTACAGTGAGAAGTGATGAGGAAATGAAGGCAATGCTGTCATATGTAAGTATACGACAAATGA^ 
TCTTAACGTGATTGAGGATGCTGTTTCTTGGGCATAGTGAAGACGAGTAAATAAATCACAG 
GGTAAAGGTTTOUyGGATTTCCGTATGTGAAAGTTAAGGGCA 

4 0 ATAATGCATCTTAATGACTTTTGCATTAGGTATG 

GTAACGG CTCATTAAAAATGAACCCCTGGGCTCTAAAAAATTTTTCAAAA CTCACCCTTCCAG 
ACTATAGT TT ACAATGTAG CTGCAGGCTT AAT ATGG ATACACGATTT AAAG ATGT ACTGTTG GTT ATT T ATG CACT ACC CATAAT A 
CTCC CACCGAAAATGAACT ACAGGGCTACTTACTGAAT CAAAGTGATCTG AAT ATATTCTGAAACGACTTTT C TTTTG TGTTTAAA 
ATTTTACTGAAAATCTGCGTTAACAGTTATCTGATACTGAA 

4 5 TTGAAAAATTCAGCTATCATGAGGAGACATAGGTTACCTGACT 

AGTCTGTAATCTGTTTACAAAACCTTTTCCCGTGGCCTGGTAGAAAGAGCATTGGACTGGAAGTCA 
GGTTTCTTGGCCCTAGGCAAATCACTTAACCTTTCAGTGCT 

AT ACTGCACAGGGCCATTGTAATGAT CAAATAG AAGAATAGATGTG AAAG CCCTTTGAAAAGTTAAAAGTTCTATACAAATGCAAA 
TGATGGAATCATTTGGAATCATGTGGAAATGATGGAATCAGOT 

5 0 TTGT AGGTTTTTG AT AGACTT CAGC TT CAG C CAGG AT ATTG TTTCATTG T AATGAAAACTGTAAACTGTCTGTATGGG TTT CTG C C 

AAT AGTTT T AAT TT AGGGT AGG AGAAT AGAGAACAAG AAGG ATAGG ATGATT AACAGAGCATTG CAT TCCAACCAG CACAG AAT AG 
GAAAAGTCAATTTCCCATTGACTTCGCATTAACTTATGTGAGAG 

TGAAAGGCAGAGTAGGATTGTCTGCTGAAAAGGTCTACCACTGGTATTGATAGATTGGCTAACTCTCCT 

ATTTTTAGAAAACCCACACACACAGATG 

5 5 ATGTATTCAACTTAACCACATGCAAAAGATATGTACTTTTGGGTATGATCTTACAAA 

GGTCACAGTG ATAATGATGATTT CCATTT CT AAGTT ACT 

TCACT CTG T AAG ACAGAT ATT ATCAT CCTT ATTTTTT AG ATG AG G AAG CTGTGGTT AAG AGAT AAGCC CAAG GTCTTAGAG CTAGT 
ATGAATGTGAATATGGAATTTAAGTTCAGTTTCTTTTCATGCCCAAGCTGTTGCTCATTTACTG 
CTTCCTCTTTGCAGGATTTTTTTTATGTCATCACTGTTATTTATTTTTTTTTCTCCGAGATGGAGTCT^ 
60 GGAGTGCAGTGGTGCCATCTTGGCTCACTGC^CCTCTGTCTCCCGGGTTTAAGCAATTCTCCTGCCT 
GGATTACAGGCACCCACCACCATGCCTGGCTAATCTTTGTATTTTTAGTAGAGACGGGGT^ 
AAACTCCTGACCCCGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGTGC 
TCATTATTAAATT CTG AAT AATTTGGGTTT ATATAAT AC CTT AGTTGTG ATTT CTGCACCATGT ATTC CTTACAGCAT ATGCTTAT 
TTTCTCAAATTCAGTGTTTAATTATTTTTGTGTTGTAGGA 

6 5 TTCTG AGAT T ACTTGGGTC CACTTTTTTTTATCTGAATAGAAGTCATCTAT CTCATTG ATG AGAT AATTTTAAAAAATT TTTTGTA 

GAGATAGGGTCTTGCTTTGTTGCCCAGGTTGGCCTCAAACTCCTGGCCTCAAGTGCTCCTCCTGCCTCAGC 
AATTACAGGCGTGAGCCAAGGTGCCTGGCCAGGTTGGCTGTCATTTTGATCT 
CATTTGGACGTAACCTTACAGTAGGTTTTGATTGGTAGTCCTCATGATTACCAGCATGTTCCT 
TTGTAGGTTCAATTTGGCTCTGAAGTTTGAGTCCAGGATTTCGTATAAAGGCATA 

7 0 TTGTTTTTTGCAAAAAGCTGTTTTTCTCTTTTGTTTTTGGTCTTGTTTTGCTT^ 

GAGTGCAGTAGCCTGATCTCGGCTCACTGCAACCTCCGCCTCCTGGGTTCAAGTGATTCT 

gattacaggtgcccgcc^ccacgcacxjcctaatttttg^ 

GAACTCCTGATCTCAAGTGATCCGCACACCTTGGCCTCCCAAAATGCTGGGATTAC^ 

gcttagtttttcatatagaattcagtaca^ 

7 5 CACTTTT A C CATTAGT CCTTT AATCATTCTTTTTCZATTGATT ACAAAGGTTAAGTAAATAAAC^ CT AT ATCT CTG CAT AAGAAT 
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AAAAACAAATTTGCAACCTCTGAGTTGAAACTTGAAGAATTA 
GTACAAGTGTACATTTTTATCAGTAATTTAAATAGATTTTT 

TTATTTAAATCACTAGCAG C CTGAGG CTGTTAATTTT ATG AGTG CCATAATAATGG CAT ACAGTGGTTTTTGAT ACATGTG ACACG 
TGGAATATTGTATTATGGTAAGGATTTGTCATGACAATAGGTTGTTCT 
5 TGTTGTCACACATTAGTACCATTTTATGTAAGTTATTTAGAAGTTC 

GAAGAAAACAAAGCTATAATAAATTGTCTTTTAAGAAAATGTAAAGTAATTO 
TTCAGCAGCCAAGTGTTA1TACAGCACCATTGTGATAGCTGCTCTG 

AT AGCATAGTTTCTTGTAAGTTATGTTCTTAG^ CTGACAC T ATTG CATCTTTTT A 

ATTCCTCCATAAAG TTTTTTTTTTT T TTTTTTTT GAG^ 
1 0 AATCTCGGCTO^CTGCAAGCrCCGCTTCCCGGGT^ 

G CCACCGCGCCCG^KH'AATTTTTTGT ATTTITAGT AGAGATGG GGTTTCAC CTTGTT AG CCGGGATGGTCT CGAT CT CCTGACCTC 
ATGATCCACCCGC CT CGG C CT C CCAAAGTG CTGAGATTACAGGCGTG AG CCACCG CG CC CGGCCT AAAGT ATTTTTT TAATT AGAA 
TTTGAAATGCTTTATTAGAAAAAAAATACACACA^ 

TTAATGT AGTTT AAATTTTGTTGTTGT TGTTCTCATTTCCT CCT CTCAT TTT CT AGAAAGATTGAAAATAG CGTG CTGCTTTTTTC 
1 5 CTTCCATTTTTTTGTGTTTGTTGTCTGGTAGT^CTATGGA 
GGGTGACCTTTCATTAAGAAATTC^ 

CTATTCT GACTT CTT^AT ATCATTTTTT CC CCAG G AGAGT ATGGCATACATTTTACTGT AGTGC CTC CCATT AT^ C 

TTGTATTATATTT AGT ACAAGAGT C CTT AT ATTTAAGTGAAGTGT AAGT ATT CC CTTTGGTGTATTTG AC CACAGTGT ACC CTGT C 
2 0 AGACGTTGAGTACTATGAGGGAAAAATACmTTGACCTGT 

CAT AACTTGG ACATTG CCTCT AAAATCATGCAGTTACT CAAGT AGAGTTGG AAGGT AAC CACAG AGGTTTTGTTCAACTCTTT CAT 
TTTACAAATAAGGACATTGAGG CCCACAGAGAATAG AAGATTCAGAT ATCAGTTG GAAGT CACATGGCTAATT AAAGGCAAGG CCT 
GGATACAGACTCTCAGTCTCTTTGCTCCTGGTTG 

AAAT ACTGCTTTTT CCTG AGTGT AAAAAAT AATAC TT ATTACCAAAATGT CACTGTACTTGAGTTTGT AGAG AAATGT AAAAAATA 

2 5 CrCTGTTGGCTGGGCGCGGTGGCTCACTCCTGTAATCCCAGCACTT^ 

TGACCATCCTGACTAAGA.CTGTGAAACCCCATTTCTACTAAAAATACTAA 
CAGCTACTCGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGTfZ^ 
AG CCTGGGCGACAGAGCAAGACTCC^TCTCAAAAAAAAAAAAACAAAAAAAA 
AGTTTTGTCCTTACCTACAAGXSCroTGTTTTACCT 

3 0 GGCCTTAGCTATTAGGAAGTTTTCTTTAAAGAACAAAGAGGTTAGAAGC 

TAAAT AGGTT CTACTTTGAACnTAACTTTG CAGGTGGCAAAAT CAG TTCAGCTTTCAATG AGTTGTTAG TC CCCTAATT CAATG CA 
ATTTATCCAAACAACTGATTACACAGTATATTOTTCAGAGAAGGTGGAGAAA 
ATAAATTGTTTTGGGGATTTGTTTATTTACTCAAATAATGACATGCTGCAAGCT 
TAGTCAAAAAAGAAAACTTTTTTC(^GAGACAGATTATTTTAATTC^ 
35 ATAAAAGTGTAGTGACCCCTTTTAGCCCCTGTTGTGATCTATGTTGGAGCCTGTTTGGTA^ 
TGCATCAGAACGAGGGATTTATTATGTGGCTCTTGGAAGCAAAGGA 
GGCACTCTGGGGACCACTGGAATAGGAGGTTTGGACC^CC^^CATTTTC 
CTGTGGAGGGTTCCGCTTAGCCTATJTTATAACAAAA 

AGTGGCAGGTTC CCTGAAGCTGCTATT AAAACTTCT ACT CCTAAAAATCAGGTGAAAGG 
40 TCAGTGTGGACACTACCCCAAGCrAACCAAGGTAATAACTCTGAAAGTTTCAGCTGAGAG 
TGAATACAAATTTAAACTCCTTCTGCCAACACAT^^ 
TTCTATGAAGGCCATCTGGGGCTCCITAAAGTCCTTTATGGTTTT 
ATTTTCACTTTCAAGTCATTAATAAATACATCTGTCCA^^ 

(ZACATTGTGAT^rTTCTTTCTCATCATCrAAT CAATCTGTGACCATATGTGGAC CCTCGGTCAAGTCTTTTAAGGAAAAG CTTGTAC 

4 5 ACATTGTAGTCTCCTCATGACAGGAGCAATGAGTTATTGAGGAGTCAGAG 

ATATTCAAAATAGATTGGCTTCATATGT TATTTTTGACCACTGGGG CAGTTGTGTGCTCTGTGAGAGGTCAGCATCATAAAGGAAA 
AGCCT CTTAATGTAAAGTATGAGAAAT AATTG AAAG AAGGTGT AACTA^ CTGAGTAG CGTTATAATTGAATTTGTAG C 

CATTTGATAGAGTTAAGATTGTTGAGTTCTTCACTGGGAAAACT 

ATTATTGCTGTAATTACTGACTGAGCTACATATAATAGGGTATGCTTGCAGTATTGACAA 

5 0 T ATGAAAAAAACAGGATG AGTATTCCTT AT AAG TGAAAT ACTGT ACCAATT ATGT TGTCTG TT AGG CATG CT AAGGAGAGATTTGA 

GAAACTCTCCTGTTATAGATGTATGGAAGAATAATAAGTAG 

GAAGGTATGACCAATGAGAGTCGTTACTATGTCTATATCAGTTCCTCTOTGGATTCTTCCT 
TGTGCTAGGCACTATATGGCAGTGATTTCTCAGGTTTTTGAAGGCATAGTCCTTTTTGAAAAT 

TCCCCCAGAAAATTCACATAATG CAGCATTTCACCAG CTTCIAAGGATCTCCTAAAGCCCATATAG ATCCCAGGTTAATACCTCTGG 
5 5 CATAAGAGGAAATAAAATATACAAGGCATGGTTCCTGCCCTTGTTGAGCT 

T ACTTTATT ATTT ATAGT ACTGTAGTAATTG AGG GGAAGGG AG G G G ATACTG ACT CAACCTCAATAAAC CAGTTAAATTCTAC CAC 

TGTACTrTGTCCAGATATCTCAGTCGCCTTGTTTTATGGTTTCTGTGTCACC^ 

TACTTGGAAATGAGAGATTCAAGGCACCATTTCCTATTTCAAGATTTTTTTTTTC^ 

AGCTCCTTTAATTTTGATCTTCTTAACATCTTTATAAG ATCAGCAAGGCAGGTTTT CAGAGGATACAGAG 



60 — — ■-- 

TCCTCATTCATCCAATCAGTTTTTCATGCAGCAAATGTT^ 

TGCAGTGAGCGTACAGTTCATGCCCTCCAGGAGCTCATGGTCCAGTC!AGGGAGATTAA 

AAGACAAG AAAAACTCATAATTACCAAG CGGAGTG ATAAATGAT AT AATACAC CTAAGCATT AG GTG CT C CATGAG CACAAGAAGT 
GTCTAATCTAGCTTTGGGGACCAAAAAGGCTTCCTGA 
65 ATGTTTGGGGCAGAAAGAACAGTGTGTTTAAAGAATGCCTGGTGAGTCAGTAGGAGCT 

ACG GGG G AGAGG GTGGG AG ATGGGG CTTGGAGGAAG GTGGGTT AG AC CATGG AGG GCCTT AG CCCAGGG AAGTG ACATG AT CAT AT 

CTCATATTGTCAGAGGCATTTGAGCCATAGCAACTCTGTTAGTCT 

TAAGAGAGAGTTCTTGGATCTTGTGCAAGAAAGAATTCAGAGTGAGTCCACA 

AGTGGTGAAAG AAT AG CT ACTCACT C CATAG AGT AG GACGTT C C CGAAAGT AGG AGGATGAATGTG C CCACC CTACGTATG ATGTT 
7 0 TGTTTATGTATAGGATAAAAAAGGATCATGGGGAGATGTGCTCTGCTACAAGGGTTTGTGATAA^ 
ATATTTTGCAAGAATTGATATTATTATTATTATTTTTTTTTT 
TCGATCTTGGCTCACTGCAAACTCACCTCTCGGGTTCAAGTGATTCTCCT 
TGCTACCACGCCCAGCTAAGTTTTGTATTTTTAGTAC^GACGGGTTTTCAC 

7 5 AATTAGGAATG CTTC TGTTTT CAAGAT ATTAGGGTAT CAG GACATTC CTGAATCTGG GTATG TTT AGTAAACG TTAT CAATCTGTT 
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CCCTTAATCATAAATAGCTAGAGCCT 

AAAGAGGTTTAATTGACACAGTT CTGCATGGCTGGGGAGGCCTCAGGAAAC TTATAATCATGGCAGAAGACAAAGGGGAAG CAAGT 
ACTTTCTTCACAGGGTGGCAGGAGAGACAGTTGGGTAGAGGGTGGG^ 
5 AGAATCCACTCACTATCATGAGAACAGCAAGGGGGAAATCCACCCCCATGATCCAGTCA^ 
' AACATGAGATTTGTGGGCGGGGACACAAATCCAAACCATATC^GACT 

TGCTGGGCCGCAGCCCCRGGAAGTCAGGCATTCTTAGTTACRGGATGTTTA 

AGATTCAGG ACTTAACAGACCAAGGAAATGT CCTGATGT C CTGAT ATCTT AAGAACAAAAG CATT CTG AGTTTTGCTTT AAAGAT A 
AT AT CTTTAAAT AAT AATATCTTCTTGTGGAAGACAGTAGTTA 
1 0 GCATCTC CTCCAT ATGGlTrTTTG CTTTC C CTCCTCTTGCT T 

TTGGGAACACCCAGCTCTGACTGGGTTTATTGGTGGAGCC^ 

TATTTCTCAAGAGTGGCTTGGGGTAGAGAATTTATGAAGCGTGGAAGCCACT^ 
CACTGGACTGGTGAGAAGTTAACAGATTTGAGAAGTGGGGAGGATGTCAAGTTAATAAAAT^ 
15 TGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGTGGATCACGTGAGGTCAGG 

AACCCCAT CT CTT ACT AAAAATACAAJlAATT AGCTGGGCATGGTGGTGGGCG C CTGT AGTATCAGCTACT CGGGAGGCTGAGG CAGG 

AGAATCACTTGAACCCAGGAGGCGGAAGTTGTATTGAGCCAGGATCGCACCATTC 

ATCTCAAAAAAACAAAATAAGGCCAGGTGCTGTGGCTCACATCTGTAATCCC^ 

AGTAAAGAGATCGAGACCATTCTGGCCAACATGGGGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCT 
2 0 CGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGACAGGAGAATGATTTGAACCTGGGAG^ 

CCGCTGCACTCCAGCCTGGG CGACAG AGTGAGACT C CATCT CAAAAAATAAT AATAAAAAT AAAACAAAAT AAAATAAAATTT AGT 
GGTTAAATACGGGGGAGAAGAGAAGAGAGATGGTTTGGATTACTTCCAGATTTCTGGTTT 

TTCCTGAGACCATAAAC CCAGGCAG CAGAAGACAGGTCTAGGGT CTG AT AGGGGATAG GAG AGGAAAT CAG TTTTGG AC CTGTTGA 
ATTGGAGGAGGCTCTGCACAATCAACTGGTTATGTTTCAGAACTT^ 

2 5 CAGCTCCTAATTAGTCTCAGTGATTCCTCrGGTAGTTGAAACTTTGAAGTGGC^ 

GCTAATGAAAAACAGTGGGGGAAATCCTGCATTTAAAGGAGAGATGGGGTAATAACC^ 
AGAAGTAGGAAAACCCATGGAGGTGGCGGGGCX5GGGGAGGGCAGTGGGGGAGTGGTTGTTC 
AGGTCACCCATCAGCACAGATGTCTTCACACTGCTTCGCCAGCCTCTGAAAGA 
AGCGAAGCCA.CAGCTCTCCACCAGCTCAGCA.GTCTACCAGGTAACCCTA 

3 0 AGATAAC CT ACTTCTTAACT TXH^AAGCAAT AAAT AT ACGTTTT ATG CCCTGAGTG CATGG 

TTTTTGTTTGTTTGTTT ACTG AT AAACGATTTGCTT TAA 

AAAACTAAGTGAATAAATGTAGATATGTAGTGCTGAGCCTAACA^ 

TCTCTATCTCTCTATCTCTCTCTCTCTTTTCTCTTCTTT^ 

GATGGAGTCTTGCTGTGTCGCCCAGGCTGGAGTGCAATGGCACGATCTCGGCTCACTGCA^ 

3 5 TCTGTCTCAGCCTCCTGAGTAGCTGGGATTACAGGTGCACACCACCACGCCT 

TAACCATGTTGGCCAGGCTGGTCTTGAACTCCTGATCTCAGGTGATCTGCCT 
TGAGCCACTGCTTCCATCCTTCTCTCTTTCTTAATAGAT^AATTTGTGCCT 

TTTT AAAAAATATGG CAG ATTTC CATGAAG AAAAAAATGT AAT T ATTG ATTG AATTGTG CCATT AG CATT AG AATT AG ATTT AT AA 
GAAATATGAACACTGTACTAAACTTGCAAACAAAACAAAGCAA^ 

4 0 TCAAATTAATTAGATTTTTGGTAACTATGCTAAACTTTGTCACATCCTTATTAAAATGACTC 

GGGACTCTGAAACTAATCTGAGTCTAGTACTGGAGATTCCATTAGCAATTGCAAAGCTGATTGATCTm 
. GATTAT CGCTGCAGTT ACTG CATGT AACGG ATT ATG CAAGG GGTAACAGGAG AAAACACATTTT AT AATTG CTG CTGATG TATTCA 
GTTTGGACACAATATTTCATACTCTATATAATTATTT^ 

ATT GTTT CAAT ATG AGG AGCTCT ACTTTT AAATTT C TTAACTAAT AATG CATT ACAGTTT AC CAAGTG C CT T CACTAATTTT AT TG 
45 CTGCAGTTTAATTTTTGAGCCTCACAGTAACCCTATATATATATATTTTTTTTT^ 

AGGCCGGACTGCGGACTGCAGTGGCGCAATCTCGGCTCACTGCAAGCTCCGCTTCCCGGGTTCACGCC^ 
CCCGAGTAGCTGGGACTACAGGCGCCCGCCACCGCGCCCGGCTAATTTTTTGTATTTTTAGTAGAGACGGGGTT 
CCGGGATGGTCTCGATCTCCTGACCTCATGATCCACCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCC 
CG»GCCAGTAACCCTATATATTAGGGTAAATATTTTTATGCTCATTTGGAGGAAACTGATGTTT 

5 0 AGG CTACACAAGTT AATGAGTGGCAGACT T AGAATT CAAC CC CATTTCAAT CAGT AAATGTTTATTAJ\ACAC CTACT ATGTGCAG A 

GCTTTGTGTTAGGGCTAATAAATGCCAGGATAAGATTATAGACCTGAAGATGAAGTTAT 

GGTTACTTTCTTACATTTCTGTCCAAGCTATTATGTTCAGTATCT^ 

TTGTTTCTTTGTTTTTTTCTTTTCTTTTGGTOTTGGTCATTTCT 

TTT AAAG GAAG AGTATATAG AAATGACACTAATAG AT CCAGATCT AAAA^ CTGGCCGTGTGCAG TGG CTCACG C CTGT A 

5 5 ATCCCAGCACTTTGGGAGGCCAAGGCAGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTC 

CTCTACTAAAAATACAAAAATTAGCCGGGCCAGTGCTGGGTGCCTGT 
TGAACCTGGGAGGTCTAGGTTGCMTGGGCCAAGATTGTGCCACTGTACTCTAGCCTC 
AAAAAGAAAACCCCAATTTGTTTTGCAGTTTATTACTTGCTGTCTTCTCTCT 
AGAGCCCGTCTTGATTTTACAAATGACTAACATGAGGCACAGTGAGCATTT^ 
60 GTGATGGAGCTCCTGGGTTTGAGCCCTCTGATTCATTCATGTGCCTTGCTCATCTC^ 
GGACTGCTCTGTGCTAAGAGGTACCATGCATGGCCCTGCATTTTAT^ 
TTTCTAGGCCTAGTGCCTTGTTGGTGGCTGTTCCTCAGGGGCCAGTTGTGT^ 
CAGCATGTCCTTGATGGTGAATTACTTTGCTATGTGGAACATAAG 

G CAGGAGAAAATATACTTGG ATT T AGAGGAAAT ATAAAAAACAAAAAACACAACCT AACT ACTC GCT TT AGAG AAACTGTTAAGTG 

6 5 GCATACTTG^TATCATTACTGTGACTGTGTTAGTCGTAGTACAGAGA 

AAAAGTGTATATAGTTTGCCAGTTGATTTGGCACAGACCT 

T AAAT AAAGCTG GAAAACAGGTCAATTT CATG CATGCGAG GTTTTATAGT AGGAAAAAAAAGCATTG CACCTGTT CTGACCCT TTA 
CATTTGAG^TTATTTGTTTTCAAACTTGACCCCAAGCACC^ 

7 o AGCA^AACTATAC<3AGAACACCACCACT 

AAAGTTGCTTGCAGCCTTCCAGAAGCAAGGGGGTACATGCTGTGGATCAATAACCGGCAAA 
CAATGTTCCAAATAGCCACTGGCATCTTAATTTTCCATGGAAACCTCAAATACTTGTGTG^ 
TTTTCCAGACTACTTGTTAACAGTTATTGCTGTCCCAAAAGGTTGACAGCCTACATC 

TTTGAGT AATGG CGAGTG CTGTATTTCTGTGGTTTTAAGGAACAGAATCT CCTGAAAGACATTTG C CTT AGTATGT ATTC C AGAGA 
7 5 GTGTTTCACATGTTTATGCCGTATGTATATAAGTTTTTATGTCT 
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GGAGAGCTGGGTACACT^ 

TATTAACTTGTGATTCCCAAAATGATGTGGCATTTTATAGCTCCCCTTT^ 
TAATTTTAATGTACATAAAAGCAATAGGAAGATATATGTGATATCTTCTAAAAAAAAACTTT^ 
AATCAAGGAGGGATTTCCTAAACATATCGTATTTAAATTIATATGCACATGT^ 
5 TC TCCTCTTTGACAGCCCCTCAAAACATAAAATC^ 

TCGAGCATTATAAGATGTGAGAGCACAGAAGCCATCTCCAAGTTAGGTGG^ 
CACAGTACCACCTACTGAATAATTTGTGTTTACTAAGGAGTGTG 
TTAC^GTGCAATTTGATCCGTACTAAAGTGTTTATAAGGCrrAAAGCAT 
CATGTTAATCCTGTCATCTTTGGATGGGTC^TATTTTGTTT 
1 0 TTTT ATACACAAATTGAATAATGTTTGATTAACTGGCAT GT AGGT C C^U^GC CCAATGTGAAGCTTCT T CATTTTG CTTTG CAAAAT 
CCAGGGAAAAAGAGCAAGAGGCAATTTGGGAAATTCCACTAAAATGCT^ 
CTTAAATACTAATATAAAGTCTTAATCAACACTTTTACTOTAAAGATA^ 

CCT AAATATT CACT ATAGTCATTTTGATGTGCCAGTAGGGG AAAAGTATTCTCAAGG CTG ATATGTAT AT AC CAG '^^^^^^ 

15 TT ATT CCACAGT AATGGAAC AGCAAGTAAATGG ACAGTT AATAGAG C CTCTGCAG ATATTT CCAAGAGGTAATGTTG AG C AAATTC 

TAATCAACAATGATTATTT CAGT AAACTGTTTCATCAACAGTTATGTT TCCAAAGGTTT AAATGCACAT AAGATGATGTTT AGCT C 

TTAATTCGAAAAACAAAAGAAGTATTC^TTATCGTGGGGATTGCAGGCTATATTATA^ 

AGAGTAAATTAAACTAACTAGATTGTTTTTTGTTTTTGAAGTAC^ 

GTTT CT AAAGCAAGG CAAATGAATGACTTT AATTCTGGATATTTAT AAATTGTGGCTCR 
2 0 AGAAAGCTT T(^GTTAGCACAG AAGCG AAGATTTACTG AGGTGTTAG G AGAGAAAT TTTGGTGGAATGTT TGGTATCTGGTTTTTT 

GAGG AGGTATTTTCT AAATAATTGAAATTCTT CAT AAATGGAAATTT GATT AT CT C CAAAAATATG AGCTG ATGAGAGT AAGTTTC 

AGTGAATCACTTGGTCAGATAATTTTCTTTTTCCCITTT 

TATAAATGTT CC CT QAAAATTTGAAT AATGGTG AACAAGAACAGTTTAATT ATATAAACC^ 

GC CC CACTTGGAAT C CATGTTTT CT AACTAATGGAACAGAT AAGGAAATT CTCT AAGT AACCCTT AGAAAT AACTITAATTGAAA 

2 5 TAAT TAAAAGTT AAACTTAC CTT CGTT AAC TG AT AC TGTACAAAT AATCATG GT ATGAGTTTTC CAATTAAACATTC TAG ACTTT C 

ATTACCTGTAACTCTCTGAAGTTTGAATGGCT 

TTGGC CAAC C CACAAGAAG AT CTGGGTT CAAGAAT AAT ACTGCT CTTC CAG CAT ATGTGTGGG CAG CAG AATCAGGAGTGCTTAGT 
AT ATTACT AGTAACTT ATT GAAGTG CTAAGT AAGT ACAGT TGGG CT ATTTAAG G C CAGTG ATAGT CCAAAACAAGGATTGTG ATTA 
ATCCACTTCTGTTGTACTTGAAGATTAAGCACCAAAACAG 

3 0 ATGTAGATGATTTCTTTTTTrTTTTTTTTTTTTTCGAGAC^ 

GCTCACCGCAACCTCTGACTCCCTGGTTCAAGTGATTCTCCT 

CACCC^GCTAATTTTTGTATTTTTAGCAGAGACAGGGTTTTACCATGTTGGCCAGGATGGTOT 

CTCCCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCAGCCAG^ 

TGACTGGAGAATTTGGGCAAGTTTC^AATTCTT^ 

3 5 TTGACATTATAGCTCTGCCATAGAAAACCCTAATTGCAGT 

GTAATCTCAGCACTTTGGGAAGCTGAGGCAGGTGGATTG 

TGTCTCTACCAAAAAAATACAAAAATTAGCCGGGCGTGGTGCACACTCCTGTTGTCCC^ 

TGGCTTCAGCCCAAGAGGTTGAGGCTACAGTGAGCCATGTTrG 

CAAAACACACACACACACACACACACACAC^CACTTCCAATGTTC 

4 0 TCATCCAAATTCTTTCAAGTTTTGTTTGTTTTCTT^ 

TCC CATGATAATCACAACCATTTTTAGATGT CTG CT CTGTT AAGG CATT AAT ATTGCTTTACG CATCAT C ATCTGTAAT CCT C C TG 

ATAGTTCTGCOTATTAGGTATTACrrGTTTCCAATTTATAGATGAGAATATT 

CAGOTATAAGTGCTTACTATGCCCTrTGAGTAAGAATTTCATT^ 

ATAT AATTAG AGTGTG CATG G GAGTAGTTTGGAGGG CTTGTTTTG G ACTGGAG AGGAGT ATGTTATG GGGAGAGG AT ACTT AAAGG 

4 5 AGG AAATT ATATGAGATGTCATTG T CAT ATGT AG T CTTAGCATAAAACCCACT AGG CT G CAAT AAATTGTTAAACAT AGTCTTTCT 

AAGCCAAATTTAAATATTTCAGAATATGAGACTGTCTTTTTATTTCTGTGTTTTTT^ 

TTTGGTAGTCTTTTGCrTTTTGCTACCTGTCTACATCXaAA 

TAATCGACATGTAGACATACTGGGTATATTTCCTGTGTCTACCTAGTTGGT 

C CCATGTTAAAAAGAATCTTTAG ACTGAACAAG T AGGAAGG AT AGGAGAAAATG ATTGTT TCCTTATTTATTT ATT T ATTT ATTT A 
CAGTTCTCTCGCC^ 

ACAAG-GTATCACAGTGTTGCCCAGGCTGGTGTCAAACTCCTGAGCTCA^ 

GAAAGGCATGAGCGACCATGCCTGGCCATAATTCTGATGTAATAACATGTTCTCCTGA.TT 

TAACTCCCAGGTCATGCCTAAGTCCTCTACATTGTTAAAAGATATGTTAAATATTAA 

5 5 AATTCCTGACCOSATAACrTTAGGAAATATGAGGTTACTGGGTTTTTO 

GTCAATTGAGGAGTGAGGCAGATGTTAAGGTTAGAGATAAGAGTTGATGTTTGTCTCTGATTCCTTGGCTAGCCACTTTG 

TG AAATTTATG AATG CACTGCTTACTT CAGTG GAAAT ATTTTGAAGTGTTGT CTTAG GAAGTATTGATGGT C CCTCAAGGACCAGA 

CGTCAAATGCATGTGACTGTAGTTCAAAAGGTTAATTTCCATTTTAAGGAAAGAAGATTTTGAG 

AAAGG CAAGTAAAAACATT TTGTTATGGTTTAG AT AT ATTTT CT AGG AAAAATAAC^ 

6 0 AAAATGAAAGCAACTGCAGTGTTTTGTAATTATTACC^ 

GCTCTGTCGCCCAGGOTGGAGTGCAGTGGCGTGATCTCAGCTCACTGC^ 

ACGACATTCTCCTG CCTCAGCCTC CCGAGTAGCTGGGACTACAGG CGCCCIGCCACCATGCCCAGCTAATTTTTTGTATTTTAGGTA 

GAGACAGCGTTTCACCATATTC^CCAGGTTGGTCTTGATCTCTTGACCTCATGATCAGCCCACCTCAGCCT 

TTACAGGCTTGAGCCACTGCACCCAGTCATTACCCACATCTTATTAT^ 

6 5 ACATGTACATCTGAGAGAGGAGCAAAAAAAAAAAAAAAA 

CCTTAACAT ACAATTTG TAGG G CACT AT AGTTTTCAAAT AG AAG CAGTCT AT AACATGGG CAGGAGG CTTTTGTATTTTT AACATT 
TGTTG AAGAAG CATTTG CTGAATTGGAAAAGTACTAACCTGT AACT CAAGATGGTGT CAGGAAATT AATATT CTTTTTTCCTGGCA 
CTG C CTTAT CTCT AGGACT AAATCTT AGCATATGTCAAG ATTTG CAAGAAATGT AAT AAAATTAGTTAT AT ATCTGT ATTTTAAT A 
CATTTGACTGCTTATGG CTGTG CAAAGTAGTATTCAAACT AGAATG TT C CAT CGTAG AAT CC CrAGCAATCTTTTTGGCATTAAAA 

7 0 CATC CT TTT CTCAC CCCG^TTG CTCACAAAAAAATCCCT TAGCT 

AG AATG AGG ACAGATTG CAGAAATTTG AATT ATGAAAGG AAT CTAATCATTTT AG CT CAGTTAACTTGATTTTT AT ATGTTTT AAG 

CAGAAATTGGAATTTATATGAGGTGGTGTCAGAAAGCTCTTTGCATTTATAAATG 

AATTCCCACTTTGGGTCTTGGGAGCCACTTTTCAATCA^ 

CATTTTGAGCAAATTTGTAAATGGCCCTGATGTGTTCTACAATTTAGTCAATTACTGT^ 
7 5 AACATACATGGCCTGAAGGTACGAATTTCAATAATTGTTTCAGTAAAGTTAGATGGAT^ 
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TGTCAAATAAAACTGCACTTTCTCAM 

GTATTTGCTCAGTATGTGTATATATTTGCATACTCGGGGTTTC 
TCTTCAGGATAATCCTAGGTGCTGGTATCTACAAAGAT^ 
CATATTG CACATGGAATAC CCTCTCTTGACATATTTATC 
5 ATTCTGGATTT CTGTTTAGGGCTTCACATCTGAAAGA 

ACCTCATTTTTrAGTGAATAAGAAACATTCTACATACAG^ 

ACTAACCCCTGGGTATATTCTTAATGTACTITTTGAACATACTG 

TTTATGTCTCCTTTTTCATCACTGTTTAAGTATTTAATGGAATAATGA 

TTT GTAAC CT AT ATACTG AAT AGACT CCAACTG TGGGTTTGT CT AG CAACAAG CT AATT AACCTT CAT AGTGAAATGAGAAATTTG 
1 0 TGGCAAATTATAGGTTAATTAGAACTGTGTCCTCAGAACACAAGACT 

CCGGACCCTCTCAACACAG CAG C CCAGCAGT CTCAGATTCACTT CCAAGCAAT AGGTGCGAG CGAGCAAGT AAAGTGTG CCCTTG A 

TGTGATTTATCT ACAGAGTTGGGGTGGGGTGGGGAAG AAGCT AG AAAG CAGTT AGATAACCT AG CTACGTATTGAC C 

AAAGAGAAAGGACCTCATATGGAAAGATGAATGTTTAGATTTAGACTGGGTGCTGAGA 

TTTTGACCTGAAGAAGCAGAGAAGGGCGTTTCAGACGGAGGGAAGAGG^ 
1 5 GTTTGTG AGTAGGC CAGTTTGGCACAGGGAGGGGTGCTGGGTTG GGGAAG GG ATTTGGAGAG AGGAGATG AGAG AG CAATAGGAGT 

GATAGGAGCTAAGGAGGTAGACAGGACTTATCATGCGGGGCCTCGTAGGCCCTGTAATGTTTC 

TAGCAGTTAGAGGGCTTTTAGTAGGAGAGGGACAT^ 

CAGAAGCTCCCATCTAATCCTGCCCTCCTTTGATGTTTGACTTCCT 

CCTGTCGGAGGAATGCACTTTATCAGCCAACGCAGCTCCTTCCCT 
2 0 TGAGATTTGTCTTTCTATAGCGCCCACTCATAACGTTTTAGTTAAGCCCTCAGAACGTGG 

CTTCCAGTTGTCTTCCCTCCAAACACAGTTCTAGATTTTCTGTCT 

AACTTGTCAGTCCTCCTTCACTGCTTCCTAGGCCTCCCTCAGTGCCACACCCTGTCAGT 

AGTAGCTCTGCGTCTTTCTACTCGCCAGCATCCCTrGCT 

GAGTCTACTCCCTCCGTTCTTGGCCCATTCAGCCCCTCCTCCACCCTGCAGCT 

2 5 CAGTCATTCCCTGCTTTAAAACCTTCAOTGCTCTTCATGCTT 

CCTTCACATATGGGTGCTCTGTGTACTCCAGCTTGCTGGCCCACCTCTGTACCCCATCCCACCTGAC^ 
ACACATAGTGCCCTGAATGGATCACTTTCTTCrrCTCTACTGGGCCTGC^ 
TGACCITCAGTCTGGCTAACCTTGGTTATCTTTGAGGACrAAATTATCTTCT 
CACACCAGGGACCTCTCTAGGAATTCCTATGATACTCCCTT^ 

3 0 TGT C CTCCAGT AG ATT CT AT ACTATTTGAGGG CAGTGACTGT AG CATATT CATTG ACAAAGTTCCAATT CCCAG CATGATG CTTG C 

CACATAGATTATAGTCAATCATGTTTGAATGAATGAATGC^ 

TGGACTAAATTCTCCCAGCTCTTTCATCTTTTCTGATATGACATT^ 

AGTGTCTCTCTTCAGGTGTTGTACTCAAGTTACAGTTGTAACACATT^ 

ATGGGAT CTCCCTCTGTTGTCCAAGTTAGAGTGCAGTGGCACAATCACAGOT CCTTG ACCTCCTGGGCTCAAGTGGTC 

3 5 CTTCCACCTCAGCCTCCCAAGTAGCTGGGACTACAGGCTTGCACCACAATCCCT 

TTTTTAATTTTTAGTTTGTAGAGACGGGGTCTCACT 
TGGCCTCCCAAAGGGTTCTGATTACAGTCATGAACCACCACAGCTGGCCT 
GTCACOSTGAACATGCAGATGAATGAAATATTGAGTAAA^ 
CTTTTTAACCAGACAAAGTTGATAGCCAGATACTAGCATAATGAATCy^ 

4 0 TTTTTTCTTTTGCCTTAGGTGAATAACTGTTTATTAAACTGG 

CAAGTCCCATTGCCAAGGCAAGGCCTTTCCTAAACTACTTAG 

TACTGCTGGTGTTAGAAAGATTGATGAAGCTGTATTAGGAATC^CGTTTTATTXCTTG 
ATCAAAATGTTATTAATTCAAGTTTATAACATAATGGAATATCTATGTAGAAACCTTGC 
TTACAATAATTGGAACTATCAACTGGGAGTCATTTGTGGGAAAACTAGGATCT 
45 ATATGTGTGTGTGTGTGTGTGTGTGTGTGTATAAGAGATAGATGGACTTCGAGCAAGATCTGT^^ 

ATC CTTTATAAATT AGGGATAAT AACT ATACCT AAAG TAT AGTTATTATAAGG ATT AAATT AG G ATAAT AAG CATTTAGC ACAATG 
C CT AG CACATGGTCATCACTTGATAAATAG CAGCTG CT ATTATTATGTT AACTGG ACTGATTTTAG GCAGCTG TGTATTTTTT CCA 
TATATGTTTCAGCrrTTCATCCrATTCTACAAAATTGTTAATTGCCA 
TCAAGAACCCTTGGATGAAGACATGCTTTATAAAATCTAGGAAAATAGGGACTCMTCTC 

5 0 AGTTT AAG ATGT ATTTAG AAAG AAGAG TAATTAT AACTTG AAGTGGT AAAATGTG AT CAAACT TG AAGT CTG AG ATT CT CT CAAAT 

GTAACCTATTAAAGGTACCAAATGGAATATGTGTATTTCATArTAGGTCTTGAGTCCTO 

TTTCTTGCTCCTTCCTTCCTCCCTCTCTCCCTCCCTCCCTCTCTCTCCCTCCCTCTCTCTCT 

CTCTCTCTCTCTCaCTCTCTCrrrGTTTC 

GTCACCACAGCTTCGACCTCCTGGCCTCAAGTGACCCGAGTAGCT 
55 ATTTTTTTTAGAGATGGGGCCCTACTATGTTGCCCAGGCTTGTTCTTGAACTCCTGG^ 

caaagtgctgggattacaggcttgagacgctgtc^ 

TACTACTACTGCCACCACAGTATAATAATAACCAGAGCAGCGACCACCATTTATGGAA 

atatttaagggtgagaagagagtggagtatatagaatacgtaagatgtctctgtgttctcagggagctcccagaot 

GGGGGAGGGTATGGTAGGAAGAAAGACATCTATACTGACAACTAGCATCGGTAGGTAGGTAAAG 
60 ATAGAGTTGGCTGGGTGTGGTGGCTCATGCCTGTAATCCCAGCATTTTGGGAGGCC^^ 
GAGACCAGCCTGGCCAACATGGTGAAACCCTGTTTTCGCTAAAAATACAAAAATTAGC 
CAGCTACTTGGGAGACTGAGGCAGAAGAATCGCTTGAACCCAGGAGGCGGAGGTTGCAGT 
NNNNNNNNNNNTTTATTATTATTATAAAAAAAAAACrGGA^ 
TCACTGCAAGCTCTGCCTCCTGGGTTCACACC^TTCTCCTGCCCCAGCCT 

6 5 CTAATTTTTTGTATTTTTAGTAGAGACAGGGTTTCACCGTGTTAGCCAGGATGGTCTCAATCTCCT 

CCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCACCGCGCCGGCCCCT^ 

GGGAAGGTGCTATTTTAAATGCTACTCTACTAGTCAGGTAGATATTCACGGGTATAA 

TCCCCTTGACTACCATTTTTCTTCAATAAGGACTTT 

GAGGCGGGCAGATCACCTGAGGTCGGGAGTTTGAGACCAGCCTGACCAAC^ 

7 0 agccAGGCATCGT^CACATGCCTATAATCCCAGGTACTCTCMAGACT 

GTGGTGAGCCGAGATCGCACCATTGTACTTTGCACTCC^ 

AAAAAGGACTTTC^CTAAATTCATCCATATGAGTGTTAACATAGT 

CTGGGAGCTGAGTCTCATACCCATAGGAAAGGAAAACCAGCTAAAATTTAT^ 

AG ACACAGTG CT AGATT TTTTGTAATATTT AATTTTT CTGGTT AC CCT ATAAAATTGGTATTTT CC CATTTT ACAG AAAG AGGAA 
75 ATTGAGGTT CAG AGAGGATAAGTAACTTG TC CCAG AT CACACAGCT AGTG AAT ACCAGGATTT ATTT GT ACTACAT ATTTG CTTCC 
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CAGGAAGCTG AATT ATAACITTTAACTGTGTAAG AGAC CAGG ACTTAAATTATCT ATTTTb r r ri'GAAGTTCTAT AAT CTTTAT 
TTGCTGTG^TATTAGTGCrTAGCAGATGACATTGTCCACATGAAATTGC 

CCAT AAAAATGT CACTGAAG T AGTT CATAAGATATTGTAGCTT CATCTTGACTGT AGG C1TCTT TATTGCTTTCGATT ACT AAGAG 
GGTTTTAGTCTCTTrATTTGATACCTTTT^ 
5 TGCTCAATCC CT ATATGTAAAAG CAAAGGTGTATTTCAGTCTGG AATTTT CAGTGGTGTT GkGAGGTGTTGATGATCTTGCCACTA 

AAAATTATCTTTCXrrTTTCAGCTTAAAGAAGTCTTCTC 
ATTAAGATTTTCACATAATAATAACATATTACCCT 

CCTGT CATAAATGAATAACATGG CTT AATTTGATTGGCAG AACAATTTGCAGCCCGT^ CAAACTGTGCT AATX3GGTTT CACT C 
CCAGCAGATTGAAGTACCATGCACACACCCT13CTGCCA^ 
1 0 TGGGCTGTTGAAATGGCCCTGCATTCTGAGGTGACTGTT/rCTTT 
CITAGCTCClT C lTTTCTCCTGAAACrAACACTG<3 

CCTT CAGG AG AC CTTAGTTTGCCT AGT AGACTG G CCATCT AAAGCTGAATGACCT AAC CCAG C CAGTGAGGGACAAACGAGCCTTC 
AGGCCAGTTATTTCTTACTAATTTTGATTGAA^ 

AAGC T ACCT ACAGG GTC CT CGATTCTT CAGTAGCAACATAAATG CGTTTT AATG 
15 GAGG CAGGCG CCZTGTCCATTGCG ATCAAGCCTG C CTATT AGGCTGT ATTC ACTGAGACATCTGTGT CACTG CAAAT AAAAGACT CT 

TATTGATTTTCTTAACTGCTGTrCACTTCCCGTCCATGTTTCGA 
CACACACTGACCTCTCCAAGCCTCACTGGCATTTCATGAATC^ 
C&TGCCTGTCCTTTAATATCACCTGTAACAT^ 

AGG C CTTGCTTATTT AG CACACAATTCCTTGAATACAGTTAGAAGAAATC 
2 0 AGTATGTGAATTOTAGTCCTTAAATATCCTATATACAGCTAAGGAGAATCTAGTGTGGTGTATGTAT 

TTTTAGATAGT ATTTAT CGAAT ACCTTT AACTTT CAG CAG CCAAAGGTTTGTTT AGATTGAG GTCCACATCTGTATCACCTCAGTG 
CCTTAT ATTTTGTT AG AAC CAGCAT AT AT AAACAAAATCTGACT AAAAG CAAAAATATTGAG CCT AGG ATATTT ATGT ATCT TCAA 
CATG CTTGCATT CTAGTT ACT AATTGT AATTT ATGT AT GAGTCACTGTGTT CTTCAGCTGGAAAAT AT ATTT CAAAATTGACAT AA 
ATTAAGAAG CTT T AACT ACATTTG TCC AG CCCAGACAATGGAAAAT AACCTGTCATAGACTCTTCTT^ 

2 5 TTTTCCACTGCAAAAAATAAGCATTTTCATAATGTTTTGACAAAAAT 

TAGATATCTTAGTGGCTTCAGTTATGAGGTTCTCTATATGGTGGGAGGTTGTCCT 

G CTTTCCCTT GGGAATGTG AT AAC CT CATATT AAAT AAT ATGG Cri'IU'lTlTTCTTT AT AAGGATTAGC CITTCTT ACCTTT ACTT 
GCACTCAAAAAATTCCCTCAAAAACCAAAAAAACCC^ 

GCTATAGGATAACT AAACAGTGAGTGAAATTG CATTG CAGTGAGG T ACTGTG CACAG TTGT AGGCAT CAATAAATTTCTGGG CTG A 

3 0 ACACTGACACGACTTGATAGGGAAGATAAAAGT(2ACA 

TGCTCTCAAGCATTAATAATGATTTGCCTTTCCACTAGCCTCC^ 

G GACATGATT CTT (^^G AT AATCAAATATTGACAAGTT AGAAAT AGCTTTCTGCCTG GGTT CATTGAT AT AGGTT ATGTG ATAT AA 
ATATTTATTTCTTTTCTGTCACTCAGAGTAAATGCCACCAATTG 

GACrGGCCTGTGTTTTCCGATTAGTCTCTGCTCTATAAAGTGGGATTTACTAAATCTCT 
35 GCrrTTCTGAAACCATTATTGCTTCGATTTGTGAAGQUAGGTCTAGTGATGGGACTAGTT^ 

AGGATT AATTG AGCATTTTT ATT TTAATGATTT AT AAGCG AT AGG T ACTG ATTC TT CTT AACACAGACAATAAACAAAGAAT ATTT 
TCTGTGTTGATCCATAATTGAATTCCATTTCATTTTTTCCAT^ 

AC^CTTAGTCTTTGGAATGGAGACATTGTTTTCCrrGTTATCACTTTATGTTGGAGTAATTCCATTTTTCCAGCTTAAAAAAATAAA 
TCIAGAAGATATTTTTGGCIAAGTTTTATTGAGAGCTATAAATCAGTCT 

4 0 TATAGTGTGGCATCATCTGTTTAACATTGGCCCTTGA^ 

AAGTGCACAGTTACTCCAAAGATATGTGAGTTTCGCCTTTGCATCT 
TTATAGGAGAGGGTGGCAGTATTTTCCTATTCCTGATAAA^ 

AGCTTGCAT CTTTTGG CTGTT ATTT TGATAACTT CAAAAAG CCTCTAGAAAAACTTTTTG TATTTGG CTGGG CGTGGTGG CTCATG C 
CTGTAATCCO^CACTTT AGGAGGCCCACTTGAGGCCAGGAGTT TGAGACCAGCCTGG CCAACATGGTGAAACCTCTTCTCTACT A 

4 5 AAAATACAAAAAATTTGCTGGGTGTGGCGGCCCATGCCCAGC^ 

AGAGGTTGCGGTGAGCCAAGACTGCACC^CTGCACTCCAGCCTGGGCAACAGAGCAAGACTCTGTCT 
ACAGGAAAACCTTTTTGTATATAGAAAAAAAAGGGAGGGGGAATATATATCTACTGTAAAATTrA 
TTACTAGTATATGTAACTCTTAGATTCTTACAGTTCATCT 
GTCCAAACTACACAACAGCCCTCCTTTACCCACACAAACCC^ 

5 0 GGCCCCTGCCTCTCTTTCAGCAGGAGCAGAGCTC^ 

TTGTAACTCCTGTGGCTTTTTCTTCCTCCTCCTTCACATCTATTAT^ 
GGACTGTATTTAAAAACTACTrTGGAATTGGCCGGGCAT 

GATCACGAGGTCAGGAGATCAAGACCATCCTGGCTAACATGGTGAAACCCCATCTCTATTAAAAA 
TGGTGGGCGCCTGTAGTCCCAGCTACTCAGGAGGCTAAGGCAGGAGAATGGCGTG^ 
55 GATCACGTCACTGCACTCCAGCCTGGGCCACAGTGTGAGATCTGTCTCAAA 
TATTGTTGTTGAATAAAAAGCTCTCCTTCACCTCTTTTCAATT^ 
TAACATCGTAGCTAATGAAATTTCTTTTAACTITGTGTCrrTTCAGTTGATTGTTTACC^ 

ATGT CC CTT CCAT ACTTAATACCGGAGAC^WICTG CTG AGCTATGATG TTG AACTGC^GTATACTGAAAGACTG AGTTT TTGACTAG 
CTATGTAACTTAAGTAAAGTGGACTAAATGGTCTTAGAATGCTGTA 

6 0 AATATTTAAATACAATATTCC^TAAGCTTCTAAATAGCACnTTCTTATATAATC 

AAATTAATATCCTTATCTAAATGAAAAAAGCAGAAATACT 

C^ATTAAATTAGCCCATTTGATAAGAGATATGAAGTTCTAAAAA 

ATCCCCCCCAAATTTTTTTACATGAGGAGTATTTGTACAGCATATAAAAATT^ 

G ATATTTTTTAATATTAAAACACAGTCAGG C CGG GCGTGGTCGCTCATG CTTGT AATCCTAG (^CTTTTGGGAGGCTG AGG CAGG C 

6 5 TGATCACCTGAGGTCAGGAGTTCGAGACCAGCCCGGATAACATGGTGAAACCCCATCTCTACTAAA 

ATGGTGGCGTGTGCCTGTAATCCCAGCTGCTCGGGAGGCTC4AGGCAGGAGAATCATTTGAACTTGAGAT 
CCAAGTTCACACGACTATACTTCAGCCT^ 

CTGTCAATCAGTTCACAGGCCAAAGGTTTGCATAACTTTAAAAC^ 
AGATACTAAGTGACATGGTTCCAGAACTTGCTCTTCAAAACCATTTTTATCT 

7 0 AG GATTC CATATTTGTCTTGATTTTTT CCATTCTT^ T AATTAATG CACATGCACATACACTTGAAATA 

AACATTCTCCCTCTCCCCTCCTCCTCCCTGCACTCCTTCCCCGCCrGTCCCCTG 

TTGGAGTGCTGGGAATGTGGATTGGGTAGCTTCTTCCCTTCTCTTTTCCCT 

AGAATTATACCATTTGAACATGCTCAGAGAGATGCCAT^ 

GTGGCCCCTCCAGAGTGTAGACTCTATCTACTACATAGCAACTTCCTGCTTACTCTTAAA 
75 TGTACTTGAGTGAJGTGAAATGTCTCTCGTAGAAATTTATGTCAAGCCM 
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GGTTGTTACGGACATGTTTAGATTACTT^ 

TGCTATTT ACGTTTC CTTCATTGATCAAAGT AAAC CAAACAGCAACT CTG CT AGTTTAGCATCTCGT CACTTT ATGTG CTACTTTT 

CTT CATG CTGCTTTAAAATTTATCATTCGGTCCAGAA 

ATTTTTTTTCTGTATTAGTAGTATAAAGGTAGTGATGAACAGAATTCCCCT 

TCTTTCATTACTTCACTGACCCTAAATATAATAATTA 

CTGGATTGGCCTTTACACATTAATCTGTCAATATTCrTTTA 

CATTGT ACTCCT AATTATCACAG CTCTCTGAAG CCCTTGGTAAATGAACCC 

TTGC TTTATT CAATTGGTGG AAGTTTAGAGTAG AGCAAAAT ATC 

CCAAGTGATATTGAATTAAAAACTCCATCXATTTTCTGCCCATGCAAGGCAGTGCA^ 

TTCAGGGCTTGCAGTGGCrrTTCCCCAAGCTTTAGCTATATGG^ 

GTTAATTTTTAAGC^TCTGGACCAGTGTGTAGAAAATTACrGAATGC^ 

ATAATTGTATAGGGAGCTCTCTAAACTCTAAAACATTATTTTATGGGCACC 

GGTATAAT AGGTTTTATAGAACCTTG AAAAACATG CAAACTT ATAT ATAAAGTT ACGAAAATGACTTTAATAAAAATTACATG CCT 

ATGT ATTTTT C CTTTATGTAT TAT AATATTCG ACT ATATGTGCATCTTAAGTG CATTG ATTT CGTG CTTTTCCATCTTCAT AAT AT 

GATTTCTG C CT AATT ATTGCTGT AT ACT AGTTTGTT ATTTG CAAAGAAAAGCTAT ATCTACT CTAGGAAAAT CAT AAACATT ATTT 

AGT TTGTTAAATGTT AATTATGGGGGT CATC TGAGGAGTTTT CTAGGG CTGATTTATTTACAGATCATGCATTAG ATTACTGAGTG 

TGTGTGTGTATTGTCTTCCTAAGTTTGACTTTGGTT^ 

CTTATCAGTATCTCrGACCTGACCCTTTTGAGCT 

CAGGCAAACAAATTGTCATTAAACTTTCCATCZACTTCGT 

ACAGCATGACACCCrTTTTGTTTTTCT 

CAACGG AGG CACAGTCTACZAAGTGAGTAG TCAATTTTGTTT AAACI^ CT ATCCGCTTTT C CAAATACTG AGT AT C CAAGTT CTCT 

GTGGCAAAGATTTTTAAATGACCACTAACACTTAGATTT^ 

AGAATTGCCTGGAAAGTGCATTTATATTGCCTTTTTAATATTTCCGAGT 

TTAGTAAGTGTACCTTTAACTGGCCrGAAGTTTAATGCATTTTTTTCCITTGCTT^ 

CTAGTACCCTACGCTAGTCCCCTGCCTGTTTAC^^ 



GATGAATATGTTAAGTTATAAAGTTCTCCCAAATTCCAGTTGCAGTGATCTTACCCCAAAG^ 

TG CTAGCCATAAAAAAT AGTT CT CCGACAGG GAGGT CTT CTCATG CAGT AGTG CAAAGTTT AAACT ACAT AAT GTAATATCTGGT A 
CTT AAT AGTCT GCAATGGTGT ATG AAA7W^T AT TC CACT AGG AATT AAAAAAT C CAT ACT CTGTAAG AAAGCCCTGAGGGAATAATA 
AAACTTT CCT C CAGAGAGCTG GTTTTCCATGTATACACT AATC CTG CATCACATT ATG GTTGTGAT ATT AG AT C AC AACCACAGAT 
ACTTT AACATT GAT ATT AATATGT ATTT CCAGAGTTGTAAAATAATTAACTTATTTT T TAC C CATGTT CTT AGAATTT 

TAAGTT CTGACTAACCACATGAATACAC ATAAAAATTGCCCTATTCTATGACCATGAAGGTTCACATC CTAAAAATCCCAGAACAT 
CCCCTACTCATTGATGCAGCCTTGGTATAGAATTTGATTTCTCAGGC^ 

TCAGGGCTTCAGTGCCCTTTGO^TACAGCCAAGATGTGAGTCCTTTCTCCTGAATATAGTCCT 

TTTGGATTGTTTTTTCCTGCAGATGACTGAGTATTC 

ATAAATAATAACATGTTTCTTACATGTTCCITGGGCCTTGGGCCT 

GGCTACTATTCTCTTTCCCTGGATCATACTAGCGTTCCTTCCCACGCAACCT 

TCTTCCTTACGATCTGACTTAGATTTTTCCTGTGTTCTGAAGGTACT 

ACATG T ATACTCTGATTTGCTT CT ATTGAGTTCC CTGG AGGGAGC ACTTTT CAAAGCTTGAACCTTTCTTCTTAATGCCTT TATGC 

TGATAGCCAAATATAATAAAACAGGAAAGATTT(nX3GCAACT 

TATTTAGACTCCTAACCTACTTA^ 

CCTAG CTTATTTCT AGAAGGGAAGAAG CAT AAG ACCATGAAACC CCAG CAG AATTTTTACGAGACATAATATTCT AG ATGT TGACA 

TTTTAATCCTTTATTTTAAAACATAGGTTTGTTTGTTTGTTGGTTGGTTTGTTTATTTTGAGATGGAGTC 

GCTGGAGTGCAGTGGC^CGATCTCAGCTCAGTGCAACCTCTGCCTCCCAGGAT 

ACAGGCGTAG3CCACCATGCCCAGCTAATTTTTGTATTTTTAGTAGAGATGGGG 

CCTGACCTCGTGATTCACCTGCCTTGGCCTCCCAAAGTGCTC 

TAAAATTGTTATTTTAAATTGTGGTGAAATACACATAACT 

CAGTAACTCCTCCATCACTCTGTTAATGTTGTTTTTAGCTAA 

AAGGGACCATTCTTCTATATGTCCTTGCCACAAGTTTTTTAATTTGCT^ 

CTCATTTAGCTACCGTTGCTAATGCTCACTGTGAGTGT^ 

AGTATCCTGATAATTTTCATTCTTATATAGTAGATTTCTTATAAAAAGAAATTT 
GGGGCGOTGAAGATACCAAATTGCATAACCGCTTGTCATTGATCATTGGGCATCATTTTTGCT 

TCTT AT AT C C CTGG CTTTT AG T ACIAG TGCAAAG AAATG ATAT AATCATTG AT AAATG AG GATC CACTTT ACT AAAGTCTCT AT AAA 
ACTAAGTGTTTGATGTGTTATGATTGAGTTTCTTCTACACTAA 

ACAGCATGATTACATTATAGGCTATGCTACATAAAC!TAGATGAAGAAAATATTGGAAAATC 

ATAGGCTTATTCCATTTTTTTCTTTAAAGTGTAATAAAGCT^ 

ACTTGATGGAATGTTGGGAATTTCAAGGGATATTATAATCGTGTAATTACA 

TTTAAGTGTAGATATCCTTCACAGTTGTACAATATGAAATTCTGGCCCCAGACArr^ 

CTGTCTCCTGACGAGCTGTGCTCTATATTATGAGAGTGACTTAGGCAGGTAATTTTAAGTTGGTAT^ 

G CT ACATGG AACAT AAAGGAAAT AC CTT AAAGT ATGG CCCAAG C C TAGG AAG AAGAAAAGTTT AGGAAAAGATAATGAATGTATTT 

TCCTTGTTGCCTCATTTCCTAAAGTGCTGC^AGTTGCTCCAAAAAAATAAATTCAGGAAAG 

GATTCATAATGAATTT CAAAAACTTTGAG AAAGT AGAGAATAGTATGTG^ 

CCTCTGTCACTAATTAGCTACGTAACCTTTGGCAAGTTATATAATGTCTTATGCCT CAGTTTTC CCATCTGCAAGATGGGGATAAT 

AATAGTATCTAGGTTGGGCGGGGTGCGGTGGCTCACGTCTGTAATCCCAGCACTTTGGGAGGCCGAGG 

AAGAGATTGAGACCATCCTGGCCAACATAGTGAAACCCCGACTCTACT 

TGTAGTACCCAGCTACTCGGGAGGCTGGGGC^GGAGAATAGCTTGAACCCGGGAGGTGGAGGTTGCAGTGAGC 

CTGT ACTCT AG CCTGG CAACAG AACAAGAAAAAAAAATTAAT AGT AT CTAG GTTGTTG TGAGGTTAAATGAG AAG ATC CG TAAAAG 

GCTT AG CACT ATG ACTG ACACTTAT T AAATGTTAACT ACTGGTAATGGT AG T ACT AATAAT AG AAAAATT ATT CACATTTTGTACT 

TCCTACCTTGACTTGGCCATGGGACTCTTTTCTTTTTTTTTTTTGAGACAGAGTCTCGCT 

AAGATCTCGTCTCACTGCAACCTCTGCCTCCCXSGGTTCAAGTGATTCT 

ATGTACCACCATGCCCAGCTAATTTTCTGTATTTTAGTAGGGATGGGGTTTCACCGTGTTGG 

GTCTTGATCCACCTGCCTTGGCCTCCCAAAGTGCTGGGATTATAGGCG^ 
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AT AAACATCTCACAGAATT ATAGTTGTGTTGAACTAACTTTGAGAAATT CTG CCTTT ATATT AT ACACATT AT ACTTCTGTATTTT 
CTTCT ATTCTGT AGTTG CTTAATGAATCATGATATCTTATGTATT AT CTTAAATACAATGAu"l"l"l'iU'AAAGTTTCRAAGTAi"l"l"i"r 
AATCCTAATTTATTTTCAGATTGCTATAGGACAGCTC 
TTCAGCAGCAAAGCCAGATTTCTATACAGCATTTCTTT^ 
5 QCAGftGTCCATTTTGTTTTCTTGTCTTTTCTCATC 

AATGAGAATGAGGCAAGTGGAGGTTTTAGTGGGGCTCTTTC^^ 

GTTGTAGAACATTCACCTAGAAGAAAAAGAATATCCACACTGAGGAGATTGATATT 

ACCTTCAGTGCTITAGTATCTGTTGACTCACTAGTTACTGTCT 

AGGCATAGTTACTTTGGAAACATTGTTTGATTCCAAAGCCAGTCTCTCAGACTCTGC 
1 0 TCAAACTTTCGATCTAGAAATGAAGTAATATTTAACTTT^ 

TAT CTTTTATATTATTTTTTCCTT AATGCCAGTTATTCAAG CCAA CTAGTCTCA 
AG ATGTTTGAAAACTGAATTTTCTAT CT CCAGCTCCTT CC CA CTG ATCTGGACTTG AGGT AGAATTTT C 
TCAGTGATACAATGGGTAGTATATATACTGCCTTCAAGGTGTTTTGAAGGTTGAATGTATACCATTACT^ 

CT ATCTAGAGACAAT AAGAGGAGGGAAGG AATTGTCAT T CT CAAAGATGCTGCTCTCTCCTCCTTTCTGTTGC CTT AT CT AAAAAG 

TG ATTTGACTGT AGAGGC CAACTAAAACATT AAATTACTG TTTGGGAG CT 
TTAATGAAAATTAGAAGGACTGATGGTCATTTGTATAAAATGAGAAAATACATAAATTGTCT 
AATATTTCAAAACATGGAATTCCTTAGGGGAAAC^GAATGAAAAGAGTCCTTGGTGGTGAA 
GAAAACCTTGACTTCCTCTTTCACCAACCAACCTGAAGGTGTGAAACTTAAAGAGGGAGCA^ 

2 0 TGGCTGTGTTAGTGGCCCTACTCCTTTATATATTTTTGCATGATC 

AT TG AC CTGAG AG CTAGAAT GATGGTTTGATT T ATTT TG ACTTT AAATTTCTAATCACTT AAGGAGT CTTTTCCTTG CAG C CAAG A 

TCATATTTTAAAGAAGCAAACAAGACATCATCCTAATGCAAAAT^ 

AAGCAAGGATATTAATGTTTTCAGAAAATCAATGAATATTTGTCAAATAAAATTA^ 

2 5 TGTACCTTTCCCACCTTTGCTTT^ 

TATTGG CATC CAG CCAACATGACAGG ATG T CACAGCTCG CTGAGCAG AG ATTTT CAGACT CAGTTTCCACC CGAATT AACT ATT AT 
TGAAATAGTTTATTGAGATTGAAGTATTGATGATTTTCTCCTAACCT 

AAGACACT AAATTTCAT TTTT AAAAATCT CATTTATTTAGC CTT TTTTT CTTTCT C CTTTATTGTTTTATTGCTTGG 

AAAAGGGTAT CTTGGTTTTATTTTTTATGATAGTAT AATTAT ACT TGTT CT 

3 0 AGTATATTGGTTTGGCCAATATTGCTATACAAACAACTTTTGAGTCAATTGAGTCTC 

GAAGTAAG AATTTGCTGAATCTAAGTGTTTCAACTTCATCTC CT CT CAG AG AAAAAAAT AGTTTTGAAG TGGCAAGACTTCTTTCT 
AATACTTTAGTAACAATTTTCTATTTATAAAGACGTTTAAAGTCCCCAGCAAAAGTGGT 
CACTTCCGTGTTGTAGGGTGTAAGAGATCTAATTTCCATTTGAATCTATTCCCTGCC^ 
ACATAGTGTCTGTCACTTTTGGATTCTGGATTCCTGTATGTAGTTCTGGGTGCTGAGTCTACAGACATATG 

3 5 ACCCAGAGGAGCAATGGAGAGAATAATGAGGGATTGGGGAATTGTGTCTGT^ 

TTCTGAGCATGTGCTGTGTGGAGGGCGTGTCAGATGTTGGGGACAGCCTAGCCTCATAAAACTTCTGTCT 
TCTGTTCTCAGGTATGCACAGGACCCCACAGTCTGGGGCCACCTGTCT 
CTGATGGCTTTTAGCITTTTCCTCAGGAAGGATCAGGAAAGGCCTCAT 
AATTGGCTAACTTTTCTCTGAGAAAGAGAAGATTAAAGGGCTCCATGACAGCT 

4 0 GGAGCAGGTTTGCTTTGTCTTCTTCTGAATACTAGACACAAAGGGCCTGTCTTGAATGTCTCTGTGACAGTCGGC 

CTCACAGGGCCAGCAC^TGCCTTACCTACTGTAGGCACT^ 

CCTAAGX3 ATGGACACAATGTGTTGTTCATTGCTGGGTGACT CTGGGAAGTGGTGAACTCCT CTACCTTAAGCAG AATGATGTGGCT 
GTTGTTCAGAACTGCTGAGAAAAGAATTCCCACTCTGACCTGGACATGAGATGACTGCACA 

4 5 GAACTAAATTCAGCATTCAGTTACCATAAAAACCTTATATCAGGTGTTGCTTGGTTTTGTTT 

CATCCTTTCACTGTGCCAAGGACTATTCCAAG 

CTAGTGGGGCAAACAATAATCAAGACAATAAACACTGAAGTAAGATTATAGAA 

AT AG AC CTATT CTATAGGTCTAT AGATTCTGT AGACCTATTCT ATAGG T CT ATAG ATTCTGT AG ACCTATTCTAT AGGTCT ATAG A 
TTCTGTAGACCTATTCTATAGGTCTGTAGATTCTGTAGAC^ 

5 0 TAGATTCTGTAGACCTATTCTATAGGTCTGTAGA^ 

TCTGTAGATTCTGTAGACCTATTCTATAGGTCTATAGAATAGACT 
ACCTATCTGAGGAGGTGCTGTTTGAGCTGAGACCTAAAGACAGGATGG 

CAGAAG T AAC CACTCAGG CGGAAG CCCT AAG ATTAAG AT AGG C ATG AACTTG TG T AATTGGAGT AG AATAAG CTCTGGGG AAGGTG 
TAGAGATGGACAGGGCTTGATC^TGCTGGGTCTTTTAGTCCTTAGTAAATCCTTGCTAAGAAGGTATGTTACCCT 

5 5 TGAGAAAGATAACGTTTGGAAGGGCTAAGTGGGATGCCCAAGATGAA 

AAGCCAAGGTTTTTCTGTTTGCCATTCACTCTCTGAATTTCCT^ 
GTGTATAAATTTGAAAGTCAATGATGTATCATCTCTAGGTGTTAAT^ 

GGCTGACAGTTATGACAACATAACTTGTTATTT GGTCTAC CCTTTGGG T GT C CTT ATTTACCAAAAACTTAT AATTTAATAATTGA 
ATCCCTGGATTAAATGTATGGTTCTGCTAGCAGGGGTGGTCTATTTCTTGATTTATAAGACACGTTA 

6 0 CAGTATTTGATGTGCTCGAAAAT03TAGTCTCTTGGTTGGCATATTGCA.CAAGCAAA 

TACTTAAATTG CTTTTTTGGAAAATAATT CACTGGAAAGT CCAAAAGG CATTTACCG CGT AT AACTTTCTG C CACAGTT TTTTTTT 

CTTTGCTCTGTCATTGGTTACTTTGATCTAGGGGACCATGTTTCTGTTAAATTGCTGTCTTACTAGAGTGAAATATGT^ 

TGATTTGTACACTAACGTTGAATTGGAGGAGCA.GTTTGCAAAATT^ 

GCACTAAATTTTTTTAAAAAAACATTAATTTGCTTTTTAAATGTGACTTTATATCATGTTGTCTTCTTC 

6 5 CTTTTCCTTGTCAGTATGGCAAAAAG^ 

ATAATAAAATGAAAAATGTTCACACAAAATTAGAACTTGCTTTAGGGCACAATCAATATAGCA 
GAAGATATGTATTGTGTGAGGTTCTCAAACATACTGATTCTATTACGAC^ 

CCAATAAACATAAAACTTATT AAATCATAATT AAATTTT CAGAAAAGTT AT CTAAAAATTATG AAT AAAAT AT ATTTT ACATGG CC 
TCAG CAACAAAAT ATAT AG C CAATGAAACTT T ATAAGTGGTGGG GTAAAAT ACATGGG AT AAAATT ATAGAATG AATTTT TACCAT 

7 0 AGACA.CATGTACCGAGTTAAGTTTGTCTACACCATTTGTTTCTCCTCT 

TGCAGAGCACAACAGCTGGGGTAACTGACCGAAGGGGCAAATGAGAAGTTTCCCTG 

AAGCC CTACCT GC CCTTATACGTGGAAGATGC CAAG AGGT ACT CAGAGTCACTGTGGTTG CATTT ACATTTCTACAGTGG GCG C CA 
ATCACAAGCITCATGTTATGGCCCCAGCrrGTGTGGGAATGTA^ 
AATGGAAAC^CAGATAAATGAATGGATTTCCCTGAGGATTCAGACCAAGTCAAGGAT 
7 c AATTCTT A^T CCTTTCC CAATG C CATTCCTG G T AAATAATT ACATTTTGGCT CTAAGAG GAGCTAG AATAAACAGAAT ATATGCAA 
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ACCTAAGAATTTCTGCCAGCAATCTCTAGCAGA^ 
TATAGATGACCAATATGAACAAACAGCTGTCTCCCTTCCTCCCT 
ATGTCAGAAATGTTTCTGCTGATCTCTGAACCT 
XTT1T1TTU M 1 N 1TCACTGCCATGATTTTGTGAGTTA 
5 TGGTCTTTATAAGGA7VGTTAGTTTCCCTCCTCTCACATAGACCT 
GACACCGTTAAGTTTTAATCTTCTAAATGGATTGGAGAATA^ 
rrGCCTAATTTTTGAGGGAGAATGTTCTA^ 

AATTGGGTTTTAAAGT CCAAATACTTTC CTT CTGTTGT CCAG AACAT AGCTCCTCAACCTAGAGGCATAGTGTCATGTGGA.G TGAA 
GTCACTCAAATTCCTTCTTATTATTGGTTTGAAGATGACTCT 
1 0 TTTACGTTAAAGGAAAAAGT CTCTC CTG GTTTCTTT^ 

TGCACAAAATGAAG ATAAAGTGC CT CTCTC CAG CCTGACAAAAG CCAT TTCT TTGCAGAATGGTAGCTG CAGCG AAAGTAAATAAA 
TGAATGGACAGCAATCACAGCCTAGTTGATGAAACTGTTCAATTA 

ATGAGC CCCTTTAG AGACAACCAGTGTTCAAATTG ATTTCAG CATTGATT^^ CTGAAGCTG AT AAATTT CTG CTTGTT AGT TAA 
AGTAATTTTG CAGAAGACTTTGT ACTGCAGTATTGAAGTGTTCATTT AG CTG^ 
1 5 TGTT CTGAG ATGGGACAATT C CCCAATCATCTTTT ACTTTTAATTTT^ CAAT CAATAGGTGATTGAAACTG 

ATT AAAACTTAT C CAACCAGCTG CCT ATGGCAGTTT AATT AG AAG CTGGATGATT CTT AAAT T AAAGTTGTCTT ATTTTCATTGCA 
AAG CC1ATCTT CCAGAAGAGTAAGGCAAGAACAGTGCAGAAAACTTAAAAG ACTGG CTTTG CT AAGGTAT CTTGCAAATGTTAAAT C 
ACTCAAT AGCTATT CCT AAG ACAATTCAAAAATGT ATC!AGAGAGTTTG CTTCT 

CAG CCAAGGAATGTTTGTTTCATT CGGTCAGAGAGATTAC CT ATTGT AT AATGAT ACATGTGAAAGAC CAG CTGTTTGGG AT ATAT 
2 0 CT ATTAT CT C!ATAAGCAAG ATGAATGCTTGAGCTTAGTTTT AAACACTG T AATGCAG G ATGAAACT AGCCATGATGAT ATCCTCTG 
CAT CTGC CC CCTGGAAACC^TCAGCITTGAATTT AT ATTT TT ATT CTGTTTTGAGGTCTTG ATT AAAAT AAATCTC CATTGAATGA 
AAAACAAAT ATTTTT ATAT AAGTTCATGTTGAC TTGT CAACIACT CRGGATTAT AGAAG AGAGATTTTTGCT C CTCTT C 
AGCTCCCTAGGGAAGCTTCTCCATACTCATTTTATCTAGTGTGGA 

TCAT AGG GG ACATATT ACAT CTGATATTACGGGG AAGAAGGGAAGAAATGG CT CACTTTT CAGAGG TGCATTTACT CTTTGACCCA 

2 5 CTAGGGTACTATTTAGTGTTCTAGAAGAGGTAATTTAGTAA^ 

GAG C CATT CAAT CG ATTTTCCCTATTGCTTTTAAAAAATCAACACTG ACCAT AT C CAG AG AATGTTGAAGACAACTGAAATT AGAG 
GATCTAGAAGCAACCAGTTTATTTTCACACAAAGCTTGTTG 

ACTCCAGTACTTACAGATAGGAATCGTGAAGCATTCAGTAACCrrCACCAAGCCTTGGTTTCC^ 

T AAT ATTT AC CT CACAAATTTTTGTG G AAATCAAATAGCTT CACAAT AG AAT A 

3 0 TTAATTATT AGAAT AAGGAGGTTTTCTG AAGTAG CCATTT ATG TAT CTG ATTTTG GAAAGT AATTT ATTGTTAGTTTTC C CAG AAA 

CTGTGAGCAAAGATTAGATGGTTATCACGTTGTTTCARGAGAGAGTTTCCTAGCTAGTCAATC 
CACATATATTGCCTAACAATTCCAGTTCTTTCATTCCTTCTCATCATTAGGCTTTATATCTCT 

TTATTT CAGAATCCITGATCAAGAGATCAGAATGACTGTAGTTTC CTTGATCTCAGAGGTATTTAGACCATATATTAAAAGACATT 
ACTCTAGTTGATGTGGAGAGTGTTAAAAGCTTGAATCAGGCCACATCAACTTGTGGAAGA 

3 5 AACTCTGAGGCAGGTATGGGAGTAAAAAGGGGTATATAGCCGGAGTAAACAGAAAGAAGAAATGGCAAATOT 

TTTCTCTTATTGAAAATCTTTCTAATCTAG 

CTTTCTGAGACAGAGTTTTGCTCTTGTTGCCCAGGCTGGAGTGCAATGGTGTC 
TCAAGTGATTCTCCTGCCTGAGCCTCCTGAGTACCTGGGATC 
40 AGAAACAGGGTTTCACC!ATGTTGGCCAGGCTAATCT(XAACTCCTGACCTCAGGTAATC 
GAATTATAGGTGTGAGCCACTGTGCCCGGCCTGTTTCTTAACCACAAATTATTT^ 
TTTAGGCAGAAAGAAGGTrGGCAAAGACTGCTATTTGAACT 
TGGATTTCCTGTTTTAACAGCTAGAGCCTGGCTAATGTGTTTATCT 
ACTATTTGAAGTATTTCTGTGGATGGCTACTTTAGTA^ 

4 5 TT AAT AAAATC T ACCTT CAAATAAAC CTAAAAAGACCAAGCAATT AG AGGATT AAATTTTAAT TTAAAAATTT AAATTT ATTTAAT 

TTAAAATTT AAAAAT TAT ATTTTT CATTAGCTGAACCACAG CITGGTTT ACATTTAGTTACTG AAACCAT AT CATTG CATTAAAAGG 
TGACA.TCT AGAGGGTTTTAAAATTAATCTCGCAAGGTACTAAAAGATTTT TGTTTGTGT CAGTATTTGTGTACTCATGAGCTTTAT 
ACAGCAG AGAAGCGTTCTG CATGTTTTGTATTG G CTT CAG AGT TGTTCAG CC TT AT AATCTTTTGGGAAAATGTAAT ACAATCAAC 
TCTCAGATTCCTTGTTTTAAGTAGGAGACTTATAGTCTCATGTGGGTATCTCCTAATAGTTTAACCCCCT 

5 0 CTTTCTTTTTGTTTTAGTTTTCCTTTGTGCCAATGCTATAAAGTCTTTGAAATC 

TGTT ATT C TAAATTTACATAAAAG CAGGG G CTTTTTTGGTT AATAAAT AAAATGT CACACAAAG AGCAAAGAAGC^ 
GAATAGAAGCATCATCTGTAAGTATAGAAACTGATGATCCTACAGTATA^ 
GATTTAATAATTGAACCTGAGTGAGACCAACTCTTAACTTTCTGTGCCAATGAGGAGAATC 
TATTTGTCTTGTCTCGATCAAACTTTTTCATAATGAATCAGATCTCAGAGAGCT 
55 TCT CTG ATGGACGTCCT AT AAAT CC CTAGGG CATT CAG AGAATAG AG AAT CAGGGAATACAGG ATAT CT CACAAGCTGTG AAATCT 

TTG CATGG AAAACAAAG AACTGATGTT AT TATTTGAG AGAGTG AACT AT CT CTG CAATG AG CTAAT AACAAAT ACAT TTGT AACTT 
TTATTTATTTTAAAATAGCCTTTGTTC!ACCGTTTTTCT 
CTTTCTTTCTTGCTTACCTAGAATAAGACCATCTCTTTTT^ 

ATTT TAGTG ACTGG ATTATTTT CT T AGGTTTCCTATCCATTTGG AG AACCTTTCAACAATTGAT AACACTCTAT 

6 0 GAGTAATTGTGTCCCTCTTTTCACTAACCTTCTTTGAAGTACGTTATGTGAGAAGTAGAATTTAAGTCAG 

TGGTTAATGAAGGAAGTGCCCTGTGTCCTTCATAGAACAGTGGCAATACTATCATTTC^ 

AAGTCATTATACATGC1AATTACGCTCCAGGTCTTTTCTCCAAGAGG CAAAAAAGGGGTATTT CCGTGCTCATTTTGATAACTATTT 
TG CT AATAAATTGACIATTTATGACATAGGATATTGTCAGTT ATG CTGAAAT CTG AAAT ATTT ATAG T AAGATTATAGGCCAGC CTC 
CCCCATTCTTTCTTTAATTATATCCTACCTC!ACAATCACTAGGAAAGCACATATGCT 
6 5 TTAAAATAAT TTG ATTGGAATATGTTT AGAAC TTAATATG CAG AAT AGTAC CAT AGAAAAT AGATG CCT ACTTTAATTGT ATG AG A 

TTGTAC CAG CTTT AT CAGAAAATGTATGAT CAGCTCT AGGTT ATT TATTTTGTGAAC AAG AGT CAG CTGGGATG ATTCATAT TTTT 
TGGTTCTCCTAGAATTTGAAATTCTAAGCATAAATAATTTT^ 
TAAATTTGTGTGGGAGTCCAGATGTTGGAATCTGAACACAGGTTGTTTCTCCm 

TTCACATATTTATTTTCCTTA^ 
70 ATGGCTTACTGTAGCATCAACCTCCCAGGCTCAAGTGATCCTCTCACCTCACGTCCCAAGTA^ 

C^TGCCCAGCTAATTTTTTTAAATTTTTTGTTGAGATGAGGTATCATCTATGTTGCCCAGGCT 

CAATCCTCCCACCTCAGCCTCCCAAAGTGTTGGGATTACAGGTGTGAGCCACT 

ATTCTAGAAATTCAGTTTATTCTTTGTAACGATGTATCnTTTAATAAATTCGCATTAA 

AACCTTTTATGTATTGTACTAGACTCATCTTTAACATTTCAAGGAGTAAATAATATAAT 
75 TT TGAAATCAGTCACA^TT TT CTGAACACTTTGTGCAAAGTACTTCGCTTGG CAGTGGGGG AG AT ATAACATCAACTTT CATGG C 
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CTCAGGGACCCCATTCmGGGGTCTC 
ACCTCTGAATTTAC3AGAGCCTCAGTTCCAGTCCTTACACCT 

GG CTTT ACATAT CATCTGT ATGCTGAGGiU^T CCAGATGT ATCTCTGT AGC C CTGGCCTGTCTCCTGAACTTCAGACTCAT ATCTG 
ATTACCAGTTGGATGT C CTCTAGGCATTT C7VAAATTATCTAAAACTGAG CTCTITATT^ CT 
TCTCCCATCTCCGTAATTGGAAACT^ 

otgctacccta^ca^Iac 

GCTTTCCTGTAATGTGTTCTCrTCACrrCAGCAGCCAAAG 

tatcacttggaataaaagtcagagtcctcaccatggcctaaatggtgctac^ 

CCCCAGCTAGCCCTCTTGCTTArc 

CTACCTGGAATCCCXnrrCCCCTAGATGTCCGAATGGTGTGCCCCCATACTC 

cttcotccccaaccctaatatcaaatagcatgtccctaatgttgatcagtctctatatcot 

ACCCCCACCTCACATATTGTATATTTATTTGATTATTGTCAGTCAGCCCCT 

TCT CAT TTGT TCTCTGCTGTATC CACACTGCCTAG CTTATTGCCTGGCAATCA^GACATGCATTAAGT ATT AAATAT AAAAAG AG 



CTTACAATAAAGTAGAACATAACATAGACCCTATATTAAGAATTTACAGCATA 
AAAGTGAACACCATTTCTIX3TAATCCCAGCACTTT 

GGCAACATAGTGAGAACCTTGTTCTAAAAAAATAAAAATAAAAATTAGCCAGC 
AGGAGGCTGAGGCAGGAGGATCATTTGAGGCCAGGAGra 

AAAAGTAAAATTTAGCTAGGCATGGTGGTTCACTCCTGTAGACTCAACTACTC^ 

AGAGTTCAAGT CTGCATTG AGCTATGATTGTCACACTGTACT CC^GC CTG AGT G ATAG AGT AAGACCCTGTTG CTT AAAAAAAATA 
TACTAGTAGGCCATGTGCAGCGGCTCACGCCTGTAATCCCGCACITTGGGAGG 

CGAGACCAGCTTGACCAACATGGT CAAATGCCATCTCTACT AAAAATACAAAAATTAG CCAGGTGTG ATGGTGGGCACCTGTAGAC 

CCAGCTATTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGAGGCAGAGG 

TAGCCTG^GTGACAGAGCGAGACTCTGTCTCAGAACAGAAACAAAAACAAAAAACCACTACT 

AAATAACTGTTGAAGAGAAATAGTAATTTATAACACXSAACATATAAGGCAATAGAGTAGTTG 

AGGGTAGGAAATCTAGTAGGTAGGGAAGTAAGTAGAAATAAAAGAAT^ 

GAAATGGTGACCATTTAAGTAAATCTGACTGTATAAGACAACGATAGTAATAAT^ 

ATAT GTT AAGATG AG AAGGGCTTG ATTG AAGTT AAAACATT CT AAG AT TTTT AT ATTGTTTAGG AG AAGG ATAT AGAT AACAATT A 
ACTTTATGTTTTGTTAAGTCAGGTATGATTGTTAAAAATTATACGGG 

ATAG ATGG AAT AAAGGG AG AAT AATT CAGTATAAT AG AGG AAGAGAAGT CGATAGGTGTAAAGTAAATGAATT AGGTGGT AG AAAT 
AAATCCAAATATATCAATAATTACAAAAAATATAAGTAGAGTAATATTAGCAGCT 
TTATGTGCTATTTTAGACTGTATTGGCAGAAATTTCTAAAATACCAAGAC^ 
CCTGGCAAAAACTAATCAAAAGAATGCTGCTACTGTOT 

GAGAAATGTTATGAAAATACAAATATTTACAACTGAACCACAAATACCATGTATT 

ATTCTTAAAGGATAGGTTTATATACACACCTTAGTAAAGAATAACT 

AGGTGGGACAGGGAATAAACCX^AAGAAAGTGGAAGGAAAGAAATACT 

AAGTTAATTAAGTGAAAATTCAAGAAAC^TTCAACAAAACCAAAAGCT 

AAGCAGAAATAACAAAATT AGGAATAAAAAGGGAACAT AATT CTATACA 

ACTTTATGACAATAAATTTGAAAACACAAAATGGCCAGTTTCTTAGAAAAACT^ 

AGACCTAGAGCAATCAAATCATTATTTGTAAAATGTGCCTGATG^ 

GAAAGTGTGTGTTTTCATGGAGCTTCCITTGTTGTGGA 

GAAGAATAAAGAAAGTGGCCGGGTGCGGTGGCTTACGCCTGTAATCCCAGCATTTrGGGAGGCC 
TCAGGAGTTCGAGATCAGCCTGGACAACATGGTGAAACACCGCCTCTA 
CCTGTAATCCCTGCTACCCAGAGGTGGCAGTGAGCCGAGATCACGCCACTGCACT 
CAAAAAAAAAAAAAAAAAAAAGAAGATGAAGGGAGAATGTGCTGAAGGGAGAATGTCCT 

AT AT AG GACCTG AAGGAAG CAATG GAG C AAAC CAG GTAT AT ATTGG AGAAAGTGG C CCAGC CCTCAGG AGTAGGTGCAAAGG CC C C 
GAGGCAGGAGCACGTTTGATGGAACAGAGTTGAGTATTGTG CATAAGGGAGACTGGAAG AAG CAGAGATCAGTG AGGTAACTAACC 
AGGGCCCAGATGATGAAATCCTTATAAAGACTTTTGATTCAGCAGTTAGAAATCTACCCATGAAAT^ 
CTACTCATAAGATTATGAAGTTAATGAATTAAATGCATGTGGATTT^ 
TCCAGAAGTAGTAATTTGTCAAACATTTATGAAGTGAGGAGCACCrA 

G ATG ATT AAAACATG GCAG CGCTCACTG C CCAGT AC TGG GGAC AGACACATAAAT ACTTTTCTGTAGTGTGC CAAGG G CAAT AAT A 

GAGCATGTAACATTTGAAGAGTAGGAGAAGCATCAAAGTTTAACTOT 

CTTACGCCTGTAATCCCAGCACTTTGGGAGGT^ 

GAAAGCCCATCTCTACTAAAAATACAAAAACAAAATTAGCCGG 

AGGCGGGAGTATGGCGTGAACCCGGGATTTGGAGCTTGCAGTGAGCCAAGATCATGCCACTGCA 

AGACTCCGTCTCAAAAAAAAAGAAAGC^TATTAAAGGGAGGTATATAGGTAT^ 

AACCCAAGGAGAATGAAAACTTGGCCTTCTCATGATTGTTTAGTGACGTTTOT^ 

CAC CT AT CTGTAGTGTGAAT AAACTT AT AGT AGTGGGCAAGAG AT AAACATGGT AAGAAATCT AG AAGTTG GAACAT GGT AT ATC C 
ATACCAGATGGCTCTTTTAATTGTCTCCAAAGT^ 

T ATATTG CAAACTTTTGCCA.GTTCTTTGTTGTT AAAG CTTTATT AGAGGGT CATTTGTAGAG CATATAGTCT AACTAG CACGTAGA 
GACACTACTGCrrCTTTAAGTCTGTTTTGGCAGCTTTCATTCT 

TTTTTTTTAGACGGAGTTTTGCTCTTGTTGTCCAGGAGGCTAGAGTGCAATGGTGTGA 

GGGTTCAAATGATTCTCCTGTCTCAGCCTCCTGAGTAGCTGGGATTACAG 

AATAGAGACGGGGTTTCTCCATGTTGGTCAGGCTGGTCTCGAACTCCCAACCTCA 

CTGGGATTACAGGTGTGAGCCACCGCCCCCGGCTACTTCCTACCTTCCT 

ACAACTCTATAAGGGAGAGACTGCTTTCCCTATTCCAGACTAGAGATAAOT 

ATGTTTAAGACGTGGGAGAGCTAG^CCAGGCGCAGTGGCTCACACCTGTAATCCCAGCACT 

CAAGGTCAGGAGATCGAGANNNNNNNNNNNNNNMWN^ 

GGTCAGTACCTTTGTAAGTTTTGAATTATATGTTTCCCTGCCCTA 

TGTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGAGTGTGTTTGACGGAGTTTCGCTOT 

CAATCTCAGCTCATTGCACCGTCTGCCTCCTGGGTTCAAGCAA 

CGCCACCAGGCCTGGTTAATTTTTGTATTTTT^ 

AGGGTGATCCACCT^CTTGGCCTCCCAAAGTGCTGGGATTA^ 

CTTTAGGTCCTAAAATATGCATATTCTGTCGCGGCTCATCTTGGCTTTGC 
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GGCTTTTGGGTGCTTCTGCTGTGAAAGJAGG^ 
CTGTACCAAGATTACAAGTTAAGACCAACTTCTTTGTCIX3OT 
GAAAGTATTCGCrTGCACTGGGCTCTTTGTTATTTTAAA 
GTATGTGTGTGTGCTCAATTTTAAGGGCTGAGACTGTTCCCAGAAGGCCTGT^ 
5 AAGCACTAAAAATATCTCATAAAAATGCETCCTCCATGCAGTCATTGA^ 
GTGTTAGCTGTTAATTAAAACATTGCAGAGGTGTATAGTTC^ 
TTACAATTAATGTGTTGAGGAAGGTCAAGACAATTGAG 

CCATTTGGAAAGTT AAGGGACACACTCGACT CATGCCTGGCATTG CCTTTGTCAT C CAGTTCT CCTTGATGATTAAATGTTAAATT 
GGTACTGTCATTG CACTGTAAACCAAACCACATAG CATCTTTATTC CT CGAAATCGAAC CCAAACTTCCACGGATGAATT AG CGTG 

10 CATCATAATTAAATTATCAAGCCTCT 

CAAGCTAATTAAGTTGCCAAACTGTCCAGTATGGTATGTATATTAGACTATCTC 

AAACGTCCAGGTGTTGCITATTCGAAGTAACAAJCZACTGAT 
TTATATTTCAGTTCTTGATTAGTGATTCGAATATATAATATTTT^ 
AATAATATATCACAAAGTCTTACGTATTA^ 
15 CCAAGTGCGGTGG CT CACACCTGTAGT AGT CT CAGCAGTTTGGGAGGCCGAGGCTGGT AGAT CACTTGAGGTCAGAAGTTTGAGAC 

CAGCCIX3GCCAACZATGGTGAAACCrrTGTCTCTATGAAA 

AGCTC CTTTGGGAGGCTGAGGCAGG AGAATTGCTTGAAC CTGGGTTT CGGAGGATGCAGTG AGCCGAGATTGTG C CACTGCACT CTA 
GT CTGGG CGACAG AGCAAGGCTCCATCTT AAACAAG CAAACAAACAAAAAACAT AATGTAGGGGAT AGGGTTC CACTTAT AAAATG 
TGTATCACTTTCTACAAAGTATAAAAAATTATTTTGCriX3GCACA^ 
2 0 GGGAGGATCGCTTGAGGCTTGGAGTTTGAGACCAACTTGGGCAACATGGCAAGAC^ 
CAGGCATX3GTGGTATACTCCTX3TAGTACCAGCGGT 

ATG AACT AATAAGATCACAG CAGTG CACTC CAG CCTGGGAGACAG AATGAG ACCTGTCTCTGAAGAAAAT AAT AAT TGATTAAGTT 
TT ATT ATT ATTTTCTAT AT CTTTCAGACAGAGT CT CG CTCTGT CAC C CG GGCTGTAGTG G AG TGGTGAGATCAT AG CTCACTG CAG 
CCTGACCTGGACTCAAGCGACCCrCTTGCCTCAGCCTCCCTGTA^ 

2 5 GATGCCAGGTAATTATTTTATTTTTTT^ 

TCCTCCTGGGTTGGCTTCCCAAAGTGCTGAGATTACAGGAATGAGCCACTGTGCC 

AG AAATAACCAATGAGGTTGTTATTTGAGAATATCTTT AG ACACIA CTT TGGT AAAGTAAACAGTTGT C 

AC CCGGTTTTTGT AAT CTTGCT TTTTCTTT AAATGAAAT AT ACTTG AATT T CTGTAT AATT AATTG AT AT AAT AAAAGTT AAGTTT 
ACCTTCCATGACTGGCTTGACCTTTGTCCACTCAAGTTGGA 

3 0 GTATATACATATGCTTTATATATGTAGTTTCCTAGTCTTGCCTGAATTTGTAAC^ 

TATGCTCTTAATAACTAAAICATT^ 

AAAAAGTAG AATGTACC CTT AAAACAAGTGAATGTTATCAT ATGTAAAT TAT AT CT CAAT AAAG CTGTTAAAAAAT TTTGTCAGTC 
CTTTCTGCTCGTGGACGCCACAGAGGAAACATCATTAAAOTCTCTGTTCTCCCTGCTGTCA 
GCCTGAACAGCTGAGGAAGCTCTTCATTGGAGGGCTGAAOT 
35 G AACACTCACAGACTGTGGTAATGAGAGATCCAAGCACCAAGTGCTOT^ 
AGTGGATGCAG^CATGAATGCAAGGCCACGCAAGGTGGACGGAAGA^ 
AAAGACCCGGTCCCCACTTAAOTGTGAAAAAGATATTTGTTGGTGGCATTAAAGA^ 
CCTGTAATCCCAACACTTTGGGAGGCTGAGGTGGGCAG 

CCGTCTCTACTAAAAATACAAAAATTAGCTGGTCGTGGTGG CAGGCACCTGT AATCCCAG CTACTCGGGAGGCTG AGGCAGGAGCA 
40 TTGCTTGAACCCAGGAGGCAGTGGCTGTGGTG AG CCAAGATCGTG CCACTGCTCTCCAGCCT CCATCT 

AAAAAAAAATAAAAATAAAAAATAAAAAAAAGACACTGAGGAATATCACCTAAGAGATTATTT^ 
TGATTGAAATCATGACTGACCAAGGCAGTGGCAAGAAAAGGGGCT^ 
ACTGTCATTCAGAAATAACCATACTGTGAATGGCCACAACGGTGAAGTTAGGAAAGCCCT 
CAGCCAGCCAAAGAGGTCGAAGTGGCTCTGGAAACTTTGGTGGTGGTTGTGGAGGTGTT^ 

4 5 GGACGAAACTTCAGTGGTCATGGTGGCTTTGATGGCAGCCGTGGTGATGGTGGAT . 

GATTTAGT AATAATGGAAG CAATTTTGGAGGTGGTGGAAGCrACAATGATTTTGG CAATT TCAACAAT CAGTCTTCAAATTT CAG A 
CCCGTGAAGGGAGGAAACTTTGGAGGCAGAAGCTCTGGCCCCTATGGTGGTGGAGGCCAATACTTTGCCAAAC 
TGGCCATGGCGGTTCCAGTAGCAGCAGTAGCTACGGCAGTGGCAGAAGATTTTAAT^ 
GGAGAGCTAGAGAAGTGACGGGGAAGCTACAGGT^ACAACAGATTTGTGAACTCAGCCA^ 

5 0 TACAAAGAAGACATGTTTTAGACAAATACTCATGTGAATGGGCAAA^ 

TTTAGTTCCTGTTCCZATGGAAAGTTTAAAGCATTCCAACAAAGTGATCTAATGT 

G CT AAATG CAAT AGT CTGATCATGATGCTGAATAAATGT CTTTT TT AAAAAAATAT TTGTCAGAGCAAC CAAAT TTTCTG CT AAAC 
AG CAGAAAATG ATG TTT CCATGTTG G CAGT AGTGGATTATGT AAATTTTGGTTTTATTTTTGTTTTAA 
GCTCATTAAATATGTTTTACAACTGGGCAGGTGAAGATGAGGAATTCCT^ 
5 5 TT CCTAT CCT AACACCAG ATG AGAG GAAGAG CCGTCTGAT CTT AG CTAGTGT AT AAAAT CCT CTATAGGG CAAGATTTTT AGTTCT 

TCGAAACTTACTTACTTCCTTATATTAATATTTTTATGGGCTTGCAATAACAATGCTT^ 
TTTTATTGTATAAAAATGTGTTGCAGATTC 
TTCATTTAA 

60 HUMAN SEQUENCE - mRNA 

AGCOTTCGCTCAACTC^GAACCTTCCGACCTCCGCT^ 

ACCTC^GACCCCCGACAGCCTGGGCAGGCTCGGTGCC!TGCGGGTGCGTTCCTGATCACCCCT 
CCATTCCCTTGTTTTCIACCCTCn'GTCCTCT^ 
TTTCCCATACCCCAGGATGTGAGCCTCTTTAACCTX3TAATGCTGT 
65 CTGGTAATTCGCATCAAGATCCCAAATAGTGGCGCGGTGGACTGGACA^ 
GGATGTGATAGGCCAGGTTCTGCCTGAAGCAACAACTAGAGC^ 

GTGATGAGGAAATGAAG GCAATG CTGTC^T ATTATTATTC CACAG T AATGGAACAG CAAGTAAATGGACAGTT AAT AGAG C CTCTG 

CAGATATTTCCAAGASCCTG CAAGCCTCCTGGGG AACGGAACATACATGGCCTGAAGGTG AATACTCGGG CCGG ACCCTCTC AACA 

CAGC^GCCCAGCAGTCTCAGATTCACTTCCAAGCAATAGCTTAAAGAA^ 
7 0 AGATGAATGAACAAGACATACGATATCGGGACACTCTTGGTCATGGCAACGGAGGCAC^ 

G GG AAAAT ATT AG CTGT AAAGGT CAT ACTACTAGAT ATT ACACTGG AACTT CAG AAGCAAAT TATGTCTGAATTGGAAATT CTTT A 

TAAGTG CG ATT CATCAT AT AT CATTGGATTTTATGG AGGATTTTTTGTAGAAAACAGGATTT CAATATGT ACAGAATT CATG GATG 

GGGGATCTTTGGATGTATATAGGAAAATGCCAGAACATGTCCTTGGAA 

TGGAGTTTAAAGATTTTACATAGAGACGTGAAGCCCTCCAATATC 
n * anTTARCACTCAGCTGGTGAATTCTATAGCCAAGACGTATGTTGGAACAAATG 
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AGTATGGAATT<2ATTCTGATGTCrTCGAGCTTAGG 
• CTTCTGCAGTGCATTGTTGATGAGGATTCGCCCGTCCTTCCAGTTGG^ 
TATGCGAAAACAGCCAAAAGAAAGGCCIAGCACCTGAAGAATTG^ 

5 GCCCAGGACCAGTAACCAAGGAGAACAACCCACCCGTCX3CCCTTCTCCGTAT 

GCTTCCCTGCCCTCGCCTTCACCTCTGTCAGCAGGTGGCCTTGCCTGGGGAGCCCCA 

ATACCTT CTGGTTTGAAGGCGCTGACACTGGCAGAGAGGTAAAGGGTGGGG CATTG AGAATGG AGGCT CCCAGGGTC C CTG CCCAC 
TTCTGTTTTCCTAATGTTTTTCTCTATAAAGGGTCAGGCCCGTCA 
CCTCTGCCTGAAAACTGGACAGAAGGACCCAGAGGTGTTCTTTCA^ 
10 GTAAAGG CTCTACCACCAT 

HUMAN SEQUENCE - CODING 

ATGCTGTGGCTAGCCCTTGGCCCCn^TCCTGCCATGGAGAACCAGGTGCTGGTAAT 

GGACTGGACAGTGCACTCCGGGCCGCAGTTACTCITCAGGGATGTGCT 
15 C!AGCATTTGAATATGAAGATGAAGATGGTGATCGAA 

TCCACAGTAATGGAACAGCAAGTAAATGGACAGTTAATAGAGCCTCTGCAGATATT^ 

GAAC^TACATGGCCTGAAGGTGAATACTCGGGCCGGACCCTCTCA^ 

GCTTAAAGAAGTCTTCTGCTGAACIX3AAAAAAATACTAGCCAA 

GGTCATGGCAACGGAGGCACAGTCTAC^AAGCATATCATGTCCCIGA^ 
2 0 T ACACTGGAACnTCAGAAGCAAATT ATGT CTGAATTGG AAAT TCT TTATAAGTG CGATTCAT CAT AT AT CATTGGATTTT ATGGAG 

CATTTT TTGTAGAAAACAGGATTT CAAT ATGTACAGAATT CATGGATGGGGGAT CTTTGG ATGT ATAT AGG AAAATGCCAGAACAT 

GTC CTTGGAAGAATTG CAGT AGCAGTTG T TAAAGG C CTTACTTATTTGTG G AGTTT AAAGATTTT ACATAGAG ACGTGAAGCC CT C 

CAATATGCTAGTAAACAOVAGAGGACAGGTTAAGCTGTGTGATTTT 

ATGTTGGAACAAATGCTTATATGGCGCCTGAAAGGATTTCAGGGGAGCAGTA 
2 5 TCTTTTATGGAGATTCAGAAAAACCAGGGATCTTTAATGCCT 

TCCAGTTGGAGAGTTCTCGGAGCCATTTGTAC^TTTCATCACITCAGTGTATC 



AGGCGGAGCCAGCAGGGGCCCCCGTGA 
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Table 95 

MOUSE NOMENCLATURE 
ICSGNM Cd28 
Celera roCG15201 

HUMAN NOMENCLATURE 
HGNC CD28 
Celera hCG16886 



10 MOUSE SEQUENCE - GENOMIC 

ACATACACAC^CATATATATACATACATACATATAT^ 

TTTATATCTGTTGTTTAGAATCTTAGGTGCCTATGTGCrTCTAAATGGCTGCAGG 

TTATTTTCCACCXSTAAAGTTCAGGTCTAAAACAGTCATTATT^ 

TGTCACAGGAGCCCTGAGCAGCTTGTTCAAATGTT^ 
1 5 ACATGTTTTGAATAGGAATTCAGATGATGAAGACrCAGGAGCCCTAAGATOT 

TATATCAGGGTAGCCAGATTGAAGGAGTCACTGTAACTTAATTTATTTGTGTAATCT^ 

AAAAATATTATGCTCATGTAAATAATTCTTTTATAACT^ 

AGTTAATAAAGATAACTGCTCTTTCGGTAiCTAGAATAATC 

CTTCTGTTTAATCAATTTGCTCTGCCATGGTGTCCrATTTCT^ 
2 0 CTGATCTCAGGCTTATGAGGTTGTGTGTTTAAG 

TAGGCAATGGCTTTCTAGACGTCTTAAAATTACCATTCACCCTCAACACIAGG 

TAAAACTCTGCCCTATCCTTTTGTCTCCTCCrGATTGTCrcrGT 

ATCCATCACTTCCTTTTCTCCAAACAGATATCATTTATTAACTTACATTATCAAC 

GAACITGX3GTGGGTTTCCACTCIATGCGCACTGACTGT 
2 5 AGTCAGTCCAGCGCTGGTGCTTTGCTCATCCTTTTGCAG CTCAGG CCTTGCCAGCT<^CATCTGTTATGGCAACAG CCTCACTGCT 

CTGAATTGTCCTTAACCTT C CATCATACT AT CGTGCAAC CT CTC C CCG CAGGGAGTCGGGAGAGGTGGGACTC CATTCTTAC CTTA 



C TATTTCGTCC!TT AAATAAATAAAATTC CTTGTTACTTTCTG AT AAAC CCTTGCAGT AAGGT CATTT CCTT AATCTTGTCCTATCT 
CAATTTGCTGTAGACTTCATGGTCCAC^TCAGGACTGCTCTG 
3 0 AACTGTCAATGACTCTATAGAATTCATGCIACTGTCCATGGGTT^ 

TCCATTTCCAAAGAACCAAATAATCAGAGCTCATTATTTAAACTTGCTGAATTAATC 
GAGAACAATTTTCTGACTAATCAAATTTTGTTTTCAGAGTGTATACATTAATTAT^ 
TTTTATGTCTCAGTACTCATATGGTTTTTTTGTTTGT^ 
ATATAGGACAGTAATATTTTAAAGAGACAAAGACAAGTGTC^ 

3 5 TGGGAGG AG AG ATTTTATTTCTG CCA.TGATT TCAT AT AAAAAT G AGAATTTCAC AAAAG CAGTTAATGGAAAATGAG AG AAAATG A 

CCATAGGTTTTTTTTTCTTGAAAATTTACCTTGCATTTAAAT^ 

AAG CACTGTAGGTCAT AGAACAGAGTT AAGGGT ATAGCTTTGATAAGT AATGGT CCAAAGTG T CATC 

ATATTTCTATTTCTGTTTTCATTGGCACCTATAATCAATTTGTA 

TGT ACTATT AAT AAGATTT AGTG T ACTTGAT ACAGTTG AT AG C AATG ACATACAGT CTTGG ACAAGG TT AG CCGGTTAGCT T TTTT 

4 0 TTTTTTTTAAATCAAC1AATAGTCATGCTTTGTCACIAGTGGTCTCT 

CAGT GCAT CTTTCAT AAACAAG CCTGCCT AT GGGTG CCAG TGAC TT AACAG G C CAC G TATTTT C AGTTGGAATCCCGAAAG AGACA 

ACATACATGCTTGTATATCATATTGTTGAAATTCATTTAAA 

ATTGCAAGAAAACAGTTGATGGAAAATGAGAGGAAATAACCATAGTTACTTT 

4 5 CCCAGTATGACTTAC1AGG CTCAAGAA<2AAGCACTAGATTGGCT CAT CAGAG AGC CTGTAC CTT ACAG AGTCTGAAAGG AAG CTAGT 

GGTCAGGGGGCTAGAAGGTGAAGGATATCCTAGTCCCAAACAGGAAATACGTTTGGAGGAAAATTA 
TGGTCAGGACCCAGGCATTCTTTGCCAGGGCAACCAATGACGTCT^ 

ATAGC CT CG CC CC CT AT CTAACATGTT ACT AACACAT CAAAT TCTG TT CACTT GG AATTTTCAAAAG AAAACAC CAGAATT AGGAA 
GCTATAACTTCATTTTT CC CCCAAATG T AT AGAAGTGTTG TG ATGG AAATAAATG G AAT CCATT AGAATCT CT AGAAGAAAAAT OA 

5 0 ATTTCTAATTAAG T ATGTATAAT ATCTTT CTAAT AT CATCTTTAG CATG C CAG TCT ATG CTTCAGT CTTGAGTC ACACCAG C CTTG 

GTGGGCCACITTATTTCACTATGATGCTTATGATATAAAAAAAGAAAGA 
ATCTTTTTAGTTAAGGATTCAGCCATTAAAAGTTGGACTGAAA 

AAAATGAAATGGAAGAG CT ACAATGGAT AAG CACTAATTGTAAAG TAGAAATTCTTTATG AAAAG CAGTTT GT AT AAT AAT AGTGT 
TTGTAGGTCTCATCTGAGATTCATCTATGAATCTGTAGGTCTCTCAA^^ 
55 TCCATGGAGAACATGTTCCAAGACTCTCTGTAGACACTTGAAATCCTAGATAGTAAAACTCTGTAAATGA^ 
TGCCAAGCCGACCACACACTGTGTOTGCAACn'TCTGCAGTCAGAG 

ATCCT AAAT ACCTCAG CATTTG ATCTTTTTTT T^ CACCGTT CGTT CACT G CAGGT 

CATTATTAAGTAAAAT AAAGGTAGTTTG AACAAAACCAG TG AG AT ATCT CAGTGGTTGATC TG AT AGCTCAG GGAACAAT TTCATG 
GCTAACTGGTGATTGTGCATAGTAGAACAACCAACTAAAGGGATAATTCA 

6 0 GTGATGCCAATT CAAAATTT ATGAATTGTTT AT AT CTGAAATTTC C CATTTAATGATCTGGGTTT ACAGATGAATATAGG TAATAG 

AGACTGTGGAATGGAAAATCACAGACAAGGAAGAGCTAGTTGGAACCATACTTCCTCT 
TGTCGAATGAACAAAGTTGTTCTTTTTTTCTTCTGTTCATTTATGATTG 
TAGTATTTTGTCTGACTTTGTTTGAAAAAAAAAGTCATTTAT^ 
AGATAATTTTCAAGTTCCTTCCAGCTGAGA 
65 TACAGAACTCCACTAAAATGGTGGAGAAAATGAAGTTTACTTTGG 

AATTCCCTTGATTCTCACATGAAGATCTCAGTATCAGCTCTTTGATCAAGTCCAGACTCT 

TTATGTTAGAATGTTGATGCAGTGTGTCAGAGGGAAAGGAACAATAGACCAAAAT CTGTTGGGAGG CTTGAGTTCAAGTCTCAAAG 

GCTGCTCATTTGTCTTGAACTTGGACTCGTCTTATAGGCTCATGATTCTAAAC 

TATTCTTTGTAAATACACAATTTAGCTTAACTGGGAGGTGTGTGTTTGTGTGTGTGTGTC 

7 0 TATACATCCTTTTAAGGTCAAAGTTCTrTTGATC 

TTC CT AAG CAT AGT AAT TGACCCACTTAG AATTGATGAGAAGG AAAG AAGTTGTTTTTGT AACTATT CATTCC CTTGGAAACACA 
ATCAATTGTCAACCTTCTAGAAGCCAGTTCTACCACAAGTTTCACAGACTATTG^ 
GGCTGCAGATAAGTGCITTCTCACCTCTGCCCTTAGTCTTCATGT 
GTTACTGCTTCCTGTCTCCACAGCTTTGGCTTGTTTAGCT^ 
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AACCTGATGATTGGTAGACATGCCAGCCAAG^ 

GGGTGTAG CCTAGGTAAACCACT CACAAGAAGG AAAACAGAAT C CT AGCATGGTT 

CACTAATGTGGTGACCCTTTAGTACATTTCCT 

ATTTTGCTACTCTTATGAATCACAATGTAAATATCTGTGATGCAGGA^ 

GTTGAGAACCACIGGTCT AAG TCTAGGT ATGTCCTT AAATTTT ATAA^ CTT AAATTATTTGAAAT ACCACAGATCCC CAG A 
AACTCCAAAAAACAAACCAAAATACAT AG CCAAGCTGACCATTGT CATGTGCTTCAGTTCATCTGTAT C ITTG'IGCTGTACTG CTG 
TAGGGTGGAGATGGATCTCTTCACGCATGCTCCATTTTT 
AGTGAGTACACATGGTCAATACCATTTAGGACAACTGTAATCTTTCTT^ 

AGAGAAQATGT CGATTCTGAGG TCCTGTCAG GTAG CTTCTCACGG GGAAT AAATT AAGGTGGGACTG AAATCT AGAAATTGTATTA 

ACAT TT AAAAAGAGGTCITATCAGCTCAAAT CCTAAG CTAGG AAG CTGTTAGC CACAT ATCAG CACAAT ACAAGAAATT AC C TTGA 

CT AT AG CAT CT CT CTTTAAG CATCAG CT AATCT AAAAATT AT CTGTCGACACAG AAG AGAGAAATGTG AGCAC CGACGGAT AAACT 

GCCCATCrCCTTTTCACAGTTTTCAAACTCTTCCC^ 

GTGCrTGATCTCCAGCACAATGGCAGCCA^^ 

AACGCTGAGGTCAAATGGGCTTGGGAGAAATCAGACAACCTCA^ 

G AAGAAAAG C TAC CTTTGAGTTCT CAT ATTACTCCTTGTG AAGAAAGACCGT TTCAGAGAATGACTG CAG TCTCTT C CTCTG AAAG 



TCTATTCTGGTAATGAAGTAGAAATGAGTAW 

TGCTTCCTTTTCTCTTTAACCACACCTAAATCACTGTCAA^ 

TTCTAAGCCAGTGTAGTCTCTCACCrCTCACCTCTGATGAATCTTGCCCACTCT 

TGCTT CCACTTTGGTTAG CGTGAT CT ACAGAAATGAAAAG CTGTGAGGACCTGCTTTT CGG CTT AAG AG CCTT CACTTGGTCT CTA 

TCACTGGCAGGAGATCTTGGTAAACATTTGTCAATGAATGTAGGC^ 

AAATCAACTGGTCCCACAACCTGCAGACTGTTGTCTACTGGATCAAAAC^ 

CTAGACTGGGAACTGCCCAAGGAATTCATGTGTAGGAAATCTGTTTTCCTTCT 

TC^GACAGGGATTACTAAAAGAACATTGGAGTTTGGATGTTGGAGCTTGT 

GTTCACAGTAGCATTGCTCATAACAGGCAAAAAAATGTTGA^ 

AACTGGAGTTCTGATACTTTATCACATCAC!ACACGAAACTTGA 

CTGTGATTTCAAACATAGGATATGTCCTGAATGGGCAAGTAGAGACAA 

ATGACGGGAACTGATGGCTCCTGCGATGGACGTTTCCTTTTCGTGTAATGAAA^ 

CCCATTGTATTAAAAAAAATCCCTTAATCGTGTTTTTAACAAGAAGAAC 

CAAC CCAAAT AT CAACACTAAC CACATCTT AACTT AT AT CAAATATAT C AG AATTT CTG AGAGTAAG AGGTTGTGG ATATG CGTTT 
TCTTTTTAAGTTCTATATGTATTGAAATACTGATAATTATATAGATGATAAAAGTTAGTAGAGATTTAATT 

CT ACC CAAAGCAAAAAAGAAG AAATACAGG AAG GAAG AAAG GAAAAAC AAACT CAAACAAACAAACAAACAAACAAACAG AAAC CA 
AAACCAAAACACCTGTGTACATGGATTCAATCCCAGGTTAAATGATAAATTTC^ 
GGGTGTGGTACAAATAACCTTTTAAAAATTAGAACTAGCAGGCTGAAGAGATTGCTCGACAATTAGAG 
ACTATAAATAGGTAGCTCTTAGCATGCTTCCCCAGTAAAAATO 

CAG CAATTTGTTT ACTACTGAAAAATT CAGGT CATG TCTGAGAAAGAG GATGC CT ACATCCTATC CGT CTTTGAT CT ATG ATG TTG 

ACTCTCTGTGCCTAAGCGGTCAGCTATTTAGGTGGTGTAGCCTTTGGTTTCAGTTCrGAT^ 

TAGATGAACCATTCATGTTTCTAGACCTCATTCAGTCATCTTTT 

AT CC CAACACT CAGGAGG CAAAG G CAG GT GAATT AGTGAG TT AT AGG C CAG CCTGG TCT ATGT AGTAG ACAG C CAGTG AATAC CTG 
TTTCTGTTTCTGTTTTGTTTTGTTTGTTTAGAAAAAAAAAAGAGA^ 
GGGTGGAGGGAAAGAACCAATCCTACAACTTGACCTCTGACCTCCATTTGCATGCATGTT 
CTCTCTCTCTCTCTCTC^CACACACACATTTAAAATGTAATACTTTCTTTTAAAAAAGGAGTGTACTT 

G AAATGGGGGTGCATTGGAAACAGT ACGAG CTTTGTCT CC CTT CT CC CAAAGG AACTG AGAGT AT AGTT TTAT TTT CTTTGCCTTT 

AAGGGGAGAAATTGAATTAGATGGTATCGAGAGCTTTGATTGTTGCATAAAGATAGAACT 

CAACGTTTTTGGGAGGCTCTTATCTGAATTTGCTCTGGAAGAGTAATTTAAGCCAGCCCTT 

TTACCCCTTTATTGACTGCT CTTGGTTTGGTCAAT AAG CAAAG T CAG AA 

TGCAGAGTTGAAAAAAAAATCACTTGGACATCTGTC^ 

CAAGGCCTGACTAACATCATAGGAATCTCAACTGGACT 

ACATTTAG CTGGGGTACTCAGTG CTCACAGCAGCT CACATTGTGCTGTCCAGGTGG AAGCACGCTCCT 
AGATGCCAGCATCATTGAAATATAAACTTGAGAACTTTCAGTGTAGTCATCATTCC^ 

CAAG GGACT AAC TTTG TTGGAGG T CTGTTCAGTT G GCT AATTAATT CACTTTG ATTT CAGGG CAATGGAATT ATTATTC TT ATGCT 

CCTAACTAAATGTTTTTTTCCCTTCAGAAAACAAGAm 

CTCAGCTGCAGGTATTCCTACAACCTTCTC^ 

TGTCGGGAATGGGAATTTTACCTATCAGCCCCAGTTTCGCrrCGAATGCCGAGTTC^ 

TGACGTTCCGTCTCTGGAATCTGCACGTCAATCACACAGATATTTACTTCTC 

GACAACGAGAGGAGCAATGGAACTATTATTCACATAAAAGGTAA 

GCTGAAGGCAGGGTTGGGATAAGTGAAGCTGGGGCTCTCGCATGTGATCTGAGACTCGAAGGCAAATCTCATAAAC 
TTACTCCIAGCAATGATTCCIAGCTTATTTGGAATGCCTCTATCTTTCCTCACTGA 
TATGAGATTTTCCTGAATTCATTTGACCTTTTGGATCAAGTATGAAATTGAGCAGAGTCCT 
TCAGGACTCAGGAAGAATGGGTTGGATACTAAACTAATTTTTTTTTTG 

C CAAAACATAAAAAGG ATG CT GGTCT AGAAATTT AG GGGGCATGCAATTTG AG ATCTTGGCT AAGTT ATCC^ CC 

CCAAACCCAATATAATTAAGCAGCACTCATGAATCAGCAGCAATAAGTACATTATGTTTACG 

CTGGGAAGAAAAATAATAGGATTCTCAAACTCAAATATAGTTATTGGCATTAACATTAAAAACTGAG 

ATGT ATATG AGAATGACAGATAAGAAAATAT AT ACTCCAT TTTACATTCAACTTTAG CACAATT ACAAT AAATG CATTTT CTTTTA 
TG CCAT AACAAAGGTGT ACTT AAG AAAATTT AAAT AT CAAACAAG AAAAGAATT ATT ATGTT AACATGACAGTATC TTT CATGT C C 
TGG CAAAAACAG AGGCAATT AAT ACACAAATTAAG ATTT CTAAT AT CAGT AATTCT CAG G CAAAATATTT AATTGG ATTGAGTCAG 
AGGATAAAAAGATATACTATTTTGGAAGAGTAAAACACTTTTCCATATAAATGATACATTAC^ 
GCATACATATCGATTTAGGAGAACTAGAAAAGAAGTCAGGGATTCTGTACTTGGCAGCTTTC^ 
TAAACCCTAAGTAGGTAACTGGAAAAATGCCTGTGCTCAACTTTAAGTT^ 
GATAATGATTTCATACTCTGCCGGATAATTCTAATGTGTAGCCAGGATTAAAACCT 
TATTTTTGTATTTTATTACAACTTTATTAGAAAGAACTTGGAAATAAAAAA^ 

CAAAGTCTATACATCTCAATGAACTGTGAAGTATTCTGCTTAGTTATAATCTTCCACGAGTTTTAATA 

TCCCCATCCCCTTTCTCATACCAAATAAATAACCTGACTCATC 

TTTO^AAATTTrCCAGGCTAGAGTTCTGACTG 
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TTATTTCMAGAGTTTAGGGTATAGGTAAATTAT 
TTAGCACTATGCAGTGCTIAACAAGACCCAGTGACTGAC^ 

TGACAAAGTATGATCAG TGGTGGGGTTGAGTAAGTCCCAAAGAG CAAGACTCGATCAT CT AAGCTGGTG GT ATTTTG CATGTTTTA 
GTACTTTCAAATTCG CCTCTGATGTACAGGACAGAAATGT ACT CAACTGCTGTTT CTTG TGTCCTTTAACC CACATG TTGTGTT^ 
5 CAGAGAAACATCTTTGTCATACTCAGTCATCTCCTAAGCT 

TTGCTAGTGACAGTGGCTCITrGTGTTATCTGGGTAAGAGGA 

ATATTAAGACTCrrcGCCTGTATCTTTTCrACGTTAAAGCAAAT^ 

TGTGAGTTCTGTTTCCCAGAAGGCACCAAGTAGCCAGGCAGAAT^ 

GCTTTT CTIT ATTCTT AATTTTCTAACAT AAT AA1TG CATATCAAGCAG CTTAAATG CAT ACATAAGGT AT AAAAT CTAAATCTCAT 
10 CAGGATTAGGCAACTGAAGCCCGTGCTTTGGCTTGGGGTCAAA 
AAGAAAGCAGGTCTGTTTATGTCAGTCCn'ATTTTCrc 

TATGAGACTGT AAG AGGAG AGGG ACTGTTC!AGAGCTCTC CCCACTC TACTAGTCTCTAT AGATTTTT C TG CAATAAG 

TTTAATAGATATTrTCTTCCTCAATC 

GAGACAATGACTCACCATGTGGACCAGATTCGTCTGGAAGCTACTATC 
1 5 TT CATCCTCTTG AGGG CTTATAGGTGTAT AG CACCATGTCTGAC!TTTTGAACATATTT AAAGAT^ TG A 

GTAAGTGTTTTGCCrGCATGCATGTCTGTGCACC^ 

AAACTGGAGTTACAGATATTTGTGAGCTACCATGTAGGTGCTGGGAATTGAACCCCGTTCCCT 
CTGCTAAGCTATCTTCCTTATGCTTGCTGAACACATTTTGAT^ 
TTTCTTAGAAACCCCTTTGTGTAGGAGTAACAGAA 
2 0 GGGTG AATAT ACAGGT AAGTTCZTT CAATGAGAAATGT ATG CTG CACAAGACAAAAAAGAC CC CTT ATCTTG CTACT CTTCAC CCAT 
CTACTACGAAGCACACTTAGGACTTAGTGAATGCTTATTC 

T AAG CCTGTTTGAGAT ATG AT ACCCATTT AAAGGC1AT AATC CAAATATTT AAAG AGACAACT TATGAT CAGGG ATTT CT AGAACTC 
ATGCCATGTGATG C CATCTTATTT ACT ATAG CATTGGAG AGGGG ACAGGG CAG CTCTAGCTCTAAGT CATCAAGT TGTATCTT C CC 
ACCAGAGATTTAAATTGTTTATTTGCTTAGCCAAAGTTGCCT 

2 5 TGGGGCAGGTGGACATTTGTCAGGCTACCTTTTGACGAGTCTATAACTTTTA 

ATTGTTTCATCTTGAGAATTCCAAAAGTTGTCAATCTGGGTATC^ 
CTCTGATATTAGTTGAGAACGTCCATCGCTCTGGACrTTGGAGTCTGCACT 
ACTTTAGAATCCTTAGGTTTGACTTGGTTTTTCTAATCCT 
ATTTCCTAATCITAAAGACCAGGTGCAATAGCCCrcrGGGTACCCTTCCAAG 

3 0 AGGAATTTACTAGCITCX3TGGGAAGGACTTCTGGAAACT 

AAATATAT ATCAAGGAAATCAGCTTCT AAGATGAGGC ACCAATG CACTGAG TGACTGAAAT CAC C C CAAAAGCAG CTTCTCACTTT 
CAATGAGTTAAAACAATTCACCTTTCAAGTCZATC 

TTCATCTTCTCTGCCTTGGGAAAACTGTTCTTGACTTATCTCACAGGCTCACTGTC 

G ATTT TGGGAGTCAAAC CAGGC tatct AACAAAAATGCCACT T ATTTGGAC TTCAAACC CAGGT CTTTGT ACTCT CTGAC CAAGTG 

3 5 GGGGTATAAGAATAATTTGCACCTCACAGAACAATGTAGC 

TCTTGGTGAGCTGGGCATTTAGGGATCCCXSAGGACTTAACCACT^ 
TGAAAGTTAACTGGGTTCCAGAGCCTTCTGAGAATCCATGOTGTGCAACTGTAG 
TGCTTGACCTGATGTTGGCATTTGAGTAAAGAAAAGGCATTTCTTGGTCACCT 
TTriTTTTTTTCTAATTCTCTGTTCATTCAGCTG 

4 0 TTGTTTTTGTTTTTGTT TTAAC ATG CTTT ATGCCAGTATGTAC C^CAATGTGTAGGT AATGCG TATTT AGGTAGC ATT ATCTG CC T 

TTAAATCTGCATTCTITGACCATTTTAGTACATTGCTTATGTGT 

GG AGGAAAGG CAAAGGAT AG AGTGG C CT TG AAGTCATGCATCTT T AACGGGCATTGGTTGGTGAGGGACACAG AT ACACAGACACA 
AGGGGTGGGTGTTCTATGCATATACATTGTGAGAGTGCAC^TGAACACCAT^ 

ATTGGTTAAGAAT CAGCTAAACCCACTG T CTG AT AAT AGTGTTGGGG T AAG CATGG G AG CTGATTGTAACAGATTTTCTTTAT CT T 
45 TGTGTGTAGCTTGGAGAACCTCAAAGTCACAACACCCTTCTTACT 

C^CAAGGGTTGGCAAAGGAGACCCTAAAAGCTAGCATCAGTACTGAAAACTGCT 

ACTTTGAGGCCTTGGCATGTGGGTGTGAAAAGGAGGCCAAGGCTGTCCTCAACCACTCATTATCAATCCA 

AT ACCTATTGT AT ATAC TT AT CACATCTGACTTG TCTGCTT AG AAT AAGG CTTTGTGTTT AAAT AACAGAGAAT C C CGACTC T G CA 
ATC C ACATGACTTT CTG CTTTTG ACAACT ATG CAAACATTACCAT CACATTG AT CTAG CAATG ATG C CCT AG ACCACAGTCAT AAA 

5 0 ■ AAACATGGTCAAGATGCAAAGTG AGTTT ACCTCTAGTTCCAGG CTTAGTATCATTCCTCTTTCTCTGCT CCTGGTAGAATTTGGTG 

GTTTTTTTTTGTTGTTGTTGTTTATTTGTTTGTCTTGTTGTTGTTTAATT 

GG AGTGTTTAAT CT AT CCAAAGACT AAAGGCTTTCTATTG CATTT ACATAT CAATAGTGAATATGTTCT AAAAT ACATT CTTG GG C 
TT CGGATT AG ATG ACAACATTGAGAATAATGG TTGCATTTTT ATAAAACAATTT CC T TTAG T AGAG CTG TTGAT AGT C 

AGGCTTTGATTACTCATTATCCTAGTCCAAGGCAC^CAAAAAGGCTCCT 

5 5 CCTAAAGTCTTAGAACCCCTCCAATAATACCTCCTCTAAGTCCT^ 

AAT AGTAG AAGGAACAGACTCCTTCAAAGTG ACTACATGAACATGACT^ C 
CTACGCCCCTG CCAGAGACTTTGCAG CXHACCGCC CCTGACAGGG ACCCCTATCCAGAAGCCCGCCGGCTGCTA 
CTCATCATCACTGCTCTGGATAGGAAAGGACAGCCTCATCTTCAGCCGGCCACTTTGGACCT CTACT 
TTTTAGAGTGTCTAGATCTAACATCATGATCATCCTGAGACTCT 

6 0 TGGCTTG AC CCAAACTCAAG CIT AATACATCT ATTGACTT GATTG GGGAAGTTAG AG T AGAGCAAT CAAAAAGATCATT CATTCAG 

CCTTGGGAAGTCAATCTGCAGGCTCC^ 

AAAGATGTTTTGGACAGAGAAGAAAGGATAGAAA^CCTTCTCTTTGGCTAA 

GT ACTTAACTATTTG AAAAATAATGAAAACACTTTTTT CACT CATG AAATG AG CCACTTAGCT CCT AAAT AG TGTT TT C CTGTT AG 
TTTAGAAAGTTGTGGACATATTTTTTTAATGATTTCTGACCATTTTTAATCACATTGACTCATGGAATGG 

6 5 AGTGCTTCTTTCCTCATTCCCG^TCATGGGAACTCAGTATTATTAATAGTCA 

CCAAGAAGAATGTGAGAGGAAGTAAGGTCACTCTATGTAAAAAAAAAAAAAAACAAA 

ACCTATGTGCACACACACACACATATACATA(^CACAAAATGCTATGAAGAGTTATCTGCTTAGTAGCCTGTTATAGT 

TCTAAGCTTCAACTTCTTACAGTTC^GCCACCTGTTGTCCTTTGTGCATGGATATCT 

ATACTGTGAACAAAAAGGGTAGTGTTAGTATTTGTCAGGGTGG 

7 0 GTCATT(CTTCCCATGTATGAAAGACATAGATGTTCTCCAGAAGGCACCATTGGGAGCC 

T TTAG AGGGATTTTATATGT CCT AGGACT CAGGACTC CAGAACTTTGTGGGCTCAGCT GCTTCAT AC CATGGGG ATACATTG ACAT 
GAACAATTATTTTGGAATGTGTCTTTAGGGACGACATCAAAGTTCTCAAGTACCTACAAGACCTG 
TTCTTTTTTGCTTCCAGTTCGGATCAACTGGAATGTATCTGGGGACCTTGAAGAACGGCTGTCCAGCTGTCT 
GTG CTTTG AATTATT CAG AGGTTTT AAAGT CIAGGAAGACCTGG TTTAAAAAACACTTCATTATG^ CAAGT 
7 5 CACTGTTCATCCAAGTCTATGACTCCTCAACTGTAAGATGGCCACACT 
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GTTGGGATTTCAGGAGTG CACAATCATG ACC CAAACTC^ CCAG CT AATTGGAATAT C CT 

GTAATGAAACATGAGGCATTTCCCCTCCCCCCACTGTTTT^ 

GAAAAGATAGGAATCTTTTAATTCTAGACAGAAGATATTG^ 

ATTAAAeCAGASAAGGCOU^ATTAGCCCTACTTC 

AGAGTTCTTGAATCTrAAGTTATAAAATT^ 

CATTTTTTTTTAAACGGAAAGATT^ 

C^CACCTGAAAGTTTTCATACCTCCTCTGAACACAACCGAAACATAGGT 

GGGTT AAAAACACTCGGTCAGAGT AAAG CTTTTG CAGT TTCAG ATGTAGTGTGTATGAAGAAAACT ATGTC^CTTG CTGCTATTAT 
TGTAAG AGTCTAAGAACTAAAGGTGTGCCrrGTAATTTCTAATT ATG AG CT CACCT ATTTGGTAC CGAGCATG CCAATTTTAAAGAG 
ACCCGGTGT AC C TT ATAG CT ACAT CCAATGAT AAAATTAC CACACT AG CACATGC CTGTGTTT AAACTCGTGCTTTAATGTTTTTC 
TTAGGGCAGGTATGCACCCCCTTTGCAGTGAGTTGGGAGAGATT^ 
CTCTCTCTGTGTCrCTTTGTCrCTGTCTCTCTCTTTCT 

TCCCTCTCTCTGTCTTTCTCTGTCTGTCTCTCT^ . 
TTTTTCTATGGAAGAAATACAAGCTGTG^ 

GATTTCAGTCTGGGTTTACCCAATGTATTCTCTGTAATGTGACTGCTGGGGAC^ 

GGTCTT CAAT CATTT AGTCAACGGAGTG AG CGACT AATCTGCAATGG CT ATTTTAAT ATG CAT ACTGATGGTCAAATGGATGTCTG 
ATCTCTCAT CCCAG CT TTCTGTACTACCATATGGGAACTATATGTAACTTGTATACT 

GTAAGATGGAC CAG AAATTGCAATGTT CATGT CCATATAGCCAC CATTAACC CAAG TT AAG CACAGT AGTGTGGGTTCT CT CAGG A 

CTTGTG AATGAGTTT ATG CT CTCT ACAAAGAGAGGTGAAG CTT AAATCT CTCT CAAATAT CAAG AT TG 

TTT CTG T AC CAGGG ACTT AAC^CATTCTATTCAT ACTATTTTCC CTGTCT ACA CTACT TGGTTTGCT C 

TTACTTCCTTGACATATTTGCCCAAATGCCACCrT 

CTCTATAACTCTACTAATAGGTATTTTTCTTATTTATTGGTT^ 

GAAAGAACTCCATTTGCTTTGATTTTATTGCT 

CTAGATGG ATTT CACGTGGGTTT ATGAGTT AGTGGCAAATCTG AATGATCTGCT CACTT CTG 

TTTT CCACTT CCT ATT AGAGACAG AAG AGGT AAAG CAGACACAG AG AAAT CTG GT AGAAGGACATAT CAT TGAT ACAAATT CATTT 
ACAG CTT AG AAATTTT CATGCAT C CTG AAAGATG AAT AC!AGT CGG AGT CATTCTAAAAATGTTATTCT AAAAACAAC CAATTAGGA 
ATAGGTTGATTTTCCATTTAAAGTTCTCAAGTTCTCTCTCAATAGCTTC 

ACCTTCTTTTAAAATGTTGTTATTGGGG CTGAAGAGATATCCCAGTCATTTAGAG CATGACATGGGCTATTTTT'CCAG^GGT^CCA 

AGG CIAAAACACAC^CATAAAAAAT AAATAAAAATAACATTGTTGACAAT TT C ATG CATTTGTTCGCTGTAT CTGACAAT AC C CAT A 

CCCAGCTCCTGGCCACCATGGCCCTGGGCAGTCCCCAATGCACAACCCCCTCTTGTTTTCACGTCCTCCCCm 

TTTTCATTTTGGTAACCCACCTAGTCCAGTTAGTGCCACTTCCT 

TGTACAGGT AG C CAT GGCTCCTGTAAGTT CAT AG T ACAATGG CTGTG CAGTGG CTGTGG CAT AT CT AG AAG ACAG CATG TCACAG C 
ATGCCACACTCTCTGGCTCTGACATTCTATAAGTTAGTTC^ 

ACAGATT C C CATTTAGGGTTGAAGAAAACACAAAAGT CACT CATT CTCACCAAT CTG ACCAG TT TCTG C CT AGT GCAG AAAAGAGG 
CTT ATTTGG CAAAGG CTGAG AG CAG CACTT AC CTATGGAT ATAAAAAG AAAT ATTT AG AAGG AAGTCITAC CTGT CT ATTCAGT AA 
AACAAT AG CAGTGGG TT C C CTATT ATG AC CTATGACTTATGAAC CATGAG C CACGG ACTTTTG AC CAGGTTCACAG TAT CAGG CAT 
CAATTCCTTCCTATAGTGTAGGACTCAAATCTAATCAGAAAGTGACTGGTTAACCC1AT 
GCACATCTTCTCTGTCAGGTTGATATTGTAGCCTCCAGGGACCGGTTC^ 
TGCCAGAAAAG CACCT CTCGGCACTATGAAAGTTAACC1AGCAGGAAAGAAATTCT 
GTTCTGTAATTAAAGTCTGTTGTATCTTCAGCAGCAGGGCITTACC^ 
CTGCCTTGCC^GCCTCCGCAGTCTCTATGACATATCCCATGCCTC 

AAGCATTTTGTCAATGCTTTAAACCC^TCTTTAGCTTCTGGTTTCTGAAAATATGGCTTTTGTAGTTAA 

GAATTACTGCTC7ITCTCTGTGGAGTGATTTTGTTCCAGTC 

AATGGTCAGGACAGGAAATGCCAGTCTCTTTACCT 

CAAAAAG AAGAG CAAACAAG CIAAAATACGTGGCATTGGGAGCAAGCTG AAAAT TT AT AC C ATACTAGT ATTTAACACACACAT CAG 
CGGACAGTGGCTGGGGGTGGGGTGGGGTGGGGGGCAGTGCAAAAGAATCCTGTGTGTGGA 

AAGAGCTTGTCTGATACTTGGTAATCCATGTAGCTGTTTCTATTTGGCCTTAAGTTTTCCTTTGCAGATGACCTGGAGT 
GTACATGATGAACTCATTTTTAAACCCACTCTTTGT^ 

AAAGATGAGCTCACATC CAGGCT ACT ACTTAAG AAAG TCTGGAATCAG CTAGTCTCG TGG ATGT TTCAACAGG GAAAT CAGAT CAA 
ACTGG AAAGGTACCAG ATTAATAGATGGAAAAAC CT CTGCACAG AT CTCAGT AAGGATGTG TTT TGT A 
TAAC CTGT T AAAAAGTTGAGGT ACC TCCAG ACTCAT C CTCAAGG T AT AACACAAGG CAAATTCT ATG AC ATCACT AG ACAG G CC C C 
TC T ATCATGT CT TAAC C CCAGAAATGTGAG AATTT ACAAT CAACAAACAACAG C CTTTAGTG CACTTTGTAGG AGATTCAGTCCAG 
GGCTCCATCTAGGCAAACAACTGAGCTGCATCCCCAGrc 
TTGTGTATGTGTGTGTGTGCATGTGTGTGTGTGCATAAGCACATGTG 
TTCTGCACTGTACTTTTTGAGACAGACTTTCTCACTGAATGTAGGACT 
CCCAGCTCTAGTACGTCAGGTGAGCCCCTTAAAGCCTGACT 
GACAGGTCCCTTACTGACTGAGCCIACCrCCCTAGACCCACATTCCTATTTAAGC^ 

ACAG AGATG AT AGAT AGTTG AG ACAAAT CAAAGAAACTCC TGGGTT CATTG CAACAG AAATCTGACCC C CACAGTAAAAGG CAG CT 
ACAGATGGTCTGGGAACCACCTTAGAAACCTGCTGATGGTGTG CTCTGTGTTCTC CTCAA CAG 
TGGTACTCAGGATCTTGGGTTTTGAGAAATACATCTGAGAGATTGGACCTCCTTATGTTT^ 
TGTGTGTGTGGTTTTTGTTTTTGTTTTTGTT 

AGTATGTCAAACIACT AG CTT AAGTGCCTTTTGCT CATTT CAG GGTTGAC^ 

CTGTGTGTCCAAGAATATGTCAGACTTTCTGAGCTGACCTCACTCTTGGGGCAAAAT 

CCATGCCCTATGCCATTTGCCCACTTAAACTCTGCGAAAGGATTGGAA^ 

AG CCTGGATTTCACT CAT ACTT CATT ATGT AAAG GCATCTCAATG ACAG AT AAT CTCC^^ 

GGGTGGAAATGTCAGTTGAGACAGAAGAGCAGCAAACAGCTCTGTTTTCCTATAGTTGTGGT^ 

AAGATGAGAACAGGTAG CATGAACTCCAGTCATTATCTGAGTGAACATCTCTGC CAACGCTGAAGGAT ATGTGTGGGTGTCACCTG 
CAAGCACAGAGATGGGGAGGGAGGGAGGGAGGGAGAGAGGGAGAAAGGGAAGGAGGGAG 

AAGG AAGGG TTGAGAAACAGGAAG TT AACTG CAT AG AG CAT CT AG CAATTCTACAG CAACAACAGGTGAAAG CTAG CCTTTACAAA 
ACTTAACAGCACAGTTACTAAATATTAAAATCAGTTGTAT^ 

ATTCAGAACAC C CAAACCCACAGAGGACTGTG AGAGT CAG CGAGG CAG AAAAC CAAAACGAAAAAACAAAACCCTGAGT ACAGAAA 
GGGG ACTCCAG CAGGATTGAAGGG AT CAG AT ATAACTGAG GGAGGGAGAGGGTAGTGGATAAACATGTAAAAGC CACACT ACGT AT 
AAAGCATGAAAATG T CATGATAAACCTATTATGAATG T ATGT ACCGTGTATGTAAATAAAAAAGTGAAG ATGT AAAAGATTAACAT 
fiCCTCTTGAAAGAGAAACTATTAGCTTTTCCTTAAAAAG 
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AAAAGCTTAATGTCTTGATTAACTCAACAAATATAGATTTAG CAACTTTCGGTGGTCAGATCTCACTGTGAGAAGAGACAGGAAGG 

GCTGCTGGAACCCAGGGAATAGCCAGGATCTCAAAGCT 

AGGAGGTGATTAGGGGATAGCTGGTAAAGCCTGTGTTAGGGTGGTAGC^ 

GCT CCTT AAGATTCAGGAAGT AAGCAACTCAGGCCCCTTGCAG AG AGCTC CAGGATACAG^ CTT CATAAG CCAAG CTGGGATT CC C 
TTTTCAGAGATGCTGTTG ACACT AAAAGAC CCCTGAC CCTT AAG CCAGGGCCCTAAAACTTATTGGGAAGTT CAGGAAG CCAGTTT 
TAGGAACTGCACAAAAAAGGCAAATGATCCTAGCCCCGCAGATGTTACAAAGATA 

T ATTGAAAACTGGGG ACGTCATT CTC CTCTGAGTGAACC CCCTTCT CACTTCCTACCCTCAA CATGAGGTC 
TGCAGCCTGCACTCTCCGGCTCCCTGQTOTTTCTGG^ 

GGCATTAAATAAAACTTCTTT AG CTCTTT AAGCTCAG CTAAGTTCT CACCTTIT C1TT TTCCCTACAAACTCAACTCTTACATATT 

CATACACATGCTITTTGTAAGTAAATCCACAGGGTCTCAGGTTC^TCAAGTCGGAOT 

ATTTGGGGTAAATAGGTATTTGTTCATTTCTTCAGGGAAGATCAC^CACAAGA 

TGAG CTCAGAGCACGTGTCTGCACATGCATGTTCTGGAGTCCCAAA 

TGCTGTAAAATGACACCTTTGATACAAGGGAGAGA^ 

TCCAAA3AGAGCAGTGGTGGGGCTGGACACAAACATGAGGTGATTGGTGTGATCCAAGTGACT 
AGGAAGCCATGGTGTAGGAAGTCATGATGTAGGAGGTCACAATTTAGGAGGTCACAGTGTAG 
ATGTAGGTAATCACAGTGTAGGAGGTCGCAGTGTAGGAGGTCAAGGTACAGGGGGTCACTC 
CACAGTATAGGAGATCAAGTTGTAGATGGTCATGGTGTAGGTGATCGCAGGAACCCAAGTCTGGA^ 

GCTGTG CAAAGTT AAAGAAAGGATGGG ATGACTTCX3GGTTTGGG CCAG CAC CA3TGGAAGGATATTTTACACTCTGAAATGGAG AA 

AACGTGGAAGGACTGTACTGTCATAGGATGGCAGGTAGAAGGTGTTTGCTTTGGAGTCT 

TTGTGTTCTTAACTGACTATGAGATTCTGGAGTTGGAAGGAGTGAATG^ 

ATGTGGAAGJVTATTTTGACAAGTGACTCTACAAAGATGTAGG 

TTAGAAGGGCAGATCTTCAGACCCAGCTCTAGACTTACTGGGTTGACAAGG^ 

AGCCACAGCCCCACTGATTCCCTACAGAAGGAGGTTGTTCCAAGCACTGAGTCCTC 

GCTAAGAAGGAAGAGAAATCATGAAACCACACTAAGAAGATAGATTTCTGCT 

ACAAAACCTT AGATGTTAGAGAGCTTG ACAGCTGC CT CAAAG CAT CCT CT AACAGCGCAAAGGGCCGAAG ACGGATCTG ACAGTTG 

CGAGAAACTCGAGTGTGGAATTAATCTCCGTGAAACTCATGATAACTGTGGGAGCrrTTGTATT 

GTGGGAGGGGAAGGCAGCTTAACAATTGGCCAGACATOTGGA 

ATTCCAGATGG ACTAGTGG C CTGGATGTGATCAG GGATTGCTTG AAG G AAAATCT AAGATAAT ATCTTT ATTCCTGTGTG AT AGTA 

AAGAACTCATCAAGAAAATAGATGAATAGAATTTGAACCCAGTAAAATGT 

AGGGTTAAGGGTTTCTAGTGGCCCCAGAGGCAAAATTTGATATTTGGAATGTCAGAAATATGAA 

GGGAAGAAAAAATTCTAGAAATACTTCAATGGGAAATGCAAAGGGAAGAGCATGG 

TACACAAATG CAAACAGCCCGG CCTTCCCTGCCATGCTGGCTGCAGGAGGGAAACCCCCCAGTTGTA 

MOUSE SEQUENCE - mRNA 

ACACACTCTGCCTTGCTCACAGAGGAGGGGCTGCAGCCCTGGCCCTCATC^GAACAATGACACTCAGGCTGCTGTTCTTG 
AACTTCTTCTCAGTTCAAGTAACAGAAAACAAGA 

CAGCTGCAGGTATTCCTACAACCTTCTCGCAAAGGAATTCCGGGCATCCCTGTACAAGGGCGTGAACAGCGACGTG 

TCGGGAATGGGAATTTTACCTATCAGCCCCAGTTTCGCTCGAATGCCGAGTTCAACT 

ACGTTCCGTCTCTGGJUITCTGCACGTCAATCACACIAGATATTTACTTCTGCAA 

CAACG AG AGG AG CAATGG AACTATTATT CACAT AAAAGAG AAACATCTT TG T CAT AC T CAGTCAT CTC CTAAG CTG TTTTGG GCAC 
TGGTCGTGGTTGCTGGAGTCCTGTTTTGTTATGGCn'TGCTAGTGACAGTGG CTCTTT 
AGACTCCTTCAAGTGACTACCATGAACATGACTCCCCG 

AGACTTTGCAGCX3TACCGCCCCTGACAGGGACCCCTATCCAGAAGCCCGCCGGCTGGTACCCGTCTACCT 
CTGGATAGGAAAGGACAGCCTCATCTTCAGCCGGCCACrrTTGGACCTCTACT 
ATCTAAC^TCATGATCATCTTGAGACTCTGGAATGAATGACAGAAGCTTCTATGGCAGGA^ 
TCAAGCTTAATACATTTATTGACTTGATTGGGGAAGTTAGAGTAGAGCIAATCAA 

TTGCAGG CTCCTGGATGAGCCCTGCCCCGTTTTCACTTG CCAGCACATTTCAGTCATGTGGTGTGATAGCCAAAGATGTTTTGGAC 
AGAGAAGAAAGGATAGAAAAACCTTCTCTTTGGCTAAGTTCGTGTTTGGGG 
AAAAATAATGAAAACACTTTTTTCACTCATGAAATGAGCCACTT 
ACATATTTTTTTAATGATTTCTGACCATTTTTAATCACATTGACTCATGGAATGGCCT 

ATTCCCGGTCATGG G AACT CAGT ATTATTAAT AGTCACAACATG ATTT CAG AACT AGAT AG CCCT CCCACAC CAAGAAGAATGTGA 
GAGGAAGTAAGGTCACTTTATGTAAAAACG 

MOUSE SEQUENCE - CODING 

ATGACACTGAGGCTGCTGTTCTTGGCT 

GCTTCTGGTAGATAGCAACGAGGTCMCCT 

AGGGCGTGAACAGCGACGTGGAAGTCTGTGTCGGGAATGGGAATTTTACCITATCAGCCCCAGTTTCGCTCGAA 
TGCGACGGGGATTTCGACAACGAAACAGTGACGTTCCGTCTCTGGAATCTG(2ACGTCAATC^ 
TGAGTTCATGTACCCTCCX3CCTTACCTAGACAACGAGAGGAGCAATGGAACTATTATTCACATAA 
CTCAGTCATCTCCTAAGCrGTTTTGGGCACTGGTCGTGGTTGCT 

TGTGTTATCTGGACAAATAGTAGAAGGAACAGACTCCTTCAAGTGACTACCATGAACATGACTCCC 
AAAGCCTTACCIAGCCCTACGCCCCTGCCAGAGACTTTGCAGCGTACCGCCCCTGA 

HUMAN SEQUENCE - GENOMIC 

TCTCCATGTTGGTCAGGCTGGTCTAGAACTCCCGACCTCAGGTGATCTGCCTGCCTCAGCCT 
TGAG CCACT0TGCCCG6CCACATACAG6TTTCTTACAT6 

CCAAATAGTGAGCATTGTACCCCATAGGTAACTTTTCAACCCTCACCTGCCTCCCACCT 

TTATTTCACTCTGTATGTCCACGTGTACATTGTTTAGCTTCCrCTTTTAAGTGAGAACATGTC 

TATTTCACTAAGGATAATGACCTCCAGTTCCATCCATGTTGCTGCAAATGATTTCA^ 

TATGTGTGTATGCATTGCTGCAAATGATTTCATTATTTTTTTG 

TGTGTCTGTGTATACATTATACACCTTTTGTACGTGTGTGTGTCT 

AGCT ATAGC C CATATTCT TT ATTCAATCCTCTGTTGATGGACT CTTGTGTTG ATTC CAT ATCITTGC CIACTGTGAATAGTG CTGTG 
CTAAACATACGAGGGCAGGTATCTTTTTGACGTAATGA 

TG CT AGTTCT ATT TTTAGTT CTTTG AG AAAT CT C CATACT ATTTTCCAT AAAGGCTGT ACTAATT TACATTC CCAC CAGTG GTATA 
CA^GCATTCTCTTTTCTCCTCATGATTATTGGCTGTTTTTAGACTGTTTATT 
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ACTGTGGTTTTAATTTGCATTTCTCTGA^ 
TGAAAAATGTCTGTTTAACAAGCAAATGCTGAGGGATTTT^ 
ATATGGAAAGGAAAAACTGGTACCy^CCACTGCAAAAACAC^ 
CAGAATAACCA6CTAGCATCATGATGACAGGATXIAAATTCACACATAA 
5 TTAAAAGACACAGACTGGAAAACTGGATAAAG AGTCAAGATC CATCGGTGTG CTGTATT CJ^GG AG ACCCAT CTCACATGCAAAG AC 
ACAT AGG CT CAAAATAAAGGGATGGAGGAGT ATTTACCAAG CAAATGGAAAG CAAAAAAAAGCAGGAATTGCAATC CTAGT CTGTG 
AT AAAACAGACTTTAAAC CAACAAAG ATCAAAAAAGACAAATAAGGG CATTACATAATGGTAAAGTGATCX1ATGCAACAAGAAGAG 
CTAACTATCCTAAATATATGTGCCCTGAAAACAGGAGCACC CAG ATTCAT AAAACAAG CT CTTAGAGAC CTACAAAGAGACTTATA 
TTCC CACACAGTAAT AGTGGGAGACTTTAACACCCCACTGT CAATATTAGACAG AT CAACAAGACAGAAAATTAACAAGGATATTC 
1 0 AGGAGTTCAACTCAGCTCTGAACCAAGTGGACXrrAATAGAC^ 

TTAG CACCG CATAG CACTTATTCT AAAAT CG ACCACAT AAT TGG AAGTAAAACACTCCTTGGCAAATGAAAAAGG AC^ 

AACAAACAGTCTCTCAjGACCACAGTGCAATCAAATTAGAACT 

AACAACCTGCTCCTGAATGACTACTGGGTGATAACX3AA 

ACAACG TACCAGAATCTC CGGGACACAGTTAAGCAG AG TTAAGAGGGAAGTTT AT AG CACT AAATACC CATATCAG AAAG CAGGAA 
1 5 AGATCTG AAATTGACACC CT AACAT CAAAAGTAAAGT AAAT AGAGAAGCAAGAGCAAACAAAT TCAAAAG CTAG CAGAAGACAATA 
AATAACTAAGATCAGAGCAGAACTGAAGGAGATAGAGACATGAAA^ C CAGGAG CTGGTGTTTTG 

AAAAGATTAACAAAATAGATAGACCACTAGCCAGACTAATAAAGAAGAAAAGAGAGA 
GAAGGGGATATCACCACTGATCCCAOVGAAATACAAACrACCATCAG 

CTCACG C CTGTAATCTCAGCACTTTAGGAGG CCAAGGCGGGCGGAT CATGAGGTCAGGAG GTTGAGACCATCCTGG CTAACATGGT 
2 0 GAAACTCCATCTCTATTAAAAACACAAAAAATTAGCCG 

AGGAGAATGGTGTGAACCTGGGAGGCGGAGTTTGCAGTGAGCCX^GATT 

T CCGTCT AAAAAAGAAAAAAAAAACCAAAACAC CT CTATG CAAATAAACT AG AAAATCTAGAAG AAATGGATAAATT C CTGG ACAC 
ATACACC CT CC CAAGACT AAAC CATGGAGAAGTCAAATCTCTGAATAG AC CAAT AACAAGTT CTGAAATGGAGG CAGT AATTAATA 
CCCTATCAACCAGAAAAAGCCCAGGACCAGACAGAGTCACAGCT 

2 5 CTGAAACTATTCCAAAAATTAGAAAAAGAGGGATTCCT 

CAG AGACACAACAAAAAAAGAAAATTT CAGG CGAAT AT C CT T G ATGAACATTG ATGC CAAAATC CTCAATAAAAT ACTGGCAAACC 
GAATCCAGCACCACATCAGAAAGCTTATCCACCATGATCAAGTCGACTTCATCCT^ 
TCAATAAATGTAATCCATCACATAAACAGAACCAATGACAAAAACCAC^^ 
ATTCAACACCGCTTCATGCTAAAAACTCTCAGTAAACTAGGTGTTGATGAA^ 

3 0 AC CCATAG C CAAT AT CAT AC TG AATGGG CAAAAGCTGG AAG CATT C C CTTTAAAAACCAGCACAAG AC AAGGATGC CCTCTTTCAC 

ACTCCTATTCAAC^TAGTACTGGAACTTCTGGCCAGGGCAATCAGGCAA 

AAGTCAAATTG TCTTTGTTTG CIAGTTG ACATG ACTGT AT ATTTAG AAAAC CT AT CTGTATTTTTAG AATGCTGTAT AT TT AG ACTG 
TAT ATTTAGAAAACCAT CTCATG C CAGTTAG AATGG CGATT ATTG AAAAGTCAG^ AAACAACAG ATGCTG GCG^ GCTGTGG AGAA 
AT AGG AACTCTTTT ACACCATTGGTGGGAATGTAAATT AGTT CAACCAT TGT AGAAGACAGTGTGTCAATT CCTCAAGGAT C TAG A 

3 5 AC CAGAAATAC CATTTGACCCAGCAATCCCATTACTGGG TAT ATAC C CAAACGATT ATAAAT CATTCT ACT AT AAAG ACATG CACA 

CGT ATGTTTATTGG AGCACTATTTACAACAG CAAAGACTTGG AACCAAC C CAAATG CC CATCAATGAT AG ACTGGAT AAACAAAAT 
ATGGCACATATATGGGCTGGGTGTAGTGGCTTACGCCTGTAATCCCAGCACTTTGGGAGGCCAA^ 

TGTAATCCCAGCTACCTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAGGCA 

4 0 TGTACTCTAGCCTGGGCAACAAGAGCGAAACTCTGTCTCAAAAAA 

GGCACAT ATATACT ATGG AATACTATG CAGC CAT AAAAAG AATG AGTT CATATC CAT CGTCTCAG C CCAAAAACTC CTT CTG AT AA 
GC^CTTCAGCAAAGTCTCAGGATACAAAATCAATAGGCAAAAATCACAAGCAT^ 
O^TCATGAGTGAACTCCCATTCACAATTGCTAO\AAGAAAATA^ 
CTTCAAGGAGAACTACAAACCACTCCTGVAGGAAATA^ 

4 5 GAATCAGT AC CATGAG ATTGG C CATATGGCCCAAAGT AACTT AT AG AT TCAAT ACTATTAC CAT CAAG CTACCATTGACTTT CTTT 

GCAGAATTAGAAAAAACTACTTTAAATTTCATATGGAACCAAAAAAGAGCCT 
AGCTGGAGGCATCCTGCTACCTGACTTCAAACTGTACTATAAGGCTACAGTAACCAAAACAGCATG 
ATATAGACCAATGGAACGAAACAGAGACCTCAGAAATAACACCACACGTCT^ 
AAGCAATGGGGAAAGGATTCCCTATTTAATAAATGGTGCTGGGAAAACTGGCTAGCCATATGCAC^ 

5 0 C CTTATAG CTT AT ACAAAAATT AACTT AAGATGG ATT AAAGATTTAAATGTAAAACCTA7VAAC CAT AAAAAC CCTAGAAGCAAAC C 

AAGGCAATACCATT CAGG ACAT AGG CATGGG CAAAGACTTCATGACTAAAACAC CAAAAG CAAT CACAACAAAAG C CAAAATTG AC 

AAATGAGATCTAATTAAAACTAAAGAGCTTCTGCACAGCAAAAGAAACTACCATGAGTGAATAAGCAAACTA 

TTT TTG CAAT CTAT CCAT CTG ACAAACGT CTAAT ATC CAGAAT CTGT AAGG AACTTAAACAAATT T ACAAGAAAAAAACAACCC CA 

TAAAAAGTGGGCAAAGGATACGAACAGACACTTCTCAAA^ CTG CAGC CAACAAACATGAAAAAAAGCTCATCAT CA 

5 5 CTG ATCATT AGAG AAATG CAAATCAAAACCACAATG AGAT AC CGT CT CAC GCTCGTT AGAATGGCAATT ATTAAAAAG T CAGGAAA 

CAACGGATGCTGGCGAGGCTGTGGAGAAATAGGAACTCTTTTACGCTGTTG 

CAGTGTGT CGATTC CTCAAGG ATCTAGAACCAGAAATAC CATTTGAC CCAG CAAT CCCATTACTGG GTAT ATAC CCAAAGGATT AT 
AAATCATTCTACTG TAAAG ACATGCA(^CCT ATGCTTATTGCAGCACTAT TT ACAACAG CAAAAACTTAG AAC CAACCCAAATG C C 
CATCAATGATAGACTAGATAAAGAAAATATGGCIACGTATATACTATGGAAC^ 

6 0 TTTGCAGGGACATGGATGAAGCTGGAAGCCATCATTCT 

CATAAGTGGGAGTTGAACAATGAGAACACATGGACACAGGGAGGGTAAC^^ 

GGGAGGGAGAG CATTAGGACAAAT AC CTAAT ACAT ATGGGGCTTAAAACCT AGATGATG GGTTAATAGGTG CAG CAAAC CAC CATG 
GCACACGTATACCTTTGTAACAAACCTGCACGTTCTGCACATGTATCCCAGAACTTGAAGTAAATTTAAAAAA^ 
AAAATGTCTGTTCATCTCCTCTGCCTACTTTTTAATGGGATTGTTTTTTTCTTGI^ 
65 TATTAGCCTTTTGTCAGATGTTTAGTTTGCAAATATTTTTCCCATTCTATAGTTATCTACTTAC^ 
GTGC^GAAGCTTTTTAGTTTATTTAAGTCCCATGTGTCTATTTTGTTGCCTTTGCTTCTGAGG 
CGGCAGATGTAAAAAAGAGTTTTTTCTTC-GTTTTCTTCTGGGATTTTTATAG 

T ATGTT AATTTTTGTAT ATGGTGAGAGGT ATGGG C C CCGTTT CAT TCTTCTGCATAT AG CTAG CCAATTTT C C CAG CACCATT TAT 
TGAATAGGGAGTCCTCTCCCCAGCATATATTTTTGTTAACTTTG^ 
70 TTCTCTATTCTGTTCCATTGTTCTATGTATCTATTTT^ 

G AAGTT AGGTAATGTGATACT TCAAAG CTTTCTTCTTTTTGCTTAGG ATT G CTTTG ACTATTCAGCT CTTTTTG G ATTC CACACGA 
TTTTTAGGATTGCTTTTTCAAATTCTGTGAAAAATGACATTCTTTTC 
ATCTGTAAAATGGAGATAATAATAATCTTCCTCACAGGTTGT^ 
CTATTATTTTTACTGAAAATGTTGTAGGGTCTC^GTATCCTGTGTATTGCCG 

7 5 ACCTCTTGGTATATGTGAATGCTATTGGATTTCAAAAGCAA^ 
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CATGTTAAAACATAGGAAGGCAAATCTCAA 

T AAGG CCAATAATATTT AATTATTGTTTTGTGAATTCGTAT AAATGAG CTGACCT CT 
AAGGGAATTAGGCCAGATGAGTtSAACTGTTC 

CTATGTAGTCATCTTTAATATGCTTGAAGACAGTTATTAACTCACT 

TCTTTGTTTCTTCTGTGTATATTCTCAGTATGTATAATGAGTCTCAGAGCCTCTT 

GAATATGGATGACCTTTTAGGAAAGGTCACCTTGCCATCTGTAAATTTT 

TTCCCCTAACTGGATAGAGGATCAGAGCCAGGAAGACTGTGTCAAATTCTAGAAG 

TGTCACTTACCTATAGTTCAGAAGTGTGAGTGAATCATT^ 

GGGTTCACAAAGACTGTGTTTGTGGGAAAATTGAAGTGTGAACA 

TATCAGATCAGGAGGGAGGGACATGAAGCCTGCCTCCTCTTACAGCT 

GAAGAAAAAAAGGTAACAACACTAGGGGTTTGTTAAACAATTTATGATATAA 

AATGTTTATAAAGATACTTAAATGACATGGGAAAATGTTCAAGATATAACAAATAATAGTACATAT 

AACATGATATTAAAC^TGTAAAAATCTTAACAGACCAACAGGTGGAAGGCAGTGGAC 

GGTAAGAATGTGGATGAATCTTGTTTTAATGCTTATAATTTTTCTAAACATTCT 

TAAAAAGCAATACAAGT TAT CTTTTAAAACAACCCACATACACAAAACCCCCAACT AGAGTAATTG CACAAAGCTTAG ATTAT CAA 
AAAGAAGAAAAAAATTACTCGAAAAGTTTTGAAGAAGTCA 

GCCGTCCTGGTGGAATAACCCTCTCTCCAAAGGGCCTGGGAGTTGAAGAAGGGT^ 

CCTGCCACGGAGCCTCOTTTGTGCCCTATTACTTAACC^ 

TACTGTCCCACTAAGAACATAGAATGTTGTTTTGACTTTCCCTT^ 

TGCACGTAGAATGGGTTTTGCAAAGTTCCTAGAAGTGAGTTGGAGGAGGCTTC 

GAGGGCTACCTTATGTCCTACACAAATGTTACATTTCTAATATTTGTAACTCC 

CAGT AGAATT CTT ACCAT TAT CTTTTTG ACACAAC C TGT CCC CATC CTATGAAAATCT CTCCTCTGCT AG AGACTTTATT CTTT CT 
TTCTCTCTCTCCCCCTTCCTTCCTTCCTTCCCTCCCTCCCTC 

T TTCTTT C1TTCTCT CTTTCiTTCTTTCT CTTT CT TTCTTTCriTTCTTT CTTT CrTTCTTTCT CTTTCT CT CTCTTT C TTTCTTC 

CTTTCTTTTTTCTCTCTCaxrETCCTTCg^ 

TTCTTTTCTCATTCTGTTGCCCTGGCTGGAGTCCAGT 

CTCCTGCCTTAGCCTCCCTAGTAGCTGGGATTAC^GGTACCCACCATGATGCCT 

GTTTCACCATGTTGGCCAGGCTCGTCTTGACCTCCTGGCCTCAAATGATCCACCCACTT^ 

GTOTGAGCCACTGCACCCGGCCTGTTCCTTCTTAAGAACACTTTGTCTCCCCTTTAATCTCTC 

CACAACT CT TGAT AT C CATCAAT AAAGAATAATTC C CAT AAGC CCATCATGT AGTGACCGACT ATTTTTCAGTG AC AAAAAAAAAG 
TCTTTAAAAATAGAAGTAAAAGTCTAAAGTCATCAAAACAACGTTATATCCT 

TGAGTGCCTTGATCATGTCCCCTAAGGGGATGGTC-GCGGTGGTGGTGGCCGTGGATGACGGAGACTCTCAGGCCT^ 

TCTTTCAGTTCCCCTCACACCTCGGGCTCCTCGGGGAGGAGGGGCT 

CTCTTGGCTCTCAACTTATTCCCTTCAATTCAAGT^ 

TT C CAATTGG CT TAGT TTATTTTAAATTTCT AACAATGTGTGAAATTTG AACATTTG A^ TGT AGTTTTGCTGT AAT AGGGCAATG 
TGTTATTTTGAAAATCATTGATTCTCAGACTACAATATAGTATCTTAACAAAAGTC 

TTCT CTAG CT ATT AG TTG AT AGTGT C C CAAGAATTTATT ATTATT ATT ATTATT TT AAATTG AGATGGGG TTT CACT ATGTTGCCC 

AGG CTGGTCT CAAACTTCTGAGCTCAAGCAATCCACCTGC CTCTGCCTCCCAAAGTGCTAGGGT TACAGG CATGAGCT ACCATGCC 

CGGCCAATAATTrTTAATCTATGTATCGTTTCTTTCTTTCTTTCTT^ 

TCTGGAAGCTCTGAATTTAGGTTGCCTATGTTCCTAACACAGGTCCAACTCTCACTAGCT 

CTCTCCAAACCTCAGTTTCCTTAACTGTAAAGTGGGAATAATAATAAACCTCACCTCAAG 

CCATGCGAAGGGCCTAGCTCAGTCAGGCTTCAAGATAAATGTGAACTG 

tgaattctggactcaccaca^^ 

TAGATTAAAATTCTTTAAGTTC CCAAT GACG ATGATGAGG AGG ATAATACAT CT AGCACTTTGAG CAGCACACAGTT CT CGT AAC T 
AGAAGGTAGGTATTACAGTTAACTTCATTTCAGATGAAGAGACTGAGGGGTAGGCACATTAAGGGATTTC 

TGGCAGCT ATAjGGAGGCAGGGTCTG AACCACCTTAGT CTGACTCCAGTGCCGGTGTTGTTGATT AT TATGCTATTCCATCTAACAT 
GGTGTGATTTGTAATTTCATGGACTCAGGTAAATCCAGGGTTAAATTAACTTCG 

ATTTGTTACTTGGGTGGGCTGG AGTAAGATGCTCATTCAACGTGTTCTCAG AGTT CTTGGTTCAAAAACTG 
GACTGAT AGGG GT CAACAGTCATG CTT AAATAAG G ACAGTT ATTTTTC CCTGAAAGATACATTGAAAAG C CAG T AT CCTCAATTTT 
(CTCTCTTATTTTGGCAGTACAGAGACTGCACTATTTGCTGCTATTCTTAAACATTAA 
CAATCTGCAGAGGTACAGTAGTTGCATATATACCCGTTTATTTTATGGTCTGA 

AAACGTGTTG AT ATAATGAAAG ACATG AGTTG GCAATGAG ATCT GGT ACCAAGCGTTTACAGCTACCAAATAT ATT C T ACAAG AAT 
CTTTAACATTTATCTTAAAAAGGTCAAACCCTATCTTTAAGAGATCTCTCAAACTTA 
GACAGTCAAGGGAAGCCTACCCTGACTGAAAACTATTTAAAATTGGTAATTC-TACACTAAATACT 
ACCTATCTGTCATCAACTTATTCACTCTCCTTTTGAAGAGTTTTTGGTAAGACTCAAAAC^ 

TAAAAGGGGACTAGCT AGAT AAGTGGTTGGAGAGTT CACACACAT AGGG CCTTTGTTTT CTACCT TGACATATATTTACTTTT CAA 

ATGCCAATTC^CTTTTCTAGTAACITTTAGCATTTGATACTGAGTG 

CACTTTTACAGTCTTTTCCAGTAAGAAAACTTACGAGCAAGAAAAGAA 

GGCTCACGCCTGTAATCGTGGGACGCTGAGGCGCGTGGATCACTTGAGGTCAGGAGTTGGAGACCCGTCGGGCTAACATGGTGAAA 

TCCCATCTCTACTAAAAATACAAAAAAATTAGCIAGGGTGTGGTGGCA^ 

AGAATTGCTTAAACCCGGGACGCGGAGGTTGCTGTGAACTTAGATTGCACCCCTGCACT 

AAAAAAAATTATGCTGCATTCTAATCAAATAGTGCAAAGCTTCTTC^ 

TTATAGATAGGGAAATTCTCTGGTCAAAGTGTAACTGTCCTTCCTAGCTCT 

TCTCTGTGCGTGCATGTGTCTGTATGTATGTATATGAAG^ 

TATGAAGAATAGTTCTTTCTTCCATTAATTTAATAATGTAACCTATTC 

TAAGT C T AAC CTGATTC CTT ATGAGTTGGATGACAGTGTAGAG AGAT ATT T ATGG CTAAGTTAAAG AAGTGTGATTT AGG AAG AGT 

GAGGAAACCTTAACTTAGATAAGCCCTTTGCATGTTTTCTACTAGACTCGGAT 

GCTGGACTTTGGGATAGGAGTCCAGCCCAGAACTACAGAGCCTTA 

CACAAAATACTTGACATTGGTCACATCTTTCAGGCCATACTTGCTATTCTTTAGCA^ 

CAGCATATTACCACTTACTTTTCCATGAGATCATTCCTTTTCACTTCTCCAAAACT^ 

GGACAAAGCTAATTTCTCCAGCGCCTTCCCTTAGCGCTTATTTCT 

TAAAAAGTCATCAGCAAAGCAATTAAAAATTTTATTTCTACTCTAGTCTT 

AGAAGTAATTTGTTTCTCACAGTTAAGTGTTGCTCTGAGACTGGGCTTGG6AAACT 

CAGATTGCCGTATTTACATGACTTCCAATATGCTGTGGCCAGT^ 

^aaaTTTGATCGATATTTCAAAAGTATAGGCATTGCTTAAGGATTO 
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TTTTTAAGGCAACCATCCAGCAAGGAC 

TCATTGCCACAGATAGACTAGAAATAGAAATGAATCTGTGATTGTAACT 

GAGAGGATCT CGCTG C CTCCC CTTCTTT AAGACATAGACCCT CCTCCTTTTT AACGT CTTATGC CTT TTrTCAAGGGTGG AG AAAT 
GCCATATGATTAC CATTTGG CTTTTATCTTGTGTGG AT ATCTT CCTC CAAGTATTTCCTGCAAG CAGTGTG CTGACATTTACAAAA 
5 GGAAAATTTTTATGCCACCAACATCCCAGAAGAAAA 

AGGCATTGATGAGGATACGCAATTCATGTCAGACTTGGTTGTCT 
CCAGCAGGTCATGGCAAGGGATAGAATCTTCTTCTCTTTCTCA 
TATTTATGGAACTTCACAGAGGCTGGAACTTTTCTTCT 
TTGTGACTGACTGAATGGCTCTCTGGTTAAGCTGTGTTAAGATATCTG 
1 0 AATGACAGGGTGTT AGCACXZAAAAACC CAAAACCAAAAAAACCAAGACAATTCACTCAGAG CTTTGAG CCTGATAATTATAG CATG 

AGAATTGGTGCCTGAAGCTGGGGGGAAGGTAACCCMGCCCCACAGAGAGGTGGTT 
ATCTCATTTTGAGGACTCCAAACACTAGAAGAGAAGTGCACT 
AAAAGAGTTTAGACrTTCTCACTATTTCAATGTTAATTTCAAAGGC^ 

TT C TGTTTT CATTG CTGGAAGAGAAC CTG CAAGGTGGTG GAGAAGTTCTT ACTTAACTTTGAG CTG CTCC CTT AT CT CTG CATT AC 
1 5 CATTACTGTCATCTTTTTCTTTGTCATCATCTrCATTATCT 

GT ATCATT CAGAATTACTCACT CATAGTT CAC TTTGG CCTCTGAT CT AATTGTGAG C CCCTTTCTGCAATATCTGAATCACCT CAT 
TGGCTCACAATCCACCATGTAATCTACTTGTCTTT^ 

CTTCAGCCTTTATTGTGCTCTTGG AAG AGGAGTTCTG ATGG AAGTG CT CTCATATTT 

TAATCTCCATCAGAGTTTCAGAGAATATAAAAAGAAATATAGGCCGGATATGGTGCCT 
2 0 ACCAAGGCAGGCAGATCACCTGAGGTCAGAAGTTTGAGACCAGTCTGGCCA^ ■ 

AATTAGCCAGG CGTGGTGGT ACACAC CTGTAGT CCCAG CT ACTGGGG AGG CTGAGGTAGG AGAATTG CTTGAACCTAGGAGG CAG A 
GGTTGCAGTGAGCAGAGATTGTGTCAATGCACTCCAGCCTGGGTGACAGAGCGAGATTCCGTCT 

AT AT AAAAT CAGTACACAAGTATTGTGT AGT ACAAGT AATTAGGTAGTT AACTGAGTT OT CTGCTTTTACAGCCC CTATGATTT 
AGTTGAGTAGGTTAT ATAT AAT AGGAGG ATGT AACAT C TGGAAACAG AT AAAT AACACTTGG CAGTCACT AC CTCAC CATGCTAT T 

2 5 TGATG ACT AAAGG ACT TTGGTTTG C CT ACATT ATTTT T ATAAT CTGAAATGT C CTCT ACACCTCT CACCTGAAGAAG CCTT CTCTG 

CCACTAGACTATGAACTCCTTGAGGGCAGAATCATGTCTTATTCATCCT 

AGTT ACTCAATAAAT ATTTGTTG AAG AAAGAAATGAATGAAT TAG TAACAAAATGAGGAAGCATAAAAT AAAT ACT CGTAAG ATTA 
ATGT AGAAAGGT AAG CCTTT ATGGAG AGTTGG GAATT AG ACTGGACAGT AGACAAGATTTGAATGAACGTTGTTATTAAT AATAAT 

3 0 TGCTAGCTAACATTTATTTTGCACTTATAATGTGCCAGGTAGTGTTCTGCCTGGCTTTCCOT 

TC CT ATG AGGTAGTTACT ATG ATTG CT ATGATT AT ACT ATTTC ATAG ATGGGG 
TTATGGTCACAAAGCTACTAAGTATGTAAATCCAGATGGTCTTGCTCTGGGGTTCATGTTCT 
AGGAAAAGGTGGATT-AATAAAGAAATTAAACCAGGAATACTGTTTCATGTTCTCAGAACAGATGG 
ATGGTGAAGCAGCGGAGAATCCAGTTAAGCAAAGGATAGAATACTGGTAACAGAAATTCAA^ 

3 5 TAATGGCCTGGGTAAGAAGAAAATATTTCAGAAT CCTTAT CT TTGG AATG AAG CTGGAGTTATGGGAATGGATGGCTAGT CGAAAG 

GAGAGGGCATAACTGTAGAAAAACAGAGAGTGGAGCTGAATCTGGAAACTAACTGGCTCTGATGG^ 

TAG CTAG TGAAGG AGACAAAGAAAGT GGTG AG AG AG GCAGGGGGACACACTG AGG CTAGTGGAATC CCAATGAGGG AA 

AGAGTTTCAAAAAGATAGTGGTTAATAGCATTAAACATAAGAGGGAAAGGAGTGAATAATGGCT 

CTGGGACTTTTGAGTTTAGGCAGAGACATGGAGAAGGAAGATAAGACTTCACAGGATTAAGA 

4 0 GACAGTGAAAAGGAG AGGAAAT AGATAAGGT AAAG ACAATT AGG AACTT AT AT CAC CT AAAATGT CT ACCAT GGTATTTTGTG CAT 

GTAATAATAAATGTTTGCAAATGGCTGAGTGACTTAAGTATGTTTGACATGTC 

AG AAGGG T ATT ATT AG GAAAGAAAGGAGG AGG AGGG AG AGG AAAAAAAG AGGGTGGGG AT AGTTTT CT CAAG GAGATAGGG AGGGA 
G ATTTGTTG CATAGGAGATGAGTGTATT AGAG AG AGG AGGACAAAT CTCT AAG AAG ATGG GGAGGAAGAGAAGAGACTTATAGGGG 

4 5 TAGAGATAAGTTAGAATGAAGCTATGGTAGATGAGTGATTCTATACTGGTGAACCTGAGGTTAAAAACAAACAAATCTCT 

AGGTCATGATGTTAAACCTATATCAACAAGGGCTACTGGGTAATACATGCCTTCTCTCTGAAG 
TTCTTGCTTC^GTCCTCAACAATGCACACTATGGCTAGGCTTGACCTCAGCTCAC^ 
CAACCCTGCCCAACATTATGGGCGGGACTCCAGTTTCCAGTCAGGGATGTT 
ATATGGTTGCTTGACCTTGGCTTG^GGAGGCCAGTCCGCTATTCT 

5 0 ATTTCTCAAGGTGGGGGATGTGATTAGGAGTATTAGTCTATCATACTGTGA(^ 

ACATGAAGAGTTTGGGCTGGAGGATATCTCAGGTCTACTTTCTGGCTTA 

TTTTCTCGGGTCTAACAAGGCAGCTCAACTACCCCATCCCCTTGTGGATGCTCTTATGACCTGA 

AAACCAGCCATCTCTGAAATCCCTTGCAGGGATTGCTAACCAGGATCCTAAATGCTGCTAATG 

TCX3AGGGGGCAATGCTGCCTTTGCTATCTCCTGTTTT 

5 5 ATGACATTTACCTGGAGGTT CTGTATATGACAAGG TGAT CCTG AG CAG ATT CT TCAGAGGT ATGTG ATTT AGG CAG AATGG AACAG 

AOVGGAAAAATGTACTGTTGTCTTCATCACAGAGTTC 

CTCAAATTTGGTTACTAGATTAAAAC^TATATTTAGAAAATAATAAATTAATAGAGGAGGTCT 

TGATCCGTTAACTGAATTAC!TATAATAATATATATGTACAGAAAGCTATAATTCCCTGGTTACCTCC 

CTTTTAGAATAGTGAGATTTGATTTTGGGGAGTTGTCTGGTGGCTCTTGGTATGGCT 

6 0 GCTATGTTGTCCCCTTAGATTGTCTTTCTAACATTAGGAGATCAGAGGAAAATTGCAGACCT 

TCACACTGCATCTGAGTTTATGTTGCTTTGGGTTAGGAAGTTTTGGTCCCAGAGCACT 

TG AGTGG T AGGG ATGGGTT ACAGCACAGGTGATG GTTTTT AG CAATTTT AATATT AACTGT CAT CCTAATACATATG AACTTCCTA 

ATGTACATTTCTATCCCCAACTGGCTTCCTCGAACTCTAGTCCTATACTTCT 

ATTTCCTCTTAACTTACCTCATTTTGTCAATGGTGCCTCCACT^ 

6 5 CTGTTGACCATCAGTCIGTAAAACTGAGCACCAATGC 

TGAACTCACATCAGTGCTATTGCATCAGCCTCCTGATTGGTCTCTGCCTCAGCT 

TTGCTTGCTGCACTCCTCAAGG CTGTGCTAACCTTCCCAAAGCAG CATGTACCACIAACATTCTCAACTCAGACATCTrCAGCATCT 
CCCATGGCCTTTCTGGTAAAATCJTATGTGCCTTTTCTTGGCTCCCCATGACTGCT 
TAAATGGAATACTCTGCCAGTTCCCAATAAACTC!TCACACTAAGGTCA 
70 TTATGGCCCAGCTAAATGTGAAGTCTCTACAAAAGCCT 

TGACCCTTTAAAGGGTTCATGTACCACGGATGGCTTCGGAGAGTAGCTGATTGTGTACC^ 

CAG CT C TG TCTTTT ACATCTTCC CAT CT CCTAAGAATC AAGTACATTAAGT CC CTTATTT ATT TGATG AATAAATG AATG AATG AC 

GATAACTTTT'CCCTCTC^AAATGAATGC 

7 5 T ACTTTCTG ACAG TTACATGT AG CACAATAATTTTTTT AAG AG GACAAGAACAAGTGT CCT CT ACATT ATAGTGTCTGC CTGTGTA 
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GGAACACAAATAAAGGTTAAAGGTGGGGCTTAGATTAGTGCTGAC^ 

GGGGTCTCACTTTGTTGCTCAGGCTGGAGTGCAGTGGTACAATCACCACTC^ 

CCCACCTCAGCCTCCCAAGGAGCTGGGACTACMGCACATGCCACCAC^ 

5 GCTAGGAGTACAGGTGTAATCCACTGTGCCAGGCCTTTAGTAAATTTATTAATC^ 

AATCATTTGTCCTTGTTCAAACTAAAACCTCTTTAGATTTTGT^ 

ATAACTGATAGTTGTGATATACACAATCTTG 

GGTGGCCTCCACTACACAACTCTCITGCCTTCTGGTTC^ 

ATTCTGCTGCTGCAAGTCACAGGCTCTGGCACrrACCC 
1 0 AACCTTCCTACTCTTGCCCAATTTTGTGTGTGCCATCTGTTTTCATTTG 

AT CCACGGGTTATG CATCTTTGGATTCAATGAACCATGGATCAAAAACAGT 

CAACAATTGCATGACTTGAGCTTCCATGGATTTTO 

TATATGCATGTTTATCAAGCTTATTCAAAGCCC 

GAAGACTTTCAAGATAACAGTCAACAGAAAACTAGATGAAATGACCA 
1 5 ATTATTACCATTTTTT CATTTTAACATGAATTTCTGGGAG AGAGAG C CATTCAAAGCATGGTAGGTCATGAAT AGCTTT 

GTTTTTAT CTT CATTG CAG CTCAGAGAGGATGGTGAAAGGTG CT ACAGTCTGGTGAATGTTTTGT ACTTCT CTTTCT 

AGGAAGCCAGGCCACCAGCTAAGGTTATCAGAGCCATAGACTGTTGAGTC 

TTTCCTGAGCAACCACCXXXSTTGTCTTGTGGG^ 

TTGAAGGGCATCCTGGGCCTGAACAGGAAATATACTCAGAGGAAAGTTACAGTAACT 
2 0 CT CTTCTATG CCAGGGAAAT CAG CAT CACCTTCACAGTGTG C TCACAAATATCACAGGG CAAAATGG CTGAAAGTAGAAG C TT TCA 

TTGTTTTTGGTTCCTACCTAACATGTGAAATTCTGTTCAGTCCGAATTTTTAAA 
ATTTTTCAACTGCAAAAGAAAGTCAGACTATGATGTGC^^ 
TTTTGCTAATTAATCATGTATTCCAGCTOTCTAACTG 

GAAAAGGAAAAACTATGACAAACAAATGTAATT^CAAAAGCCCTGAAGTTAAGGA 

2 5 ATTTCATGTGTTGACTCATGTCTATGTTCAATCCCTTTTGT^ 

ACTTGTG CAT CAGAAATAATTTAT AAGATT ATTTG ATGTG AT CAT ACTGT CTGT AAG CCTTAT CTGGG ATCCATCTATGACCTTCA 
GGAAAT CTTCAAAATTGTAAG CAAAATGTTTTGT AT ATGTG CATT AGAGG CAAAAAATTTGGAAATGTATTTT TTT CTAG ATATG A 
GCATG AAAT AAATTCAGTCTTG CTGAATGAACAAAGT TGTTT TTCTG T CCATTT ATG AT TT AAAACATTACTCCT AGATGTCT AAA 
TAAATAAATTACATCTTAGCTCTTCCATTATTTC 

3 0 GCCTCTTCTTTAAAATAAGGAATTGGCATAGATCATCTCCAAGTTCCCTTCCAGTTGA 

ATAAAATAGTGGGGAAAATGAAATGTATTTTGTTCTGGTCCTCTTCCCAT^ 

T AAC CCT CAC T CCAG ACAT CTGGT ACCAT C CT CC CT ATCAAGT CCAACTTTG AG AGG AT CTAGAACCAATCACTGCAATTGTG TTG 
AAATTTTGATGCAACATGGCAGAAGG AAATGAAC!AAT AG ATTAG AAG T CAT AAG TT ATG AG CTT AAATTTCAAGGATCACTT ATGT 
TTCCTGTAACCTTGGACAACTGATCTACCTCTTTGACCTCAGTTTCCCC^TCTGTAAGAT 

3 5 TT TACTTCACAGTTGAATATAG AGTTCAGATG AG ATAAG AG ACAG CATTTT ATAG C CCTGTG AATGT AAGC CAT AGAT ACT AAATT 

AAGCCAATGTGTTTCCCATATTGCATTTTGGGCTTCATTTTCTT^ 
AAGGTGAGGGTTCTTTGATGCTGGGAGCCACACCTTAGAGGTC 

atggtaattgacccgtttataataggtgtaaacggcaagagttgttt^ 
tcatggaagttcaagcaactttcaattttcctg^ 

4 0 tcagtaccttgctcacaagcagcagataagtaccttcrcacctccaatctcactcttc^ 

gagggaaagaggtttgaggttagtgcttcctgtctccacaccrrttagcrrggtttag ctgctaggaaagg ctttgaaaagcaaagca 
ctaaattcaccttcaaaacaacttgtcaattcagt^ 

gatgggttgacaatgt1tl t 1 t taacctagtaggtttgacctaggtagaccttacctgaacact 
tcacagggaaagggctgggtctcctgatacccaagtagcctaagaagaatgccttccactcct 

4 5 aaatcaaacacaagttaggacgagggagct acat atttgg t ct agatatg t c cttacatttg taatatgtcctt 

acattatttgaagtactgtggattttcagaaactcagagaccgaaaa 

cag ctactttg attt ct c catgtttttacttc at agag caaggggtgtg qa.tttttacacatgccccattttaaaagaaaataaga 

gaaatcagaattattgtgtttactcaaagtcacaatgatcac^ 

accaaaagtaaatttctctcatttactgtcttgc^ 

5 0 t ctgaatt attgac atgaaccaaggtgggac caaatgct atgagg atataaat attg agaaacgagattt tat cag ccaaccttg a 

actcagcttgaactctaaactaggaggctgttagctacatgttaacagat^ 
gagagccagctaatttaaaaatgatctatgttgacaaagaagagggaggaaagt^ 

caaatgg c cttatcgaaaac aaatag ttttcaagctcttttc cag caaaat ctt ag cacttttctgcat aaaag ccttcagt agat 
cttcatcaccttcagaagagagcccaacctctctaat 

5 5 ccagcaaaaagaaggtagctgtaattccccaagtcggccacactct 

acacgtcttagccaggctaacttttactragtaaaagttggtcaaatc catgtatgcatactcctt caggactcaggtcagccgtc 

atctcctgagatccttagctggtttccctgttgcccccggaagttttgtgc^gatcactgtgtcactatg 

tttatatctttcttccccatcagatcatgagctgccgtaggacx3ggaattgagtggcattcatt 

6 0 cc c atgt gcat ctt aggaacatgt cacacc c tgagtt agacagg acatat t cag caaac cactgg act ctag atgc cacag ctttt 

cttccttcctctctctcccttccctctctccacct 
tctccagccc^ccc^cactccctttttttgtctccacaaagaagtcct 
aggcagatcaggaagaatttgaatgtgcattaaaacagaatccaaataattagcaatag 
ggagaatacaaatgcctaacctgcactcactct 

6 5 attagaactgacctaatgactccttctaagtaaaaaaaggaaacaaacaaaataaatcaaaaacaaaaaaa 

tattaagacaggtcgaactttttgattctttgcatgtttatttttcagtc 
agttaaagttgttgacacaggtttccaaaaaggatttaatgagatc 

t cacc^t ctttcact ccacaatt attttatgagagtctattagtc^ t atgt aaagaag cataacatgtggt c cttt ct 
gtcaagagacttggaaaccaggggcctaagggaaatgc^catgaat^ 

7 0 t t aaaag gtag aactcaaatgtcaag tgag cg atacag atagtaat cattttg ag acctgagaag ggagtggtg ctctgtgc caag 

GAGTTAGTAGACAAGGCATGGAGGTAGGAATGCTCACAGTCTGTTCTTGGGATGTTG 
GTTTGAGGAAATGAGGTTGGAAAAACCTGTGGAGGCCGATTTGAGGATAAGAAGTTAG^ 
AATTTGACTGCCATCTAGAATTACTAGAGAAACTTAAAAAATATGAATGGATG 
TGGCCTTGGAAAACCAAAGGCTTTTAGATTTTTCCAGGTGATTCTAAATATGTTGT^ 
75 TCAACC CAATTGAAT ATT AGAAT CAAGTGGGGGAACTTTTAAGAT AAT ATGAAAGACAAT C CACAG AATG AG AGAAAACATTTACA 
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AATCACATATCTGATAAGAAACTTGTATCAAGGCTGAGGCAGGCAGATCATCTC 
TCCy^GAAACCCCATCTCTACTAA^ 

GGCAGGAGGATTGCTTGATT CTGGGTGGCAGAGG CTG CAGTGAGC CAAGATCATG C CACTG CACTCCAG CCTGGGTGACAGAGTGA 
GACTCTGTAAAAATAAAAAAAAAAAAGTATAAAA^ 

AGTTAATAAATGGGCZAAATGATCTGAATAGAGAGTTCTCTAAAAAGAATAGACCAATGGCT 

CATCATTAGTTATTAGAGAAATAGCAAATCAAACCATGATGAGATATC^ 

AACAATAACAAGTGTCGGCAAGGGTGAGGATAAATTGGAGCCCTCACT 

GCAACAGTTTGACGATTCCTCAAAAAGTTAAACATAGAGTTACCATATGACCCA 

TTGAAAACATACGTCCACACAAAAATCrGTATGCAAATGCT 

ACAAAATGTAGTATAGCCATACAATGGAATATTATTTGGCCATGAAGAGTGAAGTTCT 

GAAACATGGATGAAACTTGAAAACATTATG CTAAGTGAAAGAAACG AGACACAAAAAG CCACAT ATTGT ATGATTCTACTT ATAAA 
AAAAGTCCAGAATAGG CAAATTCATAGGGC CAAAGT AGAT CAGTGGTTTCCAGGGATTTAAGGGAGGG AGGAAG CAGGGTGATGGT 
AAATGGGT ATGGT ATTT CTTTATGGAGTAATG AAAATGTTCTGGAAT CAGAT AGTGGTG ATTCATT CATAACTTTGTGAAT AT ACT 
AAAAAATCTCTGAATTTTACACTTTAAAACAGCAAACTTTAT 

AAAAAAAACTTAGT AT CAATC CTTC CCCC CAG C CTAGACCAATTAAAT CAGAATTTCCGGAAGTGGGAGGACTTGGG CACAGAT AT 
TTTTTCATTTTAAAGTTCTTGATATAA7\GCCAGCAATGATT^ 

GGCTGGTCTTGAACTCCTGGGCTCAAGCATTCATCCCGCCTTGGCCTCCCAAAGTGCT 

CCAG CCAG CATTATTG ATT AT AT AGGAGACAAAAAG CTACT ATTAATTT AGTTCT ATG CCATTTCCTTCTTAC CAAAACCAAAACA 
AAACACCTGTGTATATGAATTCAAATTCAGTTAAATTGTAAATTTC^ 
GGTATACGGGATGTTGCTTCAAATAAACTTAAAAGAAATGCAAGTAGG 
AATGCTTTTGTTT'GCATTGAAATTTGAATCAAAGAC 

CTCACACCTGTAATCCCAGCACTATGGGAGGCCGAGGCGGTGGATCACCTGAGGTTGGGAGTTCGA^ 

AGAAAC CCCGTCT CT ACTAAAAAT AG AAAATTAGC CGGG CATGG TGGTGCATAC C TGTT AAT C C CAGCT ACTCGGGAGGCTGAGGC 

AGGAGAATCACCTGAACCCAGAGGGTGGAGGTTGCAGTC^CTGAGATCGCACCATTGCAOT 

CTCTGTCTCCAGAAAAAAAAAAAAAAGAAAAATTTTGGAAGGTGTCTTGGTTGCAAAG 

GGTCTATTTCTCAGGTGGTCATAGCCAACTTGGGTTTTAGTTCTGGCTTTGCTC^ 

TAATGTTTCTAGGCCTTATACAATCATTAGAAAAGAAAAAAAA 

GAAATGAGATCAATGTGTTCTCTTCACTTATGGAGTTGATAAACTAGTTTCAGGTTCTTT 

TCCAG ATTCTTTG ATCTTTTATTGGG ATAG CAGTAAAGGTAG ATGATGGACCAG AGAGAT AAC TC CCCT CTGGAGT CTTT CAC CTT 

TCATGATATAAGGGAATAGATCTATGAATTTCTTGGGGAGGCTATTCTCTGAATTTGGTGCAGAA 

GGGTTATATTTTTTOTATCTCTTCATCCCACTCTTGCAAT 

TACAAAAACCTGTATCACAACTTAAAAACAAATCTATTTGTAGGTTGGGCTTGGTGGCTCATGCTTGT 
GGCCAAGGTGGTCAGATCACTTGAGGCCAGGAGTTTGAGACCAGGCTAGCCAACATGGTGAAAC 
AAAAATTAGCTGGGCGTGGTGGTGCATACCTGTAATCCCAGCTAOTAAGAGACTGAGGCAGGAGAATTGTT^ 
GGAGGTTGCAGT6AGCTGAGATTGCACCACTO 

AAACAGATCTATTTTTGTGTCTGTGTCAGGCACTAAAGCCACCTCCAAATATATCACTCAGTC^ 

TAATATTAATGTCTCAGGAATATAGATTTAAAAATTGTGAGTGCCTAGCTGTGGTG 

TTCTGGGTGAAAACATGCTCCATTGTTCCATTCAAAAGAGGATGAAAATAATATTGAAATGTTAACT 

TATCCCCTGAAGAGGTCCTACATCTTCTGCCAAGAGAAAACCTTTGCTGGAGATCTGTTCA 

TCAGGAAGAAAAATTATCCTTATATTCTTGTTCTAAGCAAATGATTTTTTTTTCCCCC^ 

CCCATGCITGTAGCGTACGACAATGCGGTGAACCTTAGCTGCAAGTATTCCTACAATCTCTTCT 

TCACAAAGGACTGGATAGTG CTGTGGAAGTCTGTGTTGTATATGGGAATT ACT C CCAGCAGCFrCAGGTTTACTCAAAAACGGGGT 
TCAACTGTGATGGGAAATTGGGCAATGAATCAGTGACATTCTACCTCCAGAA^ 

AAAATTG AAGTTATGT ATC CTCCTC CTTAC CTAG ACAATGAGAAGAGCAATGGAAC CATTATCCATGT GAAAG GT AACATACAAC T 

TTACCAGTGTACCACCCTAAAGTAATGGTTTTCAAATGCAGTCCTGAAAAC^ 

AAGTGATTTGATACTAACAAAGACAAATAATGTTTTCAGAAAAATTTTT 

ATATCTTTATTTTCCTTTGCTGACATTGAGCGGGAGAGTAAGTG 

CAGTGGATCATGGCAGAAAAGAGGTTACCATTARAATATAAGCCCAGGTGC 

CAGGAAGACTCACTTCTATTTCTAATTAATTCITrrTTTC 

CCTTCCTTCCTTCTTTCCTTCCTTCOT 

TGTCGCCCAGGCTGGAGTGCAGTGGCGGGATCTCAGTT CACTG CAAGCT 

TCCCGATCAGCTGGGACTACAGGTGCCCGCCIACCACGCCTGGCTTATTTTT^ 

ATTTTTTTTTTGAGAGGGAGTCTTGCTCTGTCGCCCAGGCTGGAATGCGGTC 

JVGGTTCATGCCATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGTA^ 

TTAGTAGACAGGGTTTCACCTTGTTAGCCAGGATGGTCTCGATTTCCT 

GGGATTACAGGCGTGAGCCACCGCGCCCAGCCATTTTTT^ 

CAATCTCCTGACCTCGTGATCTGCCCACCTGGGCCTCCCAAAGTGCTGTGATTACAGGCATAAGCC^ 

TTCCATTTTCTAAAATATACAAAGAATGCTGGACTAGAAACCGGGGGACATAAAATTTGOT 

AAGTCACCTAACTTTTTCATAGTCAAAAACTCAGTACAACTGTTAAGCAGTAT^ 

TTGTTGAT GT TAT GTT C TG CCT AAATGTTAGGG AG AAAAATCATG AT TCC C C AAC T CAG AAG AAT ACAG TATTGGTAG CAACAAGT 
AAAGTTTG ATTTTTTG GT ATACTTTG TGGAT ATATCATAGCTTTT CATTTT TGTGG AATG AT AAT AAGAAACACAT ATGTTC AGTT 
TTG T ACTG AATC CT AGCAT AATG C CAATGAATGGTTTTT C TT CAATG CTGGAAC AG AG C CATG CTGATG AAAAA.T AG GAT ACT AAA 
TAAGGAAAGAATTGTTAATGTGGCAGATAAGCTTTTGTGTTCTGGCAAAATAGAGACAATTAA 
GTCCTATTTAGATTTCTAATATCTGTAATATCCAAACAGAATATTTTAATO 
GCTTGTAGCTAAGAGTGAAATATTTTTTCCTATATGAAAGGCATGCTACTTTAGGATAGTA 

T ACACATAT CATTTATGTTAG AACTGAGAAGG ACAC CAATGATCCTGTAC TTAGTAATT T TCAAT CCT AT CTGTATATT AT AAATC 
TGAGTAGGTTTTAAAAGAAATACCAATGCCTAGTTCCAGCCCTGAGATTCTGATGTAATTGATATGGGTTGAGGAAGGG^ 
ACATCAGTATATTTTCAAACTTTCTCAGATAATTTATTGTGCAGCTAGGATC 
TTCTAGTTCTAATTTTCTCCAATTTTGAATAGAATTCTAT^ 

TGTTTGT CATTAT AT AATCT ATG AGACAGGGTTGG AACT AGAAATTTAT CCT CAGTGTTCTG ACTGCCATATT 
AGGCTGATGATTTTCTTAAGGC!TTGAAAACATGCATAT^ 

TCTGATGGGAGAAAGGGATAAAGTTAGGTTAAGGCCATTGGAAGTCACCGTTT^ 
AATACAAGTCTACCAGTGTACCATCCTAACGTAATGGCTTTCAACTGTGGTCGTGAAAACTGA^ 
TTGGGTAAGTCTCAAAGAGGAAATCTATTCACTCTAAGCTGGTGATATGTTTAATATTTTTATTTm 
TTCACAAGGAAGGAAATGCACTCAATTGCTATTCCTCTATCAm 
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AQTCCCCTATTTCCCGGACCTTCTAAGCCCTTTTGGGTGCTGGTGG 
AGTGGCCITTATTATTTTCrcGGTAAGAGAAGCAGCACT^ 

GAATTTTGCCTATG TGGTTTATTTTCTGTTTAATATAG GATGATGATTTTCTTTTCCCCATGCTTGAGT AGGTT CTCTTTTAG CTT 
GTrCAACTCTATCTGAGATAAAAATCTGATATGAGTTAG^ 
5 AATGGG TAAATT AG GGTGGAGGGATTAGGAG CAGTTG CCTTT CTTCT C CCCATTTTG CAAT AT AATT ATGACACAT AAAACAli i"! r 

TAATG CGT ATATATGTATAAACACCT AGGCAACTTGGGAAGCTAGG CAACTGAAG CCAATGGACAATGTTGG CTTGATCAAAAAT C 
ATGAGAAACTTGGCTACTGAAATTGTGTATTCAAACCTAGCAAAGAAACTA^ 
CTGAAGCAGTTGAATTGTGTGAGGCCAAGTATGGTGCTGTCATTATGAAACT 
CTGAATTGAAGTACTAACTTTTGCAAATGTTTACGTAA^ 
10 CTGGGTTTGAACACATTTTGATGAACGCACCrACAG 

GTACATAGTATGGTACAtaAATGTATTGACTCCATGTTGTCTTT^ 

CAATGAGAAATGT ATTCT CAAACATAAAAGACTATT^ CTCTTG ATAT TCT CCATTTGAT 
ATTAAGTACATTTGGTGCTTAATGAATGCTTATTGGATTGATGAATTGATAACCT 

T C TGCCTG AAGTATGATACTCATCTTTAAGG CATAATCTGT ATATGT ACCTAATTCAT AGATGGGT CAGTGATTT CTG AAAGCCAT 
15 ATCATGTG AAGTCATTCTGTTTACTGT CAG CTTGGTGACTG GTACAG ACCTGGATCCACAG CCATGCCTTCCTATTATCCAGT 
CTTAGATTCTTTrAGACTCTTCTTGGCCTATTGCC^ 
CTAGGGCTCAGATGGACATTTGTCAGGCTGCCTGTTGCTTACr 
TGGAAATTCTTTATACTAGGTATCATCTTAAGAATTTGCCTGGCT^ 

TATTGT AAAATGTTGCCTGAATTCTG C CTCTGATGTTAGTTGACTACTTTCACCTTTCCAG ATCTTGGTATCTT CAT C CAACTTAC 
2 0 CTTCTAACTTTGATCCrCTGGTGTTTGCTGATAATTTTCGAGTTTT^ 
TGGCTTTTATGCCTACCATATGGCTTTTCAATCCTC^ 

TGGATACCCCTCCAGGAAAGGCATTCCCCTGTGATATGCTGACCTAGGTCAAAGTAATTTACTAGCT 

GAATTACTTTAACCTTCAATGGCCAGAGGACACTGCTCATTATTGAAAA 

ATGAGACATCAATTTGTTGAGTGACTTGAGTCACCCAAAGGGACAAAA 

2 5 TCAACTCAGACACATCATAGACACACTGGGTTTTT 

AAAATTTTATTTTTGTTTTACTTTGTCTTACAGACCCCCT 
GTTACCCAATAAAAACTACCACITATCTC 

CTATATTTCGTAGAATAACATAGTATTTTTACTCTGGGCTCATCTTTTCCTGCTGT^ 

TTGGGCATCTAGGC CATTATGGAT C CCTGGAGTTT AGCTGTT T C CTGGGTAATTTT CH^ AGATC CTATTTTTGG CTGGTCTGCAGC 
30 GTGCCTTGATGTAAGTTGCTGTTTTCCGGAACTTCCTGAGGGCCTGATATGCTACTGTAGGCAGTTCTGGGTT 
GGCTGAGCGGGTTTCTGGGAGCAGCTGCAGATTCTATCTCAGTTCTTGACOT 
TGGTCAAGCTGTCACAGGCC^CACCTGTCACTGTCTTTCAATT 

AAAGAAATCTTTTTAAAAAACATTATGTTCC CCACTGCAG ATCATAATGTGAGGAATGTAG ATAGTGTGTGTACAGGTAGTGT CTG 
CTTTTTATAATAATCCAGTAGATTCTTGAAGACCAC^ 

3 5 AGGAAACAAGTGGGTAAGGCTGGGATGGGGAGAAGGGCAAAGGAGGAATGGTC 

TTGAAGGGGTGGAGATGACACAGACACATAGG^TAGGTACTTCCT^ 
C^TTCTGCGTGGTGTTTGTTCCCAGCCATGCATCTGTCATT^ 

CTGC CATCTGACAATCACTT GAGGAT AAG CATGGG AGC TG ACTG CAAC AGAC CTTT CATT ATTTT ATGTG CAATGT AAAG AAC CTG 
AGAATCAAGACACCTTTATTGCTGTGGCCTCTCTACCAGCTATGATTTGGTT 

4 0 TTTGGCAAAGGGAGTCCTCAGGGTTGGAGTAAGTACTAAAAATGCTTC 

GACATTTCCTCGGCCTCACCCCTGCCCTGCTGTGATCTTGTGCTCAAGAGAGCT 
CAAAACAATTCTTCATTGCATTGTAGGATGTTTAGCIAGCATCTCTGACCTC 

TGATGATCCAAAATGTTTTTAGACTTCACTAAATGTCTCCTGGGACAGGGATTAAGGGAGCTTGAGGTGTCAAGGATGAGGT 
AGGTCTGTCTTGGGQVGCTGTATATATGGTAGTGTCATTCACrCAGATGAG 

4 5 ATCTTG AGTTTTATTTT AG ACATGTTGAGTTT AG AGTGGCTTTG AGG TTT AGT C CAGAGAT ATAAATTTGGGTGAG TAGGTG GTAA 

T TGAAG CTGTG AATGTGG AT AAGATTGCC!TAGGGAAAGAGG AT AGAG AAG AAG^C CT AGGAC CAAGCAGTGAG AAATTCT AACGG C 

TGGTTAGAGAAGGATGAGCTTGCAAAGGAGGCAGGGAAGGAGTGACAAGACAGGTAGGATAA 

AGAGAAATAGTATCACAAAAAGAAGAGAGTTATCAATAGTATAGAATACTGCTGAGAGTTG 

TAGCTGTGTTTAGTAACATGGAAGTCAGCGTTTTTCAAAC CTTTCTTTAGCACTTACT CATG C C CCTTTGACAAGG AAGGTG TGAA 

5 0 GTGCTCAGGCAGGAGTGTATGGAAACAAAACCAGGGCTGAAATCATTCACT 

AG CATT ATCTACTTAGGTAT CAG C CTGTGTGC CAGGAACT AGG CACTTTTTT CAT AAT AG AG AATTCT 
GCTTTCTGTTTTTAGATGTTTTCrcGGTCATTTGT 

CCCTTAGTCAAAAATTTTGCTAAGAITCCATGGTGGGTGAAACTATTGCCCTTACT^ 

G GAGC CAAG CCTATGAAAGAG AGATGAGTGT AGAAATACAT AC C CAT AAAGACATAGG AATGT ACTAGTTG TCT AG AACTGT AAAT 
5 5 TGCCATTAGTTTGGACTCCACCTTTACAAATTTGTGATTATTACAGACTCTGGAGGAG 
AAAACTAGCTTGTTAAGCAAATGCTAATGTAAGAAGATCACAATGTGGGTAG 

GTATTTACAT ATCAAC^GTGGATATGTTAATTAGTGTGCATTTTTAATTCAGGT CATTGAGAGTAACAGGGATG CA 

TTTCTCTAAAATACTTTCTTTGAGATTTTTCAAGTTCAATGTGAAGAGT 

GGTG CTCAAAAAAGGTTAGTGTTTTAGTGT CT CTGTCATTTG ACAGTTAAT ATT ATGAAT AGTTGTGC CCACAGT CTTAGAACT CC 
60 TTCCATGACATTGTCCCTCCATACITGACACTTCTCTTTCCTGCAGGTGAGGAGTAA 

GAACATGACTCCCOKXGCCCCGGGCCCACC^ 

GAC^CGGACGCCTATCCAGAAGCCAGCCGGCTGGCAGCCCCC^T 

CC^GCCGGCCACCTCAGGCCCCTGTTGGGCCACCAATGC^ 

TCTGAAATGAAGTAAAAGAGATTTCCrGTGACAGGCCAAGTCTITAC^ 
65 ACTTGACTGAGAAGTTAGGGTAGAAAACAAAAAGGGAGTGGATTCTGGGAGCCTCTTCCCTTTCTCACTCACCT 

CAAGCAAAGTGTGGTATCCACAGACATTTTAGTTGCAGAAGAAAGG 

TTTGGTTAGTGGGTTAAACGGGGTAAGTTAGAGTAGGGGGAGGGATAGGAAGACATATTTAAAAACCATT 
CTCATGAAATGAGCCACGTAGTTCCTATTTAATGCTGTTTTCCTTTAGTTTAGAAATACATA 

T CATATTTAGT CATTTTGACCAAATGAG G GATTTGGTCAAATG AGGGATTCCCT CAAAGCAATAT CAGGT AAAC CAAGTTG CTTT C 
7 0 CTCACTCCCTGTCATGAGACTTCAGTGTTAATGTTCACAATATACTTTCGAAAGAATAA 

ATGT CAGGAAATAAGGTCACTTTATGT CAAAATT ATTTGAGT ACTATGG GACCTGG CG CAGTG GCT CATGCTTGTAATCC CAGCAC 

TTTGG G AGG C CG AGGTGGGCAGAT CACTTGAG ATCAGGAC CAG CCTGGT CAAGATG GTGAAACTC CGT CTGTACTAAAAAT ACAAA 

ATTTAGCTTGGCCTGGTGGCAGGCACCTGTAATCCCAGCTGCC 

GGTTGCAGTGAGCCGAGATAGTGCCACAGCTCTCCAGCCTGGGCGACAGAGTGAG^ 
n c; CAA C A ACAACAAACCACAAAATTATTT GAGTAC TGTGAAGGATTATTTG TCT AACAGTT CATT CCAATCAGACCAGGT AGG AGCTT 
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TCCTGTTTCATATGTTTCAGGGTTGCACAGTTGGTCT 

TAGGAGAAGTGAGAATACTGTQAAAAAGGGATGTTAGCATTCATTAGAGTATG 

GACGAATAGAATGGAGTAATGAAATTCTTG C CATGTGCTGAGGAG ATAG C CAG CATT AGGTGAC^AT CTTCCAGAAGTGGT CAGG C 
AG AAGGTG C C CTGGTG AGAGCTC CTTTACAGGGACTrTT ATGTGGTTTAGGG CT CAGAGCTCCAAAACrCTGGGCTCAGCTG CTCCT 
5 GTACCTTGGAGGTCCATTCACATGGGAAAGTATTTTGGAATGTGTCTT^ 
GCCTGACCCTCAAATGACCZATGGATATTTTTCTACCrACAG 

TCAGTGGC CCTCACCATTTGTTCATGCTT CAGTTAATT CAGGTGTTGAAGGAG CTTAGGTT TTAGAGGCACGTAGACTTGGTTCAA 

CAAATGTCTCTTCCTATAGTATAATCTCCATAAGGGCATGGCCCAAGTCTC 
1 0 TG CC CGACATACAATGTT AGCTATTGGT ATTATTGC CAT ATAG ATAAATTATGTATAAAAATT AAACTGGG CAAT AG CCT AAGAAG 

GGGGGAATATTGTAACACAAATTTAAACCCACTACGCAGGGATGAGGTGCTA 

TGTTTCTTGAAAT AGTTT ATCTTGTAATGAAATATAAGG CAC CTCCCACITlTATGTATAGAAAGAGGTLrrriTAATl"rTrrrTTA 
ATGTGAGAAGGAAGGGAGGAGTAGGAATCTTGAGATTCCAGATCGA 

CCATGGATTT AAT CAGTC C CAAGAAG ATCAAACT CAGCAGTACTTGGGTGCIX1AAGAACTG TTGGATTTACC CT<^ 
. 15 ACTTGCCAG CTTCTTGGG CACACAGAGTTCTTCAAT CCAAG TTATC^^ 

AAATAGGCCTACT AAAGATGAAT CACACTTGAG ATGTTT CTT ACT CACT CTCCACAG AAACAAAGAAG AAATGTTATACAG GGAAG 
TCCGTTTTCACTATTAGTATGAACCAAGAAATGGTTCAAAAACAG 
TGAAGAAAACAATGTCATTTGCTGCTATTATTGTAAGAGTCTTATAA 
2 0 ATTT GGGTTAAG CATG C C^TTT AAAGAG ACCAAGTGTATGT ACAT TATGTTCT ACATATT CAG TGAT AAAATTACTAAACTACT A 

T ATGTC TGCTTT AAAT TTGT ACTTTT AATATTGTCTTTTGGT ATT AAG AAAG ATATGCTTT CA 
AGGAATTTGG AT AGAACTTGCTATTTAAAAG AGGTGTGGGGTAAAT CCT CAGTTTAG C CTTTTTTGAAAAAG CT 

AGACTTTCAAATACTAATTTCACTTCAAGCAGGGTACGTTTCTGGTTTGTTTGCTTG 

TGG CTC ACCTCTTTG AG ATGTCAGGCTAGG CTT ACCT ATGTGTT CTGTG T CATGTGAATGCTGAGAAG1TTGACAGAGAT C CAACT 

2 5 TCAGCCTTGACCCCAT CAGTCCCTCGGGTTAACTAACTGAGCCACCGGTCCTCATGG CTATTTTAATGAGGGTATTGATGGTTAAA 

TGCATGTCTGATCCCITATCCCAGCCATTTGCAC^ 
CAACTACATGCTGGAGtfVTTAGAGATGGTC 

TCAAAGAGAGTGACACACACTCTCrCAAGACCTGGGGTGAGGGAGTCT 
TCTTTCCTATAGATATATGCATACTTTCTG 

3 0 GCCTTTACATGTCCTGTTTACTCTGTCTAGAATC 

GTGTCATCTTCTCCCCTAGTTAAACTACCCCACACCCTGTCTGCTTTCCTTGCTTA 

CATT AG ACATTTTT CTTATTT ATT TGT AGTTTAT AAG CTT C ATGAG GCAAGT AACTTTG CT'J.TGTTTCTTGCTGT AT CTCCAG TG C 
CCAG AG CAGTG CCTGGTATATAATAAATATTTATTGACTG AGTGAATGAGTTTTACATGG C CTTATGAATTAAC GGTG AGTCTAAA 
TGTTATAATCATACTTTGAGAGGGAGAAAGCCAGTTCTATTTTCTACTITCCTCTTAAGA 

3 5 AACAGGAGGTAAAATATATACZACAGCCAAATCTAGTGAAAGAAAATCTGATGAA 

GCIAGTGAGGAAAACCAGTACTGTTAGAAAAGTCATGCCAACT 

ATT C CCT CTTAATGG AGTATTGGCTATAT CTTAAAC AT AT CTTTAGTTTAAGACTTCTGAAAATGTTGT CTTTGTCTAAAATGTTA 
AAATCC CATCCTACTACTGACCTG CTACTCTTTTTTCTGGGGTGATTTTACCCCCAACAAATT CTATAAATGCCTTCTCTGGCTCT 
GATTCAAAAAGTGCTCACAAATG^AAATGCTGGGGACAGA 

4 0 TCTAATAGCACACACACACACACACACACACACAC^CAGACAC^CACACA 

GGGAAAGGCGCTTGAACAACZAGACACAAGAGCTGGTCTAATAATrCA 

AAGGTGACTCAGAGTCAATTAAATTGGTGTGCAATGAGTTCATCTCTAAACT 

CTTAATAAAGGTAATTAAACTTGAAAAAGATGAGCTTATATGTCTCTT^ 

GGAT AGTT CAATAAG ACAAAT CAAAT CAATTGGAAAGGAACGAG ATGGGT AAAAAAAAAAAAG CCG TTAAATG AAT CTG CT CAAT A 

4 5 AGCACTCCTTTACCATGCTAATAAAAATAATGTAACCCAGTTCTGTAAAAAGGTTCTAGTACTTTGAAA 

ATAAAATGCAAACITCGCTTCACCTAACACCTCCCACA^ 

G AATTCACTTTTTCCCTCCAGTATGATAAAACAG ATTGATTCCCTTTT AAAG C AATACCCTTT CTT CTCT CC CAT CTTGTCGGT AT 
CAGA.GAT AATGG AT AGGTGAGAAAAAATTAAAGAAGCTT CTGGATT CATTGTAAT GGGG ACCT AAC CAT CACACTATG CAG CAG CA 
AATGGGATGGTCTGGAAAGCAGACCAGAACCTTGTCTGGTAACTTTGCTTCT 

5 0 CTCTTGJVCTTTTTGACAGTGGTGCTCATAAGAAAGTTTGGGGCT 

CTAATGCAG^AGGATCACTTGAGTCTAGGAGTTO^GACCAGCTTC 
TAGCCAGAOSTGATGGTGCAGGCCTGTAGTCCraGCTACTTAGGAGGCT^ 

CACATGTGAATGGTG^GGCCTCTCAGCTTTCTCTGATTTGGCAACCCAGAAGAAATATGTGCTTTCT 
CCTAGCACTTGGAAACCTAGGCCACTCTCCAGAGCAGAAGGTC^CACAATTT 

5 5 CACTG GG CCCAACACT TGAAAATTGCTTGAG AT G AT AAATTG CTTG AGGTGATCAAAGAAT AT ATCAGCTCTT CAAAGCTG ACCTC 

A1TCTTGGTGCAACTCTCAGGGCCAG AATGTGTGCTTCCCGAAGCTCATTTT CCTAT CTGATTTGCCTGTTGAAACTCTGTGAAAG 
ACCCTTGAATGCTAGGAATTTGAAGGGACAAAAAGTTGAAGAAAG^ 
CACTTCATCAGGCAGAGCTATCTCAGTGATAAATTGAAACTCT 
AGTTGAGAAGAAGAGCAGCAAACAGCTTTGTTTTCCTATAGTTGTGGTTTTACATC 

6 0 CAGCATGAACT CCAGTTGTTAT CTGAGTGAGCATCT CTG C GAACACTG AAGG AT ATGTG TGGG TGT CACCTG AAAAC!AAAGAGAAA 

GACAGAGAGAGAAAAAGAAGAAAAGGAGGCCCTGTTTCGGAAAGAAGAGCCAGTTAATCAAA 

TTT ATTTCATG AG CTTT AT ATGTGATTT C CTT C C C CT TG CTAATCT CGT CTGAATTTATTCTCTGG ACT AAAT ATATTTGCATCTG 
GTGACTGAGGAAGGGGTGGAAAAGGTGACCTTG AATGGGCATCAGGGTAGTTAC CATAGTT ATTTAGTCATTT CTTCAGTCTCTTG 
GGCACACACTTAGCTTCCCAAATGGAGGTTTCTGCTAACTTTTGAGAA^ 
65 GTTGTTGGGGACCCTCTTTTATCCCCAACCCTATGCAATGCTTGTTTCATTTGCTTTATGTTTCT 
TCATTCTAGAACTAGTTAATGGGCACCTGCCCAGCACTAAGATGTC 

TAGGGGTAACCACTTAGTCTTGATTTTGGGGCTTAGATCTTA 

TCTTTGACATG TT C CTATTT AACATCTC T AACT CTTT CAATGT OVTAG ATCAGGAATCAGCAAATTTTTTTCT 

7 0 T AGT AAAT ATTTT AGGCTTT GTGACTAT ACTGT CTCTTAT AGAT AAT AAGTTl^ CACTGTAGT GTGAAAG CAG CCAT AAACAAT 

ATAT AAACAAATAAG CATGGCTG CATTC CAATAAACCTTTATTT ATGG AT ATTGAT ATGTG AAGTT CGTGTGATTTTCATGT CATG 
CAATATTACTTTTCCTTTGTTTTTTTCCCCCAACCATT^ 
AGTGGGCTGGAC^TGGCTTATGGGCTACGGTTTGCTC^ 
AGTTCCAGCCACCTTCTACATTTTTAGGGCCAGAAGGAATT^ 
7 5 CATCTCTTTTGAAAGAAGGACATGGATCAATAAAGAAG 
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GAAGTT ATAAGCAAAGGGAATGAGAAGCTGTTTCAGG AAGACAGAG T CTT CTGTATATT ATTATTATTAATTTTGACAG AGTCTTG 
CTCTGTCACCCAGGCTGGAGTACAGTAGCATGATCTCAGCTCACrACATCCTCTGCCT 

GTCT CCTCAGT AGCTGGGATGATAGG CACACG CCACCAC TCCCGG CTAATTTTT TTATTTTTT ATTTTT ATTTTT AGTAGAG ATGG 
GGTTTCACCATGTTGGCCAGG CTGGTCTCGAACGCCTGGCCTCAAGTGATCCACCCACCACX3G CCTCCCAAACT 
5 GGCATGAGCCACTGTGCCTGGCCTATATTATTGATGAAAGCCTAAACA 

TACTCTTTTCCCTCTTTTCTAGG CATCACATC CTTGTAATCTCCAAGGAAACAAATTT L^L'lTCTTTCITlTl^LTrT TTTTTGAGAT 
GGAOTCTCGATCTATCGCCCAGGCTGGAGTGCAGTGGCACTATCTCCGCTCACTGCAAGCTCT 

CTGCCTCAG CCTCC CGAGTAGATGGG ACTACAGGCAT C CGCCACCCAC CACCACGCCTGGCT AATlT , ri"lGTATTT TT AGTAG AGA 
CAGGGTTTCACCATGTTAGCCAGGATGGTCTCGATCTCC^ 
1 0 AGGCATGAGCCACCGCGCCCGGCCAGGAAACAAATGCACAGAAGGAAGGGGGCAG 

CTTGACTTCCCTGCTAATTCTTCCCTAAGTAGTACATCCAAATCCAACTCTGGGTGGGTA 
TGGCCTCCTACTTTCTACCTATCTCATGTTTCCAGTCTGCATTTGGCT 

TACATT AAGG CTCAG CATTTCACAAGTAG CTGATG AAAT CAGTATT ACT CAG AT CAG AAAGG CAC CGAC CT AACCCTTACCATCCC 
TTGCCCCCATTCCTCACTCCAGTCTTGTTATACCCAAGGTACAAATCGTTC 

TTGCCTCTAAAAGGTCAGATGATTAAGATTATTGTA 

TTTC CTGACTTT ACACATGGTGTTCCAGTTCTGAGTTGGGGTG AACAG CTTTGGGAAATCCCAAG ATrrr 1' CTACTCCAGAAAAT A 
GTGCTACT ATGTACATTTATG CACTGTGTGCCTTGCACAAAGG CC CCT AGTGAAGGGAATG AAGAGGGCTGAAAT C TAGTCAAGCT 
CCC CTT AT CAAGCT CTGCACCTGCTT AATGAAGGAGAGGG CG C CTTCT CTAATG CC CG CATGTAT GG GCTATTGAGG CC CTG ATAG 
2 0 AATGCTAG CCTTTCTGCCATTTGGTTACrrGAGCr^ 

TAGTTAAAACAAAATAAAACTTACCCCT CCCCCACAAAATGGA^ CAC AAAAATTTAT CTC CTCGGTC CCTTCTT 

TAGTCTTCCCTTCACGTTAGCTCTCCATGTTTT^ 
AGTAATGAGAGAAGAAACTTCCTAGTTCGTGTCCTAGTTTGGCTCTT^^ 
CTAGGTAATATAGCTTGGATATTTGTCCTCTCCAAATCTCATT^ 

2 5 GTGTTTGGGTCTTAGGGGCGGATCCCTCA.TGAATGGCTITGGTG 

CTCTATCTCTCACTTCCTGCCATGTGATCTTTACAT^ 

TGCTAGAAGCAGAATAGAGTGCTATACTTCTTGTAC^ 

AGCCTCAGGTATTCCTTTATAGCAATGCAAATGGACTAAGACACT 

3 0 GTT AATGTAGTCTACTCTTAT TTCTTAGTT AT CAGT ACATGAGGGGAC CAGACTAAAT AACCT CAGAGCTCATT C CT ATAATTGTT 

TT CATACTC CATATTAT AACTCTCCT AGAATT ATCTTC CCATCATG AAT AC CATTGT I'lTGGG TTTCTC CACT ATCACATTTCT AA 
GTCTTACAGATATAAATATTTTATGTTTGAGATTTGTCAACCAGGATTGT 
TACTAATTTATCAAAATTGTTTCCATCACTTTTTCTGGAGAAAATAATGAATTCT 
TTAAAGTTTAATGAAGTTTAAGGTTTAAAAAATGATGTAATAGTTTACT 

3 5 AGCTTGTCATTTTCCAAATAAACTTCTTATACATTATTGAACT 

ATCTTAATGAGTGAATCAATGGATGGATGGATAAATGAATGACTATCTTTC^ 

ACAAATT AGATG AAACAG GTATTAT G AAAAGACTTGGG AAAAT TG AGG ACAAATTAGTT AACT AG AT ACT ATGAAAAGACTT GGGA 
AATTCAT ATTTTT AAAATTATTGAAATGT ATT AATT AAT ATG CATTTTAAG ACATTGT TT AGGCATT CAGAAGCAAAAACTGTTT A 
TCCTTCCTTGGCAACAAACAGAACTCTAATTGTGTTCAGT^ 

4 0 ATT AGTTTAAG CCAGAATGGTAGT C CTAACCT TCT CAT CAGTG ATTGG TTT AGG CATG GACACATGATAAAATT CTGAC CAATAGC 

TGCCATCTTGGCTAACTGCZAACCTCACTC 

GGACTACAGGCATGTACCACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGAC^GGAT 
GAACTCCTGACCTCAAGTGATCTACTTGCCTCGGCCTCCAAAGTGCTGGGATTACAGGTGTGAGCCACCGTGCT 

4 5 GCTCTTTAAAAAGAGATGTTTGGAAAGAAAGTTTCCTTTCTGCTATAGAATC 

TGCGACGAGGGGAGGAATTGGCCTGAGAACATGCTAAGGCAGGAAGAGCAGAAAG^ 
TGAGCTGCAGATTGAGCCGACTTGAATTCTCCTTGCTTCCT 

TTCTTGT CATGTGAGACAGT AAAT AACTT TATTGTTT AAG C CCTT TTG G G TT AG ATTTTCTATTACATGAAG TTTAAAG CATT CTG 
ATAAITT T CATGTTTATTG ATGTGT ATAAATACTAAT ATATT CTAT T ATTTGGGGGATACAAAGAATTTGACTCAATAGT ATCTTGT 

5 0 TCTATTGACATGTTCAATTATTAATCACTGATAACCCTGAATAACTAAATGTTATTACCACATT 

GACATATGAAATTTG GGCAATT TG CTT CTTTG GGAAG CATT AT AAATCA GAG AACAGCAGAGG AT ACAAAG AC CCCCACAAG AG AT 

CATCZAAGATTTTCTTCTTCCCTTCTTTTTGTCATACACATTGCTGAGAAGGA 

TTCTGTTATTTTTAGTTTTGAGACCCXaGTCTC^ 

5 5 ATGTTTTCAGTAGAGATGGGGTTTCATCATGTGGCCGGGCTGGTCTCGAACTCCT 

CAAAGTGCTGAGTTTACAGGCATGAGCCA.CCAAGTCCAGCCAGTATTTGTATTTTGAATTAAAGTATGTA 

ACTGGCTCACTTCCTGGTAATATAAAATACAACATATCAGTAAAATCTTCC^TGATTCTCCT 

TGTGTTACCTAATGCTGGCCTAAACTCCAGCXJGC^ 

TGGTGCCTCATGCCTGTAATCCCAGTACTTTGGGAGGCTGAGGCGGGCAGATCACGAGGTC^ 

6 0 ACCCTGTCTCTACrAAAAATACAAAAATTAGCTGGGCGTGATGGTGCGTGCC 

GAATGGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCC 
TCTCAAAAAAAAAAAAGTCACGT 



HUMAN SEQUENCE - mRNA 

6 5 TAAAGTCATCAAAACAACGTTATATCCTGTGTGAAATGCTGCAGTCAGGATGCCTTGTGGm 

aggggatggtggcggtggtggtggccgtggatgacggagactctcaggccttg 

tcaattcaagtaacaggaaacaagattttggtgaagcagtcgcccatgcttgtagcgta 
gtattcctacaatctcttctcaagggagtcccgggc^tccc^ 

7 0 g gaatt act cc cagcag ctt caggtttactcaaaaacggg gtt caactx5tgatggg aaat tggg caatgaatcagtgacatt ct ac 

ctc cag aatttgt atgt t aaccaaac^gatatttacttctg caaaattx3 aagtt atgtatcct cctc ctt ac c t agacaatg ag aa 
gagcaatggaaccattatccatgtgaaagggaaacacctttgtccaagtcccctatttcccggaccttctaag^ 
tggtggtggttggtggagtcctggcttgctatagcitgcta^ 
aggctcctccacagtgactacatgaacatgactccccgcc^ 
75 cgacttcgcagcctatcgctcctgacac^ 
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TGGATAGGAAATGACCG CCATCTC CAG CCGGCCAC CT CAGG CCC CTG TTGGGCCACCAATG CCAAi u i u i"1TCTCXjAGTGACT AGACC 
AAAT ATCAAGAT CATTTTGAGACT CTGAAATGAAGTAAAAGAG ATTT C CTGTGACAGGCCAAGT CTT ACAGTGCCATGGCCCACAT 
TCCAACTTACCATGTACTTAGTGACTTGACrGAGAAGTTAGGG 

CTCACTCACCTGCACAT CT CAGTCAAG CAAAGTGTGGT AT C CAGAGACAUTTTAGTTG CAGAAGAAAGGCTAGGAAATCATTCCTT 
5 TTGGTTAAATGGGTGTTTAATCTTTTGGT^AGTGGGTTAAACG^ 

AACCATTAAAACACTGTCTCCCACTCATGAAATGAGCCACGTAGT^ 

ACATTGTCTT TTATGAATT CTGATCAT ATTTAGT CATTTTGAC CAAATCAGGGATTTGGT CAAATGAGGGATTC CCTCAAAG CAAT 

ATCAGGTAAACCAAGTTGCTT^CCTCACTCCCTGTCAT^ 

GTTCTCCTACATGAAGAAAGAATATGTCAGGAAATAAGGTCACTTTATGTCA 

1 0 GCTCATGCTTGTAATCCCAGCACTTTGG^ 

TCCGTCTGTACTAAAAATACAAAATTTAGCTTGGCCT 
ATCCCTTGAACCTGGCAGGCGGAGGTTGCAGTGAGCCGAG 

TCAAACAACAACAACAACAACAACAACAACAACAAACCACAAA^ TATTTGTCTAACAGTT CATT 

CCAATCA.GACCAGGTAGGAGCTTTCCTGTTTCATATGTTTC^ 
1 5 GTGGGTTGTGG AAAGAG CGT C CAT AGGAGAAGTG AGAATACTGTGAAAAGGGATGTT AGCACT 

CAAGAAGGTTCITTGGAAGGAGGACGAATAGAATGGAGTAATG^ 
AATCTTCCAGAAGTGGTCAGGCAGAAGGTCCC 

AACTCTGGG CTCAGCTGCTC CTGTACCTTGGAGGT C CATT CACATGGGAAAGTATTTTGGAATGTGTCTTi l TGAAG AG AG CATCAG 
AGTTCTTAAGGGACTGGGTAAGGCCTGACCCTGAAATGACC^ 
2 0 GGGGAC C!TTGAAGAATGCCCTT CAGTGG CCCT CAC CATTTGTTCATGCTT CAGTT AATTCAGG TGTTG AAGGAGCTTAGGTTTTAG 

AGGCACGTAGACTTGGTTCAAGTCrrC^GTTAGTAGTTGAATAGCCT 

AT AAATGG AG AT AATGGTT ACAAATGTCTCTTCCTATAGTAT AATCT CCATAAGGGC^ C C CAAGT CTGTC TTTGACTC TG CCT 
ATCCCTGACGTTT AGTAGCATG CC CGACATACAATGTTAG CTATTGGT ATTATTG CCAT AT AG ATAAATT ATG T AT AAAAATT AAA 
CTGGGCAATAGCCTAAGAAGGGGGGAATATTCTAACACAAATTTAAACCCACTA 

2 5 TTT AACTT C C ATCATTTT CCTGTTTCITTGAAATAGTTT AT CTTGTAAT GAAATATAAGG CAC C TC C CACTTTTATGTATAG AAAGA 

GGTCTTTTAATTTTTTTTTAATGTGAGAAGGAAGGGAGGAGTAGGAATCTTGAGATTCCATATCGAAAATACT 

TTTTTAAGTGGGCTTCCATTCCATGGATTTAATCAOT 

TTTACCCTGGCACXSTGTGCCACrrGCCCAGCTTCTTCGG 

CAGAGCTGGAGAGTTTTTTG AAATGG CAGTGGCAAAT AAAT AAAT ACTT TTTTTTAAATGG AAAGACTTGATCT ATGGTAAT AAAT 

3 AAGAAATGTTATACAGGGAAGTCCGTTTTC^ 

TAGTT T CAG AT ATG GTAG TTATtlAAG AAAACMTGTCATTTGCTGCTATTATTGTAAGAGTC TTATAATTAATGGTACTC CTATAA 

TTTTTGATTGTGAGCTCACCTATTTCGGTTAAGCATGCCAATTTAAAGAG^ 

GATAAAATTACTAAACTACT 

35 

HUMAN SEQUENCE - CODING 

ATGCTCAGGCTGCTCTTGGCTCrCAACTTATTCCCTTCAATT 
TGTAGCGTACGACAATGCGGTC^CCTTAGCTG CAAGT ATTCCTACAATCTCTTCT 
GACTGGATAGTGCTGTt^AAGTCTGTGTTGTATATGGGAATT^^ 

4 0 GATGGGAAATTGGGCAATGAATCAGTGACATTCTACCTCCAGAA 

AGTT ATGT AT C CT C CT C CTT ACCTAG ACAATGAG AAGAGCAATG GAAC CATT ATCGATGTGAAAGGGAAACACCTTTGT CCAAG TC 
CCCTATTTCCCGGACCrTCTAAGCCCTTTTGGGTCCTGGTGGTGGTTGGTG 

CACCCGCAAGCATTACCAGCCCTATGCCCCACCACGCGACTTCGCAGCCT 

45 
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Table 96 

MOUSE NOMENCLATURE 
ICSGNM Sept9 
Celera mCG19301 

5 

HUMAN NOMENCLATURE 
HGNC MSP 
Celera hCG29396 

10 MOUSE SEQUENCE - GENOMIC 

GCTACTTAAGAGGGX:CCTGCTrAGGTGTCTTTCCX3AGAGC 
GTCGGGTCCTACTGTGGTTTCACAGGGTTAGAGCrGA 

T CAGTACT AC CCGGTACT CAGT ACT AC CCGGTACTCAGGGGTTAAGATCGlT'ri' TATTG AAAGTGAAGTT CATAGT CTTGTTTGTG 
AGATTCCCAGAAATGTGGGGTTCACTTCTGGGAACAG^ 
1 5 CATTGCTAATTCTGCATTCTTATTCAAAAATATCTCTCTATAAAAG 
TAATTTCCCAGAAATTTTGGACCGGAGACTTCAGACTCCAGA 
TTTTATACTGCCTGTCCrATCCCAGGACCTCCTGGCTGGGGJlGCCCACAGCAT 

CC CACCAGT AGG G TGCTCAGGTG^TTCATGGG ATACG^CCTGCAT A(^TATG^GTG CCTTACCAGGTGT C CAAG AG AGATGGTGAGG 
TTTGAGAGCCACTCTGAGACAAAGGACAGATAGAGATGGTCACAGATCTGGACTGGCr 
2 0 CAGCATGAGGGTGTGAGACACCACACCTCATCTTAGAGATCACAGGGATTGGAGCC 

CCCTGACATATCTTTTGGAATCAGAGTCACCCATGGCCTTG CTCCAGAGTCTGGGGAGTGAGACTAAACCGTACTACTC CAGGGGG 
GATGCTGTTGTGTAGAAGGCAGCCCTAAGAAGAGGACCCTGGGAGATCCAT^ 

AATGG GGCTGGAG AGG TGTCC CAG CTTTGCAGTGTGAGAAAGGCCAGGC CAAAGGAAAAGGAAGGACCIAA CAGGAAGGACAG 
GTAGAAAGTGCTAGAAAACAGATTCAACCCAGAACCTGCAGTTAGGAACCTGTGGACACCCT 

2 5 GG CTCAG AGAGATGACAAG C TGGTGT CCGTC CATTAGTCAC CAAACACACAG C CAGGAAG GTGACCGGATTCTGGAAAG CCTGG AG 

GAATTCTCTGGTGTGACTG CAGCTCTGGAACTTTCTAGATG CTTCTAG ATTACTAC 
CACCTCTGGTGGGAAAGAAACTGGGAACAAAACTTCTAGTTCCCCTTTTCAC^ 

TGGGTGT T CIT CTCAGGCTTGCAGTT ATACAGATAG GGT CATGAAAAGGT CACAG G C CCGAGTT CT CACT CCTTATCTG AAATGCT 
CTTCTCCCTGGCAGCAAAAAGCCCTCAGGATGTCIAGTC 
30 CCCCATCCTCACTCCACCCGACCCCC^GTCTCCCTCTCTGAGCCCCACCT^ 
GGCCTGTGACAGGAGGTGCCCGTGGTCCATTCCTCCCTTCTCGG^^ 

TCGCGTCTCTCTGAGTTGAGCGGGGAGTGGATGGACCAGCTCCATGCAGCCCAGCCACTGTGATC^ 

GCCAAGGGGCACTCTGACACTTGCCACCG^ 

(^AA*3CCTTTGCCCCTTCAAGTCCCTTGCTTATG 

3 5 TGACTCTATCAGCTGTACATGGTACAAGTGTGTATGTGCCAGCGCACTTGA^ 

TACCCTTGAGGCAGGGTCCCTCACTGGGCCTGGAGCAAGGCrrGGCA 
GCACTGGGGTTCTAGGTGCCCGTGTGACTTTTATGCTGGTTCTGGGAGTCAAACT 
CACCTGCCGAACCATCTTCCCAGCCTGTCGTCTTGACTTCTGTGTTAGGGCTTGCGTCrrGTGAGGCAC 
TACACAGGGTTGAAATTCTCAGTGGGGACAATGTTAATGCTT^ 

CTATGACC^CGCCCTCraCCCTCCACCCTCCT^ 
GGCATCTTGCCATTCTCCCCTAATGTCCGCCTTCCTCCCT^ 

GGGACTCTCACACTCCTCTGTAAOTCATAAACAGGGTTGGTAGCCCCGGGCCAGGC^ 

CAAGGGGGAGGTGGCGAGTGTCCCAGCTCTGAATGATACCCTCCT 
45 AAGAGAGAGAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATCCACATTTGGTCCTAAGCC^ 

GCCACAC^TCCAGGAAGTCTCCTGGATGACAGGGGTCCTGCCCTCTGTTGATGGGGGGTCC 

AGTGCTGTOTGGTTTAGAGTGAGACTTTACTGAGTCCATTTTTAGAGCCCAAAGGGACAGCTCTCC 

TCACCCAGGCCTGGGGTGCCTTTGAACACACTGAAACAGGTAGT^ 

GCCHTTAATCCCAGCACTTGGGAGGCAGAGGCAGGCGGATCT 
5 0 GCCAGGGCTACACAGAGAAACCCTGTCTCGAAAAAAACAAACCAAACCAAACC^ 

GCACCTGAGGTATGTAGATTCTGTCCTGGCTGCAGGGTG^ 

CTCTCTACACCGAGGAGCCCACAGGGCTAGACACCTGGATGCTTTTAGCCTCGTCTGAGCAGTGGAA^ 
GTTTGGCTGCATTTGGTCGGGGTGAGGTGAAGGGCCTGAAGGATGGGGTC 

AAT CACACTGAGGTAAGACAGTTGGCTGCAGACTCCIAGGGTCACACCTCAAGT^ CACAGAAGACACGAG 

5 5 GCTTACGAAGGCTGACACGGGTTTTGTGAGTTTGAGACAA 

CCTCAGCreGAATCACAGATGTGCIAACACCATGCCCTGCTTA^ 
CTTCrrCTGACTGGGGTAGGGGTGGGGCTTCTCAGAAAGAAGGAAATGGGCCCATGCT 
GTTACCTAGGAAACCAAGAGGGGGCAGGGCTTGCTGAGTTAGCGGCCAGGCCTCCAGGCCT 
GGAGTTTCACCTGTGGTAGTGAATGAACXKK3GAGAGTGCT 
60 ATAGGGT^TCTTAGGAAACAGTGCTGGCTGAGGTTGGATGTGCCAGGGTGCA 

TACTGGAAGTTGTTGGGGGGGGGGGGGAAGGAGGGAGACAGGAAGTAGACACCACCGAGGAATCGAG^ 

CACCAGCTG CATGGGGCGTGGCAGG AAGGGGCTGCTGTAAAAT TGCAAAAGCTAAG CATTAGGGCTGGGGGGACAGGGCCCTGGAA 

TCTGAAGAGGAAGGTCCCAGGACTGAGGGGTGCAGGCAGGGCCTGAGCA^ 

AAGAACATGGCCTGTAGAGGGAGGAGGGGTGTGGCCTCAGAGAACCCACTGGA^ 

6 5 AGAAACACGGAAGAGGCAGAAGGGAGGGGAGATCACGCCCGGAAGCCAAAA 

CCGCAAGGGCTGCCTCAGGCTGGGTGTt^CTGGAGA 

AGAAGCCAGATGGCC^GCCAAGAAGTGAGTGGGTGCTGGAAAGAGGGGAAC^ 
ACAGTTGGGGGAGGCAGGGGGTCAGAGAGCCTGCAACTCTXSCTCTGGAGAT^ 
CCTGAACACCCAGAATCCATTGCAAGCCACTGTTTTCAGTGGTAAGATGCTCT 
70 CTCCTTAATCCTGCTrCTGGCCTTCCCTCAAGGCC^^ 

ATAGGACCCCAGGAC^CCCATACTTGCCACACAGCAAC^CTTGCAA^ 
GGGTGGACGGGGGGAGGGGTGGTGTTGTTCTTGGCATTCAGTTGX3AAGCATCT 
AAGACCGTGGGCAGGAAGGTGAGCATTCTGCCTAACCAGAAGAGAAC^GGATGC^ 
GTGTGTGTGTGTGTGTGTGTGTGTGTATGTATACACGTGGACCACGAC 
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CTGCATGTAGAGTTGAGCGTGGGTGTCCT 

TTCAG GGGAGGTGGAGACAC CTCTGGTATCATCGAGAAGGACC1TGTGTGCTGTC CCCAG AG CCTGGGM 
GCGGGGGGTGCAGTGGGGAGTGGG<rrTTTTAGCTGCC^ 

GATAAGGCTTGGTAGATGTGAGCCCGGCITGGAAGTAAGATCCCAG GAGCC CTCTGAGGATC CAGAGGGATGG CTTGAGAGTG AAC 
5 CTGCCCTGCTGCCTAACCTITAAGTTCCTGTGGGCCACATGAACriT 

TGTACCCACAGACATTTATTGTGTTAGGGTTATGAAGGTCTGAAATCTAAGCC^ 

TGTAGGG C CTC CAGAGAT CTt5TCCCTGGCCTCTT(JT TGCT CTGGTGGTTC CTGGCACCTCTGG CTTGCTCTG ATGATCT CTTGC CTC 
TGTCGCATTGTGGTCACCTCATATGCTAATCAGGCCTCCT 

TGGGTCACACTCT ATGT CACT CTGTATTCAGACCCAAAAGTTGC CTTGATGAGT AAAG TCATGG C^CTTTCGGGCTGAGG ACAGGT 
1 0 CATTGGTCAGCCTGGTGTGTCTTAAGACGTGATACTTCTAAGGCATAA 
CATCTACTTCTTTAAAAAATTGCTTTTAAAG^ 

CTT CCTG CTTCTAC CTC CAGAACATTGAGATTATAGGGATTTGC CACCACAGOTGGGTTGTGTGGTG CTG GGAGG ATTGAACCT AG 
GGCTTCTCGCGTGCTAGCAGGGTGTCCTACCAACTCAGCT 

TTTT CCAGCTGGG AGTG GT AT ATAC CTGT CAT CATGGCAT CGGGG AAACTGAGGTTGGAGGAC CCAGAGT CTGGGCT AT CCTGGG C 
1 5 TTCCAAT ACT CGTCCAAT<3T AAAG CAGTCCCCCCAC^ 

TTCATCTCATAGAGTtSGGAACCCCACATCCTAAGGAGCCA 

TTTTTTAAAGATTTATTTTATTTATTATATGTAAGT^ 

GATGGTTCTGAGCCACCATGTC^TTGCTGGGATTTGAACTCAGGACCTCTG 

TCTCCAGCCCTCrt3AATTATTTTTAAAGTGTACT 
2 0 GGCTATACATGGAGCGGAAGCCAAAGTGTGGATGGAGACCCAGGC^ 

CTCCTCCTCTCTTGCTCTTTTCTGCTraSGTACCCCCCTCCTTCT^ 

CTTCCTCACGCCCAAGCTTCAGGGGCCTCTACGCAGAAATGGCTC^^ 

CAGGCCAGCAGCTXSGCTCCCCCTGGGTTGGGCTCCTTCTC^ 

GAGCTGATGGTTGTCnTTGGCACAGTCTAC^^ 

2 5 TGACZAGTGAACCTCGGGACACGGCTCrGCATCTCTGTGATTTTTAT^ 

GGCCTCCAACTCACCATGTAGCCGAGGATAACCCTGAACTCCTCCTCCAC^ 
CCTGAATATGAAGTGTTCAAACCAAGGCTTCATGAATGTCATGTAAGCGCTT 

GAGGAAGGGAGGGAGGGGGAGGGAGAGAGATCAGGTCTCAAGCTGGAGAGATGGCTCAGCAGTTAA^ 
AAGGACTGAGGTTACCCCAGCATCACAGGGCCGCTCACAGCCATCTGTAACTCCAGTTC 

3 0 CTCCTAAGGCACTGCACACTCGTAGTGCACAGACATACAGGAAGTTAAAATACCCAAATACATAAAATA 

TAAAAAGGACAAGGTCAGC(^GGCGGTCGTGGTGCACACTTTTAATCCCAGCA 
TTCAAGGCCAGCCTGGTCTACAACGTGAGTTCCAGGACAGCCAGGG^ 
TtMGAACCTGAGGCTGGCCCTGAACTCCTGCA^ 
TGGTTGACCGACTTTTTTTTTTTTTAAAAGTATTTGCGTGTG 

3 5 GTCCCAGGCACTTGTGAGCTGTCTGACATGGGTGCTGGAGACCAAACTCCCAGCCCTCC 

CACAGCCCCTCCAGTCCCTCAACAGGAAGGATGG 

CTACAAGTGGCCAGGCTCrCCAAGAAGACTTCTGAGTCA^ 

TCTAGCCAGCTCATCGGACTGTAATGCCCCAGTCrAGGG^ 

CCAGTGTCTTAGCACGGAAAGGAAGAAGTATGTGAAATATCTCTGC(3ATTTTCACGT^ 

4 0 GCTCTGTGAGATCAATTAAATCGGGCCACGTGGTCCCTC 

ATCTGTAACTTGCATCTTGGGCTTTGTGCCAGCGGGTCTG 
ATGGCCTGC!TTG<3GAAAGAAGCCrCAGAAAGCCCCACCCTGCCCCCTTCATGCT 

TGAATGAGACTCTGAGCTG CACAAAGTCACCATGTACGGAGTGCCATGTGCTACGC CCAGGCTGG CAATCTGGAAAGCTCACTGGA 
GCTGAGCACAGGGGCTCCCAGAGAGCTGGAGGATTAGTGTGTCCAAGGTAAAGGT^ 

4 5 AGGAGAGGCAGTAGGGCAGTTGACGGTCCGTTGGTCAGCTT^ 

GCTGGGTGATAAGGGAGGGGTGGAGGGGGAGCGCTTCCTGGTTTCAATGGACCTGCTC 
ACTCCAGACTAGGAAGGGAGATTCCTAAGATGCX3GCGACCTGGGGAGGCCTGGGG 
TCACGCTGAGGATGGAATAGGAGAGTGTGCACACACGAGGCrCTGGGGCATTCGTTCCTGCAGACT 
GAGAAATCTGTCACTGGCCCTAGGCCCCTCCAGAGTTACTAAAGGCTCCTCT 
50 CCCCAGCACTCCCAGTGTGGGAGCTGGCCTGGGGCCACCX3TGGC 

TAGGGTCTGGCAGATCCCAGCATCCCAGCATCCAGGCCCAGCCCCAGCCCCAGAAGTATCCG 
TCATCACTCTCAGGGTCCCAGAACATCATAGGGCCTGCAAGGAGTTGGGGGAGGAGACT 
GGCTC^GGAACAGTGGGAAGAAGCAGGCXSCCAGAACACAACCGGAGTCCCT^ 
AGCCCAGCCCCAGGTTCTGGCTTCCTCTCITGCTTCrrCCCTCCTTTGGTTGC^GCCA 

5 5 CTGTCTCAGCTGCCCAGAAGCCTATGGGGGGGTGAGGGGATTGGGA 

CCAAGATGC&TGCCTGTTGTCTGACGGGTTGGGGTAGTGCC^^ 

AATGGAAGACCCTTCTGGCTGGGAAGAGGGTGCCGTGATTCTGGATTGGCACAGACTTGGGCAAAC^ 

GCATGGCTCCAAGGACCATACTGTAACTCTCTACGT AACAGCGC CTCTO CAG ACTTG 

CCCTGCTGCTGTCX3CCTGCAGCCGGTC 

6 0 CTCCCTCCG<^GCCGTCCTGAGCCCTCCTACCTCTCTGCC^ 

GATCCTCTGCTCC^CTCCGAGCCATGTGACCCGGTGGGCGG^CCGCAGCTCT 
GGTGAGTCCTCGCTCTTTGTTCTTCCAGCCCCTCACAGTTTGCT 
GGGTGCATTGGGACTCGGATCCGGGTAGCCCCGGGGCCTGGAACTCTGGTTGGC^ 
GCGGGATGCCGGGTCTCCACGCGGGAACCAGATTTTCGATCGCAGAA 

GGGCAAGAGCAGAGGTGTATTGAAGGCTTCTTGCAAAAATCT 
GCTGGCTGGAAATTAGTTCCTGCGGGCTGTTACAAAGGT^ 

GGACCCTTGTATTCTCTGGCT^TTGGTGATGTGCTGAGCCCAGGAGGGTGTTGCTGGGTGGGGTGGGGGTTTGGTATGCATTTGTT 
TTGCTAGAGAATGTGCAATGGGGGTGTCTGCCTTCTGGAGCCTTGTGTTGGAACCTGTGOT 

7 0 TGTCCTACAGCAGGCTCTGCAGCCAGGCGGAGACGAGAACCTGGGCCAG 

GAGGGAAATGCCTTCCCCTCCTGGAGACCGCCTGAGTTTCCGACCTACTGTGCGCACCCCCACCCCCACCCCAOT 

CTGCCGGCTCTTGGTCZAGAAGTCTGTAGAGAAGATCTGCTCCCCAAGAGAAGTG 

ACCCGGAAGCAGCCCAGTAGTGGGTTCTGGCCGCTCT 

AGGACAGTGAAGACGGGATCTCCTGCAATGACGTAC^CCCCCCCTCATTCACT^ 
7 5 CTCi RGTTTACCTTAGTTTCGCTTGGCAGCCAGGX^ 
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CAGCCAGCCTCCACCCCAGGCCTCCTGGG 
GGCAGCTTTGAACTTATGGCTGCCCCACTGTCTCAGCCTCCT 
GGGAAGGAACCTGASTTGTAGCTGACCTGGCTGACTCrc 
CTACTGGAGCCTGTTCATATGTCTACCGGGTGGGCCCT 
5 TCXSTGGGGTGAGGTGGTAGCCACXrrCGGTGGAAGCCAGCrCCAGAA 

ATAAGTCTCCATGCCAGGTCTAAGTGGGACCATCrrTGGGCCTGTGGOGTC 
ACATGGGTCAGGAGTCCCCTACCCATTGGGCAACC^ 
CTCCCCAGCrrTCCTGTTGTGTCATTGCTCCTCT 
CAGCCTGGGGCAGGCAGCACAGATTCCCTGC 
1 0 TCCTCCACACACTACCGTCTCCACTGGCTTTCCCAC^ 

TGGGAAGAACAGCTTTATTTTAATGCATGTAGGCAAAACCCCAGTGGCC^ 

TCCCCCTTCCTCCATAGACTAAGCCGATGTTGCCAGTTGAGCTTTAGGAAATGGTCTGGGAGTGOT 
CTTCCTACAGTTTGGTCTTTTGAAAAGTTTCCAG 

TCCCAGACTCTCTGCCTTCCAGCACTAAGCTTTGATTTGTTTATTGAGC^ 
1 5 CCAGGAGCGGCAGAGCCTTGGACTCCTTCCCAAGGAAGA 

AGTGGGGTGGGGCTGGGGGCTCCTACCTGCATCTGGGGAGGTGGGCATTTCTGTACTC 

TCTGCAGCCACAGCTGCAGAGTCTCCACCCA^ 

CCTGGCTCTTCCITTCCATTCTCTG 

AGGACTGTGGAGCCTGATCTTGTCTTCCTGCAGCC^ 
2 0 CACCACCACTGTCCCCAAGTCACTGGCACCGGGGTTGGGGTGTGGGGCCTGGGCA 

AG C C CTTG AAAGACACAGAGG CTTAGAGTTTGTGAGAAGGT CATCAGGAAAAG ATG CTG TCGTG ACTTT ATTC C CAG CACTTGGGA 

GGC AG AAG CAGG CAGCITT CTGTG AGTTCAAGGCCAG CCTGGTCT CTATACAAAAAACC 

CTGTCTTAAAAAAAAAAAAACAAAAAAACAAAAAAAAAAACAAC 

ATCTACTCC^TGTGGTAGCCGTCTOT 

2 5 GACACACAGACACACACGTTAAAATTGTGTTAAAAGATAAGCTATTGGCTGGCTGTGGA^ 

CTAATCTGTGTACAGTGCATGGATCTTGCCCAGGGCT 
GGCATCATTGAGCACCTCAGGCTGCTGGCrGGCCACATGAGGACT 
AGTTGTATGTCCCCCATCTCACCATCTTACTTTGAGTTTTTGAGAGAGGGACT 
TATACTTCTGCTCTTTCCAGGGACTGAACTCAAGACTT 

3 0 CTCCTTTTTGGTCATGTGAGGCGAAGTCTGATTCTGTA^ 

GTGGCACCGTGCAGGGTTTCTGCCTTGCTTTTGG^ 
TTCCACAGTGGAATGTACTCAGAGCTCAGTGCTCCAGG^ 

GGTGGAGGAGATAAGACAGCCIX3GTCCAACTGGGGAGTGAACCCCTTTGCTGTCCTATTGG 
TGGGCGTGCTGAGGCTGGCCCTCCTGTTCAGTGGTGATTTTAGTGTCCrCTCTGTTCTCAGTG 

3 5 GTGACCCCATTCTGGATGACCTGGGTGCTTGATT 

CTTTAGTCCAAATGTTTTCTGGTTTCACCAAGGC 

GAATTTT AAGGAAAG AGGACAGATAGGGTCTCTC CCTCTCCCTCT CCTGAGG AGGCGCTGGGG CTGGGGAGTTGGCTCACTGGGCC 
AAAGGCTTGCTGCACCATCATGATGATAGAGTTGGATCCTTCAGAATCTACAGCAAAGGACAGGTGTGTGACACT 
CCTAATCCCAGCACCAGGGAGGAGGCAGGTGGAGTCTCGCTCCTGGCCAGCCAGCCTGGTTGAATGGGTAAGCT 
40 ATGAGACC CTATGTCAAAATACAAAGANNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNl^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 

4 5 NNAGGACCCCCGTGGGAGCTAAAGGTTTGGGGTACAGGAGCGGGTCTTCTTTGCAGCAC 

GCAGCGGGTAGGATGTTTGTTGTGGGTTCTGAGACACrGACGTCTGTACAGCGGTCCGGTCCGT^ 

CTGGTGGTTGGCGGTGGAGGAATGGCGGAGAGGGATGCCGGTAGCCACTGGGGAGCAGAGGACTTTG 

TCTGTGACTAGTTTATACCCACCGGCTCTGAATAAGTAT 

GACCTGGTAACCGGTGACTGTAAACTTGGTAGTTGATGGCTGAGGTCTCTGTAGGGGTGCCTGAGCCTTTTCTGGTGTGTGTGTGT 

TTGGGGCTACCTCTTTGTATTTTGACATAGGGTCTCATTGC 

GATCCAACTCTATCATCATGATGGTGCAGCAAGCCTTTG 
GAGAAGTCCAjGCTTAATGTACACACTTGCTTA 
55 CCCTCTGCAACCAGAGGGACTGATTGCGAGTCTCCACCT 

TGGCATGTATTTAGGAGGGTGTTACTTCCTCCTCTTGGTGAC^ 
GAGTTGGACCGTTGGCTGTCAGTTCAGCTGCTTGTGTACCA^ 
GTCTTCATCCAGCTTTAATGGGTGCCCCTTGACTCTTATACCTTAGCACCCT 
GGTGGCTCAGCTCCCAGAGAGTGGAGCATCCTGTGTAAAGCCT^ 

6 0 AGTTAAGAAAGGGTGCCTGCTTTGGCTGTGACTAAGGCCTGTTTAGAAGATGAGGTGGAACA 

CATACTGCAGGGCGGTTGGGAGACCCACTGTGCTCATTCATGATGGGAGACCTATTATGGGTCAGGTTGG 
ATGATGAGAGACCCAATGTGGGTCAGGATGGGAGACCTACTGTGCTCAGGATGGGAGACCTACTGTGCTCA 
TGTGGGTCAGGAGACCAAGGTCAAAGTTCTGGTCTGCCTTTGCTATTGGACATGCAGCTGGCATATTGTG^ 
CTGGCTTTGAGACTAACTCACTGCAAGCITTTAACACT 
65 AAGACATGAGTTTGAGGCTGGCCCGGGCTACATAGTAAGATCTAG 

CCCATCTGCACAGGATTTGGAGAGTAGAGGTCTGAGTGTCCTGGGCTGCGGCTGCTGTCGAGAGCCAAGCATT 

CCAGAAGTTTCTGAATCTTGGGTAATATGTCCCAGAGCTTA^ 

GCCAAGGTAGCTCTGGTTTGAGGCAGGAGGGAGGGACCGCTCAGGATTTGGCAGTGTC 

7 0 GGGAAGTGTGGGTCCCTGGATCATGGAGATGTTACAAGTAAGGAA^ 

TACCCACCC TCAGACATTGCCTGCTTCCTGCTCGCCTGG CAACAGTCAGG CTGTTCCCAGGC CGACG CTTAGCTGTTCCTTTCCAG 

GTGTTGTGGCCACCTTCCCAGGCCCCTCCATGGTTAGCTGTGGA 

TCCTGAGGCTGCCCTGGGGGCCACAGCGACATCCTCC 

ACACATAGGATGTGCCATCGTGATTGTGACCCAGCCTTGGTCTGGGAGAAC^ 
7 5 TTTTTTTTCAAAAGGAGGGGCATAGATGACCn'CACTGTGGGCTTCA 
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CTAGTTCACAGGAGCTGCCCCrrGAGAGGCTTGGAGGTTTGGGAGTG^ 
CCTTTGTCCATCCAGAGGTCTTCCXCCCTTTGGGGCT 
GTGACTTCCTGGGAACGTTCAGCACCCAGGTTTGAGGGCCCAGCAGCCCCT 
GCAGGACTGAGAATAGAGCAAAGAGGACTCTACACGT^^ 

5 GGCCTCTGGCTTCGGGATTCCCTTGCCAGTCCAGGCCCGGGAAGGCACCAGAAAGCAGAAGCAGGA^ 
CTTGCCGGTAATTGGTGGG CCTGAGGGACTATGAATTGGATTCCTGATA CAGGAGAGT AGC CACAGGCT CACAATG 
AGGCT AGCC CCTT C CX5GTCT CTTC CTCACACTGGAGGCACTTCTTG CCT^ CAAG ATG 
CCTAAATGGGGTGCGGTGAGGTGGGGATAGACACCGTCCCCTC 

CGAAGGGACGTGGAATGTATGCAGGCGAGCATGGTGCCCCCGGGGCCACAGTTTATGGAGAACA 
10 CACCACCAC CACACAAATT AAACCAAAAATTAAATGTTAGC CAC CAATG CTTTAAGCAACITGCAGGAAATGTGGAGTCTT CTTC CC 

TGCACTGCCCCCATCCCTCTGTGACTGTATCCTAGCACTGTGTCT 

TGGGCACAGCTTCCAGAAGTTTGCCAGCCCCCCCCCCCCTTTGTCCCCTTGTCCTTGGGAGAAAT^^ 
TCCCAGCGCCTGGGTTGGAGAGGCCCTGGGAGAAGACTGGCTCCCAGGCTGCCGTTCCG^ 

GC CAGTTTT C CTGGACATCGGTTGTCT ACAT C CCAT AATAG AT CT AG CT CAGT CCTGGGTTTGAGAAGAGGTGG AAGAAGAGGCTG 
1 5 CAGGTGTAGGGATCTCAGCTGGGAGGAGGCTGTGGGACCCCCAGCCACTTGGGACCCCGCCCT^ 
TGAGAAACGTGTTATTCH'GGGCCTTCTTGCGATG 
TACTGTGAATGAATGGCTTGTGGACTGTAGAGAGTTTG 

CACTCAGGTCGGGAGTTTCCATTGAGTTCCTGGTTGCATCTCAGACACTAGGT^ 
TTGGGCCACCAAAACATGCTGTCTTGTATCCCCCCACCCACCCACTCTGGCTACAAAATATTGAAAG 
2 0 TGG CAGTGGAGTTT CTTCTAG ACAGCATTT AGAGATGGTAGCAAG AC CCAGTC CTTG ACATGGGC CGTAATAAGC 

AACAGAAGTGACATCGCCAGTGGAGGCCCAGGTGAGGGCCCTGTCCTCCGTATCTATATCTGCCCCTCGCTCTGCTCT 

AGTCTGAGCCIGGCATTAGGACAAGATGGAAAAAAGATA 

GCCTGACCTCAGTGGACCCTGATCTGAGTATCCTGGGA 

TGACAATATCCAGAAATTGTTAAAGTCACATTTGGGTGCCGGGACAGAGATTCATCATGCAC^ 

2 5 AGAAAGGGAATCGGGGAGTACCTCCAGGCCCCCATTGAGTTATCCAGCCAGGTTCACAGTGACCT 

G^TGAAGAGTCACGGGGTGCACTTACTATAGCCTGTG 

GAAGACCGAGGAGCAGGGCAGAGTACAGCAGCAGGGAAAGAATGTTC 

TTTTTrT . J rc TTTTTTT T rriT T AGACAGG^ 

GAACTCAGAAATCCGCCTGCCTCTG CCTCCCAAGTG CTGGGATTAAAGGCTTGTGCCACCACTGCCGGGCTAGACAGGCTGACTCT 

3 0 TAAGAGCCrTTTCTCAAGAGCTGAGAGGAGTTTGGAAGG 

TGTOTTACAGCTCAGATATTTGCTCAATCCCACTGTTCCATGGTACGTC^ 

ACCCTGGGTGCrTGGGGACACCGGCCATTGATTTGAAGCAAAGACAAGGTAGGACTAGGATATGTGTCAAAACTG 

TGGGGAGATGGCTCTGAGTAAAACGCGACCCAAGTTTC 
35 GGCATTAGTGTACCGCCAATGGGAGATAGAGGGATAA 

GAT AAAACAG AGGGGGG AC CATTGTGGTATAG CTGG AGACAGCATTCC CAAT AG TG GGGG CAG GGG CAGGGTAC C CAG AGTTCATT 

TACACT TCCAGACAGT TCTTGCAT AT AT AG ACTTGG ACAACAG T AGCT ATAG ATACCAAACACTT AGTG CAGG ACCTGGTATATGG 

AGTCTTGATACATGCCTGTTGAACTGTGATGAGCATTTGTCT^ 

TCCATCCATTCACCTGTCTGACCATCCATCTACC 
40 CC1RTCTAOCOQTQTCTOTCCATCTACCCM 

CTATCCATCCATCCGTTGTCCATTTGTGAGTCTGCCT^ 

GGTCATTCATTTCAACAGTGCX5TCATAGGCTGCCCTGAGGCATTTGCTCTACTTGG 

TGGTGTGAGGATCTTTCAACATACACTGGTGCCGAGGGATTCGTGTTCCCTGGTG 

CCCAGGACAGGGATCTGTAGCTGCAAGAGTGAGTGGCTC^TCTCAGTTCTTCTGCCTGTCCT^ 

4 5 AGGTTTCHTCn'TCCTTAAGGTCCTGTGGTCACGCCCrrCTTG 

CCTTCAGTCTCGTCCTGTCCCATGTCATGCCCTGACCCCC^ 
GTATCAGCTGACCTAGGCAGAAAAGATCTGTATCT 
TAGAGGGGTAGGGCGGGGTGTCTACTTTTTGTTATTTCAAA^ 
CAAGCTCTGGTGTATGTCAGCAGCTGACAGCAGCTGATAGGACA^ 
50 TTAGGGGACATCCCTGATAAGAGGAAATTCTTGTAA^ 

CCTTCTTTCTCCATAGACATTTGTCAAGAGCAAAGGGAATCT^^ 

AGTGGCGTCCTGGGTAGGGCTGGATCTTCAGGTGCAGCTGTTGCTGTTGGAGATTCCTGGGGGTGG^ 
CCCGATGCTGCTATGTAATAGATGACCCCAAGGATGGGCCCAAGCAGGGTGGTCAGTGGACCCC GGAA CCGGAACCGTAT 
CCCTGACCCAGAACTTTGGAGGGCrrCTGCAGAGGCC!TTAGTGTGCTGTTGGAAATCTGCTGTG 
55 GGGCTTGCAGGTTGCAGCCTGCCCTGAGCAAAGGTTGATG^ 

AACAGAAGGGGTCTCCTCGAGGGGGGTGGGGCGCCCCTGGACTTGGTCCA^ 
GTTGGCAGAAGTACAGGGCTCCTTGAGAGTCCTCCCTACAATAGAGGGGAGGT^ 
GGGACCCTACTCCCTTCTTTGGGATGCCTAGGGCCCCrrGCCTCTGCACACCTCCCCACCCT 
GGATGGGATGAAGTCTCAAACTTCTCTGTCCGTCCCAGGCCCTTATCGG 

6 0 TGCGGAAACCACATCAGGGCAGTGTTAAGTCCCCTCTTGGAACTTGGACTGGGGA^ 

TCCCTTTGCAAATCCCTCTCTACTGAGGCCAAAGCCAAGGCCAAGCCTTTC 
AGGAGGAGTCTCAGATTTCTGGCATCCATGCTTTTAACCAGACTCCAAGGAGACACTTCC 

TT CGATTTG ACTT ACTT AT CAAATTAGATGAGCCATGTGGAGTTTT AAAAAGTT CGG C TTTTGAAA.C CGGG CACATGAATTTG CAA 
GATTTGATCATTTATATGCAGTTTCCATCTTTGGAACAGGAC^ 

6 5 TGAGTC^CAGGGGCCAGGGCACTGTGAAGGCAGGGTGAA 

TCTGCCC^CCCATGTGCTGACAACCTGGGTCGAGCCGCCCAGGGAGTCAGCTGCTAGAGCCTTGA 
AGACTTATGTCCCATGCTCCTIX1ACTCTGTATCCCTC 
TGTGTGGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTX3TGTCCATACGTGTG 
AATATGCTGACCTTCGTCAATCACCTTTT^ 

7 0 CTGTTTCCACAGCCTCCAGCTCTGGCACATAGAG 

ATTCCATAACAAGTGTTTTTATCCACTGAGCCAT^ 

GAGTCTGTGGGGGTTTGTCGTGGCTCCATTTAAAATCTTACTCTGTAAAGCTTC 
CCACTGGTCAGTTTCTTAGCACAGTGTATCTGCTGACTGACCT 
TACTGTCX5CTCTCTCTCCAGGGCGAGTCCTGCTCCT 
7t; RTTTTATTTTTTTCCTCACCTTTATTTCCCCAGTGGGCrCATTAGGCTCCTCTCT 
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ACCTCTGAGCCTTTGTGAGTCCCCCTGTCTCCTCGC 

CCAGGTGTGGCTCCAATGGAGAGTTCTAGTCTGGCCTCCTTGTTCTAATGTGT 
GCCCTACTTGAGGCTCAGTAGCTGGTGGTAAAGAACA^ 
ATTAAAGTGATACTACCCC^GGAAACCIXKXSAAGGG^ 
5 CTGCCTTCAGACTCTGGGGGTTTAAGTTOU^GCCCCAACACAGGG 
GTGGTGCATTTTGGGATGCTCATCTTACOGCAGATC 
TGTCCTTGGCAAGCCTGTTAGCCTGAGCCTTC 

CCCTCCTAAATTGTX3GCCTACGGCCCTGTCCCAGTAAA1X5TGAAGGGTGATATAA 
GGGAGAATGTGTTACACAGTGAGTAAGTCTGCAGATGGTGGTGGTTGGACCCACCACCC^ 
1 0 GG TGTCTGGGG CAGGAA.CAGGATGGGGTAGATGGGC CTGCACACACAGTAGGAC C CACGGTCT ACCAAAAGTAG CTTTAGCTACTT 

ACCACCCCTCAGAAACTCCTCrGGCTGGGAGATGACTGGCTGGCT 
AGAATCTCAGCTTGAGAATCTCAGCTTGAGTGTCAGTGTCTGGGGCTCA^ 
GCCTCTCT^TCCAACTAGGCTATAAACAGCTGCCCGGTGTGGCCCCCACCCCT 

TTT ATTCATGATGTCATGT AAC CATTGCrCCnn*CT AATTCGGTGCATTTTATCAC C CTGGAGGG AAACGT CATGCTGG ATAAAGA 
CACTTGCTACACTTATTAATGCAGCTGTGAACAACTTAGCGCTCTCT 

TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTATGTGCATGTACCTACTCCACAGCACTC 
GGGAGTGTCTGTTCTCTAGTCCCAGAGACAACGCTCT 
2 0 AGTCATTTCTGCTTCTTGCTGGGCTAGGACTCAAGGATGCC^ 

GGGAACACACTCCTGACCAGTAGGCAGGTGTC^GAGCACAGAACTCATGTCCAAG 

AAGGTCTTGCCTCTGCTCATGTGACCATCCAGTGGCATTTCAGGAAAGAGTCAGCGTGTAGACCTC^ 

GAGTGCCAGGCTGCCTGATACACGTGTCAGGCTCACGTAGGTATCTAAGACTGAGATGAGCTAAAATG 

TTGCTG AATTAG CT CG GG AGTCACAGGCAG ATGCACTTGGCTGGTT CCTCAGAAATAAAAAT AGAACGG CT CACATCTGGATGT AT 

2 5 GTCCCAAACCATCGAAAGAATGCTCTTGCTAGGCATGGTTGGTGTATGCCTC 

TCATTGAGTCTGAAGCTAGACTGTATATAAGAGAGAGCCTCAGGGAAGGGGGGAGAGAGGGAGAGCTAGCCT 
AGCTCTAAGGGTGTTAATTATTCTGTATGAAATAAGGCAAAAGCCAGAGATCAT 

GGTCGCTCCTGGAGCTCCAGGTCTGGGATCCCAGG CAACTCTGGGTGGAGGGGTCTAC CTCCTAACCGGGCAGGCAGGTGGATGGG 
GAGGCCCTCCCAGGAGGAAGGGTCACAGGGGTAGG CCAGAGATGTCCAGAGCCCCTGG AGAGCATGGGG CTCTACTGTGTTCTTAT 

3 0 TATTTCTCAAGCAGGCCCTCTGGCAGGCTGGATGTTTAGCTTGGTTTGGGCT 

CCTGGGGCCCGAGGTGTGCCTTCCTTGACTCGGGGCGATTAGGGAGAGTTGGGGGACAGCC^ 
GAGGTAGGGGGAGGGCAAAGTGAGTCTCAGACTCATAAACAGATGGAAGCAATTTTCAGTCATGTG 
ACGGCAAGGCAGGGCCCCGTCTCGTTCTCCCAGGAAGCGGGCCCC^AAGCCACAGGTTCTAGG 
TTATTTAAAGATGCCCGAAACACAGGAGAGTATTTCTGAGTGGAGAGCTGAC^ 

3 5 TGGGTTGGGGATGCTCCTGATGGCAGGCACTGTCCCTTAGGACTGTCTCT 

CCTGCTGTGTTTACAAGCATTCTCTTGTTCAA 

CTGCACCCTGTGGAGGCCGAGGAACAGGAAGTTACTGAC CAGTAAACTAGATGCCTGTGGGGGTGGTCCCCACATCTC CTTCCTG A 
AGCTAGCTTTGCAAGCCATTTCTCCTGTCCTTTCTGCCACTC 

CCGCTGGACATGCTAGTAAAACGGAGAGAAAAGCTGCCAGGCCAGAGATGGTCAGGGTTGAGCGAGCTGAG 

4 0 TCAGCTTGAGT TT AGTGGTTT CC C CGGACGTGG AG AGACAGTTTAACCAAAG CCTG AAAAAG CC CCTATGCTACAGACCCATCATT 

TTGCTGCTTCCTGGCCCCTGCTCACTTGGAATGGTGCC^ 

AGAAGGCACAAGTATTCATTGAGCCTCTTTGAGGTTAGCGTGCGTGGGAACAGACATAAGGA 
ACCCCTCTGTCCTGGCTGTGTTTGGGGGGCTGGGCTGTGGGGTCATAAGTTGAGACTGCTGACATTTTCCAT^ 
GTGCCTCTCTAGGGCTTATCAGCAATGCATCAGCTCTACCTCG^ 
45 TATCrCAGAATAACAGTTAGGTATACTGTAAGC^TTTTGTTCAACCAGACCGGGTGGAACCAGCCTTGCACA 

GAGCACCTACGGTCTCCATCTGAATCATCCCCCAGG CTTGGGCTCTGACCATGCATGCACTCAG CCACAGG AGGCTCTAACCCTGA 
CCTACTGAAGGAAGCTGAAGTCTTCAAGTTTAACCCCTAGCCrrGTCCACAGACCTACAAACC 
AGGGCACTTGCTGTCGGCCGTAGCCTGTTTCCTGCCTATCC 
CCAGTGAGGATGAGTCAGGTCTGGAAGAGGCAGGTAGAAATCA 

5 0 CGTGAAGGGCCCTGAGTGCTAGCTACAGTTACTGTAAGCTCTGAGC^ 

GCCAACAGAATATGGAACAAAGTCCTTGCn3CCCACCTGGAACCAGGGGAGCCCTGGTTGGCA^ 

ATGGCTGAGTGGTGTGGACCAGTTGAATGCTACTGTAGGGGATTT 

CGACCCCTGCAGCCTTGGTTTCCCATCTGAGTGCCATGCTGGCTGCAGATCCCA 

CCGAGGTGACTCCCAGCTGTGTTGGCGACATGGCTGACAACCCTAGAGATGCCATG CT CAAG CAGGCCCCCGCGTCACGGAACGAA 

5 5 AAGGC CCC CATGG AGTT CGG CT ATGTGGGGAT CGACTCCAT CCTGG AGCAGATG CGCAGG AAGG CTATG AAACAGGGCTTCG AGTT 

TAACATCATGGTGGTTGGTGAGTGTTACAGGCGATTCX^GGGGCTAGCCT 
GAGTTCTGGGATTAAGAGTGCCACCATATCTGGCCTATCAGCTCTCCAG^ 

CCTAAACCAATAGAGTCAGATAGGCTTCCGACTGCAGGGCCTCTGTCTGGTGAATTGTCAGGATGTCTGTTCrr 
CACAGCCTACACATCTCTTAGGGGCTGTTGCAGCTACCTAGGC^ 

6 0 CTTCCTAGGACCAGGAAGGAGCTCTGCTGAGTTAAACCAGGCCCCAAATCCCCACATTGGGGTTTA 

CTGAGGAGGTGAAGCCTCAGGCAGGGCTATCCAAGGATGTCCTGG 

TTCCAACCGG CCTTCCCTTCTCTCCCCAGGGCAGAGCGGCCTCGGGAAGTC CACTT^AATCAATACCCTCTTCAAGTCCAAAATCA 
GCCGGAAGTCGGTGCAGCCCACCTCGGAGGAACGCATCCCCAAGA 

GGCTGGGTGTAGGGAOTGCGTCTTCCCAGGGTGTGGGGTCAGAGCATAGCACTAGAGAGGTC^ 

6 5 ACTGACTAATAGTGGGCTGGATTCCCATGGGCCCACCAGGCTGA 

GCTGGCACCACCCAGAAGAGGTCACTTGCTCCTTGGCATGGCCTGAGCT 
AGGCAACCTGAAGGCTTGGGGTGACCCGCCTGACTTGGGGTGCTATGGTT^ 
CTGACAGTGATTGACACGCCGGGCOTCGGAGACCACATCAACAATGAGAACTGG 
AAAGCCCCACTGGTCTTGGGGCTCCATGATACCACTGGAGGCCAGACCCCCCTGCCATA 

7 0 ATTTGGGGTCTAACATTCTAGATClAAAATGGGGTCACCCGGCrCCCATCrGGGGTG 

GGGG AGT ATGG CAG AAAATCTGAAAT AAGGCTGGTTTG T CAGCATT CCTGT ATG AAATCAG AGGCTGGTGT CAA 

GCCAGTTCTGGGAATTAGCCTGTTATAGGGGCCTGGGAGGGCGGGCTTAGAGAGGACGCGGG 
GCTAACCCGGCTCTCCATGCAGCTGGCAGCCCATTATGAAG 
ATCAACCGGAAGAAGCGCATTCCCGACACCCGTGTCCACTGCTGCCT 
7 5 CTGGGCCCTTTCTCTTGATGGTGGGAATCCCCAGTTC^ 
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GAGCTGTGTCAGCAACTGTCATGGGGCCTCTC 

CCTTCTTCCOTTGTGCACAGAGCTGAGGACAGGGCAGACTCTG^ 

AGACTCTCGGGCTGTACATGTGCCTGCCCCCATTTCCCGTCCTTGACAAAGATGCT^ 

GTGCCTTTCAGAATCTCCCTTTGGAATCTCCCGGGGCAGCTAGACTGTCAG 

GTCTCATCTGTTCCCAGTACATCCGTGTTGTTCCTCTCCTTGCTGGATATCCTGGTCGOT 

TCTAGTCCTGGGCTCAGAGCAGCAGGAGCCGGAGGTG<OT 

GCACACTCCCCGGCTGTTCCTGGCTCATCCCTGGGTGTAAGAGCGTGGG 

GGCAAAACTCTCACTGGGGTCTGGTTGGAAAAACC 

CAAGAGCCAGTT CTGGGAATTAGC CTGTT ATAGGGGCCTGGGAGGGCGGG CT T AGAGAGGACGCGGGCAGAAGGGAGCAT TCAGGG 
TGCAGGCTAACCCGGCATGCCATTGG CCTCAG CTGGGG CTGGCITGTTTGGGGTTCTGT AGCTGCTCCGAGCACAG CCTCGGGCAA 
GGCCAAGGCTGACAGAGGAGACATGXSCrGACATG^ 
ACTAGGGCCCTTCCTCCAGCCGCTCAGTCCCAGGGGACAGTAG 

CAGCCTCTGCCCCATCCCTGGATCAGACCCTGGGTCTCTGCAGCCAGGCTGAC 
CTTCTGGATACCGTCCCTCGTCTCTCCTTTTCCCTGGC^ 

GGTGGAC^GTGTCCAGGGCCAGTTGGACTCTGAAGATOTGTGGATGAGGGAGCT 

CAGCTCCATGAGGGGAGAGTGCTCACCCTACTAAGGCCCAACCTTACCCCTCCA^ 

AGTCTTTTGCACTGTTCTGTTTCAGGCTCAGGCCCCTGGACATTGA^ 

GTCATCGCCAAGGCTGACACGCTGACCCTGGAGGAGAGGGTCTACTTCAAACAGCG 

TCCCCTCCAGGACTGCCCAGAOUVGTGACAGGCTTGACAGGGGCCTG 

GCTCTTCACAGCATGTGTTCAGCCTTGGGGGAGCCCCTGGCCTTTGTGGATGTGGGG 

TTCAGAAGGGAGTAG ATCAAGCAGATGCAAATGTGTCAGGG C CATTGGAC C CAG CATTC CTG AGTTTT ATT AAGTGGGACAGTC CA 

CCTCAGTGGCTTCCTCCTTCTCTGCCATGGTCAGGGCCACTGCTGTCT^ 

TAAGCCCTGGGCCTCTTCTGTGTGAGGTGACTTTGGAGCCTOT 

TTGCAGAGG CTACCAGGGCTT CCTTCCCTAGC TGT GTGTTGGGGATTTT C C CAGCATG CAT AGG GGAGGGT AGT CAGGG ATG CAT C 

TGCTGGAGAAGAGACCAGGATGCTCCCCGGTC^ 

CACCCTATATTGACAGCCACGTGGGCTGTCAGTAAGTCG 

GCTCATCCTGAGAGGGTTTTCGGAAAGAGGGCGCAGCAAGCCTG^ 

AG CGCAGGGAGACTCAGGACAATGG CTCGAGGGCCCCTCGGGACAGGCGAGTGCTTCTTGATGCCTAAGGCTTCAG AGC 

CCTCGG03GGGGTGGAACTCAGGCGGTAGAGTGCTCTCCTACCATCGAGAGGG 

AGCATGATAACTGTCGCCTCCCAGGATTAGGAGGTAGAGGCAGGAGGTTCACAA^ 

TGGGGCTAGCCTGAGCTACAGACTTTGTTrCCAAAACGTGG 

GTTCAATTCGCAGCACCCACGTGGCAGCTCACAACCAGCTCCAAGAGAACT 

CATACATGTGGT ACACATGCAGG TG ACACACTTGT AT AT AT AAAGTAAATGTAAAAT GAATCTTT AAAT AAAT T AG ACATCCTCAG 

GGTGGGCCCACTCAGAGCCTGCCTTTGTTAGGGGATCATGGCTCAGGCACCTGAGTGGCTGATAGCCT 

CAGATCACTGCAGACCTGCTGTCCAACGGCATTGACGTGTA 

CGAAAAGTTCCGGGTCAGTGGGTCAATCTGCCTCTGGGAGGTCAGGGGCGGTGGACAGATA 
GGTCCTCTGAAGGCAAGGCCTGCAGTTTTCCAGGCCCAAGACACAGGGTGT^ 

TAACTCTTCCGTC CCTCTTTCAGG AGATGATCCCATTTGCTGTGGTGGGCAGCGAC CATGAGTATCAAGT CAATGGCAAGAGGATT 

CTG GGAAGGAAGACCAAGTGGGG CACTATCG AAGGTATTGGCTG CTATGCTGCTG CAGTGTTG AAG CTGTAGTTGGG CCTGGGC CG 

TCACCCATTGCGCTGTGGGTCCCTGTCTCTTGTAA 

TGACTTTGCCAGACTTTTGTTACTTTGTCCCACGCACTGAGG^ 

TGGGTCCAGGGATTTTCrrGGCrCTTCrcrCTACTCTrCCACCCT 

ATGAAAACCCACTTTGTTCCTTGTGACCAGTCTGGTCA 

TCCCAGTTCTCTAGGGCCCCAGTCTTGCACCCTTCATAGGTGCAGGCTAGACCT 
CCCAGGATGACCTGGTACTCAAGTCCTCCTGCTTGGCCTCCGGAGCIACTGGGATCATAGGTGCCAC^ 
CCGAGGAGCAGTTTACTCTAAGGCTCTGGTCTGGTTGGTGCTTC^ 
CATAGTGGATTACCATTTAACCCTTTCAGTTCCCCAG^ 

CAGTTCCTCATGCTGTGGTGACCTCCAACCATAAAATTATTCTCGTTGCTACrTCA 

TGAAC^TCTGACACTAGCAGGATGCCrrC^GGTTGTAAGTTTGCAACTGTTACAAATCGTAAACAT 

GTTGTGACCCGTAGGTGGAGAACCACTGAGTCACTGGGCTAGTAACCATTGGTCAGG 

CTCTGCCCCCCAGAAACCAGGTTTGCTGTTTCTGTGCTGTTTCTGCGGGATCTATGA 

GGTCAGGCACTCCTTCATTTTCACCCACCCrrACAGTGAGCTCGGAGCC^ 

CAAGGGAGGGAAAGCAGAGTAAGAAAACAGAATGAATCGACAGCCAGG^ 

CCTTGCACTGGAGACCCTGGAGTTGGAATTGTTGCCTGGTCTTTCrGAGGTGACCGCTGAAGAGGGCC 
TTCCCCTACTCTCCCCTCCTGTGGGTCTGCCAGGTGGGGTGTCCTGGGGACAGTCTCTGGGTGGTTT^ 

TAG CACTCCCCTCAATGTCAGGTGTTGACCCCAGAAGTGACTCTAG CTGTTCTCTCAATAGCTTGTCCTCAG 
CTTGGTTCCCCTCAGTCGTTCCTTCCCATCTTTGATGTCTTCCTCATCCTCT 

GAGGAGTAAAGTTTGGGGTAAAAGCCCTGGGACTCTAGGAGAGGCATTGGATACTAACIGTGTCTCTTCTGGCC 
TTGGGTACTGCATGCACTCAATGAGAACTCATTTAGCC^GGTTAGATTAOT CCCAG CTAAGGATCCAGGCAGCCTCTGGGTAC 
GCCCCCTGGTGGTCGTTCCCACATCCAGCATCCTTGTAAAGTGAACCT 
TTACTAAGTCCCATCCAATGGGGCCTACTCCTAATCCTTGGTACGCACCATCTGAAGT 

AGTACTAC CTGGAGG ATG GAAAG CT ACT TGGTTCCTCAAAAG CGT C^C AAAGTCAG AAT ACATCCTTGG CT CATTTGGT CTTTTGT 

GCCCTTAAGCTCCAAGCCATGCTGATCTCCCXSTTTAAATTTCrrCT 

GGC^CGCAC^TGCCAAAACACC^TGTAGTAGTAAGACGACAGCTTGCAAGAGTTGG 

ATCAAACTGCAGTCATTGGGTTTGGCGGCAAATGCCOT 

GCCAACAGAACTCACCCCAGTCCAGAACACAAGGAATGAGGGGTTTAT^ 

GGTGTCATGGCGGCACGACAATAGACTGTTGATCGCTGTGTAT 

CTTTCTTTTTTATTCAACTGGGAAA^ 

CATAGACACAC TAGAGGCGAGTCTCCCTGTTGACTCCGAAACC3VTCG AGTGGCTGTCAATC^ATC CTG CC CA GTGGGCGGGGCTTTC 

TGGAAGCTGTTGGCTTTCCCXSGGGAGGAGAGGAG'GGGCT^ 

CCAOTCACTGTGAATTTGCTTACCTGCGGGATCTCCTTATCAGGTG^ 

GTTGGTGGTGGTAGATGGGCGGGAAAGGCCCCACCCTGGrC^ 

AACAGAAACCTGGGAATGGATGAAGAAATGAGGATTCTTC 

CAGTGGGCTAAAC^TGCTAGATTCAGCTATGGCTACrrGTCCACACAGCACAGCCTGG 
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TAAAAGAAAAAATTCATTTGTGTGATTA^ 

ATTCOlGGAATAAAGCCTTCCGGGCTCOTCACCTTTCCCACrCC 

ATAGATAGAAGTTGCCTGGC^GGCITTAAGGCTCT 

CAGGGTAGGCAGCGCTGGGGACTGTGGTCATC^CTGCCTCCT 

AGTCCTGCTCACTGGTGAAGAAGGCCTTGAGACCrrTCCACAGCT 

TGACT AG C CAATG TGAACACAG AGACGGGGGTACGGGTTAGAATGTGAGG CCACAGAAGGTCTAAAGTTACCAGAGAGCGAGTGAA 



GGCTAAGTGGTCZACCCCCTCCCCCCCAATGCTGGCCATGTTCCCTGCAGGACGCACATGCAA 

AT CCACT T CG AAG CCTAC CGAGTGAAACGCCTC!AACGAGGGCAACAGCG CCATGG C CAACGGGAT CG AGAAGGAGC CGGAAGCCCA 
GGAGATGTAGAATGCGTCCCGCCCCTGGACCCCACCCCCAGATCTITTTCATCAT 

ATATAATTATCT CCTTGTCACCTG CCTCCAT CCATCTCTTC CCACACTTTG CCAGGT AACAAGAGAGGGTTTACCTCC CAAGTGTG 

CTCTTATTGGCTGCAGCAGCAGGGTGGGCGGGGCTAAGCCTGGGCTTGCCTCTGTC 

GGGTTAGAAGAGCTATGTGTCCGTCCCCCGCTCTGAGT^ 

CCGTTAGACCACAAGACCCCATGGCCGCAGCCTCAAGCAGGTTAGAGACTGCCCC^ 
AAACCTCACCTGCCCCTCCGGGGGCGTTTCTTACAGCGCCCTCAGCTGCCTGCCCCTCAA 

TTG CCAAT CACTT AGACAAAGTGACAAC CGTGC C C CTCGAGCTTGT CC CAGAG CAGAAAGTG CCTTG AT CT ACAAG AGCCAGTCAC 

CTCTTCCCAGATGTCCCT3TGGGTGAAAAGCAGGGACGTGCT 

TCCCCTGTGCTGTAGATATCGCTACTACACTGGGCT 

CCGCCCCCAATGTAAG GAAGGT CAAAG CAAGAAGATGAGTCGAAGC CATGAGGGAGGAAG C CGTGG AAGGG AGGCAT AAGAGTGTG 

TGGGAGCTCTCCTGCCCAGGTGCCGCGGAAGGCATATCCGCXSTGGTCCTCAGTTTGGGCCAA^ 

CGCTGCCTCACCCTGTCCCACCCCACACACCCCAGGCTCAAGCCTTGATGATTCAGTGACr 

ACCATTTTGTTGTCTACATGAGCCTAGACTAGCCCTGTGCCCCAGAAC 

TCAGAGATGGATAGCCAGGCTCACTCATCTTCTCTCAG 

TCCTGCCTGCCTCCCATTCTCATTTGCCCCTCAGGAGAAAGTTTGGTGAGGTGCGTO 
TACAGACCCAGGGCTAGCTTTCTGCAGCCGAGAAGTG^ 
CTTTGGGCGTCTACTCCCTCATCCAAGCATGGAAGGGCCCCTCGTCATGCCCT 
TTCTTTTGCCAAAATCAGGACTTTGAAGGAAATCTCCAGCACX5CTGG 

GGGGATTTAGCAGCAACCCGGTCATCAGGATGACTTTGTGCCTCTGTAACCAGGTATTGAAACTC 

ACGTGGCCCTCTGCTGGGGGCTTCCTGGTGGGGGG'GCAGAAAAGTCTOTCTCTCT 

AAATAACAAAGTGTTGAAATGTATTTCCTGAAATAAATGTTTCAAATAC^ 

CTGGTCCATTAAAGGTGGTGAGGGGAGCACGGGAGTAGAGGCTCCCAGTGGAAGGAAGCT 

GGAATCAGCAGCCCAGTGGTGAAAAGTTAGCCACACCCACTGCCTGTAATCCAGCTTGT 

CTGGTGGTACCTGCACACATGTGGCAGACAGATATACACATAAAT 

AGGAAAGGGGAGGGTCCCTCTCraTC 

ACTAGTGTACACTCTCACC^TTCGGAAGCCCGGGA 

GGACTGCGTTATTTC^GGTCCATGCACTTTGCT 

TCTGTAAAGATCCTGTGTTCAAATAAAGTTTTATTCTAAGCTTCTGTGC^ 

TTCCTGCGTCTCAGGGX3TTCCAGAGCTACAGGGACAGCATGAGAGAACAACCCGAGGCT 

GGTCrAAGGAGTGACCCTACCTCTOTGGCCTGCCCACACrCTTCCGAAGGCCACCTGAAGACAAAC 

TTGTGCTCAGCGGCCTCATGGAACCAAGCATCAAGTTGTTGATTTTGGCTGTTGATGGCACTGG 

GGK2AGCTTGCTTGAGTCCCCCAGTTCTAAGTTCCTTGAC 

GCATCGCCAGTCCCCTCCAGCCCCAGGTGCTTAGGCAGTCGTCAGCTCATTAACCT^ 

TCCAAGCCAGTGATGGGGGAAACCTCACCTCCCACTCAAGCAGACACTGTAGGACCAGAGTGGCCAGCCGTTC 

AACGTGGGCTTTCTTGTGCCTCCTAGCTAGCAATGAGGGAATTCTCACCTTTCTACT 

AAGAGG^U\AAAAGTTCAGGTG CTGGGACCCACATT CAGG AACTCAAGCTT ATGC7ATGTGGGCTCCAGCCCTGAATGTAAATGTGG 
GCTTTC CATG G AACTAT CAGAG CT AG AACCAAAG GGAGG AGGGAGGGGAAG CTGGACAAAGG ACAAGGTGG CTT CATGGTGTG GG A 
CCCCATGCCAGTGACTTTCTTCCCTCCTATCCAACTCC^^ 

GTCACTAAGGC7VGGAGCGGGAAGACCCTGCAGCAGGCCCTGCTCTTGATGGCTTA 
GTCTGAAAATGCCAGGCATCCCCCCCCCCATTACAGGCCTCAGG 
TGAGGCCTGCTCAGTGCAGACCTCTCAGGCTGGTCAAAGTCTGAAGCGCT 
TGTTTGCAGGTATCCCnTGTGTGTGTGTGTGTGTGTGTGTCTGTGTGTGTGTGTGTGTGTC 

TGTGGAGG CCAG AGGACAAC C TT AG TTGTTTTGTTT ATTT ACTTG AG ACAGTCTGGG CT AGCCGGC CAGCAAG CCC CATGGG T CTG 
CCCGTTTGGCCACrTTGCTGCTGGGGTTATGTGTATGCCTCCATGCCTGGCTTTACCCGOT 

GTG CT CC CATAACGAG CACCTTG CTGACTGAGC CATCT C T C CACT CAAAAGATCAAAGTTTCTTTATTTAATGTGTGTAGATGTCT 

TACCTACATGTATACAGGTGTGTGCCTGATGCCCGAGGAAACCAGAAGAGGGGG^GGATCGCTAGGGCT 

CCGTGGGCX-jCTGAGGATGGAAATTGAGCAACTGGT^ 

AAAGATTTATTTATTGATTATTTCTAAGTACACTATAGCrGTCTTCAGACACT 
TTGTGAQCCACCATGTGGTTGCTGGGATTTGAACTCTGGACCTTCa 

CAG CC CCTCTGTACT CGT TTT AAAACACCACTTT CTGG CT ACAATT CT AGCC TTATTTAAT ATAG AAAATGGAGAAAAAATT ACAC 

AACAGAGAAATAGCCCCATTCACACCCAGTAATGTTGCCTATCCTGACACTGK3 

TGACCCCAGCATTTGAGACACCGAGGCAAGAATCTCAGGGGTTTGAG^ 

TGACCCJAGGCCCCTGTGTCTTCTGCCCAGAAGGACTCCTCAGCCAGGGGCTTGCCCT 

GAAGGAGGGGGGAGGGAGAAATCTTGTCCCCATCCTCACTTCA.TTCCCCTGCCTCCCT 

TTGGGGACCIAGTGTAGGATTGAGGTGCAGTTCCTGAGTTCAAGTGGAAATTCCTTCCAGATGCT 

GAGGGGCCTATGTAGCAGCTGAGTGCCCAT^^ 

TTGCTAGGAGCCCTTAGAGAGTCCTGGTGCAGAGGGTTGTCTGCCTTCATGCCCAGTC 
TATGCCGCCTCCAAACCTGGTTTCCTGCTGCCAAGGGCC^ 
CCAGCCTGCCTTATGCCAGCriTACGTCATCACCCAC^ 



AATGTCTTGCATTAGATCTGTTGCGTGATGTGATCTCTTAACGGTGCGCCI^ 
CTTATCCTAACAGTAAGACCAGAATTAGAGGCATCACACACAGACTA 

AG AG AACCAG AG C C CAGACTG GCT AT AT ACAAAC CCTGTC (^TGAAAG ACTAAAGTAAG ACTT AATAAAGAAAG ATGT AAAAT AAG 
ACTCACATCACTGATAACTCATACAGTACCTGCACACTATAAAACATTTGA^ 

*£ r T* r T , * r r | a Tr: AT&T ATT AAAATAAAAAATAACATATAATJUU^TATAAAAATAAAAATGTATG CATG TATGTGTGTGTAT 
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GCATGCCTGCGTCTAGGATGTGTGTGTGTGTC 

TGTGTTGCATGCATG CATG CATGTGAGTGTATGT ACT CACGGTGCAQATGGAGAGAGG ACAAAACAGGAGATCAAGAGTCT CTCAG 

tgccctgtgacaggtcttctcattgcgcxjagaagct^ 
tctgcatccagaagtacaagcagggattccttctagagccaaag^ 

tggatgtggagagggtgaggcttc 

cctaccacactgggcag tat ctccatcaact acaag ccataaaaa.caaracaaat aaaaat cct c ccttaagt ctcttctatcaga 
tatttggcaacaatgataaaat aactgtaattctttgtc ctcctgt ccagc ctt c ccttcctggcttagggatagcc 

agagc ctcctactgagtgagcgcctgcgtg ctgacagtgaatttt aggtcag aggtgtgatggttagt cctcattgt caactgg at 
1 0 ttgggaccacctaggacacagatgtgtttataagcacatttct agga cttg agtg 
aattaaaaagaaaaggaaaactggtgagcccttccaaggaccagcatgcattgct 
gctacctccatcrccrrtccaccatgatagactatcacctctccaaagcgggctgaggtc 

tcacat atttctgt cacagtagcaagat aggtagag ctg agacacaaaaccaccccag cggtg ccctt cctagattg cat acggaa 
gcattcctggaggcacaggttcaagctgggccagtggacaggc^ 
1 5 tcagagggcagagtagaagtggggtgtctcittctgcrgtgtggccttagacaagc 

ag g cattgt ttgatgccaatg aggaaggagggaactctgtgttt cattgctgtgataaag agt ccaag gaaaacagc tttgaggag 

gaaagatggattttggctcagaggttggaggtctcaagccacjaag 

ggaccttgcatcgactcaccaggttcaggtcn^ 

ATCCGTGGAGCCTGAGCATGTGTGTTCGTGTGTCCAAT^ 
2 0 TCTGCAGGATCAGATGGACCCAGATCACATCTGAAACACAGTTTCOT 
TTCTGCA.CCCAGCGCAGCTCCTGAGCA.CCTGGGATGCCAOT 
G AAAGAT CAT CTTCCTTTCCTTT CGTAT AAGGTGCTTATAAAGAATC TG AAGA 
CAAAGGGATCTTGATCTGTCTGCTTGAATTAAAAAACCATCACT 
ACCCACAGTGCC^CTGCCTGCTCTTCCCCCTTTCACTCCCT 

2 5 TCAGCTCTTCAGACAGCCCTGCCACCCAGACTATCAC 

AGAAACGGCTTGACCTCTGTAGAAGGGTCCTTTTGTGGGAGA 
CCTGTCCAGCCAAGGTGGGCAAGGCTCAGAACTAGGGTTTCTACOT 
CTTTTGCCTCTGCCTCTTTTTACAGAGTTGTGGCTGJATCTC 
GGAGGCTGGGTACGGGGACGTAGGATCTGTTTGTCTGTCTA 
30 TATATGGTAGGCAAGTGCTCTCCCCTGAGCAACAGCCCAGCCTCTaSAGCT 
TGTTCACATGAAGACCTGTCCTCTCCTCACTGTTCCCCCAATA 
TGTCTGTTACAGTTTGAATGTGATTGTGCCACAGGCCCCTGTGCTTAGTCCCC^ 

TTAGAAGGTGGAGCTAGCCGCAGGAGTGGCCACAGCTCTTACTCTGAG CTCTCTGATCAGCTCCATATAG CAGCTGTCCCCTCACC 
CTGGCCACCAGGAGCCACXaAGCAGCCATGTCTTCCCCAC 

3 5 GGAACTCTCCGTAGATCAGGCTGTCC1TCGAACTCACAGAGATCCTCT 

CCCCCACTGCTATGGCAGACTTCTTAAACrGAOTCAAAAGAACCT 

AATGAAAAGGGAAATTAACGTTGACAAATAACCCCCACATTCCTACCTCATGATCTCATGATCTTGTTTTC 

TCCTGCAACTTATTCCTCTGTCCCCAAATCATGCCTTTTAAAAAAAAAGTCAAA^ 

GTGTGGAGGAGAGAAATGGATGTTGCCAGAAACTCTGACGTG 

4 0 TGTGGTGOTGTGGATGAAAACAGCCCCAGTGGCTCATATATTTAAATGTTTGGT^ 

GGAGGCX3TGGCCTTGTTGGAGGAGGTGTGTCATGGAGGGTGGGCTTTGAGGTTTTAAATCCT 
NNNNNNNNNNNNNNN^^ 
NNNNNNNNNNN1«WNNNNNNNNN^ 
NNNNNNNNNNNNNNNNN^ 
45 CTGCTGCTCCAGCACCATGCCTGCCTGCCTCCTG^ 

AGCTTCCAAATAAATGCTCTATTCTATAAGTCCCCTTG^^ 

GTGATCGTGGCTCCCGGAAGCTGGTGTAGCCACACAGGTACCGTTG CTGTAGAG AGATGGTATG ACCTCCTGACATG CCAGATGTT 

ACAGATGTGAACTGTGAAATTCACACTAGGGTCATCCTGTTCACACTTACT 

CCCCCCCCCCCTCCCCAAGTCCACTGGCTGCA 

5 0 TCCACCTGGGCTGGCAGGCTCCACAGCTCCCCTGGAGACTCCATAGGATTCACT 

GCCTC(^TTGTAATAGGGGCCTCCGTTGACTCCCTGCCCAGATTCCTCCAGAATGTTCrGGC 

GGGGTTTCTTGTAATTCTTTGTAGTCTTTATCTCTCTTTGAACZAAATGTGCAG 

GGTATTTG CCACCCCTCCCCCACTCAGCTCCAGTTCATGAAAGTGTCTAGAT TGTGTGGAAGC CACTTTCCAGAATCCAAAAGGTT 
5 5 AATTTAAAGGCCAGCCTTCCCCCCAACCCCAGTGCTGGCTCCTGACAGTCAAGCAGTATCTCCZA 

GCTGTTGGTTGGGTAGATGGCGTGATCC^CCTTCGGCAGGGTCTGTGGTTGGGCC^TTCCCATTGCT 

AAGCAGTTACTAAATACTTTGGTTTTGAGACTCCGTTGGGAATCAGAGCCATCAAGGACG 

TCAGTCTACTGAGAAAAGGATGGCTGGCTGTCTAGGACGTCACA^ 

AACCACCCTGTCAGAATGAGCTGTAAGAGGAGGGAATGGGAGTCT^ 
60 TACAGAGGGCACCCAGTTTTTAGAAACTGTATTTGAAACAGCT^TGGGTCGAGGATGTGGCTCA 

AGTCTTGACTTGGTTCCCAGTGTTCCGTGAGTGTGACAACACATGCCTG 

GCAG CCCCGGCACATG CCTGTAGCCCCAG CACATGCCTGCAGCCCCAGCACATGCCTGTAGCCCCAGCACATGCAAGTTGCAGATG 
GTCTGTCCTCTGGAATCATAAACO^AGTTTGAGACCT^ 

GCCAGTGGGGCGGGGGGAGAAGTGGGGGAGGAAAAGGGGTCTAACATCCGGCCAGAGTTCCT 
65 CAGAGCTGCTGGACACTTTTCCATTCGGTCCCGGGTGAGCATGCCTGACCC^ 

TGGGC^TCCCAAGCTACCTTGCTAAAGCCCCAGGGCTGTAGGAGAGAGGGAATAGGGAAGGAGTTCC 
CACAGCGGAT CTAG ATGGAACAGAGACTCT CAATAGTTT AAAAG CTTT ATT 

GAGAGAGAAAGGCTAGAGAGAAAGAGAGAGAGAGAAAGAGAGAAAGAGAGAGAGGAGAAGACAAAGAGAAAGGGAAGA^ 
AAGTGAGAGGTAAGGGAACAAAGGAGTAAGAGAGTGAGGTGGGGGCCTAACAGCCCTTTTTATC 
70 TGAGGAGGAGTTTGGCCTGAAGGTCAGAAGCTTGGGCCATTC 

TGAGACATCTCCCTGACCTGGAATGAGGGATCCACTTGCCTCTGCTTACCAGGCCAGATGCCT 

MOUSE SEQUENCE - mRNA 

AATTCGGCACGAGGCGCCAGAACACAACCXSGAGTCCCTGAGGCTCCAGAACAGAGTCAGG^ 
75 GTTCTCGCTTCCTCTCTTGCT^CTCCCTCCTTTGGTTGCAGCCAGC^ 
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CGGCCTCGGGAAGTCCACTTTAATCAAT^ 

TCCCCAAGACGATCGAAATCAAGTCIGATCACCCACGATATTGAAGAGAAGGG^ 
5 GGCTTCGGAGACCACMCAACAAIX3AGAACTGCT 

GGAAGTCAACATCAACC<3GAAGAAGCGCATTCCCGACACCCGTC 
TCAGGCCCCTGGACATTGAATTCATGAAGCGCCTAAGCAAA 

CTCGAGGAGAGGGTCT ACTTCAAACAG CGGATCACTG CAGACCTGCTGTCCAACGGCATTGACGTGTACCCG CAGAAGGAGTTTGA 

TGAGGACGCAGAAGACCGGCTGGTGAACGAAAAG1TCCGGGAGATGATCCCATTTGCTGTG 
1 0 TCAATGGCAAGAGGATTCTGGGAAGGAAGACCAAGT^ 

CGGGATCTCCT TATCAGGACGCACATGCAAAACAT CAAAG ACAT CAC CAGCAACATCCACTT CGAAGC CT ACCGAGTGAAACG CCT 

CAACGAGGGCAACAGCGC(^TGGCCAACGGGATCGAGAAGGAGCCGGAAGCCCAGGAGATGTAGATGCGTC 

ACCCCCAGATCTTTTCATCATCCCTGGCCCACCCACCTACCCTGTCTTATTTTATATAAT^ 

ATCTCTTCCCACACTTTGCCAGGTAACAAGAGAGGGTTTACCTCCCA^ 
15 CTAAGCCTGGGCTTGCCTCTGTGCTCTATTTCCACCCGGGCTCAGCCCCTGAGGGGTTAGAAGAGCTATGTGTrc 

TGAGTTCTAAGCTGAAQCCTGTGGGGGCCAAGTCCTAGGGGGTGCAGAGGAGCCCGTTAGACCACA 

CAAGCAGGTTAGAGACTGCCCCAAAGGAGGATGGAGCrGGCCGGGTATTCCTGAAACCTCAC 

CAGCGCCCTCAGCTGCCTGCCCCTCAAGGGAACTAGAGGCGTCACAGCCAAAGTTGCCA^ 

CCCTCGAGCTTGTCCCAGAGCAGAAAGTGCCTTGATCTACAAGAGCCAGTCACCT 
20 GGACGTGCTGGAGAGAGGGAGGTATCTTTTCTCCCTCGCCCriTGGGTTCTCT 

GCTTTAATTATAAAAGACXSAAGCGTGAAAGATGCTCCCCGATGTTAGGAAGCCCCGCCCCCA^ 

GATGAGTCGAAGCCATGAGGGAGGAAGCCGTGGAAGGGAGGCATAAGAGTGTGTGGGAGCTCTCCTGCCC^ 

ATATCCGCGTGGTCCTCAGTTTGGGCCAAGATATTCTGGTTACATTGATGCTCCGCTGCCTCACC 

AGG CTCAAGC CTTGATGATTCAGTGACTGTACTGGGTGGGAGCCAGAAACCTGACCATTTTGTTGTCTACATGAG CCTAGACTAG C 

2 5 CCTGTGCCCCAGAACCC^TCAAAAATACCCCTAAAGAGGGAAAGATGAGGGGGTCAGAGAT 

CTCAGAGGG AACATTAGGGACCATCCATGCACAG C TGAC CAAAGC CGTGT CC TTC CTG C CTGC CTC CCAT TCTCATTTGC C CCTG A 
GGAGAAAGTTTGGTGAGGTGCGTTAGGTTGGACCCGCTTGGGGAAGGTGCCTCTACAGACCC^ 
AGTGCAGTGGGAGGGGTGGGGTGCAGACAGATGGAGACGAACATTGTTTCOTGCTTTGGGCGTCT 
AGGGCCCCTCGTCATGCCCTGTGGCCGAACAGTTCGCTTGCCAGCTTGCCA 

3 0 TCCAGCACGCTGGACCCAGGGC^CTCCGGCCTTCACCAGCCCAAGTGACCCT 

TTTGTGCCTCTGTAACCAGGTATTGAAACrrGTTCX3TCTCACCAGGCCGGTCCACGTGGCC 

GGCAGAAAAGTCTCTCTCTCTCTCTCTCTCTCTTTTTTTTTTTGTATAAATAACAAAG 

GTTTCAAATCTCGTGCCG 

35 MOUSE SEQUENCE - CODING 

ATGGCTGACAACCCTAGAGATGCCATGCTCAAGCAGGCCCCCGCGTCACGGAACGAAAAGG 

GATCGACTCCATCCTGGAGCAGATGCGCAGGAAGGCTATGAAACAGGG^ 

TCGGGAAGTCCACTTTAATCAATACCCTCTTCAAGTCCAAAATCAGC 

AAGACGATCGAAATCAAGT CGATCAC CCACGATATTG AAGAGAAGGGGGTTCGAATGAAGCTGACAGTGATTGACACGCCGGGCTT 

4 0 CGGAGACCACA.TCAACAATGAGAACrGCTGGCAGCCCAT^ 

TCAACATCAACCGGAAGAAGCGCATTC^ 

CCCCTGGACATTGAATTCATGAAGCGCCTAAGCAAAGTGGTGAAC^ 

GGAGAGGGTCTACTTCAAACAGCGGATCACTGCAGACCTGCTGTCCAACGGCATTGACGTGTACCCGCAGA 

ACGCAGAAGACCGG CTGGTGAACGAAAAGTTCCGGGAGATGATCCCATTTG CTGTGGTGGGCAG CGACCATGAGTATCAAGTCAAT 

4 5 GGCAAGAGGATTCTTGGGAAGGAAGACCAAGTGGGGCACTATCGAAGTTGAGAATA 

TCTCCTTTATCAGGACX5CACATGCAAAACATCAAAGACATCAC 

AGGG CAACAG CGCCATGGCCAACGGG AT CG AG AAGGAG C CGGAAG C CCAGG AGATGTAG 

HUMAN SEQUENCE - GENOMIC 

5 0 TCCCAGCCCAGCTTCTCTGTAGATCCACATCTGGCCAAT 

CTCGAGGTTGTGTTTGGGACTGGGCCAGGGATGACATTTCTGGCAAAAGCTTAAATGACACCATGA 

C ACACTT TCT TTCCT T TTAAAAAAAAAATGGAATT CTTAGAG CTT ACAAAACAAAAATGT AAAGAATAAGGT AATAAAACAC CTG TG 
TACTCACCATCCCGCTTAAGAAGTGAGACATTGCAGGTGTAGCCGCAGCCCTGGGTGACCCTCT 
GCGGTGCCCGCCCCTCTCGATTGGGATCTGGCCCCGGGCAGGCACATC^ 
55 CTTTTTTTTTTTTTTGAGATAGAGTCTCGCTCTATCGCCCAGGCTGGAGT^ 
TCCTGGGTTCAAGOSATTCTTCTGCCTCAGCCTCCTCAGT^ 
GTATTTTTAGTAGACACAGGGTTTCACCATGTTGGTC^ 
CCAAAGTGCTGGGATTATAGGTGTGAGCCttCCGC^TCCGGOT 

GCCTAGGTCTGTCTCCTCTTGAATGCCGTCATCAGCATCACATCCTTGGTG CCTCACTGGAGTCAGTGGCTG CCCTG CCTGTGCCC 
60 ACAACGTGACGGCGTGGAGACTTATCCATGTAGATCCACGAAGCCTTGGTTCACTTATTTTGGCAGCTAG 
TGGGGAGGCAGAGTTTGCCCGTCCCGTGGCCC^CTGGTGACATCGCCGAAGGTCT 
TTTCCCCAGGTGGATTTGGCTGGGCTGTGGAGATC^ 
TGGTAGGCTCTCCCATGTTCGCTCATTTTC!TGTGC 

CCCACGGTGCACGGAGCATTT(ZATCTCAGTGGTGCCTGAGCTTCCCrrCCCTCTG 
65 AACTCCTCTGCCTGACATCCCGTGTCGCTAGGTCCTGCTCTTCCATCTTCCAATACCTCTGGGOT 

CCCCATCATCCCTTCTGTCCTAGGTTGAACCGTGTCCCCCACAAACTCA 

GAGATGGGGTCTTTGCAGATGTAAGTAGTTAAGGATCTTGAGATGAGGTCA 

GTGTCCTTAAAAGAAGCAGAGAGGCCACAGACACACCGGGGAGAGCCACGTGAA^ 

AGGCCAGGGAGCGTCTGGGAAGAGGCAAGGAAGGACCCCCCCCCC^CCCCACCCCAGAGCCTCCCT 
7 0 CACCTTGATTT/TGGACCrCJTGGCCrCCAGCATGGGAGAGAACCCG CTTC CATTTTTTCAAGCCCCACAGCTGTGGTCCATTATGGC 

AGCCACAGGACACTGATGCACCTCCCACCTGGCCGCTGTGGATGGACGACCCCGCCCGCAGGAAGTCTCC 

GTCTCTGCCCTCACAGGGGACGCCTGCACAAGTAGGTGTGAAAAAAGGGCCCTC 

TCCCTTCATCCTATGCTACTTCCCATGCCTCTTCC 

ATTGGCTGTCACTTATTTAACTGCCACCATCCCCAGGAGA 
7 q TT CCCCAGGGCACCGTGCCACGTTGCCTCAATG CACAGAGTAGCTGCTCAAAGTGTCTCGGACGAAC^GGTGC CCCATGCCCAGCC 
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ATGCAGACAAAAGTGTGCCACGTGGAGGGAGCTGGTCGTGGAAGAG^ 
CTCTTGCAGGCACCCCTAGTTCATTACTTCCTCTCT^ 
CCCACCCAGGTTTGCAAGaVGGAAATCTCAGAGTTGACCTTGA 
ATCTCATCAACTCGCCCTCTGCATACTGACCCTTGGACAGGCT 
5 CCTTGGACCAGCAGGTGCCATCCrrCCCCACCCCrrGCCCTGCrGTGGCTCCrTCCCCT 
ACTTCCCTCCCTGTGTGCTGGGGACCGGGTGATAGGATCCCTCCACCC 
ACAGAGCCCTTAGTTTCCGCX:CCCAAAGCCTCGAGGGTGGATTCGTG 
GAGCCCCCTGCTTCTGGGCTCCTGGCTCTGAGTGCTCG 
CATGCTCCTGGACCTCCTAGCTGACrCAGGTGACTTTGG^ 
1 0 TGTCCTCCATAGCCTCGGACCAGAGCCACGGCXSG^ 

TGATAGGAGATGCTTGGGAGAATTGCCCAAAAAACTAGCTTAAAATG^ 
TTTCTCCGTTTGCCTCCrcGGTAAGAAATTGATGGCGAATGCAATOT 

AACGGGG CACACGCTAGG CATTTGTGACTATTTAAGTTTAAAT TATAACATAAATG CXB AAACTCTTTGC CTTGGTAGCCCTT CCAC 
ATTGCAAGCGTTCACTGCCACATGTGGCTG CCAT ATTGACT ACGG CAGATACAG AACATTGACAT CATTG CGGAGAGTTCTATGGG 
1 5 GGGGCGCTGGTTTAGATCATCTTCACGTTGGATCCACAAATTCATTGTTTATCCAGTGAACGGTT 

GGGCCCTTGGCCTCTGCACCCCCRGACCTCCCTCGCTCTGCAGGGGCAGTCGGGG^ 
CTCCTGTCAGGCTACAGCCCAGATGGTTGGCTGAGCACGAGGA.TGTGAGAG^ 

GTCTTGGTTTCCTrCCACGCTGACCrGGAAGGACTCCTAAAG CAACAGGCCTGCCCAGGCGGTTTGCACC^^TAAACCCTGCAG CG 
2 0 GCCCCAAGGGCAGAGATAGCTGGATTGTATTTATTTA 

GAGGAATGCAGCTGGCCTCAGCCTGGGAATCCGTCTTGCTTGGGAGGCTGTAGGCG 
GGCTGGCTCTGCTGGCAGGTACAGTAGCAGGGTCAGCAGGAGGGAGTC^ 

ACTT CAGTGG TTG AGT CTGAGGATG AG CAGGAAATG AATGGAAAAG T CTTTTTTT TIT CTTT TTTCTTTCT AGGCAGGGT CT'j.^1' CT 
CTGTCCCCCAGGCTGGAGTGCAGTGGC^CAATTVATAGCT 

2 5 CTCCTAAGTAGCTGGGACTACAGTTGTGCGCCACCATGACTGGCTAATTCTT 

ATGCCTGTAATCCCAGT ACTTTTGGAGCCG AGGTGGGA CTCCAG ACCAGC CTGGGCAACATGACGA 

GACCCCGTCTTGACTAAAAATACAAAAAGTTAGCTGGGCGTGGTGGTACAGCCCT 

CAGTGAGTTGTGATCACGTCACTGTCCTCCA 

GAJGGTGGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGC^ 

3 0 ATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCATGAGCCACTGTGCCT 

GAGATGGGATTTCTCTATGTTGCCCAGGCTGGTCT 
AATTACAGGTGTGAGCCACCAC^CCCAGCAGGAAAAGTaVCTTTTTAA 
CAGCATGGAGGATTTACTGGAACGTGTGTTAGGTGAGGGGACATGGAAGA 
GCTTTCAGCCAAAGATGCGTTCTCTCCACCTCCGCGCAGC^^ 
35 CGAAGACCCTTTCCCGAAGGCCCCTGCTGCCCACATCAGTGTCACCTGGACCX3C 
CAGGGCCTCCTGGCCTGGTGTGGGGAGCACAGTCCACCCT 

GAAGCTGTGCTTCGTGGAAGGGGACAGAGGCGCTGGTGCCCAGGATGGTGCACCTGGGGCGGCTG 

GGCCCCTGTGGAGCCCAGATTGTTGTGAGAAGGGGTGGGGAAAGGCAGGTGATCCGCACCTTTCACT^ 

GGGGCTTATCCTGCACACATATTCAGTTTGGAGTATGAATTTCCATGAAG 

4 0 CCGTGAGGGCCGGGCCGTCTTCTGGGGCAAGGACAGGA 

AGCATCTTGAAAAAGAAGCAGTTCCTAGGCCAAGCCAGAGACTGAAGAATGACCACCT 
ATCTCGCGCTCTCGCCCAGGCTGGAGTGCAGTGGCGTGATCTA 
GCCTCAGCCTTCTGAATAGCTGGGATTACAGGCATGCG<?C^ 
CCATGTTGGTCAGGCTGGTCTGGAACCCCTGACCTITGTC 

CTTCCTGTCTGTGGGTTCCGGCCT^ 

CAGGCCCAGCCTGTGCCGCTGAGCCCCACrrTTGCAATTAAGAGGAGCTGTGAA 
GGGTCTATGGGGGGTGAAGGACAGGGATGCCGCCCTG^ 

GGGCAGGCTTGGGGTGGCTGGGAGCTGGGCTCATAGGCTGGCACAACCAGAGTTTCCC^ 

5 0 CTGGGACAGCAACCTTTGGTCCAATTTGGGAAAAGGTGGGTTAAACCAAGCCCGGCTCCCT 

AGTATGCAAATGTGCCTCTGAATCTAAGAC^GGACC^^ 

CCCACCCTCTGCTGGCCTTACCTCACCACCTCT 

GGTGCTGCCAGGTTTTCCATTTCAAGTGATCCCCOGCCTG<3GCCAAAATC 

5 5 GGCCAAACCATGCTGGGTAAAACTGAGGGTCAAGGCT 

GCGTGCAGGAGGTGGCTGGGTGAATGAACTCAGTCATAGCCCGGC^ 
GCTGGGTCAGAGCCTCGTGGGGCCTTCCTGGGCTTCCCTGCCCC^ 
GTTTTCTTCCTCTTCTTTTTTTTTTTTTTTAATTTGAGAC^ 
AGCTCACAGCAGCCTCGAACTCCTGGCTTCAAGCAATCCTTCCGCCGCAGCCTC 

6 0 TTGCCCAGCrGGGGTTTCTGCTTTTTACTTTTTCTCITGTTTCTA 

ACCAACAAGGAATATTATTTCAAAATT 

GTGGTTGTTAGTGGTTGGCTTAACACAATGTTTTAGGAAGAAATAGAATAAC^ 

GTCTGTGGCGCTGCATCAGTTTCCTGCAGCTGCTGTAACGAGTGGCCACAAGCTGGCTG 

(^CAGCTCTGCAGGCCACACATCCGAGATCAGGGTCACCAGGCTGAAATCAGG 

6 5 GGGCAGGGGTTCCCAGCCCCGTGTGTCCTATTAGGAACTGGCCTGCACAGC^^ 

TCTGTATTTACAGTCACTCCCCATTGCTCGCATTACTGCCCAAGCTCTGCCT 

GGAGAGTGAACCGTATTGTGAACGGCACATGTGAGGGATCTA^ 

CCGTCTCCCATCACCACCAGATGGGACCGTCTGGTTGCAGGAAAATAAGCTCAGGGOT 

TATAAT TAT CTCATT AT AT ATT AT AATAAT AAT AGAAAT AAAGTG CACAATCAAT ACAACGTG CTTC C CGGGTG CAGTGGCTC 

7 0 ACACCTGTAATCCCAGAGTGAAACTCTGTCTCAAW 

TTTGGGAGGCCAAGGCTGGCGGATCACGAGGTCAGGAGTTCAAGATCAGCCTGGCCAATATGGTGAAACCCCG 
TACAAAAATTAGCCGGACACGGTGGCAGGCGTCTGT^ 

GGCGGAGGTTGCAGTGAACTGAGATAGTX3CCACTGTACTCCAGCCTGGGTCGCAGAGCG 
AATAAAATAATAAATATAATACACTTGAATCATCTCAAAACCATCTCCCCACTCCCTGGT 
7 5 CCGATCCCAGGTG CCAAAAAGGTTGGGGGCCJVCnX3CCTTCATCGCCTGCCCCTTTGAG CCTCTGGTGGCTGTGGGTGTCCCTTGGC 
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5 



0 



TTGTGGCTGCATGGCCCTTACCTTCAAC^ 
AAATCTCCCTCTGCCCCCGTOTGTATGAGCACCGAGACTGCCT^ 
AGATCTGCAACTGAATCACAGATGCAAAAACTCTTGTTCCAAA(^^ 
TGG^CCATTATTCAGCCTTCTCTCGTGACTATGATCCCTCCCTrCCT 



TGGT CCTCC C CACCCC CCAGGAAGATGTCCTTTCTGCCAAGG AGATTG CATTTGGGGTCCTATCC CAGGCAGAGAG CGGACGGGAC 

TGTGGCCTGAAGTGGCACTTCAGGGCCTTTGGCTCTC 

ACCTGGCTCTGTGGGGCAGGGATTGTAAACTCAGAGGCCTC 

GGGGCCTTGGGGAGCAGCACAGACTGAAAAGTGTTTTGTCT 

AGGCTTCCGTGGTGCGGCCACGCGTCCTCCCTTTCTCAACAGA^ 

GCCTTGG CACAAACACACATTCTCTTTAAACAGG CTTGGGCCAACAGAAC CAGC CTC CGTGT CTCGGGTTTGAATG AACCCTTGAG 
GTGAAGCTTAAAGACAGAGAGGATGCCCCTGATCCCCACCCCAGGGACCCACATTGTAAACCCC 
GCCTCAGCATGGACCCTGACCATGTGCTGGTGGGGACCCCCCAGGTAAGTAAGCAGC 
ACAGCACTGATCCCCCAGTGGGGCCCCGAGTTCCCGGAGAGGAAGACTCOT 

gcagactaa^ 



GGCACG . . - — - - - 

GGCACGCAGGAACTGGGCTTTTTAAACCCTTAAGCCCAAGGAAATCGTAGCATCGCGGGACAGGGAAAAT 

CGTCAGGAATTTGACTCTGTGAGTTGGTTTCCAAGACT 

GGGGCGGCGGGGGCTGGAGGAGGTGGAGAGAGGAGGCTGCCGGAAGCCGCACTCGGGACCTCTGCAG 
GCCGGGACTCTGGGACTCTCGCAGGCAGACCCGGTGGTCTGCCGGACTCCTCGGGGCCCACTTCGGGCCCTCTC^ 

TTTTTGGATTTCTCrCCTTTGCTCCCTTTTTCCT 

GCACATGGAAATGTGGTAATT CGGATG CATT CGTG ATTG CAACAGATTGAAGAAATT AGAC CAG ACAAAGAGTGTTTTT AGAGGAG 

GAGGAGGAGGAGGAGGAGGCTGAGAGAGGGAGGGCGACGGGGGTGAGAAAGGGGAGGCCGCCTCTG 

GCCGCTGCTAAATATATCCGTAGGAATGGAGAGGGACCGGATCTCAGGTACGCAGACAGCCCCCTCCCCATCGGC 

GCCCTGGACTCGGGGCTTTATTTATGCCTGGGGGAATAAGCATGCAAAGGGCGCTTT^ 

CTTTTTGACGGTTCCAAGCTCGTGGGATGAACTCCGTCTCAGAAGOTAAGCTCT 

CCTCTCTCCTGGCTACTTCCCTTTCATTTTCCAGTGTGCCATTCTT 

GAAACCTTGGGAGGGTGTGGGGGACCTGCCCCCCTGGCTCGGGTCTGGGGCGGCGGCGC^ 
CACTGTGTGGTTTGTTTTTTGCTTCCAGCCAAAGTCrG 

TCCTGGCAGTTGATTATAGAGGGGGAAGCTCCCGGCGCTCAGAGGAGCCTGGCGTGTGTGTGCACGTGTGTGTG 

TGTGTCCATTCACGCTGGCAGCCCCCrTGCCGCAGGCACAGACCCATCAGGGCGTGAACAGGCAAA^ 

CTGCCTTTAGTTCACCCCAGCTTTCCAGGCCCTCCCrCCCAGGCCCAGCCT 

GCCATCTCACGTGGCCAGCGCCGTCATCGGTAATTAGCAGTGATC^ 

GGAGACATGGGGTGGCTGAGCCAAGCTCCAGGCTTTGGGTATTCTGAAGTGACCTACGG^ 

GTCAACCCCTACGTGCCTTGGGCCCTGGTCTGTTC 

GCCCATCTCGGTGGCTCCTCTCTAGGACGTGTGGTCCCCTCCACCACTGCT CCCAGGGC CACAT CTTTTTTTTTTTTTTTCTTTGA 
GACGGAGTCTGGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAAGCTCCGCCTCC 

TCTCCTGCCTCAGCCTCCCAAGCAGCTGGG 

GTCAACCAGGATGGTCTCGATCTCCTGATCTCGTGATCCGCCCGCCTCAGCCTCCCAAAGTGCTGG 
GCGGCCGGCCTCAGGGCC^CATCTTGATTGTCACGTTGGG^ 

GCACAGCAGK3CGGACCTCAGGAGGGGGCCAGGGGAGACGGCAGAAGGCTTGGGTGGCAGTGAAACTGCATCTTA 
CGGGGGCTCAGGCCCGCTGGGACGGGTGGGTGTGGAGCCCAGCGTCCTGCTCTCCGCCCAGCAGGAGCCGCAGGGCAGGGAGCGCT 
GAGAGGGACCTTCGTCAGGCTTGCACAGTGGAGCCGTCCTCGGAGTGACCACCCCACCAGGCAG^ 
CCCTGCCTCCTGGGCCACCAGCCCCAGGAGGACGTCACTCCACCCGCCTGGCTGGCAGGG 

TCACTTTATAGATGAT CTAGCCGAGGCC CAGATGGGTGACCAG CAGGGGATCTG AGAGCTG AGCAGAGAGGCGTGGGTTTGGGGGT 
GCTGCTCCCTGGGGAGCTGGAAGTGATGGCACCCTCCC^ 

AAAG CAGCCAGTGCTGTGCCCCAGAGACCACTCTG ATCCCTG CTG CC CTTGGCTGAGG ACAGCACAGCCAGGGATGAAGCGGGGAC 
CTCCCCTCGGGCTGTGCCTTGATGAAAGATGCTTTCTACAAAATCCCATTCGGATGTGTTTTCATGTAM 

GAAACTCAGTTGTAAACTGGCTGCCTTCCGAATGAACACGACAGTCCCGGAGCTGTT^ 
ACTTTAGCGCTGGTCTTTAATTGTCACTGAACTCCCGCACAGTCCCTGGGGAAGTGACTTCCTTCCTGT 

GAGCAAGCCACTTGGCAGCTCTGGGCCXSCAGTrCCGTCATCT 

CACCTCCTG CGGGTTTCTG CGCGTGTCCTGAGTTTTGTGGGCGCCTCTTCCCTCACTGAGGGGCTCT^ 

CTCACGCCGCGTTGTCAGGCTGCAAGCCACTGCGAGAGCTGCTCTGATCCCCAAGCATGCTCTGGCCTGAGGAGGAGCCATGGGGA 
GGACGCTGTGCCTGTGTGACAGAGGCCTGCCCGGCAGATGCCGGCCCGCTGACTT^ 

CTGGGCTGGCGTGTGAGGCGTGTCCAGCTCAGGCTGCCCTACAGCTGCCCAGGAAGGGTTTGTGGGAGGGGGTGGTCCTACCCTGG 
TGAGTCCACGGTGGTCTGCCAGCCCCAGGGAAGG CTTAGGAG CCCCAAGACCTG CCCTCTTGTCCCTCCCTCGGCAGTGTCTGGCT 
CCGGTCTGGTTTGTGAACGGGGGCCTGGGTACTCTCCCGCCCTCCCTGGGGGCTG 

TGTCAGCAGGGCCGGCATGAGTCATGGAGTGGTG ACTGGC CGC TG AGGCATGGAAGGCCACTCCTTCCAGCCAAGGGTCGGAG CTT 

TTTTTTTGTTCTGGAACGGCTTTGTGGCTTTTACAAATTAAGCAGTTTTATCTT 

GTCTCCAAGAAGGGAACTTTC<^CTGGGGAGCTCTGTTGGGCTG^ 

AAGTGGCCGTCTGTGCATGGTCATGAATGCAGGCCCGGGGTCCTGGAGGGGGCTTGGCCACGCTGTCGCTGGACGTCTGTCCCCAC 

CAGGGTCCATCTGATGCAGTACGGCACaGGGTAAAGCCCAGCTCCTCTTACG 

GCCTCCTGGAGGGCGGTGCCTCAGCCCTCACCCCrCCCrCCTGTCCCTCGCTGTGCTGTCC 

AGTTCTTACCCCTGGACGCTCGGTGCCTGTCCGGTCTCAAC 

GACGCGCTCTGCACCCGCTTCCTGCCCAGCCCCGTGGGAACTCTGGTCCCTGTAGTGGTTTCCTGGGGCTGCCATCAC 
ACA^ACTGGGTGGCTCAAAACACCAGACATTTCTTCTCTCGCCGTTCTGGAGGCCAGAAG 

GGTTCTTTCCGGAGGCTCCGAGGGGGAGCCTGCTTCTCGCCTCrrGGCAGCTTCTGGCGGCTCCCGAAACCCTTGGCGTTCCTTGGC 

TCGTGGCCACACCACTCGACCTCTGCCTCCGTTGTCAC^GGGCCTCCTTCTGTGCCCTAAATCTCCCCCTGTCT 

GACAGTCTTTCTCGGATTTAGGGCCCACTGGGTAACCCAAAATGGTCCCATCTCA^ 

CCTTTTTCCAAATAGGCCATGTTCATAGGCTCTGGGGTGGATGTAAATTTTAGGGGCCACCTTT 

GTCTGGATGCACAGCATGGTTCCATGTCCCAGACTCCTTGCTG 

GTTTCrrGTGCTCCTGGTGAGGTGTCGGGGTCGGGCTTAGCCCCTGCAGGAATTCTGGGGACAGGATGGAGCCXS 
AGGCGTCTTCCTACCTGCCCAGAGCTCATGGTTGTGAGGGGTCTCTCCTTGTGGTTTTGATTTC 

GTTG AGCATCTTTTCGTGTTTATTGCTCATTTGCACATATTCTTG CAGAAATGTCTGTC CAAGTCCTTTACCTGTGTTTGATTTGG 
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ATGATTGTTGTTGCTGAGTGTTA^ CTGC CATTC 

CATGGATTGCCITCTCCCCACGTTGTTAGTGCCCrrTTGATATC 

CCAGGCTGGA6TGCAGTGG CATGATCTTGGCTCACTGCAACCTCCG CCTCCCTGGTTCCAGCGATTTTCCTGCCTCAGCCTCCTGA 

GTAGTTGGGATTACAGGCACCCGCCACC^CACCCAGCT 
5 TGGTCTCGAACTCCIX3ACCTCAAGTAATCCACCCGTCTCX5GCCTCCCAAAGTC 

CTGATATGCAAATATTTTAAACTTCTATGA 

QAAATCATTGCCAAATGCAACGTCAGGAAGCTTTTCCCCTGTC 

CTTTGATGCAAGTTGAGTTGATTTTTGCATGTGGTGTAAG 

TTTCTCACGCCCAGCTGGTTCCGCTGGGTGGCGG^ 
1 0 GCCAGCCAGGGAGTCTCAGCTCCTGCCCA^GACCTGGCTGGACGTG 

GGCAGGGCCAGCCCGGGGCGTCCTGAGGCCTGCCCTGCGGACATGCCCCTTC 

GACCAGCTGTCTGTGCITCAGGCCATGGAGGCTGAGTGTTTCCA 

AGCCCATTCTCTGCCTCCGCTTCrrGCCATCTGTGCCrTTGTCrGCTTC 

ATCTGGACACAAGTCTCCATCCTGAAGCCACCACCCAAACCCCTGTGCCT 
1 5 GACCAACTCAGCAGCTGATGAAGCrTCCCTTGGGGCT 

GTGGAAGTGGGGCGTGAGGGTGTG!\GCCTTTTTCT 

TTATACAGCACCCCACACCCACCTCCCCGCCTCCTACCCCTTCTTCCAAGGGGTCATCTCTC 
CATGTGGTCCGCCCAGCAACCTTTGACCCCCAACATGACAAAATAAACCT 
GTGCTCCCTGTGGACTTGGCGCTGGGGTCCCGTGGAGGACAAAGCCAGACACAGTCCTTGCCCT 
2 0 ACCACMCAGTAAACGTGAAACACAGGAAGTGACaGGTC 

AGGTCAGGAGGGCTGCCTCGGGGGGACACCTTCGGGCTGAGCGCAGAAGGATGAGGGG^ 
GAGGCGATCGCTGCAGGCAACCGGC^ATGTGTTCAAAGGCCCTGGGGCGCGGGGGGCTG 
CrTGGGGTTGAAAGAGACTGACTGTCATGTTGTGAAATATACACTTGGTTTTCATCTC 

TTGGAAT CTC CAAAGTGATGTCTTTTTGGATGCTCATGATTG A CAGCTGG CAG CTTCAGGATGGTTC C CAGGG AAGACCAG 

2 5 GT AGAAT CACAAGGTT CAGCACCACCCCG CAACCTCCAGGTAG GGG AGAGGGG CTG AAGG TTAAGCAGATCAT CAGCGG CCAATGA 
TTGAATCAATCATGCCTTCGTAATGAGGCCTCCGT 

GAGGCTCCTGGAGGGTGGAGCGCCTGGGGAGC^CATGGAAGCTCTGCGTCCCTCCCCCATACCTTGC 
GTATCCTTTGTAATATCCTTTATAATAAACTAGTAAATTCCATGAGCCCCAGGAACATGTGTAAAAGAAAAAAA 
ATAGCTGCTGTAGCAAGTTAATCCTACCAGCACTGGCTCrCATCTGCCG^ 
30 TGGGACCCCT CCGGCCCCACCACCXICTCCrGTAGTGGTCTCGCTGGCTG CTGCCCTCAGGTTAGTTTATTTTTACCCGG CCTGCTC 



CTTCTGAAGCCGCTCGCrrcTGTGCCCCCGCCTGGCCCCATGCATCCCGCACGGTGCrrCCCC^ 
GGCATAGATTCTGGGGCTGCCTTGGTTCAAGCCCACCCTGACTGGGAGCCT 
GCAAGGCTGGCATGCTCKMGCAGTGGGGGCTGGGTATGGGAGGGGATCGTGGGT 
3 5 TTGGAGACTGATTTGGGCTCCAGGCTACGCCrrGAAATTGGGACA^ 

TGGACGCCCTCATACCTCCATCCGTGGAGCCACCCTCCTCTTTTTTTTTCT 

AACACACCCTCGAAATCAGCGAAGAGGCACTGGCTCT 

AACAGACGGGTGTGATGGGGCTGCGGGGTTTCAGGTGGCAGG 

40 TGCCCAGGGTTTCCCCGTGCCCAGGGTGTCCCTGTATCCAGAGCATCCCCATGCCCAGGGTGTCCCCGTAC 
TCCCAGGGCGTCCTGTGCCCAGGGTGTCCCCGTGCCCAGT 
CCAGGCACTGCCTCTCCCC^CTTTCCTAATCT^ 

GAGTTCAJCTTGTGCTCTTAAGTTCCTGTTAGACACAGGCAGGCTGCTTTAGTTCT^ 

45 TCTCCTGCCTCAGCCTCCCAAATAGCTGGGATGACAGGCGTCCGCC^CCATGCCTGGTTAATTTTTGTATT^ 
GTTTCACCATGTTGGTCAGGCTGGTCCCGATCTCCTGACCTCAAGTGATCCACCCAGCTTGGCCTCCCAAAGTC 
GCATGAGCCACCX3TGCCTGGCCCGTCTGGTGAATGTTTATTGAGGCCTACTATGTGCAGCCCCT 

TG AGGAG CAACAAAGACACAG TCG CT GGTC GCTTGTTACTTAACAGTTGG AACTG AGAGG CAGTC CACAAATGAG CAAATCCATAG 
GTGGTTTTAGATGAGGACTGTGAAGAAGCTAAGAACAGGTTGCTGGGGGTGCATATCTGTGTGTGTGGCG 
50 CCCTTTCTGAGTGGTGACACTGAGCTGAAGCTGTAATGACCAGAAGGACCCAGCCAGGCCAAAATCCAAGG^ 
AAAGGAAACAGCAGGTGTGAAGGCrcGAGGCTGGAGCCTGGGGAAGGTGGGAGTGGCTG^ 

TGACAC C AATGCGACAGGCAGGG CCAGGTCGTGCCGGGCCTTGGAGG CCCTGGTGAGGGCTTGATTTT ATTCTCACTATGGTGGGA 
CCTCAGTGCCCTGTGGCTGCTGAGAATGG CCAGGGGCAGGCAGGGAGGCCTG CTAGGGAGGAGGCTGCTGCAGGCCCTG CAGCTGG 
AGAGGGAGGATCAGGATCTCITTAGAGAGGAAGTTGGAGAGGGAGGATCT 
5 5 CCCGTCAGCATCC^GCAGAGGCCACTGGATGCTGACAGGTC 

TGCCGGCTTTAGGAGGAGCGCTTTTGGTGTCTGAGCCCAGTGGTC^ 



60 



65 



70 



75 



TTCCTTCCITGTTTCGTGGGATGCTGAGTCTCTGGGCAGG 

CATCCCCAAGCAGGAACCCTGGTGCCTGGTGCCTGAGGACTGCTGTCCACCGTCCCACGATCCTTCAG 
ACTCGGGGCAAGGTTCAGATATCCAGAACATTCTTGGCriX^ 
CaCCAGCCAAGGCTTCGGGGTTTGGACACCATCCCTGGGAGGGGTGAG 
ACTGGCCAGGO\GGTGGGTGGGGGCTGCAGTCCATGCCCCCGCT^ 

CTGGGGCTCCCTGTGCTCACCTTCTGCTTCCTTCCTCTCCCTCGGAATGGGGGGTGGACACTCCAAGA 
TCCCTGCCTCCTGCCTCCTCTCCTTCGGGACAGACCCCT 



CATTGTGGAGCTGGTGTAGGAAGGAAGGTGCCGGGAGCC1AGCTCCAATTCACGTGCTGGCTC 

GAGGTGGTGAGACAGGGACTCATTGATCATGGGACTGACACCCCCT 

TGAGTTTTGGCGCTGCTCTTGGCGATGGCGTGGACTCAGCCAGGCTCT 

GGG(^GTGTTTCrGTTCACCTGGAAAGGATGGGTAGGACTTGGAGACAACAGGGATT 

CAGCrCACCCTGCAGAACAGACAATTTGGTTAAAGCCT 

TGACCATCAOSGGGACTGGGGGAGAGGCaCAGGCATGCCTGTCCGGCAGCA^ 
GGCGTTGTCTTTCCTGAGAGTTGCACTGTCTCCCT 

GGCTTGGTCTCCCGCAATT C^CCCTGCCCAGCCCCACACAGCTTGTCCCTCCAG CTGGGAATCGGAGCTCAC CTGGAGTCCCAGGC 
AGCAGATT CGATG CAGTGATGCAGGGCTGTCAGCCATGCCAGCAGGCCCTGCTG CTCACAGAGGGATGGGTATTGCTGTTGACAAT 
AATTAAAGAACrACGGCTTCAGGCTGGGTACAGTG 
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TGAATCCAGGAGTTCCAGACCAGGCTGGGCAAC^TAGCAA 

TACAGCTTCAGAATGATTGACTTCAGCATGTTT ATT^AAGTCAAGGGT CTT CAGTGT CCTGGGCCT ATGTCACTGCGTCAGAGGGAG 
AGAGGAAGAGGCTGGGGTGATGGGCACTGGGGCTTATGCTCCTCCTGGAATG^ 
TCGAGCGGGGGCGGTGTCCACTGAGGACTGGATGTTGAGGTGTGGG 
5 AGTGCCCCGAGATTGCAGAGAGGGGACCCGCCCAGCCTGGGCAGGGAGGC 
AGAAAGCCTTTCCTCCAGCAGGTGGCGCCGCAAGCTCT 
AGAGGCTGCCTGGGTGTAGTTTCCGCCAACTGTGACTC 
AGATGACCAGCCCCCGCACAGGCACX^GGGTGCCTCCT 

CTGGGCGGGGAAAGGGAGGGTTGGAAGCTGAGTTAGGGGGAAAGGGGGTGGTCTCCTGCCATCT 
1 0 TTTGGTTGATAGGTTTCAATCTTTTTTACATTTTTCCAA 

GAGTGTGGTGGCTCATGACTGTAATCCCAGC^CITTGGGAGGTTGAGG 

CGGGC^TATAGTGAGATCXZCATCTCTATAAAGGATACAAAAAATTAGTCAGGCATGG 

GGAAGGTTGAGGTGAGAGGATTGCTTGAACCTGGGAGGTTGAGGCT 

GACAGAACAAAACCCTGTCTCAAAAACAAAACAAAACAAAACCACAAA 
1 5 CACCTACCACCTAGATTCCACAGCAACCATTTTGCTGTTTGCTTCACCAC^^ 

ATGCACTTAAAATAAGCTGCAGAAATCAGCACACAACATCCTGTCAACAC 

GTCTGTGATTATTTTTGTTTTTCTTTGAAGTAAAATGTATATACCCTGAG 

AACAAAGGTATCCACGTGTATCCCAGCCTGACCAGAACATAG 

CATCACAG T C CCCACC TTCCCCAGGG CAACCACAATTGTGCTTTT TTTC CAC CACGAAATAATTTTG CCAGTTTG AGTGTTT CTTA 
2 0 GAGCTACCCATGTTGCTGCAGGTGCCAGCGATTTGTTCTGTTTTATT 
TAGTCATTCTCTTGTTGAAAGACCCCTGGCTGTTT^ 
AGTTTTTCTG-TGCATATGGGCGTTCATTTATTCTGTAG 

AACTGCCAGATCTTTTCCCAAAGTGGCCCTTCCATTTCACACTCCCACCAGCAATGGATACGAGTTCTAGTTGCTCCACATC 
CCAACATTTGGTGCTGTCACTCTGTGAAATCTTAGTCATTCTGGTGGGC^ 

2 5 TTGGCCTGATGACTAATGCGGTTGAGCACATTTTCATGATTGGCTCTTCATATATTTTTCCTTTGTGAAATG 

TTGCTCATTTTAAAATTGGGTTGCTTGACTTTTTCTTCT 
AGTCTGTGGCTTGCCTGTTCATTTTCITTCCTTTTTCAAGAACAAGGTOT 

TCAAGCrGTCCTCCTGCCTCTGCCrCCCCAAGAGCCGGTTTTACAGGCATGAGCCACCATGCCCGGCCT 
TATTTTCCAATAAGCAAAAGTGTGTCACTTTAATTTTTATG 

3 0 TTCTTTGTAAGAAACCTTTGCCTACCCrGC^ 

ATGAATTAACTTTTGCATATGGTGTGAGGTAGAGGGTCAGGGTTCATTTTGTTGTCGTTTTATGT 

TTCATTTTTTTTTAAAGCTTTTATTTTGTATTTTCTTGGTGACTTATGAGGTC 

CGTATTTTCCTTTGTAAAATTATCTGTCCACCT 

TTCATTTTATAATGCTTGAAAGAATGAACACCTTGTTCCCCAGCTACAGGCTTCTGCTCTTTGTCCT 

3 5 AGACTTGGGTTTTGTTTTGTTGCTGCTTTGTTGTTTCTT 

GGCACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGG 

CTAACACGGTGAAACCCCATCTCTACTAAAAATACAAAAAATTAGCCGG 

AXK3CCGAGGCAGGAAAATGGTGTGAACCTGGGAGGCGGAGCTTGCAGTGAGCCGAGATT 

AGAG CAAG ACT CTGTCT CAAAAAAAAAAAAAAAAAAAAATG TTGT TT CAT ATG TTTTTT AT AT ATTCTGG AT AT AAGTCCTTT AT C 

4 0 AG AT AT ATGT ATCATAAGTTTTTTTTTC CACATTGCTCC^ CAAGAGTATT 

TTGTTTTCATAAAGTCCAATTTATCAACTTTTTCCAGTTT 
AAGGTCAAAAGAAGTTCTCCTATTTTCTAGAAATTTTATAGTTC 
TACATTTTTTAACCTATTATAGCCGTCAGGATGTTGTATGATCC^ 
TTCTCGTTTTTCTATACTCATATCCAGTTCTAGCACCGTTTGTTGAAA^ 
45 CTAAAAATCATTGACTGTATATGTGGCCCTATATTAGTATA^ 
CTATATTAGTATATGTACGTATATACATATTGGCCCTATG^ 
TATTGACTGTATATGTGGTCCTGTATTAGTATATGTACATATATA 
GTATATGTACATTTATACATGTAGGCCCTGTGTTGACTGTATATCTGGTCCTGTAT^ 

CGTGTTGACTGTATATGTGGTCCTGTATTAGTATATGTACATATATACATGTAGGCCCCATGTTGACTGTATATGTGGTCCTGTAT 
50 TAGTATATGTACATATATACATATAGGCCCCGTC 

CCCATGTTGACTGTATATGTGGTCTPTGTATTAGTATATGTACATATATACATC 
ATTAGTATATGTACATATATACATGTAGGCCCTATGTTGACTC 

GCCCC^TGTTGACTGTATATGTGGTCCTGTATTAGTATATGTACATATATACATGTAGGCCCTATG 
GTATTAGTATATGTACATATATACATGTAGGCCCTATGTTC^ 
55 AGGCCCTATGTTGACTGTATATGTGCTCCTGTATTAGTATATG 

TGGTCCTGTATTAGTATATGTACATATATACATGTAGGCCCCGTGTTGACTGTATATGTGGTCCTGTATTAGTATATGTACATATA 
TACATGTAGGCCCCATGTTGACTGTATATGTGGTCCTGTATTAGTATATGTACATATATACATGTAGG 
TGTGGTCCTATATTAGTATATGTACrrATATACATGTAGGCCCTGTGTTGAGACACTATTGTGTTTTCr 
TTTGCTGATGCCACATCTTGATTAATGTGGCTTTACAGTAAGTCITAATATAAA 

6 0 ATTCTAGGTCCTTTGAGTTTCCTTATAAATTTAAGAAGTAGCTTGTTGATGTCTrACAAAAATGCA 

GTGTTGAATCTGTAGATCAATTAAGGAAGAATTGGCATTTTAAC^ 

GGTCCTTTTGTATTTCTTTTATTTCTCTACTGAAAAAAAATCTCTACAGAATTACATTGTT 
GCATGCCATTTGTTAAATTTATTCCTAAGAATTTT^ 

ATTTG CTG CTAGT ATGTAAAAAT AC!AAT AG ATTTTTGT ATGTAAAT CTTAT AT C CT ATGATCTTTT CCAAAGT CATTT ATT ACT CT 
65 GGTAATGGTTTTGTAGGTCTTTTAGGATTTAAATACATAGTTTTAC 

ATTGCAOTGGTTAGAGCTGCCAGTTCAGTATTGAATAGTAGTGATTAGTGGACATC 

TTAGATTTGTCATATATGACCTTTGTCAGATTGAGGAAATTCCCTTCTATTCTTAATTTGCTGAAATTTTTT 

TGATTTTAGTAAATGCTTTTTCTGCATCTATGGAAATGATCATTGTTTTGTCTT 

GATTTTTGCCCCCCGTTTTTTTTTTTTTTTTTTTTA 

7 0 TAGCTCACTGCAGCCTGAACTCCTGGACTTA 

ATGCCAAGCTAATTTTTAAT1TTTAAAATTTTTTGTACAG 
TAAGCAATACTCCCACCTCTGCCTCCCAAAGTCCrrGAGATTACAGGTC 

AACCTTG CATTTCTGGAAGAAGTCCAACTTGCTTATGAGATAT CT AAATTTGATTTG CC CAGTTTTTA 
AGGATTTTTGCGTGTTGATTCATGAGGGATATTGGTGTGGTATTTTCTTTGTTTG^ 
7 5 TCATTTTGGCTTCATAAAATTAGTTTGGAAGTGTTCCCTTOTCTTTGTTTTCT 
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AAATGTTTGATAGAATCCACCAAGAAAACOGTCTGGAC^ 

TCAATTTAATCTATAGAAAGAGTTATTCAGGTTT^ 

CCCTTTCAGGTAAGTTGTTGAATTTACTCATTTAAAG 

AAAGCAAATATTGCAATAAAG CACGTC^CACAAACTTTTTGGTTTCC CAGTACAT AT AAAAGTTATGT TTAGG CCAGG CATGGTC 
5 CT CATG CCTGT AATCCTAGCCCTTTGGG AGGC CAAGGTGGGCGGAT CATTTGAGGTTAGGAGTTTGAGAACAG CCTGGC CAACATG 

GTGAAACCT CATCTCTATTAG AAATACAAAAATTAG CTGGGTGTGGTGGCACATGCCTGT AGT CC CAG CrGCTTGGGAGG^ 
TGGGAGAATTGCTCGJUVCCTGGGAGGTC 

CTCCAT CT CAAAAAAAAAAG AAAAAAGTTATGTTTACACT AT ACTATAGTCTATTAAAGrc TAT AAG ATCATTATGTCTTTTTGCT 
AAAAAATGCTAATAATCATCTGAGCCTTCAGCAAGCCATAATCTTTTTCTTG^ 
10 CCAGGTGGTGGTTGCTGAAAGTTGAAGGTAGCTGTGGC^GCAGTTTCTT^ 
TCTTGCTCTATTGCCCAGGCTGGAGTGCAGT^ 

CCTCAGCCTCCCGAGT AG CTGGGACTACAGGTG CCTG C CAC GACGCCTGGCT AAl"l"iTiGTATTTTTAGTAGAGACAGG GTTT CAC 
CTTGTTGGCCAGGCTGGTCTCAAACTCCTGACCTCAGGTGA 
CCACTGCACCTGGCCGGCAGTTTCTTAACTGAAGTTTTTGACATG<^ 
1 5 ATTATGTTTGATAGCATCTTACCCACAATAGAGCnTCCTTCAAAATTAG 
TAAGTTGATGTAATATTCTAAATCTTTTCTTC 

ACCACTTTGTCrrGCTCATCCATAAGAAGCAACTCCTCATCTGTTCAAGT^ 
GGCTCCACTTCTAATTCTAGTTCTCTGC^ 

AGGGTTGGAATCAACTTCTTCCAAACTCCTGTTAATGTTGATATTTCAACCTCT 
2 0 T AG AATGGGG AATCCTTT CCAGAAGGTTT T CAATTT ACTTTG CCCAGATCCATT AG AGGAAT CACTAACT ATGG CAGAT ACTG CTT 
T ATAAAAT ATATTT CTTAAAT AATAAGACTTG AAAATCAAAT TT ATCCCTT CTCCATGGGCT G CAGAATGGGTGTTACATTAG CAG 
G CATG AAAACAACATTCATCTTT ACAT ACATCT CCGTCAGAG CT CAATG AGCAATAATATTTTG 
AAAGGATTCTTTTTTCCTGAGCAGTAGGTCTCAATGATTTTTTTTT^ 
CGGTGGCATGATCTOSGCTCATTGTAACCTCTGCCTCCCGGGTTCAAGTGATTCTTC 

2 5 CAGGCACACACCACCATGCT^GGCTAGTTTTTGTA 

CTGACCTCAGGTGATCCACCTGCCTCAGCCTCTCAAAGCATTGGGACTAC^ 
TTTTTGTTGTTGTTTTTTAATTGAGACAGAGTCT 

CTCCGCCTCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTGCAGG 
TTTTGTATTTTTAATAGAGATGAGGTTTCACCACCATGTTGGCCAGG 
30 TGGCCTCCCAAAATGCTGGGATTACaGCTGTaAGC 

TGCCTCAGCCTCCCAAGTAGCTGGAATTACAGGCGTGTGACACCATGCCTGAATAAT^ 

ACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTC^AAGTGATCCCCCCGCCCCG 

AGTTGCCGCACCAGGCCTCAATGGGTTTTTAATATTCAGT7VAACCATGCTGTAAACAGATGTC 

3 5 CATTTCTGAAGCATAGACAGAATAGATTTGGCAGAATTCTTAAGG 

CTTACGGTCAACAGCTGCGTTAGCCTCTCATGAGAGTCAGCCTGTCCTTTGAAGCTTTAAA^ 
GCTATGAAAATCCTAGATGACATCTTCTTCCAGTAGAAGGTGGG^ 
TCAGTGCTCTGAGCTAGATCTTCTGGGTAACTTGCTGCCGCTTCTCCATCAGC^^ 
GAGATGGCTTCTTTCOTTAAACTTCATGACCTGACCT 

4 0 TCATACCATTGAAGAGAGTTAGGGCTTTGCCCTGGATTAG^ 

ACCACTCAAACTTTCTCCATATCAGCATTAAGGCTGTTTTGCTT^ 
TTTCAATAACTTTTTCTTGGCATTCA(^AACTTGGCT 

GCCTTCCTCACTCAGCTTAATCATTTCTAGCTTTTGATTTAAAGTGAGAGACGGGTG^ 

C CATTGTAGGGTTATT AATTGG C CTAATTTCAATATTGTTGAGTC T C AGGG AATAGGGATGC CTGAG CAGAGGG AGAGAG AGCGGG 

4 5 GAACAG CCAG TCAGTG GAG CRGTCAGAACATACAACGTTT AT TGAATAAGTTCACTGT CTT ATCTGGATGTGGCTCG TCATGCC CC 

AAAAACAACTGCAATAATGGCATTGAAGATCACTGATCACTGGGCGCGGTGGC 
AGGCAGGTGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAATATGGTGAAACCT 
AGCCAGGCGTGGTGGCGGGTGCCTGTAGTCCCAGCTACTGGGGAAG^ 
GCAGTGAGCCGAGATCACACCACTGCACTCCACTGCACT^ 

5 0 GGGCCGGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGC^ 

ACTCGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCC^G 

GAGCAAG ACTCTGTCTCAAAAAAAAAAAAACIAAAGAAAGAAAAAAGJ^ CACAG ATCAC CATAG CAG ATATAATAATA 

ATGAAAAAGTTTGAACTATTTCAAGAATCATCAAACTCCTACACAGAGACACA^ 

5 5 AGACTTACTTGATGCAGGGCTGCCACGAACCTTCAATTTGTAAAAAGTGCAGTATCTG^^ 

TGTTCCTTGTCACTGTAATGTCGGTAGGATC03TAAGGAAAACACTT<^TTCCT 
TCATCAGCAGCCTGGCTAGGAGGTTATCAATCTACrATCTTTTTAAAGAACCAGCATTGGACT^ 
TGAGTTTCTCTGTTATCTTCATTACTTCCTTCCTTG^ 
CCGGCAATTTGAATAGCAACTTGTAATATAAAGTGTTCTTAA 

6 0 ATTGTGATATAAGAAAGATGTATTTGGGCTGGGCTCGGTGCAGTGGCT 

GTGGATCCCTTGAGCTCAGGAGTTTGAGACCAGCCTGGG CAACATGGTG AAACCTCATGTCTAC CAAAAATACAAAAAATTATC CT 
GGCGTGGCGGTGTGTGCCTGCGGTTCCAGCTACTTGGGAGGTTGAGGTGAGA 
TGAGCCGAGATTGCACTACTGCACTCCAGCCTGGATGACAGA 
TCTTCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCT^^ 
65 CTAAAACTCTTGAAATCTCCAAAGTGATGGGTGTCTTTTTGTTTGCTAAGGAGG 

GGGATAAGGGCTGGTTGCCAGGGGACCCAGCCTTGTGATTAGAAGGCTAGAACTTGCAG 
AGAGGGGATGAAGGTTGAGTTGATCATCAATGGCCAAAG 
GATGAGGTTTGTAGAGTTTCAGGTTGrTGAATACCTGCAGGTC 
CCTTCATACCTTTCTTATGCATCTCITCCAGTTGGCTGTTCA 

7 0 AGTAAACTGATTTCCTGGCTTCT12TGAGTCACTCAAGCAAATTATTGAACCTGAGAAGGGAGTTGT 

CAAGTCAGACAGAAGTTATGGGTAACCTGGTGTCTTACCACTTGCGACTGATCTCT 
CCTTAACCTATGGGATCTGCACTAACTCTGGTTGGTGTCAGAATTGAACT 

TGGAGAATTGCTTGGGGGTCGG CGGGAGGAAACCACACATCTGGTGTCAGAGCTGAAG CACTG AGTG TTGTGAATGAGAGTGGAGA 
GATAGAGTTTX3TTTTGCCTTATGCAAGAGGACTCTAAGGAGTATAGAAAGA 
7 q TAG GAGGAAAG CTTCGTG ACATTGGAGTTGG CAATGCTTT CTTG GATGTG ACAC CG AAAATACAACAAAAGAAAT AAATGATAAAT 
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TGGAOTCACCAAAACTTAAAACTTTTGTC 

TTTGAAAATCGT ATAT C CGATAAAGG AC T AGT ATC CAG AAT ATAT AAAGAACTCCTACAACTAAG CCACAAAACAAAAAAAAACAA 
C CCAGTGAACAGACATTTTTCCACAGAAGACAT AC CAATGGCCAAG AAGCACAGGAAG AGATG CG GAGTGTCACTAATCATCAGGG 
AAATGAAAATCAAAGCCACAATAAAATATGATTTCACACCCTTTAAGGT^ 
5 CAAGGATATGGAGAAATTGGAGCCCCX3TGCACTGCnX3GTGGGAATGTA 
TCAAAAAATTAAACTCAGAATGACCATAACATCCAGTGATTCCA 

ACAGGTOTGCT ACATTCITGTTTGTAGCAGCACTTT^ CATAACCAAAAGATG CAGGCAAC CCAGGTGT C CACCAG CGGATGAAT 
GAACAGACAAAATGTGGTCCAT CTGTACAGTGGAATATTAT^ C CTT AAAAAGGAAG^AG AGGCCAGGTGTGGTGG CTCACG CC 
TGTAATCCCAGCGCrrrGGAAGGCCGAGGTGGGCAGATCACT^ 
1 0 CTGTCTCTGCTAAAAATACAAAAATTAGCCAGGCATGGGGGCATGTGCCTGTAATCCCAGCT 
TCGCTTGAACTAGGGAGGTGGAGGTTGCAGTGAGCCGAGACCA^ 
JVAAAAAAAAAAAAAAAAAAAAAAAAAGGAGATTCTGACACATGCT 
CCAGACATGAAAAGACAAATACTGTATGATTTCACnTTTATGAGG 

GAGGTTG CCAGAGG CTTGGGGGAAGG GAG AATGAATQGGAAGTTGGTGTTTAATG GGGAT AGAATTGCAGTTTGGGAAGATGAGATG 
1 5 GATGGTGGTGATGGTTACACAACAGTGTGAATGTGCTTCATGCCACCAAA 

TGTG ATGGAT ACTTTAAGG CATGT ATGCAAACACACAGAAG CAGTT ACT AC CACTGGGGTTG ATGG AAAAG AAACCAAGAACT CGG 
AGCGTCCTCCACACCTTGCCACGTCITACGTTCAGGCCTTG 

GGTGGGCAGGAGCTGGGTGTCTGAGAAGTTTGCTGAGGTCCCATTGGGCT 
2 0 GCCCCACTGGGCACACGCCACTGGCAGCTGTGCCTTCAGAGGAA^ 
AGGCCTGGCAGCTCCCCAC!AGTCTTGTCCCCrGTGGA 

TCGGTC CAGT ATCACG GAG CGG^jGCCTGG C!AGG ATGTATGCAGAGTT AAG AGGCAATACATGG GT AACTGG CAT ATTTTTC CAGAT 

TCTTTGTTTTTTTTGTTGTTTTGTTTTGTTTTGTTTTGTTTTGT^ 

TGCAGTGGCATGATCTAGGCTCACTGCAACCTCCACCTCCCGGGTTCAAGCCATT 

2 5 TG CC CACCATGCCCAGCT AATTTTTGT ATTTTTAGTAGAG ACAGG 

CAGGTGATCCACCTGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGC^ 
AAAGCAGATCCTCAGGTC r iTlTl Tn 

GCACTAGCTT(^GCCTACX5TATTTTAGTATGCTGTGTTTTCATTATCATTT 
CTCTTTGACTCATGGGTTATTTAGAAGTATGTTGTTTAATTTG^ 

3 0 CT AATTT AATCTTACTT TGG CCAGACAAG CAT ATTTTGCATG CTTT T AAAATTTGAG ACTTATGG C C CAGAATGTGT AT CTTG GTG 

AATG CACTGTTTTCACTTGAGAAAAAAATATGTATATATATTCTAG ACTT CCTGAGTATAG AGCTTTATAACTG CTGATTAGGTCA 
ATTGCTAGTTACCAGTGTTCTTCAGATTATCTGATGCTTGTTTCTCAGA^ 

GTGTCCCCCAAAATTCATATGTTGG AGCTTAATCTCTGGTG CAGTGGTGTTGGGAATTGGGGCATCAGGGAAGTATTGG CTCATGA 
GGGCAGAGTCATCGTGAATGGGATTAGTCCCTTCTGCCATGTGGGGACACIAGC^ 
35 TCTCACCAGACACTGAGCACCCTt^TTTTGGACTTCCCAGCCTCrAGAACTGTGAGCAC 

TAAT CCCAG CACTT CAGG AGG CCAAG G CGGGTGG AT CACCTGAGGTC AG G AGTTTGAGAC CAG CCTGGACAACATGGTG AAAC CTT 

GTCTCTACTAAAAATACAAAAATTAGCTGGGCGTGGTGGCAGGCTCCTC 

GCTTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATTGCACCATTGTACTCCAGCC 

4 0 AAAAAAAAAAAAAAAGAGTCTCCACCTATGATTGTGGAATTGCCCATTTCTCCCTTTAATTCTTTCAGGT 

TGAAGCTCCTTTCGTTAGGTGCATACACATTTATGATTATAATGGTGTCTCGATGAACTGATCATTTAACCATGAT 
CTTTATCTCGTAATAATTTTATCTTGGACTGAACTTTATTGA 

TGCATAAGAAGGCTGGAGAGGCTTATGGAGAAACTTATGATGGTTTGACTTACAGTTTTTC^ 
ATGTATACAGTAGAAACTGTACTTTAAGTATGTGTGAAACCATTGTGTTTCT 

4 5 ATAGTCAACACTTTATTATAAAATAGGCTTTGTGTTTGTTGATTT^ 

GGTAGG CTAGGCT AAGCTAG AATGTT CGGTAAGTT AGATGTATT AAGTG CATT TTTG AG ATGATATTTATTTTCCACTT ATGATGG 
GTTTGTCAAGACATAACTCCATCATCATTCAAGCAGCATC1AGTATGTOT 

AAACAGCATCTT TTGT AG ATAGTAT ATAGTTGAGT CTTG CT ATTTTATT CATTCTGACAG CCT CCGCCTTTT AATTGGAAT ATT C A 
GG CT ATTT CTGTTT AACAT AGTTAT AGATATGACTGGGTTT AGAG CT ACTAC CITGCTCITTGATATTTAT TTC TTT CATCTTG CT 

5 0 TTCTTGGCU'TLTITTATGTTAATTAAGTATTTTTTAGAATTCTATT^ 

GTCTCACTCTGTTGCCCAGGCTGGAGTGCAATAGCATGATCTT^ 

GCCTCAGCTTCCCTAGTATACTGGGATTACAGACGCCTGCCACCACGGCCGGCTAATTTTTGTATTT^ 
ACCATGCTGGCCAGGCTGGCCTCGAACTCCTCACCTCAG^ 

AGCCACTGAGCCCGGCCTGTTTTGACTTTTTAATTGTATCTCTTTGTATTGCTATTTTAGTAGTTACT 

5 5 ATTCTTAGCTTTTCACAGTCTACTTAGAATTAACATTGTTTTTCT^ 

CATTTACCACTTTCCATCCTGTATGTGATGCTTGTTAGATA 

^ TT , mGC r mA ^^ GCCCCATG( ^^ TX J^ GA ^ 

CCTGTGCTCTTCATTCCTTCCTGAAGAGCTGAGTTCCCATCTGGTCT 

GTAGTGCAGGTTTGCAG ACAGCTAATTCC CTTTCATTTTC ACTTATCTGCAAATCTTTATTTTGCTGTCATT CTTTCTTTTTTTAT 

6 0 TTT TATTTTT ATTTTTGAGACAGAGT CTCAC TCTGT CT C CC CACAGG CTACAG TG CAATGGTG TG AT TT CAAAACT C CTG C C CAAT 

CTCAACTCACCAACCTTCTCCTCCCGGGCTCAAACGATTCTCCTGCCTCAGCCTCCCAAG 
CATACCCAGCTAATTTTTGTATTTTTAGTAGAGATGG 

TCTGCCTG CTTTGGCCTCCTAAAATGCTGGGATTATAGGTGTGAG CCACTGTGCCCAGCCTGCTGTCATTCTTGAAC CATAATTTC 

ACTGGTTATAGAATTCTGAGTTGTCAGTTTTTTCTTTCAGGACTTTAAAATATAGGGT^ 
65 ■ GTCAAG G AATCAG CATT CATCT AG CT ATTCCCCTGTAGAC CACATGTTGTTTTT'CCCTGG ATG CT T T T AAGACTTTTTAT CCAAGT 

TTGATTGCTCAGTACCTACCTCTGGGCAAAAAAGCCTCCAGAAACC^^ 

CTCTACTCCAGCTTCTACCTACTTGAGATATTCTATACTGACTCCAGGCTGTTGTTATTTTC 

CTATAATGTTTTCTGCAGAATTGACCAGATAGCAGCTACTCTCTGTGACCAGGAGCCTTG 

CAAAGCTCCTGCCCGTGGCCCTCAGTTGGCAGTGGCCTCTCTC 
70 TCCTCATTAGTACAGTCCAGAAAAGTGGAATTCCTGGTCTGGGTG 

TGCCAGCTGGCCGAATGTGGGGTCTCCAGCCTAGCACCCTTGGACTCTACCCC^GGAGTTATGG 

GCCTCAGCCCCTGTCCCCCTTCTCTTCT^ 

CTCCTCACTAGAGGGGCAG&GTTGGGAGGGAAGCCTGTTGAGCC^ 

7 5 TGGGGGGCTACCAGTGGCATGCAGGAGGCCCAC^TGACCACTGTTTGCTGGGGCCTT^ 
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GAGAGAGAGTGTGAGTCAGGGTAGTGT 

AGAGCTCTAGATCTGGAAQAAGCTCTGCCTCTCTTCTCTCCTCCAGTC 

GCATTACACCCTTCTACAGAGGAAATAATTC^ 

GAGAGGAGGAAACGGGGGCGGGGGTGAGGGTGGGAAC^GAGCT 

GGTGGAGATGACCAGAATGCTCCAGACAGACCTTTTATCAAACTT 

GAGTGAAAGACAAGGAAGGGCTTTCCTGCAGGGGGGTGTGTGTGCCTGTG 

GTGTGGACCTGTGTGGTGTGTGCCTGCACATGTGTGCACGTGTGGACCTGTC 

TGTGCTGTGTATGCACTTGCACATGTGTACACAGATCTCATTTTCCT^ 

GCCTCTCCACCCTTCCCAAG<OT^ 

ACACACACACACACACACACACGAGTCAGGGG 

CTGTAGGCCCCACCTTTGGCAGAAAACCATGTTTTTGTTGCTGCTCGTTGA^ 
TGAGAAGAGCAAATTTTGCCTATGGCACTGGGGACATTGAGTGGCT 



CTCAGAGGCGGCCCCAGGGCCCTTGGAG 

TCTGGTTGGCGCCCAGCCCAGCGGCCCTTCAGGCGGGCCCrGTGCA^ 

ACAGGTCCCCCAGGCTTGCCCAGAGCCCCGGCATCAGATGAGGTGC^^GAAGGGA^ 

GAGCCCCAGGACCACACAGGGGCCCCGACATGGCAGGTGCTTGGTCAGCA 

ACTGACACTTGCTGACOiGAGCCGGGGTGAGCATCTGGCT 

CTTGCCGGGCCACCCTGCTGAGGCCCTGTGTTACTGTCCT 

CGAAATCGGCCACGTGTOGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGTTC 

AC^CCATCCTGGCTAAC^TGGTGAAACCCCGTCTCTACTAAAAATACAAAAAAW 

CCTGTAGTCCCAGCTGCTCAGGAGGCTGAGGCAG^ 

ATTGCACTCCAGCCTGGGTGACAGAGQVAGACTCCGTCTCAAAAACAAAAA 

CTAAAGTTTCTGGAGCCAGAAATCCCAGACCGAGGTGTCTGAAGGTTCATTCCTCCTGGGGGTTCCG 

TTTCTCCGAG CTCTGCGG CGGCCAGCAACCCCTGGTATTCCTCGG CTTGCAGCTCGGCCTACCCCCCACCTGGCCTTCACCCTGTG 

TCTCTTCTTGTGTCTCTGTCCAAACTGCCCTCTCrGTTGTCTTTTTAATTTATT^ 

CTGGAGTGCAGTGGCGTGATCATGGCTCACTGCA^ 

TGGGACACAGG CGCACGCCACCATG CCTGGCTAATTTTTTAATTTTTTTGTAGAGACGAGGGCCCAGGCTGGTCTCAA^ CTGC 
GCTCAGGCAATGAGACTTTGCCTCCCAAAACACT 

ACCTGT CATTGGATT TAGGGC C CACTCTGATTCAAT ATGCCCTCAAAT AAC CCAATTACATCTTCAAAG AC CCTA TC CATAT AA 

AGTCACATTCATAGGTACTGGGGGGTCAGGACTTGCATATGTCTTTTGGGGGAACA 

GAACGAACGGCCTCTCCAGGGTGTGCGCAGGTGGGTGAGGGGCCCAGGAGAG^^ 

AGAGATTCCAGAAGGAGACTGGATGGGGTGGCTGCATGGAGCAAG^ 

GTAGAAAGGGAGCATTTATTTTGAGTTCCCCCAAGATATGAAGCTAGTACC^ 

TCTTTACAGAAAGAGGTGTTTTTCTCACAATGTTCACTTG 

CTGGAAGCATTCAAGGAGGGGCCATGTGGGAGTTGTTCAGGAGTGCATAGA 

CTTGTCTTATTTTACGACAACTTGAATTTCAGAATGTAATGTTATTGGGGATATG 

GAGGCAGCTT^ 

GGTTCTGGGCATGTGTGTGCAGATTGTCA^ 

CATGCACAAGAACACGAAGACCCGAGGCAGGTATAGTCCGTGAGACAGATCCCTCTAGGCTGAAAGA 
GAACAAGGCTCCAGCCTCGGCAGTCCCTCCCACTTCTGGTGGGAGGATCTCCACAGAGAAGAAGGGTCAAGAAGCACA 
TGCTCACGGTGTACCAAGCTCTGCITrAAGCTTGTGTCTTATTTTTTAOT 
TGCTGTTCTCAGCACTTTACAAATATTGATGCATGTTTG 

TGAGGAAACAGG CGTAGAGGGATAAAGGGACTTGCCCAAGGTOTGTCTCrACCTTACACG CACATGAACGCTCACTCATGCGGCAG 
GCGCTAGTTTCCCTCTGATGTGAGTCAGGCCTTGAGCTGGTTGGGGTCT 

GGAAGCTTCTAGCAGCCTCATCTCCCTGGTATGACCTAGCTCCAGGGACTGGATTTCCCCAAATGCTCCTG 

CCTTTCCAGCTGGGGGAGGGCTGAACGACCCCTCACTCTTGGA 

GGACAGGCCTGOTGTGGGTTGAGGCTCCCAGCAGCTGGG 

GGGCGAGCGTCCTCCTGAGGCCAATTCCCTGCCTCCCCCAGCGCAGGCGCGAAGCCATCTTTGCTTGTGTGTTT 

GGAGGGCGGTGCAATCGTGACGCCTGCTTGTTGCAC^^ 
TCAGTCATCTTGCCCTGACCTGGCGTGGGAGGCCCGAGCACTGCTCTTTT^ 

CCGGGCTCCCCCAGCCRATAA^ 
AGGGCCAGGAAAGGATTAGAATGCTTTTCTACGT^ 

AAATGTGGAATGAATGGGCCTATATGTCCCATTGTGTGACTCAGAGCCAATGACTCrGGCT 

CCAGCATCTCCCTTTGGGGTCTGTGACCTTGGAAGCAGTGACTCTCTCCATCCCACCTTGAATGGTGTCTC 

GCAGCTGGGGAITGCC^GCGCCTGTTGCTGGTGGGCIGTGCGCCAGATCCCGTCTT^ 

TGTGAAG^GCTGTGCGTGTAGGATTTGTGCAAATGTGGCCCTCCCTGCTGCCAGCCCATGGGAGGCTGTCAAGATTC 
ACATTAGCCCACACTCTCTTTOTGX3CCCACTGCTGTGG CTGTTCCCTTGTGT CACGTGGCTGCAGGCACAGCCCCCAAGTGGATGG 
GGATCGTGGGCTGAATGTGCA.TCAGATGCTCCTGCTTCTTGGTGGGTGGAGACCCTCAACCCATGCACCGTGGCTGCCGCATCT 

AAGGCCTTCCATGGTGAAGGACTTGGACCTX5AGC^ 
ACCTGTAATCCCAGC^CTTTGGGAGGCCGAGOT 
CCCGTCTCTACTAAAAATAC^AAAAATTAGCCGGGCCT^ 
AATGGCGTGAACCG&GGAGGCGGAGCTTGCAGTGAGCCGAGATCGCGTCACT^ 

CTCAAAAAAAAAAAAAAAATGAAGTGCTCCCAGCTCTTCCTCCTGTGGTGAGGTTCTGAGTTTCCATCCT 

TGGGGAGGGCATCTCTAGACCCGGACMAAA 
CTCACGTTTGGAAGAGGCGACACTGTGTCTCAGCCT 

GAGGCTTCCTAAGCAGCTCTGGGCACCGTGCACCTGGCATGGCCTCCAGCCCCATCTCCCTGCCTTGGGAGACCCGGTGGGGACTC 
CGGGCTGAGCTCCTGTGGCTGTGGAACCTTGGTGGTTTATTGTGGTGGCTGAGA^ 

ACTCCCAGTGGCTCCGTGAACTGCCTGGGAGCCTTGGGCAGCTCTTTCTCACTCTGTGCCTCCGCTTCCTGATGG 

CCAGGGX3GGAGCGACCTTGAAGGTATTCGTGGGAACCAAGCGAGGTGATAAATGTAGTGTCCTTGGTAGAGAAC 

CCACCCCCATGAAGGGGGCTGCTCCTCCTCCTTCTATGTTTCCTCTTGTTCTTATTTCTCCTCT 

CCTCCCNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNN^ 
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NNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNN^ 
5 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNKNNNN^ 
TGCCCCAACTGCCTTTCTCCATTGTCCTTCTGTAAGTGGTTCT 
GTGGGCTCTGGCAGGCCAGGCGCrcAGCAGCCT^ 
1 0 TGTTGGTGGGAGAGGGGAAGGCGCCGTGGTAGGGGAAGCTTTC^ 

CAGGT CACTAGAACACAG CTG AGG AAG CTGTCGGC CAGACGTCCAGAAGCATCAAGCTGGTGGCTGGGCTGT^ 
GGACAGGCAGACAAGGTGTCGGAGGAAGGGGCTGACACTCACTCAGTGTCTGTCATC 

GCTGGCCCACGCCACCTGTGGAATCITCCAGGTGTGGGGGCAGATGGGGAAGCAAGGGGGCTTGAAGACAC^ 
GTCCTTAAATCCCAAGAAGAAGAACAAGGACACATCCACAGGGGCAGGGCTGAAAGTCTG^ 
1 5 TGGTGCAACAGCTACAGTGGCTCCCTCTTACCTTTCTCTTCCCTCCATTCT 
AAGCATTTTACAGGGAAC^CATTCAGCCATC^CCTAGATTCT 
TCTATCCCTCCTTCGCCTGGCCATGAATCCATTTTATTTTTAT^ 
TCTGGAATCACTGTCAGAAATTCCCTCCCTTCCCCTCCCCTCTCCTTTCCTTTCCT^ 

AGG CTGAAGTG CGGTGGTGCAGTCATAGCACACTG C^GCCTCCAACTCCCGAGCTCAAGCAATCCTCCTGCCTCAGCCTCCTGAGT 
2 0 AGCTGGGACCGCAGGCATCCACCACCACACCCAGCTTCATTT^ 
CCACTTCTCACAAAGCGAGGGTGCTGC^ 
CAGCAAATCATTTTGCCTGTCTAGGTCCTGTTTTCATT^ 

C CAGTT ACCAAGCTC CTATTTTAGGC CAGG CACTTTT CACATGTTATT ATGAGT CCTTT CAATCTAGGTAT TGTT CTCAATGG ACA 
GCTTAGTCAACGGAAGCTCAGAGAGGTGGTGTAACTTGCCAAAAGTCCCACTACCCAGTGAATGTCC 

2 5 GAGTCTGACACAGAGCCCAGGCCTCAGCACCTGGCGATGTTTTGGGGGTGTGAGCAGCC 

GCTGTTCCTTCTG CCTCATGTTTGTGTTTGTCCCCTCTGAAG CTCAG ACTGTTATTTTCCAGTTCCAAATACCAAAACTGGCTAGG 
CACGGTGGCTCATGCCrTGTAATCCCAGCACTTTGGGAGACCAAGGCGGGTGGATCACTT 
CCAACATGGTAAAACCCCATCTCTACAAAAAATACAAAAATTAGCCA^ 
AGAACCCAGCTACCCAGCTACTGAGGCACGAGAATCAC^ 

3 0 T CTCT AGG CTAGGCAACAAGTGTG AGACT CTGTCT CAAAAAT AAT AAAT AAAT AAAAT AAAAT AAJVAATT AAAAC C AAAT GC CAAA 

ACCAAC CCACAAAAT AAAC CTTGGTTAAAGTAGCCTG TTCAT TTTG CTTT AACT ATGT ATATTGCAC CAGATTCTGC CAGGG G CT C 

AGCGTATAAAGATGAGACCCCTGCCCTGCCAGACAGGTGGGATGATGGCCATACCTTGAGTTAGTCAAGGCACCATCTGGCTGTGT 

AACAAAAGCTAAAACATCAGAGGCTGAGCAGCaCTC^ 

CCGGACCTAGTTCATTCCTTTGGTGCATCCTCGACA^ 

3 5 CTGCACTCCAGCCAGAGG^AAGGGAAGAAGGGAGAGGGGAAAGGGCAAGTOTCCOT 

AAACACITTCAGCTCCTTCCCACTGGCCAGAACCCAGTTGTATGGCCGCTCCTGTCATCA 

C ATG CACTG ATAG CTCT CTTTGTATGAACAGAGCTGTGG CAGGCC CTATGC CAGGGAG AAAG TAAGATTGG AAAAGAGCTT ACCAA 
GGAGGTGGCATTTGCACTGTGC TTAAGGGGCAAGAAAAACGTCTTCC^ 

4 0 CTTTCATCCTGGAGCTGGAACAGAjCGT 

TGTGAGGCTGAGGGTGGGAAGATTGCTTGGGGCCAGGAGTTTGAGACCAGTTTGG^ 

ATATAAAATGTAGCTGAGTGTGGTGGCACCTGTAGACCCMCCCC^GCTACTCGAGAGGCTGAGAT 

GGAGTTCGAGGCTGCAGTGAGCTATGATTGCACCACTGC^ 

ATAAAATCAAAGTTTTAAAATCGAAAAAAAATCIATGAGCACATAACTCCAG 
45 GTGGCTCATGCCTGTAATCCTGGCACTTTGGGAGGCCAAGGTGGGTGGATC^ 

CAGCCCCATCTGTACTACAAATACAAAATTAGCTGGGC!ATGGTGGTGGG CACCTGTG AT CC CAG CTACTTGGAAGGCTGAGGCAGG 

AGAATTGCITGAACCCAGGAGGCGGAGGTTGCAGTGAGCCAAGATCACACCATTGC^ 

TGTCTCAAAAAAAAAAAAAAAAAGCAAACCCAGGGCCAGGTCTGTAGCCACAT^ 

CMAAACCTGCCCCACATCACATCAGTG^GCAGTTGATTGGGTCAGCCTGCCAGTTGAACC 
50 GTAAGGGGCATTCTAGACATACCCCCCTTCCCAGGGAAGAATAAGCC 

TTTCCCAACTCTGTGTTTAGTGATGTCCTGTTTCTAGCTTGAAATTTGCC^ 

ACTACATATCAGGCCTTACCTCCTGCTCCCTACrAGGGTAGGTTGTTlTTlTTTTGT 

CATTCTCCTGCCTGAGCTTCCAGAATAGCTGGGACTACAGGCGCCCGCCACCACA 

5 5 CGGGGTTTCACCGTGTTAGCCAGGATGGTCTGGATCrCCKlACCTCGCAATCTC 

GCCACTGOSCCCGGCCTAGGGTAGGTTGTTAGAAATGTACCAGCCTGCCAGTTTCAGCCTTGTTTGTCTTTC^ 
TTTTTTTTTTTTTGAGACGGGGTCTCTCTCTGT^^ 
CTCCTGGGATCAAGTGATTCTCCTGCCTCAGCCTCCCAAGT^ 
ATT^TAGTAGAGATGGGGTTTCACC^CAT^ 

6 0 AAAGTGCTGGGATTACAGGCGTGAGCCACCGCACTGGGCCTCTTTCGTTCT^ 

TGATGTCTGTGTCAATAGGTACTGCCTGCGGGACAGAGCCT 

GTCACTCCCTGCTGCTCAGAACTAAAGATGTAG CTATGCCTGGGGTAGGTTGGG AAAACCTGTGGGAATCACAGAGCTTG CTGTCA 

TGCTGTGTTCTGGATACCTGAAGCTTGGC^GGCAGGTGGGATCGAGGGTGGGCATTAG 

ACATAGGTCCTGAGATTTTAGGTCCCCAATTGGACAAAGCTGT^ 

6 5 GGTCAGTTTTCTTTTCTTTTCTTTTCTTTTTTTTTT^ 

TCGGCTCACTGCAAGCTCCGCCTCCCZAGGTTCACGCCAT^ 
CCGCGTCCGGCTAATTTTTTGTATTTTTAGTA^ 
TCCGCCCGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGT^ 
GTCTTGCACACCTTGCTGGCTCn'GGAGACCATATGCACCAT 

7 0 AGGGCCTGTCGATGTGTTGACACCTGGCCATTCACTGGCCTTGAATGTCCACATTCACC 

GGCTCCATGTGGATTTGGGGGCAGAGGCCTGTACTCCAGATGGTGGGG 
GCCCTTTGGGCTCTGTAGCTCCAGATGTGTGCTCAAGATCTGTGGCCACATGGA 
TCCCTGTGACCAGAGTAGCCGTGGCCCTGTCTGGGAATGTT^ 
AGCTGTGCTGAGAAGTTGGCCCATCTGGCTTGC^GACTC 
7 5 CTGGGTGCrCCTTGTCTAGGTTCCTTGCAGCAAAAGAATTCT^ 
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CACCTCCTAATCCCAGCTGACTGGCTCCCACCTCCTC 

GC CTT CGCAGTC TGAGCTGATG CTG CTGTATGTT CACAGGGCT CACACGCTT 
GTGGCTCTCCTGGATGCCACACTAACTCAGTAAGG^ 
GGAGAATCCCTTCATCTGACGATCCC^GAAGGCTTC 
5 GTCAGGCCAGGG AGGGTGTGAGGX3GCCCA.CCAGG CAGAGGGAACAGGA^ 

AAGTGTGAGCCCCTGGGCATGACCAAGGCG 

CCTTG CTG AG CT CCACTCTGT ATG CACCAGGTGACGGAGGCTGCCCAGG ACGGTGGG CAGGGAAGTGACAGGTCACATGG T C ATCC 
CACCAAGAGCCGGGAGTCCAGCTTGGGGGGACCCA^^ 
GG AGAAGGAAACAGTGTTGCTTATGAATTATAAACAGACA.CTC CC^ 
10 CAGAGGTGGGAAGGTGACTTCTGGCTCTCCAGGCCTTGGGCCTC 
CTCTGTTGCCCAGGCTGGAGTGCAGTGGTGTAATCTTGGCTCACT 

GCCTCCTGAGTAGCTGGGATTACAGGCACCTG CCAC CACAC CCGGCTAATTTTTGTATTTTTAGT AG AG ATGGTGTTTCACCATGT 
TGGCCAGGCTGGTCTCGAATTCCTGAOrrCAGGTGATCCGCCCACCTTGGCCTCCCAAAGTGCT 
GTGCCCAGCCGAGCCCATCTTCAAGTTCTCCCTGGCCCT 
1 5 TCTCCCTTAAGACTGTCCCCAAGTGGACAGGTCCCCAGAACCAAGGAGCAGCCT^ 
TCCGTGTGGCCCTCWSCCTCTGGCCTGGTTCCX^ 

GTCCC1TGGCTGGGCCGAGTCATCGTGGCCAGTGACTCATGCAGGGGGAGAAAAGTGAGGAGGGGGAACA 
GTGTTAATAACAAACCCATTAGGTCCCTGAGGTCACTTCACAGGAGGGACAAATGGTTC^ 
CITCCAGAGACACTCAGGGTGGGGAAGATTTCATTATCTC 
2 0 CGCGCTTCCTGTCTGCCTCCCACGCAGAGCGTTCT 

GGGTGCGGCATGGGGGCACTGCAGGGCACX5GGGTGCATGCATTTTTATTTA 

GGTATATTTGGTGACATAAAACCTTCTGTCGATTCATGGCAGTCTGGAACGGAGTGGAAGCGTCC 

CACATTTGGCTGTGAACGTCGGTGCCTTTGCACTGGAGCCTCTTTG 

CTCCATCATCCTGTGGAGGTGTCAACATGTAG CCAGAGATGGTCAGTGGCAGTG AGGAGAGGGGGG CAGGAAGAGACCAGGGCTGG 

2 5 GGGCTTGCCAG CCCACAGTTCTACCCCTGAACACCCATGGCCTCCCTGTTCAGGT CAGCGAGG CTGGGGGCAGAGACGGGCAGGGG 

TCCCCCAATGTTGGGCATCTCCCCGCTCCCTCATACTCTGGATTTGCCCTCTGG^ 
CTGCTGGCTGAGGTGGGGAGCAGAGCCCAGCTGTGCCTGGAGGCTCCCC^ 
AGGCCCAGGCCGCTGAGGCTGGCTCCAGAAATTTCTTCACAACCACCACrGGATGGAAAACCAG 
AGTTTTGTTTCT^CTrCCACAGGT^CCCATAGTTA 

3 0 ATGTGGTTCGGTCTCCACTGTGGATGTTGGTGATCTTAGTCTCCTTCTCAAAACCAA^ 

CAGAAAGAGCCGCCACCCACCCACACCGCTACATCTTCTTTCCACTTTTCT 
AGAAATCCATCTACZAGCCTGACTGCAGATGTATTTTGCTCCCTTATTGAGTCrCT 
TTCTCTCTCTTTCTCTCTTTCTTCCCTCTC 
TTCTTTCTTTCCTCTCTCrCTCTTTCCCCCTT^ 

3 5 AAGATAGAATGAATGGCATAGGTAGGTACTGAGCTCCCrGTAACrGGAGGTGTTCAAGTO 

GG CATTG AGG AAGGAT C TC C CTGGG ATTGTGGATGG CAGGACAGG C CCTG TGG AG CCTTCTGAGCG GGTG CAGAAG AGGGAGGGCA 

TGTTCCTGCrTCCCTGCCCTTCCCGGGTTGCCAGGCC^ 

ATGGGCAGAGGCATTGTTAGAACATGGAGCCGCGGTCCACAGAG^ 

GTCACTGTGGTGACTTTGAAAATAAATGCTTTCTTCTCTGCTTGAGTCCCTGTTGTTCCTGCTGACGTGGAACTAGGGCAGGCTGG 

4 0 TGACTCAGCCCrCATCCCTGCACTGCATCGGTGGCAAGTTGCCTCCAGCCTCA 

ACAGTAGGGAGG CCAGGACACCAAGGGCCCCAGGGACAG CCTGTTTCTCCTGGCTGG CACG AGTTCTAGCCAAGCCTCCTGGTAC C 
CTGGGGGTGAGGAGGTACCAGCTGTGTGCTTTAGTCACrGGCTGCTTTGA 
CCCCTAGTGTGACTTTGCAGAGCrCCCGGAAGGGTCAGAGGGTTTTTTTCCATCACT 
CATTTTTGTACTCCAGTCTCAGACGGGTGCTAGGAAGTTGGAGACTTTAGCCATTTCACGA 

4 5 AACAGGAGTAGGAGGAACGCTCTGGTCATGGGACTGTGTGAAGAGAGAAGGATCAGGAAG 

ATACCTCAGCAGAGCCGCTGGTGGCCTCCCAGGCATCTGA^ 
GTTCAGGAGGCACAGCCAGGGTTCTGCTGGAGAGGGATCATCTTGGAGG 
CTGTGTGTGTGCAGCCCCTGACCACCACCATCCCTGCCCCCGCCACTTGCTCCCAGGGTTCCA 
AAATGTAGCAACAGAAGGTCCAGGCAGTGCTGCAGTC^ 

5 0 G CTGATTCAGTC CCAAGTGCCGGAGCGGGGAGCTCGGATGAGGTGGG 

TGCTGAAGGCCAGGAGCCTCCTTGGGAGATCATGAAT.TTAGAACACTAATACTTTGTTTAAAAAGTCCCCTTC 
GCTCAGGCCGCTGTGGTCAGGTTTTGCTGTAAATGTTCTGCACAAAGCAGGCTGG 
GGTCATGTGCTTCTTTGAGTGAAACCTTATGGTTGCATTGAAGCGGCCAGGCAGAGGG 
GCTCAGTOVGGCTGAATGGGGAAATGCAAACTCAGGTGGCCTGAAAGAGACTTA^ 

5 5 AGGTGATGTAAAGAGAGAAGGGGTTCCGGGGGCTTGGCCCAGGTGGA 

GAGCAGGTCTGCTTGGTGGCTGGCAGAACGC^CATGAAG 
CCCATCAGGAGTCTCCCGACTCTGCC^TTCTGCAAAGGAAACTGCGCTTTOT 
CCTAAAGCCACTGTCTCAAGGGGTCTCATGGGACAATGGGAGAGTTGCTGTGATGT^ 
GCTCACGTCTGTAATCCCAGCACTTTGGGAGGCTGAGGA^ 

6 0 GGTGAAACCCCATCTCTACTAAAAAAACAAAAATTAGC 

GCAGGAGAATTGCTCCGATCTGGGAGGCGGAGGTTGCAGTGAGCCGAGATCACGCCACT 

ACTGTC TAAAATAAT AAT AATGCT AATAAT AAAT AAAAT AATAAT AAT AAAAT AA CAC 

ACCC^TGTTTGTAGCAGCTTTACTCTCAATTGCTCAAATGTGGAAGGAATGGATATA 

ATTCAGCCTTCAAAAGGAACGGAATTCTGTCACTACATGTGAACCTTGAATAATGAACCTTGAAGAC 

6 5 TCTCACAAGGACAAATATTGTATGATTTCACTTATGTGAAGTCAAATTCATAGAGAAATTGGAATG 

GAAGGAAGGATGGGGAGTTATTGTTTAATGAGGACAGAGTTTCAGTTTGGGAA 

TGGGTGTACAATGTGAATGTACTTAATGTCACTAACATGGAAACTTGGTTAAAATGGTA^ 

TTTTTTTTGAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCACT 

TTCAGGCGATTCTCGTGCCTCAGCCTCCCAGAGAGCTGGGATTATAGGCGTTTGCCACCACGCCCGGCT 

7 0 TAGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGA 

GGGATTACAGGCGTGAGCCATGGTGGCTGGCCAAAAATGGTAAATTTTATATTACATATATTT^ 

CTATTAGAAAAATGATAGATTTTT ACAAT ATGAATT AAAT T ACAAGTT ATGAAGGAAAAAT T AAG CAAAACATT AAAAAATTACTT 
ATAAGTAATTATATTTAAAACAAAAGAT AATAAAT ACACAACTTT CAT CACCTC TTAGGTTTTTTG CAACATTTAACT AT TGTCTG 
TCCTGTTGCGGI^ATGTTCTTGAGGTTGTATTTGTATG 
7 5 CTCCAATGATGTCATACAGATAACTTGAAATTGGCCGGCTGGGCACGATGGCTCATGCC^ 
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GG CAGG CAGATCAT CTGAGTTCiUVAACCAGCCTGGCCAGCAGGGTGi^TCCT CATCT CTAGTAAAAATACAAAAATT AG C CAGGAG 
TGGTGGCGGGTGCCTGTAATCCGAGCTACTTGGGAGGCTGAGGCAGGAGAATC^ 
CAGGATTG CCCCACTGCACTCCAGACCGGGTGACAGAG1XSAGACCCT 
GGAATATTGACACCACAGGAATAGGCAAGCGCT^ 
5 TTGACCG1TAACTGGCACACTACTGGAGGAGGGAGGTTCAGACTGATTCGCA 
ATGCTGAGGGTGAGGGAAAGAGAGGAGTCAACAGTGGTGCrc^ 
TAAGATGAGGGGCTGTGGGAACTTGAGGGGCTGTGGAAGGCT 

TTTGTGTCAGGCCACGCAGAGATAAGACCATCTGGGCCCTGCCCTTGGGAAGTCTCTGTCT 
GGGGTGTCCCGTCTGGAGCCTCAGTGTTGCTGGGAGAAAGCCCCrCACCTTCC^CGCTCTCCGA 
10 GCTGAGCCTCTCCATGGAGGCTGTGCCTGGAGGGGGGTC^^ 

CCCCACTCCGTTCCCACGCTTTGTAAGCTTCATTTCCCCGGGGCCTGACCTG 

CCAGAAAGGCGTGCAGGGGTTTCCTATGGCrCTCTTCCCTGTCCCCATCCCCCrrT 

GCTACCCACAGGGTTTGTGATTGCTTAGACATCTTTGCTGTATGAAGCTTTGACCT 

15 AAGTCTGAAATCAAGGTGCCAGCAGGGCCAGCTTC 

GCCCCAGGTGTTCCTrTCGCCTGTGGCTGCATCACTCCGGT 
CTTTTCTGTCTCTTAGAAGGACACAGCATTGAATTTAGGGCCCACCTTA^ 

CTTAAACAGAATCAGGT CACATTTGAGGTTCCAAGTGGGCAT ATCTTTGTTTTGGG GGGGATCACCAG CC CACTGTAGGTCT CGT T 
GTGCACTGAAGACGCAGGCGGTCCCTTTGGCCAGGCCTGCAGTCX3GTCT 
2 0 GGATGCACAGGCCTTTATAGTTCTTTTGTTCTTGCCAAATTTCAGCTCAGTTTCCTCT 
AGTGCCCCTTCTGGGACTGTGTGTCCAGGCCCTGAGGCCCTAGA^ 

GCCT CATTAAAAT CCTGGAG CC C CAG ACTGG ACCCAGC C CTTAGCT ATGATCAAAG GACAG CT 
GAAAGATGATTTAGGCTGAGAAAGAGGAGGCCAAGGAGCAACTGGATTGTCCTTG^ 

TGGTGCC CAG CTGTTTC CCATCT CCT CTGAGG ACAGGAAAAAAAAGGAAATAAATG ATGT ACAATT AG CAGTTG GTTAGATACG 

2 5 AGGGTGAACTTCCTGGCTGGGAGTATGGCCAAGCCCTGATTTGG^ 

CTAGGCCTGGTCATGTGGCTCTGAGTGATTACATTGAAGGGGCAGTGGCTGGGGCTGGAGC 
AGGCATGCG<^OTTTCTGGCTGCTCTCCTGCCATTGTCCCTTGGGTTCCCATTTTGGTGAG 
GGGAACTAAATATGTTTGCTGCACTGTGCGTGGAGATGGAGAATGTACAATTGGCTGACCCTC 
CCAGCCCTGGGAAAGAACAATTGCATGGGTGTCTCCACACTC^^ 

3 0 TAGGATGGCTGTTGTGCCGGACCCGGCATCTTCCCAGGGGGGCTGTGTTGTTGGGCTGA^ 

CCCAAATAC^GCGTGGACAGGTGGCCCAGTAGGGGCTGGACTATCCGATAGGCCCAGGTGC^ 

GCCTGGCGTGGAAGATACGGGGTGCTATTAATGGCAGCAATGGCrrGC^TTTCTGAAACCCGGGCTCCCAGGCCGACGAGGGTGTGC 

ACGCATCIXSAAATCTCTGTGGTTTTGCAGTTCCC^^ 

GGAGGGAGTGCCTGCTTTTCTTAAAAAGGAAGGACITGATTr 

3 5 AGACCCCCTGCCCCCCX3CCTCTCCAGAATGGCTGGAGAGTCT 

GGCAAGTGGGCAGCGAGCGACCTCAGACCCAGGATGAGCrrGT 
TAGCGTTGCGGCCTGCTGCTTCGGGGGTGGGGGTGTTGTTCCATC 

AGGGTCT AAGGGACGGGGTGACAGGGAGAGCATGCAGGAGTGGGTTTCTGG CTTTC CAGGGCGAGTGGAAGAAG CGCCTCT CTCT C 
TTGTAGGTGACAGACCTGGGGGGCCCTTCTTGAGGATGAGAGCCTGTT^ 

4 0 TACCCCAGCCCCTCGGCCCTGCCTGCCrTGTGGATGATATAGTTTAAGGGTAGAGACCGCT 

GGTTAGGGCCCAGAAGGGAGGGAGAAGCCCTTGGGGCAGCTCCCTTTCTGCTC^ 

TTCGGAAACGTCTGCTCCTGAC!AAGGTCTACTTCCrGCTCT 

GCTTCTCTGACAGCCGTGTTCCATCCCCGCCCTGTGCCCCTTCT^ 

CGGTGGCCCCCGGGGCGGTGGTCGTGGTGGGGGTGTTAGCTGCAGGGGTGCCCTCGGTGGGTGGGAGTTGGTGGCCTCTCGCTGGT 
45 GCCATGGGACTCGGATGTTCGCCCTGCXiJCCCCITCG^ 

TTTAACCCTTGCAGGCGCAGAGCGCGCGGCGGCGGTGACA^ 
CTGCGCCCGGGGAAGGGGAGGAATCTCTTCCCCTCrrGGGCGCCCGCCCTCCT 
GGCCGCAGCX^GGCX3CCCGGGGGGAGGGGCTGAGGCCGCGTCTCT^ 
GCTCrcGTCOTGTGCCCAGGACTGTCCCCa\GCGGCC^^ 

5 0 CCAGTGCGAGACAGGG AGGCCGGTGCGGGTGCGGGAACCTGATCCGCCCGGGAGGCGGGGG CGGGGCGGGGGCG CAG CGCGCGAGG 

AGGGGCCGTCGCCCGCCTTCCTCCCCCATTCATTCAGCTGA^^ 

GGAGCJGCGGGCGCGGCGCCCCAGCCAGCGCGCAGGGCCCGGGCCCCGCC^ 

CTGCCCIACCAGCCACCATGTCGGACCCCGCGGTCAACGCGCAGCTGGA 

GCTGCTGCGGCaSCGGACGTGCTGGAGAGGACCCrcCGGGTGGGCCTGGCGCGGGAC^GGGTC 
55 GCTGAGGGGAGACGGGACCCCTAATCCAGGCGCCCTCCCGCTGAGAGCGCCGCGCGCCCCCGGCCCCGTGCCCGCGCCGCCTACGT 

GGGGGACCCTGTTAGGGGC^CCCGCGTAGACCCTGCGCGCCCTCACAGGACCCTGTGCTCGTTCTGCGCACT 

CTTCCTTTTATTGTTGTTTGTGTTTG CCAAGCGACAGCGACCTCCTCGAGGG CTCGCGAGGCTGCCTCGGAACTCTCCAGGACG CA 

CAGTTTCACTCTGGGAAATCCATCGGTCCCCTCCCTTTGGCTCTCCCCGGCGGCTCTCGGGC 

AG<3GAGGTCGTTCCTCACCTCCGACTGAGTGGACAGCCGCGTCCTGCTCGG^ 
60 GGCCGCCAAGTTGTGCAGCCCGTGGGCCGGGAGCACCGAACGGACACAGCCCAGGT^ 

GGCCCCCATCCAGGCCTGGGGATATCCTCATCCGCCTCCCAG 

GAGTATTTTGGCTCTCTCCTGTCTTCGGGGTTTAC 

CCGCGTCTCTGAGCCTGTGAGTGCGTGCGCTTTCCTGCGTCCT 

GTAAATACAGCAGGCGAAGGGGAAGCTCACACAATGGTCTCCAGCGCTCTC 

TTGAAATGAGCAAACTGTCCAGGTGTCAGGCCAAGCTGGGA 

CTGACATCTGGGTGTCCTGAGCCCAGCTGCCCACACGGCCCACCTGGGGTCAGCCCTATCTGAGTGGG^ 
GGGACCAGAACTTTGGCTGGACGCCAAGCAGAGTGCGAGTGGCTGTTCT^ 

GGACGGGAGGGGGTTGTGATC CAGTGGCCACTGGCX3CTGTGCAGAGTGTGAGCTGGAAACAT CGTAGTTACTTTGTCAGCTTAGTG 
7 0 GTGAAAGCCCTTTTTCAGGCTCTATCCCTTTGC^ 

TGCTGTTCCC^CCCCCCTCCCCCAOVAC^CCTCCrCATCTGGACATCTTTC 

TGTGTTCCGTTGAGCCTCCTGCAGTCCCCGAGTGAATGTGACCTCCCTGC^ 

TCCAGGGGCCTTCCTTGTCCCAAATGTCCAAGTGGCACGACTTAGCCGGTCTGACCACT 

CCTGTGTTGTGCAGAGCTCTCCTCCTGCCTGCGGGATCGAGGTCTCTGCTCTCAG 
7 5 GATTTCTGGGGAAGAACTCTCGTGTCTCAACGGGAGCCCTGTGGCGGC^ 
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TCTGGAGCTGCTTCTGCGGCTGTCT 
TGGTGCAGCAGTTGGCAATCCTTTGGCTCTGGGGGAAG^ 
TTATCAGTCTCCCCCTTGTGTGCCCTGAATGGTTTTG 
5 GAGAGAGTGTGTGTGTGCATGAGTGCGCATGTGCATGGGGGCTGACCTGGGGT 
CCTGGAGTATAGTTGGAGGGTGTGGGGGTGTGTATATGGGAGTTGG^ 
TGAAGGAGGTGGAGGAGAGAGTGTGATTCAGCCCAGCCAGGGGTGATGT 

CCACTAGCTAGCAGGGGAGAAGTCAGTATGGAGGAGGCGGACCTTGAGGGAGAGTAGGAATTGGATTC 

TCTGGCCAGCAGCAGCCAGCAGCAGTGGGGGAGGCTGGAATGAGCTGGC^ 
1 0 TGAAGATC^TATGGGCCAGGCTGTGGAGGGCCAGGCATGCCCGCCGGGAGTGCAGCT^ 

GAATGGGCGCAGGAGCAGCGCCGTGGGAGCACAGGTCTCTTTCC^ 

GAAAGGGTGGGGAGGAAACTGCCAGCTCCTTACAGCGCTGGGATGGTGGCC 

TCACCATGTGGGTTGCTGAGGGGCGGAGACTCCAGGGGCCACCCCAAAGCAGGACGAGCT 

TTTTCCAATCGAGCAGACGTCTAGGCCTGGAATCCTGTAACAGAGGCCACM 
1 5 TAGTGGCAGTATACAGGGTAGACGGCAAGTGACCTGGCATGGGGAAAGAGGC 

ACAGGTAGACGGGAAGCCOTCCGTGGGCCTGTGTGTGTGCTCGGAGTTAAAAT^^ 

CCCCTCCCTGGGGAGCACCTTrCTCTTCACCrTTTCCTTCCCCGC 

GGGGACCGAGGGCACCATGGCTTTATGTTCCATCACCX3ATGAGTTGCACAG^ 

GAGGCTCAGGGTTCCCAGGAATGGAGAGGGTATCAGTGT 

2 0 AGATAGGTCTCCGAGCTTCAAGGTCTTAGGGCTCTGTGGAGGCC<3CCACGTAGCAGCAA 

GATTTTTAGAGTCTCA1TTTCTACTTTTCCACTC 

GACAAGAGGGATGACCCTTTCCTCTTTCTCCCTTCTGGTTGGTGGAGGCACGGG 
GGCCAAATCTGTTATCGCAAACACGATTACAACTCGGGTCTTTG 
GCTCAGGGGTGTTTCCTGTAACCAGATTTGCACACCGAGAA 
25 ACCCACGTGCTCAGATCCCAGAGCTGTGTGGCATCTGAGCTTCTCCCGAGCCCCCA 

GTTCTGGAGGCAGAGC^GGCTGTGGGACGGGCTTCTGGGTGGGAAGGACCATGTGGATATGCCTT 
CTCCGGGACGTGGGAGCTGGCGCGCTGGCAGGATTOVGGTGCCTCTT^ 
TGCAGACCTGTCCCTAATTAATGCCAGTAGGCCTCTTGCGTC 
CTGCCTCTCTGCAGTTTGAGCTTTTGTGCT 

3 0 CTGGCAGCACTGACTTGATGCTCTTTGAGTTCAGAG CCCAGGG CCAGACAGACCCACTGTCCCGACTACGAGTTGGTrCATTTAGA 

GGGGGGCGGACACAGCACCCAGGCAGCAGATGCACT^ 
AAGAGGCAGGTGGAGGTAAACGCCAGGACACCCCTGCAGTC 
TCCAGGAAGGTTTTGAATGGGGCTAGTGGACAGAGTTTGCATCCAGAGG^ 
GGGATGCCCAGGCCGCCCTCACTCTGCCATTGGAAAGC 
35 AAGTGGGGTTAGAAACAGCTCTCTTTGAGGGTTGCTGGGGGACTCTGA 

ACGGGTGTGGTGGGTGTCGCGGCTGGTGTGGCATCTGGGCGGGAAAAGGGGGCATTTGCAAAGGACT^GGTAGGTTTC 

ATATG CAG AT CTGGGGATGGGAGGTCTCAGG CAAGGGCCTGTGTG AT G C CACTGT ATGAATGAGG CCGGACAG CATGGT CAC CAC T 

AATTATGGAAAGCACGGC^GACACTAGTGCCAGTAGCGCCCGC 

CAGCCATGGGACCCCCCTTTTCCTGCCTGGCTGACATTCTGGAGGCTGGCGTCATGTTG 
40 GTCCTGGTCTTCCOTGCCACCATCTACCCCCACCCCCACCCTGGAGTGTCACAGAT 

AGCGCTGACAATGAGGTGAGCACTTGCrGGATG ACCGGACCATGA CTGCGTTCGGCTCTGCAGG 

CTCTGCAAGGGCCCAGTCAAGTGTGGTGCAGAGCTGGGCTGGCAGCT 
GATGGGCCAAGGCTGCGTGCTGCTTGCCCAGCCCGTGGACCCATCCACTGAGCACC 
45 AGGAGGTGGAGGGGGGACTGGGTGCCTGGAAAAACAAGCCCACCGAGAGCCTGAGCCCCCAGGGGTCAGTATCAGGC^ 
GGTGCCCGGTACACAGTGGCCACCTGGTCTCGTCGGCTGTCT^ 
CCCAGGGTAATACAGATATGCAGACTGTGCATTTGT 

CCGTTGGGAGAGAGGTGCTGTTCCCTGATAGCCTGGTGCCGGGAGAGCAGAACTAGGGTCTCCTCCT 
TCTTACATGCCCAGGCATCAGGGGAGGTCACACTCAGGGATGAGTCTTGGCCA^ 
50 CCTTTGCCAGCCAGCGAAATGCCCGTCCATCATCGTGCCGGGGATGGGGCTGGCCrr 

TAGGCTGGAGCCCCGGACG ATTCCX3CCACC CCCTCCCCCGACTTTTTCTTTTTA CCAAAAAC 
CAAGTTCCTGTGCTCTCCCTTGACAAAGCCACTC 

CAAATTTGGAGGCTCTGAGTTGCTTATGAGACCAAAGGGTCCGGAACTCTCTGGA^ 
CTCTCCAGCCAAACAGACCC^CCAGTTGGCCCCTAGACACCCCCAGGCGCCTGGGGCA 
55 CTGCAGAGCCCCGAGGCTCCGTTTAGACTGGGCCAGCAGACAGTTCTAGAGACTTCAAGTTTGTTG 

AATGCCCAGAAACTTAGAGGGACAAGAAGATGACCrrCGATGTCAGCCGGGACCATAAACAGTGAGGCCCGCCCCCCC 

GCCCGCCTCCCCCTCCATGCCCGCCTCCC^ 
(OTCCCGTAGCTGGGTCTCCAGG^ 

ATTTTTTTGTTTTTCCTTGTCCAT CTTCTTTCTCGGAAATTT CTGAATA^ 
60 TCTCTGGGCCAACTGGGAAAGCCATCATTTGTCCAATATCCAGAAAATCAAGGGGCCTGGCCTTAAATAC^ 

CCCCACATGG CATCGAATTTCCTGTGTCCTCAAAGCCAGGAG CCTGTCCCTTGGATGTGGGCGATGGAGCCTTGCAGACACCTGGA 
GCGGCTTTCCTGGGAGAGTCTCCCTCGCCTTCCCATCCACCCGCTTCCCCTCCCCTGCCCCAA 

GGACTCAGGTCTTGAGGGACAGCGGAAGGGTGGGGTGGGGAGTAG^ 
65 TGGGCAGTTCCTGGGAGCTGGTTTTTCCTTCrrCACCCCTTCTACA 
GCAGCCTGTGGCAGGTCTTTCTGATTCTGTGC 

TCTCTACCTGTGGCCTTTG CAGG CAGCAGTTCCCTGGGG CAGGGCCTGGGGGTCCTGG C^CCrCCTTGTGGGACCCC^GGGTATG 

GGGAAGAGAGCATTCCTGGCCATGGGGAATGTGGCCTGGACAGAGCCACCTAGCCTCTCCCCTCCCCCAGG 
70 TGGAGATGGCTCTGCTCCTGCCAGGTCAACAGCCAGCAGTCTGGTGGATTTCCATC 

CCCCCCACCC CATGTGCAGGAAGCTGGTTCCAAGCAGGGACCCTTGGC^ CCCTGCGAGGG 
GCAGGCCAGGCCCCTOCCCTATGTCCCCCAGA^ 

CTGGAGGGGTGGTCTTCrCCTCTGTCCTCTGCCTCTTGCCCCCCATTAAA^ 
TTGGGTTTTITCCTTTGGTTTAAAATGAAGGGCTGGAATCCCTGGT^ 
75 AGG.V 2 CA a C TG CTGAATG CC CTG CCAGGCCAGGAGCATCG CTG GG GGTGGGAGAGG CAGTTGGC TT ATTTGTTT AT C CAACAG C 
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CATCCATGGOTACCCCAAGGGCCAG 

TTCCTGGAGGJUM3GCTTCCTAACCAGAGACCGGG 

TTTCTTTTTTTCTTTTTCTTTCTTTCTTT^ 

CGGCTCACTGCATCCTCCGCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCT 

CACGCCCAGCT A ATrr riTGT ATTTTTAGTAGAG ACAftGGTTT CATCCTGTTGGCCAGG CreGTCT CAAACTCCTT ACCTCAGGTG 

ATCCGCCCACCTTGGCTTCCCAAAGTGCrGGGATTACAGGTGTGAGCCACTGCGCT 

TAGTGTGTGTG<5AAGACTGTATGTCTGTGCCTGGCAC^ 

CAGCACAGGTCATGGCATGAGACAGGCTCCCGCAGCCAAGCGGGG^ 

AGGCTCrGCATCTOTCCTCCTCCCCAGGAG^CCTGGAGGGCTGGGAGG 

CTGTTGGAGAGGCAGAGGGAGCTGCAGGTTTGTAGCAGCGJ^ 

TTTGCTTTTTTTTCTAAACTACAAAAGAAACCCTTGG 

CCTTTGCAGGCCCAGTACTGTGCAGAGGGAGGAGGGTGGAG^ 

TAGCCCCCCTCCCCTGCCTGGTTGCTGCTCT^ 

GCACAGAACGGAGACCCTGGTGAGGGAGGAACTCITTCTCCAGCCCT 

TCAAATATTTACATCT CT CCTAATATTT AAATG ATG AAG CAGG CT C CTGCACACAGCTGCC CTGGAG CCAGCTGTTG CCTGCC CTA 

TCTCTCACCCAAGTCCAGGC 
ATAACCATTTCCTTTAATTGAGCACCrCCT 

GAGCCCAACAAGGTAGATTC^CTGCTGAGGCTCCAGCGCTGGAAGGCCG 

TCACTCCCCCACTGTGGAGACCTGTTCCCTGGCCCCTGGCTGAGCACGATGGAAATCT 

TGGCTGAGGGTCCCGACCCTGGGGTCTGACGGGGGCTGCCTCTGCCCCACTTCAGCCTGGCTGCTCTTCCA 

TGCAAAACCAGAATAACAGTGCTACCTGGCTGGTACTTTGGGTGGCTGGGAGGCGTGG^ 

GTGTCAGCATCGTGGCGATCTCCGAGGGTAGCCTTCCCAGGCCCA^ 

GCTCACATCTGGCCTTGCCCTTGCCCACACGGCAATGTGGGGGAGTCATCAGCCTCTCAGTGCCT 

GGG ACAAGAGT AAAT CACTGT CT TCAGGT ATGGTAAG CACTCAT ACACAT TG ACTATTG CAG AATG AACACAGTGAGCCCTT CTT A 

GGCTTCTACTACATGCCAGGCAGGGTGCCAGGCCCTAGTGGAGAGGGGACAATGATGGGCAAGCTGGCT 

TCGAGCAGGGAGGAAGGCGCCGGCTGCCCTGACTGTGACCCGGAGTGGGGTGATTCGO^GAGGCTGGCGGGAG^ 

GCTGGCTGGGGGCTGCTCCCTCTCCTGCCTGTGTTC^ 

TGTGGGACCTCGGGCAGGTTTTGGGACCTTGTGACTTTGTTCCCGC^ 

GGCGACTTCTGCATGTCCTTCAGTGTGTGACTGACCCTGGAGCTGAGACTTGGCAGCTGAGGCTGGACATTGTCCTTGGATAACTC 
CCGGGGAGGTCGGGGCAGGGTGGGGCTGTGAGCTGGCAACTCCGAGTC^ 

TTCTCGGGGAGGAAGGAGGGGATGGGGGAGATGAGGCTTGTGAGTCTTCAGGAGGGCAGAGGTTCACACCTGCAGCGCCCC^ 
CACTCCGGGGTGCCCAGGCTTACCCAGCCCTCCTGAG 

CAG AAATGATTCTCCTGAACAGT C CGAG C CAAGAATG ACATAT CTGG AAT CAGAAC ATCAGAACTACAG AGG AAT CGAG AG AACGG 
G C C CAAAATGTGTAGGGACCTCAG C CACATGGCG CCACATGGAG G AAAG CACC CAACATCAT CACTGT ATGAACCTGTTTAAACAC 
ACACACACACACACACACACACACACACACACA(^CACAC^ 
AGTTACCTGGGTTGTGGGTTTATAGAriTTTTCCCTTTGTT^ 

ACAGTCAGGAACAAAGCAATAGATGTTATTTTTTAAAGTTCATGGGCTCGTGCCCGTAATCCCAA 

GAGGATTCCTTAAGCCCAGGAGTTTGAGACCAGCCTGGGCAACATAGGGAG 

TTTCACTGTGTCCCAGGCTGAAGTGGCGGTGGCACGTTCTTGGTTCACTGCAACCTCT 

CTCAGCCTCCTGAGTAGCTGGATTACAGACACTCAT^^ 

GCTGGGGACTTGGAGTTGCCIAGCTCACATATTTTTTAGTA^ 

CCTCAAGTGATCCATCCATCTCGGCCTCCCAAGGTTCCAGGATTACAGGC 

T AAAACAAGAAT AAAAATAAAAAT AAAT AAAAGTTT ACTGT CAAAAAG AAAAACACAT CAC C CAACC TAATCTCCC CCTAGG ATTG 
AGAGATGAGGAAGCAGAGGCCGGAAAGTGTCACATTCCCTGGTCGCTGGCAGTCAGGGG^ 
GCTCCAGGCCTGGGGTGTCAGGTGGAGCCCCTGCTTTGCCCAAAGGTCTAGGTGCOT 
GGCC^CAAGGAATGTGGGCCAAAGGCAGGGTCCTTGACCrCTGCCCAGGAGC 

AGG CC CCrGCGGCTGTGCCTTTGGAAATTG<ITGGCCCGAGCrAGTGTG CGGAGCTGGGACAGACCTCACCGTGGCCCTCGTGGGGT 

CATTCCATGTCCGGGCTGGAAGGGATCTTTCGGTCCAGCCCTTCCrGTTCCTCA 

TGGCCCGGGGTGGGGTGTAGATGGCGCTTGTGACTCCTGCATGCCACGTGCAGAC^ 

AGCCATGCTGGGGATAACTTGG CTGAAACCTG CGATTTAATGAAGAGAAGAATG ACTC CTCCTGCAGCCCTGTGCCCCACAGGCTC 

CCTCCTGCCCCCACCCCTCCTGCAGAGGGGCCT(IATGCTGGTGGGTGGGTCCCTC 

AGTCGAGGCTTGGCTGTCTTCCTGCCCAATGTCCCCTGTAGCCCACGCT^^ 

CTTCACAAGATGGGTATCTGGGAGATGCTGATTGGCAGGCCAGGCTCCCGTGG 

TGGGCTGAGTTGGCAGCTCAGGGCAGTGGGGGTGGGGGTACCTTGCTCAGCCATGGGCCTCTCT 

TGTAAACAACTCCTTTCAGTTTTTCCTTCTCTCATAGGCCACCAGCAGAATTGAGGGG 



CTCTCTGGCTTCCCCACTTCGAAGGAAAATACGGGCATAGGGGAGACAC 
TGCTGGGGGCACAGTGCCTGCCCTGCCCACAAGAGGGGTCAGTTCTCrCCAGGATTTAA 

GCACTCAGAGG CCCAG AGGACTCATATCACGAGAGAGAAGCTGTGACCG CACAGGCACTGAAGGTTAAACAGAAGAGTGGGACACA 

CGGGTGACCTGAAAGTCCCAGGGTCAGTGTGGTTAGCCCOVGATAGC^ 

CCCCCGTGACAGTTTGCTGGGGCTGCTGCAACAAAGCCCCACG 



GTGCTCCTGGG CATGTAGATGCGTCACGCCACTCCCTG CTCCGTCTTCACATGG CCTTTTCTCTGTGTCTCTGTGTCTCX3TAAGGA 

TG CTTGT CATTGCAT TTAG AG CC CACCCT AAATCCAGG ATAC TCT C CT CTTGAGATCCTTTATTGCATCTG CAAAGAC CCTATTC C 

AACTCCTGTCACATTCACAGGTCCGAGGGCTAGGGCTCTGGCATGTCTTTTGGGGGTCACTATTCAGTCCACTGCG 

TTAGGGX3AAGCAGAATCTTCTAAGCTTGGGGTGAG03GAGGGAACGCT 

CGGATCrCCTCTGTCTGATGAGGGCGTGATGAGGTTCTCATCTCAGGTCCTCTGAATG^ 

CTGGATCTGCCCCTCCAGAAACCTC(ZAC!TTAGGTCCAGGTCCCAAGGTG<3 

GTGAGTGTGTGAACGGGGCTGGGAGCAGGACACTTGTCACTTCTGTGGAC^ 

GTGCCTGATCTAGCCTGACCCTCCACCCCCACCTCCCCCTTCCCGCCCCCCTTCACCAGCCCTGTTCCCA^ 

AAGGCAGGCGCCGTTT CCTGGACTTCTCAGCAGCTACCC CCTACCCCAGACC CCTCTCTTGTAGGCCCCTCTGGTCTCAGCGCCT A 
CTCCTCCCTGG<3CTTCreCCCCrCTACCGTGAGGCTTCTGGGCAGAAGTGTGGGGCCCCCTCGCra 
CCAGGGGTCCTCCCAGGGGCCCGCCCTGCCCGCTAGGTTTCCAGGGAG 
&TGrGTTGTCTCAGAGCAGGAGGGGCCCCGGCTGAGTCCGAGAGTCCCCTCAGTCA(^ 
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CCCGTCTTCTGGAGGTGCTGC CTCTCTCrGGTTACCCCTTCTGGTATCTTCTGCGTCCAGTTGCGG CCCCTCCTACACAGCACACC 

CACCACCCCCTTTTACCTGTGACTCTTCT 

CAGCAGAACCTAAACCAGGCAGATCCGTGTAGGTTCA^ 

TTGGCGCGCTTTGGATCATGTCATTCAGTGCCGGTCCTCCAAACCTCTCCCCTG 
5 GGTGCAGCTGTGACCGGAGCCTGAACTCTGCTCCCTGGGGT(^CCGTCCCCT 

AGGCCATCGGCTGCCCGCCTTGGGGCCACAACACIAGAG^ 

TTAGGCG CTX w lUTrTT T TTlTrm " rT TTTTTGCATTTT^ 

AGGGAGGGGTGGCTTCTCTGCCCAGGC^GATCAGGAAT 

AGCTTGTCCGGGAGGCTGGACCCTCACACCGATGCTGCCTGTGCCTGGCA^ 
1 0 ACGCTCCTCAGGGTCTCCAGGTCCGGGTCCTGGTCCCCGGCAGAGCTTCCCATCCATC 

TCCTCGCAGGGAGCTGGGATGCATTTCTAAGAGCAGCATGCG^ 

CTGTTGTTGGAGGGTCX3GAAGTAGAGGGGAAGGAGGGGCACGCGGTAGGCTG 

GGGGCTATGAGCAGACCAGGAGCCAGCATCACGTTTCTCTGTCTCCAG 

GTGTCTGGGGCCACCTAAAGGGCGTGGGTGGATTAAAGCTGAGGAAAGCCTCATTTT^ 
15 CTCAGAGCTGCAGCCTGTGGTTTTTCGAAGTCAATCACCTCCCAGGGCC 

TCTTTGACCCAAGGAGACCTGAAAGGTCATCTGGTCCCTCTCCTACCCCCAGGCAGGC 

TCCCACCCTTCTGCTGCCTTCCACCAG 

TTCTTCCTCTTTATCTGCCTCACATCTCTC 

CTGGTTTAGAACCCGGGGGTGGGGGGAGCGCrTAGACCAGTGCAGCACATCT 
2 0 GATT TAGT AAAG AAAAGAGGAAGTCCAG CAACTTAGCG CC^CTTTAAAG TCCGCCTGG AATGAC CCTGTG GCTG CCGGAACCAGGG 

GCACGCGGTAiGATGACGGGAGCAGGCACAGAGAGGCCTTC^ 

CAAGAAGAGGACCCAAGGCCCTGTGAGCGGTGGCCTCCTCTGGAGCCTTTACTGTGGTCCT 

GGCTTGAAGCAGAAGCTGCTACGTGGCTGTCCCAGGCAGGCGCCCGTAGTCCCAGCTACTCX5 

TGAACCCGGGAGGCGGAGCTTGCAGTGAGCAGAGATGGCACCACTGCACCGC^ 

2 5 AAAAAAAAAAAAAAAAGGAGTAGAGAAATCTTTCCCAGAAGGCCTG 

AAACTGCACATTTCGCTTTTTTTTTTTTTTTTTGAGACGGA 
CTCGCTGCAAGCTCCGCCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCT 

GCCCGGCTAATTTTTTGTGTTTTTAGTAGAGAC^GGGTTTCACTGTGTTAGCCAGGATGGTCTCGATCTCCT 
CCCGCCTCTGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCGCCGCGCCCGGCCACACGTTTCTGTTCTTGAA 

3 0 CTGACCCTGGGTGTGGTTACTCCAAGACCT^ 

CCTTGGGGGACACGTTAGTGAAAGGGACCCTTGAATGACTA 

ATGTATGGGCCTCTTGAGCCAGAGGAGACAACAGCAGAAAG CTCTGCGGCTGATGGGACCGAGCAGGAGAAGGGCCGGCGGAGG CT 
GGGGCAACAACGCACTTGCCCACTGTGAAGGGCCTGTGTGAGCTGATGGCTGATGAGA 

AGG CTAG AC CAGCTC CAGGACACCCAT AT C T CCTTTT CAGAACCGGTGACTCAGTGG CTAGAAGGGG CTTCAG AATTGACGGG ACA 

3 5 CTGAGGGAGTCGAATGGAATGGCTGGAGTGGCCACAG CAGGCTCTAGGCCT CCTCCCGTTTCTTTTCTGTGTTCTGTTCCTGGGGT 

CTATATCCTGGGGTTGCTGTAACAGAGTGCCACAAC 
TCCTTGGCCTGCAGACGCATCACCATCATCTCTC 
TCCTTTTCTGTCTTTTATAAAGATGCT 
AATTACATCTGCAAAGATCCGTTTTCCAAATGTC 
40 ACACTGTTGAACCCACTAC^TTTAATAAACCCAGAGTTGGTGCCAATGTCCCCTGCG 
GTAAAGGAATCCGTAGGACTTCTTTTCCCT 

TGGTTCAGCTGGTGCTGGCTTGCCAGGTTGTTAGAGAGCTTTATCCAGGGAG 

GTTTGGAATCCCAAGGGCTGGCTGTTGAGGTCAGCT^ 

CGTTTCTGGAAGGAATGCTGTGGGCTCCTGTGCCCCA^ 

4 5 GCGTGCAGAGGGAAGGAGGAGGCTTGTTTATTTCTCTTGGC 

CTCATTCTCACATCATCATTGCAGAGTGAGCCAGGCCTCTTGGGTG 

CTTCTTGGG C C CAGGG TTG CCAG AGG CAG AGAGAAAXIAAGCCTCCTCCTAACAC^ CATCAGAATCCATCAGA 

GTGGCCTCCTGTGGCTCTTCCCTGGGTGCTGACCCCC 

CATCCAGGGGTTAGTGGAAATGGTTCTTTTCCTCX:TTC 

5 0 GGGCCACGTGTGCAGATAAGCCTGGGCAGTCAGTAACCTTTGGCGACCAAGCTGGCCT 

GGTCTTAGCTGCTAGCACTGTGGCCCAGCCAGGCCTGGC 

GGTGGGGTGGTACCCTGGTTCTTCCCTCCCAGACCTTTGGCAGGTTTGAACTAT 
CTCCXSCCTTGATGTGGACTTTGCCTTCTGTTCACGT^ 
CCCTCCACTGACACTGCTGTTTCCTGGCTTCCCT^ 
55 ACGGTTTTGCTTCCTGAGAACACACCCCTCCCCATCACCCCAACTGTGTCCTCCTCCCrrATCC 

GAGCCACCTCCCTCCTCCACCAAGCAGCCCAGGCCAATTCTGGAGGGCTAATTTGTGTCCCTCCAGAAGCAG 
ATTTCAGTGGGAGGGGACACCAGGGGATGCCTGAGTGTGGGAA 
CTCATTGTGACCACAGGTGGCTGCCCAGCCCCCATACC^ 
GGGGTTGGGTAAGGGAGCTGGGGTTTTTATACTTCAGTTCTTGG 

6 0 TGGGCACTAGGAAAGG CTCTGGGTGGAG ACACGGAAATAGGCCATGTGAAGTCAGTGGGCGTGAT TGGTAGCAGTAAGGCCAGGAC 

AGCCTTGGTC^CCAGCAAGGTGACCACACATTTCAGCATGCACAGCCCCCACAGAC^ 

AACAGTCACCCCTGCCGCGTGCCATGCCTGCACCCAGGG 

CGCTCTTGTTGCCCAGGATAGAGTGCAATGGCACA 

GCCTCCCGAGTAGCTGGGATTACAGGCATGCACCACCACGCCTGACTATTTT^ 
65 GGTCAGGCTGGTCTCGAACTCCTGACCrrCAGGTGATCCACCCACCTCAGCCTCCCAA^ 
CGCCTGGCCTCAGGGGCACCTTATTACCCCATCACCTCCTACAGCAGGTGAC^ 
TATGTCACTCTCCTGTTTGGTTCTCCTGCT 

GCCCTGGAGCAGTCCTTAAGCTCCTTGAAGGCACACCAGGCCTTTGGGCCTCGTCCCCCAAGTCCTGGATAAA 
AGCCTTTGCAACTGGACCCCACTCTCCCAGCGCCAAGGCCCT 

7 0 AAGGTCCAGTGTCACAGGAGAGTGAAGACAAGGAGGTGCCCATGGCGTACATGCTGCTGGACTCACrrCTGTA 

TGGGGAGTTATCTGCCAGGTGAGGTTTTCCCX3TGGCTTTTTCCTGCAGGCTCAGGGTAGC^ 
TGGCACCATCTGTCCGGCTTCCCTGAGCCCTGACCCTGGCTCCTAGATAGCCA^ 
TGCTO^GAGGAGACCCAGTTTGACCAGGGCTAGCGTTCCTGGAACCTGAGCC^ 
TGCTCCCTCCTACGAGAGCTGAGCTGCGCATGAGATTATTCTGACCTTGGGCTCT 
7 5 TGGAGCAGGGCTCTGTGGCTCGCTTGGTGTTCCCTCCCCAGACTGAGGCCTGATATGTA 
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GGAGCTCGGAGCGGTTTGCCAGGCAGCCTCT 
AGGCCCAGGTCCAGGCTGCTCCTGTTTCGACCC 

TCTrTTTTGTGT AG CAGCTGAG ACCGTAGG'rG CTTGAGAATAGGAGAAGC CCTG TG C CCATCCTGGGCrrGTGAGATGTTTTTCCAGA 
GGTGGGGATGGGGATGGG CCAG CATGCAGTTAC CCACAC CCT CCAGCTTGTTGG CATCAAGGTCTGTGTATGTGTGTGTTTTTTTT 
5 TTTTTAATTTTGTTTTTCCAACATGGAGTCriTGCCA 

GGTTCCCAAAGTGCTGGGATTATAGGTGTGAGCCACCATGCCCAGCCACT 
TGC^GTGGCTCACGCCTGTAATCXJCAGCACT 
AACACGGTGAAATCGTGTCTCTACTAAAAATACAAAAAAATT^ 
AGGCTGAGGCAGGAGAATGGCGTGAACCTGGGAGGTGGAGCTTGCGGTGAGCCG^ 
1 0 AGAGCGAGACTGTCTCAAAAAAAAAAAAAAAAAAGAAjCAAT^ 
TATAAGCATCTCAGAAGGCAGGAAAATGTAATGAACAAC 

TGTG AGTGTGTGTGTG CGTG CAAACTG CACATGTGTGTATGAAATGG AATTAGGAT CTTTGTATACATCTTGTGTCCTG CTATTTT 
TACTTAATATT CTTCGG CATTTCTCAT CITCAAAAATGTG ATTTTAATGATGGTG CAGG CCTTTGACAT ATGGGTT T AG AAAATCTG 
C^GAAACCCGG ACTGTTT CCAGGTTTTGTCAGTTTAATGCTG ATG CAAATGTC CTCGTACGT AAAT CTTT CTG T ACTTCTGATTAT 
1 5 TT CCTTGGGAT AAATT CC CAG AAGTGGG CCAAAGCACATG AACGGTCTTCAGG CTTT TGGAG CAAGTTGAGATCGGT GGGGATTG G 

AG AAGGGACCCAGGGAGG CTTTTGGG AGCTTGCGAGATTGG AGTCTGG CTTT^ CACAGATG AAGGAGATG G CT 

GGCAAAGGTGGGAACCTCCTGCTCTATGGGTGGGGGGGTGACACTGCIT^ 

TAACTCCTTGCAGACAGAGGAGGAAGCTGTCCGGTGCTCCT 
2 0 TCTGTGCAGTCACX3TACGGGGTTCTAAAATCTGCAAAATGACAAG CTCACGAACACTCTCGCTTCTGTGGGTGAATGTGGAT ACC C 

CTTGGCTGCGTCTCCCTCCACAGGGAAGATGGAGACGTGTAAAGTGACAAGGACCT 
CTGTGAGGATGGTGGCAGGGGGCITCCCAGGAGGCCTGCACA^ 
GCAGTCTTAGGACTCTGATGGGGGAGQAGATGCTGGGGAG 

CTGGGTACAACTCACAATGTCCTCCGTGGGGAGGCAATTGCCCATGAGGCATCCCTGGC^ 

2 5 ATCCCAGCAATGAAACAAGAATCTAGAGTGATGTGATTCCTCCGATGACTTCT 

AGGGGGAAGGAGGCAGCTCTCGCTCCTCTGGGCTCCTGATGTGGCAGCTCTGGA^ 
ATTTGTCGAATGTGGTTGTGGTGGGATTGGGGCTGGGCTGGAGAGTG 
AGCCAGGGGTTTCATCTTCCCAGGGGTCTCGTTC 
AAGATTGATTCGCTCATGGTTCTGGAGGCCACAAGTTTG^ 
30 AGGACCCTTCCTTGCCTCCTCCIAGCTTGCGTTGGCCCAGGCGCT 
AGCCCTCCCCACCGTGTCTCTTTGTGTCCCAAGTCC^ 
CCCAAATCCAGGATAGTCTCATCCCGGGATCCCFAAT 

TGGGGTTAGGTATTAGACATGTCTTTTTGGGGGTC^CTAGTCAACCCTCTACA 
AAATGGGGGTGAAGGTCTAGTGCGTCCCTCCTCCTTGGGAGCTC 

3 5 TTTTTTTGAGATGAAGTCTTGCTTTGTCX3CCCAGGCTGGAGTACTGTG 

TTAAGTGATTCTTCTGCCTCAGCCTrCCAAGTAGCTGGGATTACA^ 
TAGAGACAGGGTTTTACCATGTTGGCCAGGCTGGTTTCGAACTCCTGACCT 
CCCAAAGTGCTGGGATTACAGGCATGAGCCACCACX3CCTC 
ACTCTGTC^TCCAGGCTGGAGTGTAGTGGTGTGATCATGGCT 
40 AGCCTCTCGAGTAGCTGGGATTACAGGCATAT^ 

CCACGCCTGGCTTAGTAGGTTTTCrrAGAAGCGCCTGAGGCCrGGTGA 

CCCGGGGAGCCGGAACACACCCCTCGTAGCCAATCTCAACAGTCATGACGTGGCCAGGATGACATGGGGAG 
GTGTGGGAGGCCAGCTCCCCGGGAAGGGGGCTGGAGTGGGCTTGGCTTTC 

4 5 GCTCGTTTGGGGGGATGGCGTGAGCCAGGGTG<KKZACCTCTTCATTTAGGTGTCACCTT 

CCTTGGGGAAGATGGACACCCCCAGAGTGGGGGATCTCCTGGGGAA^ 
TTCTTCTAGCACACAGTTGTATTCTCTTGCAGTTCTGGAGGCTGTGAGATCT 
CAGCGCCGTCCTGATGGTGGGGTGGCCCCTGCGGGTGGAACAG 
GTGGTGCTGGACCGAAAGACCCAGAGGGATCATAGAGCCAGGCAG 

5 0 ATG AAAAGACAC CAAG CATTGAAAG AG AGG AG AGGATG G TTG GG G AC CGGCACACATGTGCATTTTG CG CAGATG GC CAAGGTGG T 

ACGTGGGCCTGGCTGGAATAAATGATTGCATCCGCACCGTA 

GTGTCTTCGATCATGAGTTGGCCATGGTGGGGGTGACTCCAAGGGGACTCCCAAGGAGGGGGTTCACCGGA 
CATTTGTTTTCTTGGGTTGGACCTGGGACAAGGCCTT 

55 GGACACCTGGAAGCCGGCCCCTGCTCTACAGCACCCATAGGACAGA 

CAGGGAGCAGATCAG AGGACCGCGGGAGGCCCTGGGAACCGCACGAGGACCCAGGCATTGAGCTCTG CTGGCATCAG CCC 

GATGTACAGTCTGGCTCTGTGTCTACCCCGTACTTCAC^ 

TTAAGCGGGTGTAGATGTGATGAGCTTGGTAAGCCCATCTCATTACATTCAGTTCT 

AGAAGTGGAGATTCAACCTTTCCTTGGGGCCCTGGGGCTTCTG CAGAATCAGAACACAGCCAGC^TTTGCATGCAGGGTGCAGAG 
60 GGACAGAGGGACACCATGTCGGGAGCCCACACCTGGC^ 

TCAGCTAGAGGTCATGGCTGGACCCATAGGCGAATAGAGCCTGTGGAT^ 
TAGTTTTAGTTTGTGGAGACAGAGTCTCGCTCTGTCAGTCAGGCTGAGTGCAGTGAC^ 
CGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTATAGGCGAGTGTCAC 
TAGTAAAGATGGGGTTTCACCACATCAGCCAGGCTGCT 

6 5 CAGGAGATATCATATAAAGATCCAGATTTCTACT^ 

CTTAGTGACCCAGGATTTCGTCTGTGTCCACCAAGAAGGAGAGGTGCCTGTGGCT 
AGGAAGCTGGCAGCTGAGGCGGCCAGGGTTAGGGGGTTGGGCTTCCTCTTCCCCT 
AGACGGGTGAGGGGAAGGCACTGAAATTGGTCCTGGAGCGCTATTA^ 

7 0 CAGGGGAGGCAGTGGTGGGATCCCATGAGACCCGGACTCTGTGGCCTGGTGAGGC^GGGAGCGGCTGG 

CACAGGGGTGCGCTGTGGGTCTGGCCTGGGGAAGGAGCT 

ACCAGGAGCGGCTGGCTCTCATTTGGAGTGCGGTTGTGGGGACrcTGGAGGTGGCAG 
AGGGAGCAGACCCTGGTGGAGAGGCAGGATGGCTGCAGGGCGGGAGCTGGTGGTTGGCATC^ 
GGGAAGATGCAGGAGAGGCTGATGACTGAGAAAGTTGGT^ 
75 TTGGGCTCTGGTATCAGCCTGATGTACAGTCTGGCT^ 
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ATTCTTTACATCCCAGGGTTAAGCGGCTGCAGATGTGATGAGCTCX3 
CGCCCTI^AGTGTCATTGATGGGGTTTTGTTCCCGCTGCGGTTCT 
CCCGCTGTGGCTGATGGTAGAGGTGTGTTTGCTGGTGGTGTCCCAGTGTC 
GCGACAAGAAGGAAATCAAGGCCGGGCACGGTGGCGCACXaCCTC 
5 AGGTCAGGAGTCCGAGACCAGACTGACCAACATGGTGAAACCCCATCTCT 

GTGCCTATAATCXZCAGCTACTGGGGAGGCTGAGGTAGGAGAATCGCTTGAACCCG^ 

AG AG CAAGACT CCATCTCAAAAAAAAAAAGAAGGAAATAA CTGGAG 
GGGGAGGCTGTGTGGTTTGCGCCGTGGCCATGGCAAGACACGACGGGCAGCATCTTCC 
CTCATGATGGTTCTCCTAGGATGGGGCTGTCCCTTCCGCTTGCTGAGACATGAAGGACGGG 
1 0 CATCGCCTGCCTTCCTCTGCTGCTCCTTAGCAGGAAACATGCCGGAG 
TTTATTTTTCAGCCTTGAAAAGATCTTTTGAGGTCGAGGAGGTCGA 

CTACTCCG AGCCACTGTGGCCAG CTCCAC CCAGAAATTCCAGGAC CTGGG CGTG AAGAACT CAG AACC CTCGGC CCGCCATGTGG A 




1 5 GTGTTGGGCCACAAGACGCCAGAACCGGCCCCTCGGAGGACGG^ 
GGAGCCCCCTGCCTCCAAGGTCCCCGAGGTGCCCACTGCCCCT 

AGCCTGCTGAGGCGCCCACCX3CCCCCAGCCCAGCCCAGACCTTGGAGAATTCAGAGCCTC 
CTGGAGCCCAAGCCCCAGCCCCCrGTGGCTCAGGCTACACCCCGGAGCCAGGAGGGTGAGTCGCAGA^ 
GTGGTAATGGGGGGCCTGGTTGCTGGCTTTGCCACAGAATCTTGGAGGAAGAAGCTGGATATG 
2 0 ACCCAGAAAGACCGGAATGCATGGGGGTGGGGGTCTGCAAGCrrCAGGAGAGGTGGCTCTCTCT 
GCCCCAAGCGGGACTTGAATGAGAGTTTGGAAAGATTTTCTGGGTTTCAAGGAG 
GACCTCTGTCTGCTGTGTGTCCCTGGAGGGCCTGGGAGGCTTAGAGAATGAGGAGCCACGCA 
GGTGTAGGACAGGGTGGAGTTGAGACCCCTAGAAGGGCTGTCACATTCTGGTGGCTCCTGCTTGAGGAATT 
ATGCCCAGCAGAGGCCGAGGGCTTGGGGAAGGGAGGAGGGACGTATGCCTCCTGAGGCrrTGCA 

2 5 CTCCCCCAGGAGGGATGGTTGTGGCAGAGGGAGGGGAACATGAGGA 

AGGGGTG CCTGGACCAGGGGGCTGTGTGATAGCTGCACACGGTTTTATTTGGGGGTG 
GGCTCCAGCAGCGTCCTGGGAGGGCTCAAGCCCACAGCCGTGCT 

GACACAGGGCCTGTTTGGAAATGCCAGGCTGTGGGACATGGTTGGGTCCCTTCTCAGAAGCGTGCCCCCT 
GTGGCTCCATGGGGCTATAGTAAGGTGCCAGCTGGGTTAGAAGCCCCTCAAGAGCCAGGTCTGATTCT 

3 0 CAGTAATGCTCAGT AAG C CTTCTTTAG GTTGGACAAACAG CACAGT CTGGGTTTT TGATTT AG CTG CTGOTT C TT AT TT CTAACC G 

TGGTGAAATACACAACATACAATGTGCAATTTTAGCCACTT^ 

CAGCTGCCACCGCTGTCCATCTCCGGTACCCTTTTCATCCTCCCAAACTGAGOT 

TCCCCCAGCTGCTGGCAGCCACCATTCTATTTTCTX3TCTTTATTATGAATTTATTTGA 

AAATACAATAACTGCCCATTrTGTGTCTGGCTCATTTCACTCAG CATGACGTCCTAAAGGTTCATCCGTCGTG CTGTAG CATGGGG 
35 CAGAATTTCCTTCCTTTCAGAGGCTGAATAATATTCTGTTGTGCGATGCACTTGTC 
TTGGGTTGCTTCCACATCTCTCTAAGTCCCGCTTTCCATCTTCITTGAC 
CCTCCAGGCTGGCGTGCAGTGGTGTGATCACTGCTTACTGCAACCTCTGCCCCCTGGGCTCA^ 

CAAG T AC CTAGG AC CACAGGTTGCGCG CCAC CATGC CTGG C T AATTTTT AAATTTTTTTTGTAG AG ACAGGTT CT CACT AGGTTGC 
CCAGGCTGGTCTCAAACTCCTGGGTTCAAATGAGCCTCCTGCCTCGGCCTCCCAA 
40 CCCCTTTTTGGTTTTTCTTTTTGAATGAGAGT^ 

GTGGGTTTGGGGGGAGACCAGTGCTCCTCATGCTTTCATGGGCATGTGGAGTGCACATT 
TCTGCATTTCCTACAGAGGGGGCATCGGAACACACTGAGTAGCCAGAGGCT^ 

TGCGAGTCCCGTGTTCCCTCTCCCATCTOTCTAAGCTTGACTCrrGAAGAGCCACCCTGCCCCAGCCACCCAGAC 
GTTGCTGGTGGCATGGTGTCAGGCCAAAG&CATGGG 

GAGGCGGTTGTCACCGAGCCATCTAAATCTCGGTGATGGCTGGTGCTGGATGCACAGGG 

AGGGGG ATGT C CATGGGGATGT ACAAGAACATT CTCCTCCAGGG G GAGACACAGTT TAG CCCCTAAATTG TGC CAGAGACTGTG C C 
GGGAGCCAGGCCCAGGG 

50 HUMAN SEQUENCE - mRNA 

TGACTTCTCAGGGTTCCTCCTGGGCAGGTGCTCCGGAACCTTTTCTCAGCA 
CCACGTTGGGGTACAGGGTGAAGAAGGGCTGGGGCCAGCCCAGG^ 
AACCCTTAAGCCCAAGGAAATCGTAGCATCGCGGGACAGGGAAAATGAAAG 
TGGTTTCCAAGAGTCTAAGTTAAGCATCTCCAAGTGGATATTAAAAAGGAGCAGCA 

5 5 GGAGAGAGGAGGCTGCCGGAAGCCGCIACTCGGGACCTCTC 

GCAGACCCGGTGGTCTGCCGGACTCCTCGGGGCCraCTTCGGGCCCTCT 
CCTTTTTCCTCCCGTTTTGAAGAGACAATGCTACTTCAGTTT^ 

ATGCATTCGTG ATTGCAACAG ATTGAAGAAATTAG AC(ZAG ACAAAG AGTGTT^ GAGG AGGAGGAGG AGGAGG CTG AG 
AGAGGGAGGGCGACGGGGGTGAGAAAGGGGAGGCCGCCTCTGAGCGGGACGCCGGGACTCCTOCCGCTGCTA 

6 0 AATGGAGAG^GACCGGATCTCAGCCTTGAAAAGATCTTTTGAGGTCGAGGAGGTCGAGAC^ 

TCCAGACTCCCCTACTCCGAGCCACTGTGGCCAGCTCCACCCAGAAATTCCA 
CGCCATCTGGACTCCCTAAGCCAACGCTCCCCCAAGGC&TC^ 

GTCCCGGCGCACTGAG CTGTCCATTGACATCTCGTCCAAGCAGGTGGAGAACGCCGGGG CCATCGGCCCGTCCC 
AGAGGGCCGAGGTGTTGGGCCACAAGACGCCAGAACCGGCCCCTCGGAGGACGGAGATCACCATCGT^ 
65 CACCGGAGGATGGAGCCCCCTGCCTCCAAGGTC CCCGAGGTGCCC ACTG CCCCTGCCACCGACGCAG CC CCCAAGAGGGTGGAGAT 

CCAGATG C CCAAGCCTGCTGAGGCGCCCACCGCCCCCAGCC CAGC CCAG ACCTTGGAGAATTCAGAGCCTGCCCCTGTGTCTCAGC 
TGCAGAGCAGGCTGGAGCCCAAGCCCCAGCCCCCTGTGGCrGAGGCTACACCCC 
TGCX3TTGGCGACATGGCCGACACC 

CGGCTACGTGGGGATTGACTCCATCCTGGAGCAGATGCGCCGGAAGGCCATGAA 

7 0 GGC^GAGCGGCTTGGGTAAATCCACCTTAATCAACACCCTCTTCAAATCCAAAATCAGCC 

GAGCGCAT CCCCAAGAC CATCG AG ATCAAGTC CATCACGCACG AT ATTG AG GAG AAAGG CGTCCGG ATGAAG CTG ACAG TG ATTGA 
CACACCAGGGTTCXSGGGACCACATCAACAACX3AGAACTG 
TGCAGGAGGAGGTCAACATCAACCGCAAGAAGCGCATCCCGGAC^ 
CACTCCCTCAGGCCCCTGGACATCGAGTTTATGAAACGCCTGAGCAAGGTGGTC^ 
7 5 ACTCACCCTGGAGGAGAGGGTCCACTTCAAACAGCGGATCACCGCAGACCT 
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AATTTGATGAGGACTCGGAGGACCGGCTGGTGAA 

TACCAGGT<^CGGCAAGAGGATCCTTGGGAGGAAGACCAAGTGGGGTACCATC^ 

CTACCTGCGGGACCTTCTCATCAGGACG CACATG CAGAACAT CAAGG ACATCACCAGCAG CATCCACTT CGAGGCGT ACCGTGTG A 

AGCGCCTCAACGAGGGCAGCAGCGCCATGGCCAACGGCGTGGAGGAGAA^ 

CCACCCCCGGGATCCTGCCCCCAAGTCATTTCCGTCCCCCCCCCAGGCCCTCC(^ 

ATTTGTCATCGTTCCCCAGCCTTGAGATGCTTCACTTTC^ 

GTTCCTCAATGGCCTTTTGCTACGTGCCTCCCGAGAAATTTGTOT 

ATAAATGTTTCAAATGCAGAAACCCAAAAAAAAAAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGGAGAGGGACCGGATCTCAGCCTTGAAAAGATCTTTTGAGGTCGAGG^ 

CCAGACTCCCCTACTCCGAGCCACn'GTGGCCAGCTCCACCCAGAAATTCCAGGACCTC 

GCCATGTGGACTCCCTAAGCCAACGCTCCCCCAAGGCGTCCCT 

TCCCGGCGCACTGAGCTGTCCATTGACATCTCGTCCAAGCAGGTGGAGAACGCCGGGGCC^TC 
GAGGGCCGAGGTGTTGGGCCACAAGACGCCAGAACCGGCCC 
ACCGGAGGATGGAGCCCCCTGCCTCCAAGGTCCCCGAGGTGC 
CAGATGCCCAAGCCTGCTGAGGCGCCCACCGCCCCCAGCCCAGCCCAGACCTTGG 

GCAGAGCAGGCTGGAGCCCAAGCCCCAGCCCCCTGTGGCTGAGGCTACACCCCGGAGCCAGGAGGCCACT^ 
GCGTTGGCGACATGGCCGACACCCCCAGAGATGCCGGGCTCAAGCAGGCGCCTGCATCACG 
GGCTACGTGGGGATTGACTCCATCCTGGAGCAGATGCGCCGGAAGGCCATGAAGCAGGGCTTCGAGTTC^ 
GCAGAGCGGCTTGGGTAAATCCIACCTTAATCAACA^ 

AG CGCAT C CCCAAGACCATCG AGATCAAGT C CAT CACGCACGAT ATTGAGG AGAAAGGCGTCCGGATG AAGC TGACAG TGATTG AC 
ACACCAGGGTTCGGGGACCACATCAACAACGAGAACTGCTGGCAGCCC^ 

GCAGGAGGAGGTCAACATCAACCGCAAGAAGCGCATCCCGG ACACCCG CGTCCACTGCTGC CTCTACTTCAT CCCCGCCACCGGC C 
ACTCCCTCAGGCCCCTGGACATCG AGTTTATG AAACGCCTGAGCAAGGTGGT CAACATCGTCCCTGTCATCGCCAAGG CGGACACA 
CTCACCCTGGAGGAGAGGGTCCT^CTTCAAACAGCGGATCACCGCAGA 

ATTTGATGAGGACTCGGAGGACCGGCTGGTGAACGAGAAGTTCCGGGAGATGATCCCATTTGCT 
ACCAGGTCAACGGCAAGAGGATCCTTGGGAGGAAGACCAAGTGGGGTACCATCGAAGl^ 

TACCTGCGGGACCTTCTCATCAGGACG CACATG CAGAACATCAAGGACATCACCAGCAG CATC CACTTCGAGGCGTACCGTGTGAA 
GCGCCTCAACGAGGGCAGCAGCGCCATGGCCAACGGCGTGGAGGAGAAGGAGCCAGAAGCCCCGGAGATGTA 
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Table 97 

MOUSE NOMENCLATURE 
ICSGNM FzdlO 
Celera tnCG51280 



HUMAN NOMENCLATURE 
HGNC FZD10 
Celera hCG21398 

MOUSE SEQUENCE - GENOMIC 
NNNNNNNNNNCCATGTATCCCITAAGGCCA 



TGAGCAACAGCCTCAACTTCCAGAC^ 

CACCAAAAATACAAATATGGCTGCCATGCTCCCCGTGGGTGGGTGGGTG^ 

GTGAGGTTGAACCCCCCCATAGGAGGATAAAGTCCCCGAGTAACTTCCCTCTTTCGC 

TCTTAGTACTTATTGCTTGAGGCCCCA^ 

CCCCAGGTGCTTACTTGACTGGCTGACTGGGGATGlA-CAGCTAGCC^CT 
TCTCCCCGGACAAGTAGGCATCTTGGCCCCCATCTCCAGTGTT 

CACTTGTCTCAGTCCCAGCAAACTCAG CTGCCTCTCTGGGCTGGAGAGGAATCTGGCATGACTCCCTCACTCTGAAGACAGCAG 

GCAGACTTCACAGAGX!TTCCCTTCCCCATCCTCTGCCCTTTGGCT 

CAATGGAGAGGCAGCTTCAAAGGTCTGCCCAATGTGTTGGCCATCTGTATGG 

CGATCTGATCTGGGGGACAGGGTAACAAAAGC!ATTGGGGGCAGAGAGGTAGAGGCATOT 

TTGCTATGCAAGCZACGAGGACCTGGGTTrAAATCCCCAGGACT 

GTGAG ATAGGAG GCAG AAGT AGAATTCCCAG AAGCT CGAGGGG CAGTGAG CTT AGCATT CACAG AG GCAAAACAAG AGAC CAC C CC 
ATTCTACCCCAAGCCGTCTCATATTAAGTGGAAGACAAAGACCTCT 

TAGCCCAACCTAC^CAGTTGATGTTTTGACCTCAGGTCCAGAGCTCACCCAGCCTTATTCTCAGCT 

CCTACTCCGAC CCAGG CATG ATTT AGCAGC CATAGATT TGG AAAG ACTGGAGC CAGAT AGAAGT ATACATCACTCCAG AG CCTG G A 

GAGCTCTGCCTTTCACTCAGAAAGTCATCACAGGCCCACCAAACCTGCCAATGAGGCCT 

TTTTTGTCAATTAGATACAAGCTAGGGTCATCTAGGAAGAGTGACACT 

TAGGGAGCGTTTTCTTCATGATTGATGTGGGAGGGCCCAGCCCTTTGTAG^ 

ACATACCTTTGGTCTCAGCACTCGGGGAGGTAGAGGCAGGCAGATCTC 

GGACAGCCAGGGCTACATAGAGAAACCCTGTCTGGGTGGGAGATGCAGGGAGCAAATTGAGCAGGGCGTGGGGTTC^ 

GCATTATTTCTTCACGGTCTCTGTTTCAATCCCTGCCT^ 
TGTTTTCTTCCCCAGGTTGCTTTTGATCGCCGTGTTTATCCC^^ 
CTCTGCGGCCTGTTGGGATTCACTCTTTCCTTGATTCGTTTCCAGTCTGATTTCT^ 
CAGCCCTTCTCAACCTGTGGGTCGTGACACCTCCAGGGGGTCCTG 

AAACAGTAGCCAAGTCACAACrGTTTTAAAGGGCCACAG CATTCGGAAGGTCGAGAACCGCTGCTCTAGACGGAGC CCTGAC CAGC 



AGGATGCAGCTACCTCCATCC^TCTCCTTTAGCCCCTTGGATCTTTCCACTTTCCCCACTCTCCCAGCAT 

TGTTCTCCATGCTGACATGCCTAGGCCCACCCTCTACCTCCTTCT 

CATCTTCCCGGCATCCTTCCTGCCCGCCTGCACCCCCCCCCCCAATCCCT^ 

GGTTACCCCGAGAATGTGACAAAGGGTGAGGCTGTGGGTCCCAAGGGTCTGCAGATAGAGGGACGAGCCAGCAGCCTTGGGTGCAA 
GACCAGGCCTTGGAGGGTATCTCTACCTTACCTTATACCACAGAGAGAAGTCTGGCTCCCATGTTGCCCT 

AGG CTGAG CACAAG CTTTCAGGAAC CATTGAATT CATTGT G AGG CTGAG ATGG AT CTG CGTCCATAAG G C CAGGAG G G GT AACAGT 
GTCAGACCCCAGCCCCAGAGGGCGCTCTGCGGTTGATTCAGTAGCCAGGAGCATCCAGAGAGCTCTTCTGGGGCCCTTG 
TCCCACTCTGCAATCCAGGGGAGATCTGC^CCCCTTTTCCTTGGCAACACTCA^ 
' CCACGGGGAAGGAGATTGGTTCCACTCCTGCATGAGGTTGTCGACGAAGTCATTCTG 
AAAGAGGGCCGAAGAAATGGCTCAGCCGTTAAGAAC^CTTGCTGCTCTTG^ 
GTGGCCCACAACCGCCTATAACTCCAGTTCCAGGAGATCTGACCACCrrTCTTCT^ 

AGGTACAGTTACAGGCAAACACTCAC^CACTCAACTAGCCAGGGAGTCTGCTGGCTAGTAAATCCCAAGTGG 
TCTAAATCCTTCCTTGTAGCrGTCTGATATTCACTCCATCCCAAGTCTTAAAAAATAAA 

TTTTAATGCAAAGGGAACAGAAAT CAGAT CAGAG AGGTT AAGT AACCAGC CTGAG ATGCCACAG CTG AGGTCATAAATAAACAT CT 

CTCCCCTTCAATCTCTGACTTTGAACAGCCCTTTTAA 

TCCACAGGGTCTGTTCCTGGAAACCCCAGCTGGCTGCTGACATTTCCTC 

CCTGAACCAGATGGGGCTTCTCAGATCACAGCTATAACGT^ 

CCTCATCTTGCTTCAGCAGTTCCTGTACAATCAAGTATCTTAGTTTGTCCCAG^ 

CACTGTCCAAGTGACTCACGTTGGCTTTTAAATGATCGATTTCTGGG 

CCAACAAGAAGCAACTATGTAAGGGCTTATTTCGGCTCACGGTTGAAGGGCAC^ 

TTTTT^CTTCAGTCCTATGGATGGTGCTGCTCACAGCT 

AACAGCAGCGGCCTGAGGAGCGTCTCCTACATGGTCCTGGGGCCTAGAGAGTTAACAAT^ 

TATATTTACTGTGCTTCACAAACTGTCATCT^ 

AAACCAAAGCCCAGAATCCGGAATAGCTCAGCCCTAGTCAGGTGA^ 

TGTAAATACATGCTGTGGTGACCTTCGCCCCATAAAATTATTTTGTTGCT 

TAACATCAATATCTTCTATGTGGGATAGCTGATACTCAACCTCTAGGAAAGGACC 

CACAAGTTGAGAACCAATGCTCTAGAAAGTGGTAGAGGGATTTTCAGAC^ 

TTGTGTCCAAGTTCCCCCCCCCCCTCCCCaSTTTACTGTGTCAGGGAGGATGACGGACC^GTTT 

CTGAAGAAATG AAT AAAG CTGAAAG AAGCAGCGGT AG CTGG GGTG GTGGACAAAGAAAGG CACTGGC CT AGCAGCTG AGG G GCAGA 

GGCCTCAGTCGCATGAGCCCATGGAATAACTOTAGAAGTGGCTGTTAATT^ 

ATGCAAAACAGAATCACGTTTGCCTGCn?TCATTTTGTGAGGTAAT 

TTCGTTTATTTTATTTTTTTAAATTAAACTCTATATACTAGGGTTTAAATGCTGCTCACATGGGGGGTTCCTG 

GGGCACACGTGGGTCTGCCCGTGTTTTAAACAGAGGTATCT 

a_Aj^ATTATTCCCCACTGCCCTCCCCA^ 
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TTAAACAGGGAQGAGCGATO^CAGAAGCCTGGGCTGTAAAAAGG CTAATGCG CTGTGGAAGAGT AG GAGTTT AATCTAATTT ACT 
GAAGCAGTGGGGAGCCGCGCGACGCCAGCGATCC^^ 

GAAAG CGCCCCGCGAAC CT CTGGTAATATTTCAATCCACATTT CAAGGGAAGTGGC CTTAAAAT AGCGGCCTAAATTGTTCTG CCG 

5 CGGCCOTGACACTCCTTCATGCATTTTATAACGCAGAGGGCCAAGG 
ATCTTACATGTGCTGAAGTGT AATTAAAACTTGTC 
TATGCITGGAGCATATGTTTGTGTGTCTTAACrAAACATCACT 

TTGTG AGGCAGTCGTGGAACCAAGC CTAGTGGGTTGG AAAACTGAATTCT CAATGTGTG CCTTTT AAAGAGACCATTAC CAGT AAG 
CAGACCACTTTCCTTACTGTAAGGAACAGTTT^ 
1 0 AGTT CTAGAAATGGCCTTGAAG CTACCAAGACAAACGTTAAGGAG CCAG CTCCCTGGTT CATCTT CAGGCTGGGAGACAGGATGTT 

<^TX3TCTTTCTATAATCTCTGCCCATACCC 
CTTTATTGTAGGAGACX3AAGTACTTTTT 

TCAAGACTTTATGGTTCCTTCACCCATACTGAGGAAGGCTGCAGC^ 
TTTTTTTTTAACTTCTGACTTTACCAATTAAACW 

GACAGTATTTTGAAATCT(^CGTAGATAGATATATTA« 

CATGTTT AATTATCAAG TT TCATT AAAAACC CTTG CT CAAATTTTGAGAAAC CT AAT AAG ATTTCCATT AATTT AAACATAGCTCT 
ATTCCCAGGCCAGTCCCCCCCACCCCACCCCAAAGCTTTCTCTTCCCTC^ 
AGTCGTTCTTCAAAGAAGACAAAAGAAAAAAAAGAGCCGAGGGGAGAGAGACAGC 
2 0 TGCCAGAGGGGGGCCGGTTTCCGAGCCAGGGCT 

AGGAGAGAGAAAAATAAAATATCrTTTCAGCTCGAGCCA 

CTCGGGGCCCTTATTTTTAGATACTAATGTCTATTTGTTGCTTTC CTTT CTTTTT CTATACT AGGTTGGGCAG ATGGAAGAAATT C 
TGTTAGCTTTTTCTTCTCTTAATGTATTTTAAGGGAGTCCCGGGGGGTTGGGGGAGA 

AAAAT AAAAGGT CGGAACACTGT AGAAGT AAC CAG AG CAT AAAAG AC CAATT ACGTGGGG CAGTCTCAAATAGCT CACACCCTGCT 

2 5 CCGGAGCTGAGCTGACCCAGGCTACTTCCACGCCCCCCACGC^ 

AGAGAGAGAAAACAGAAGGCAAAAGTGTCAGGAAGGACCCCTTGCAAACTAGTCTGTCT 
TACCTGACTTAAAAGGTTTTGGAAACAGCGTACTAGACTATAAA 
ACACOSCAGGTTTTrCTTGGGTC^CTAACACGGTGGTGGGTGGTGA 
CACGGATCCAACTTCCTCCCGTGAAAGGCAGAGGGGACCAGGGGATCTAAAGGTGA 

3 0 CCACCCCCAGGACTCAGCAGGTCCTGGGAGTGTCCCAGCCTCAGGTCTC 

AAGAACTCCCTGCAGATGGGACGCTGAAGCCTTGTTTTAATTAGAAGTCGAT 
GGGGTCCTGGGTAGCCTCGTTCCTGGCT/TGGCTGGAGCCACT 
GCAAGACTGACTGGCCGAAAGATGAGCAGGGGACTrAGCTGCACCTATGCAGTTGG 
CCCR.GACACTTAAAACGCAAACCIX3AATGGGGGTGGGGTGGGGTCTAAGG 
35 AAGCCAAGAGTCCGCX3ACAGGTGCCGCACCCC 

TGTGCGGGGGAAGGAGGTTGAAGCTCTGTGTCCTTATCACAGATTAGCCCAAAACCAAC 
CACGGAGAGGCCCCACCACTCrrGCGTGGGAACCAGCCGTGCGCGTGTAG 

C CG AG CACAT CTTT CGC CAAAT AATATAAAT ACACAACTT CC T AT AAT AAAT AAACT CAG CC GC TTGG AGAAAC CAC CAG CCGGCT 

4 0 GTCAGCGGCCCGGAAAGTCTCCTCCCCXjCCCCCTCCCT 

CAAGTCCCCTCCCCCTCCCCTGCACTCGGCCCTTTTGAAATTTCAAAGGCAGCCCCTCCCCGGAGT^ 
AGCAAACCCCCGGCTTCGGGAAGATGCX3GGC!AGCITTA 
CCGTGTCTCTTTGAAAGTGCCCTCCXSCCAGGGCCCGGTCGTAAAAGT^ 
ACCCTTACCCGAGTCCCCAANNNNNNNNNNNNNNNNNNNNNNl^ 
45 NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNKNNNNNNGGCGGAGGCTGCCCGGGGCGO 
GGGCCTGTCACCAATCGGCTTTGCTTTTTTTTTTTTO 
GACGCCGTGCTGGAAGCAGCTGCGGGCXiCCGGGAGGGAGGAGCT 
50 GTCCGCGATCGCCCGCGCCCGGAAGAAGAGGCCAAAGTTTGCGCGCTGAGCGGCGCTCGCTCCGCCGTA 
TCCCCTCCCCCGCCGCCCCCCGACTCCGCTGAGGCCGCGCGCCC^CC^^ 

GATGGGCATGCGCGGGGGCCCGAGCCGGGGCX3CGGAGCCGCGGGCAGCAGCAACAGCCCGTGCCCX5 
CGCTGGGACATTGAGGGCGCCACCATGCAACACCCGGGCCCX3CGCCTG 
CATCAGCTCCATGGACTTAGAGCGCCCTGGAGACGGCAAGTGCCAGCCG^ 
55 CCACCCXJCATGCCCAACCTGATGGGTCACGAGAACCAGCGCGAGGCGG 

GGCTGCC^CAGCCACCTTCGCTTCTTCCTGTGTTCGCTGTACGCGCCCATGTGC^ 
CCGGGTCATGTGCGAGCAGGCCCGGCTCAAGTGCTCGCCGATCATGGAGCAGTTaiA^ 

GCAAGCTCCCCAACAAGAACGACCCCAACTACCTGTG CATGGAGGCACCCAACAACGG CTCGG ATGAGCCCAGCCGGGGCTCTGGC 
ATGTTTCCTCCGCTCITCAGGCCCCAGAGGCCCCACAGCGCX3CAGGAGCACCCACTAAAGG 

6 0 TGACAACCCAGGCAAGTTCCTACCATGTGGAGAAGAGCGAATCTTGCGCACC^ 

CTCATCGACCCATCGCGCTTCAGGTACCCCGAACGTCCTATC^ 

CATCCGCCTCTTCGCGGGCGCGGAGAGCATCGCTTGTGACCGGGAC^ 

CGGGCTGTACCTTAGTCTTCTTGGTACITTACTACTTC^ 

6 5 CTGG CTGCTGGAAAGAAGTGGGG CCATGAGGCCATTGAAGCCAACAGCAGCTACTTTCAC CTGG CAGCTTGGGCCATCCCGGCTGT 

GAAGACTATCTTGATCTrGGTGATCCGCAGGGTGGCAjGGGGATGAGOT 
CTCTGAOCGGCTTCQTGCTGKjTCCOSCTQGCTTGCT^ 

CACATCCGGAGGGTGATGAAAACGGGTGGGGAGAAC!ACGGACAAGCTGGAGAAGCTCATGGTACG 
CTACACTGTGCCGGCCACCTGTGTGATTGCCTGCTATTTTTATGAACGCCT 

7 0 ACAAGTGTAAGATGAACAATCAGACCAAGACACCTGACTGCCT 

GTGTCCATGCTGCTGGTGGT<5GGCATCACCAGTGGGGTGTGGGTOT 
CCGGGGGCTTAAAGAGAAAAAGCCGGAGGAAACCAGCCAGTGTGGTCACC^ 
AACCTCACCTTGGGAAGTATCAACTGCCCGCCCAGCOT 
AGCCCAAATGGGCAGCTTTTCTTGGTTGGCTGGTTGTTGGTAAATCAAAG 
7 5 AT GCATGTGCAT AT ATATGCATATATTTACACATACATGAAGT CT AC CTTGAGGTT CAG AACAG CTGAATGTAAAG^GTTCTG TTC 
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AGTTTGTTGCCTCCCTGAAGGGTCGCTCATTAAGGAGGG^ 

CTCCAGAAAACTTTTGTTTAGGCCTCAAGCACACAGCTGTC 

GCTCCTTTCTAAGCTAAAGJM3CTTTCCT 

CAG CTAGG CTGCTGTATGGGTG GGAGGGAAGGTGAGGAGGAG AGGGGATAAAAG AG GTGACACGCCCCCCAGCCCAAGGTCTGCCT 
5 GCAGATCC CTTAAAAGAGAGTCACTTCCCCAG CCTT CATAGC CATGTTTCTC CCAACACAGAGC C CACG CT ACCCT CTCACCCCAC 

TGCAGATCT AACACTGGAGAATTCAAATGGTGTGC^ CAAAACA 
AATGAGAGAGAGAAAAAGTATTTTGTTGTAAATAAAAGGCAAGAAAAGA^ 

GGGTTGGGGGTTGGGGGGGCGCCTTCAG CCTGATACATTTTGTGGCTTTTTAATGGAAACCAAG CCAGTGTGTTCTACACACTGGG 
ACTGATTTGTGGAGAGGAAGGGGGAGGGGGGGAAG C CAG TGGAGAG CAC CAGAGGG CTTATTGAXTT CTCTGAATTGTTAAACAAAT 
1 0 GATTTCCATGAGTGGTCTTGAAGCACTTGGGAAGACAAACTT^ 

ATATAGGATATTTTTCATGCTCCACTATTTTATTAAAAATAAAAGACATTC^ 
GGAGCGTTCTCCATTTCTTAGCAGATTCTTGGGTGTGCX1AGGCTG 

AAAAGATCCTGGGGCTCAGAGGCCTGTGTGCTGGCTGTGGGAAAAGGTGGCGGTGACTCTGCCCCC 
AATGCCX1AGACCACACTCACCCAGGATGCAGCAGTCTTTACCTGACCTACTGTGTC 
15 CTCTGGGGCCATGGGATACATTTCAGAGCTGGATAGGTGAAAGTGGGACTAAGTOT 
TCAGAAGCTTGGCCACCTTGAAGTTTCTGGTACC 

CACT AGG CTTGACTAAGCT AAAGTT CT AG CTAG AAGG AC CCAAGG ACACAGTTCAG CTCTGAGAAATTGGG GG GGAGG GGTCAGTG 
TAGAC CT ACCCAACT CTGACCTT CATAGCTCAGAAGCACHTT AAACT ATTTG AGC C CTT CTGAAG CAG AGAT ACCAG CTTG AT CGT 
GG CTTTC TGAAGCAAGTATTGGG GAAGTTTCACAGGG C CATGGGTCCTTG AGCAGG AAGGAC AGGTG AATTGGGTAGT CTGGGG AA 
2 0 CAAGTTCTAGAAATCAATG CTGGGCAGC CGGG C CAT CGGTG AGCAT AAAACCCAGAACC CAG AATCATG GAAATAAAC CTCTAT AT 

TGACTATTGTTGTCCTTACCACCAGAAGAGACACTAAGAGTGA^^ 

AGAGAGAG AGAGAGAG AGAGAGAGAGAGG ATCACTGG AGTGG CAACTGGCCAGACCTAGGCTCAT CTG C C CC CCTG AGCTG TAGTG 
GTTAGTTGTATAGGTTTTGAAAGTTCATACAGAGCAATC 
GACGGTGGCCCCATCCCCAACAGTGACAGGGATCCTGTTTACAAATATCAAGA 
2 5 TGTGCTCCTTGGGCCATAGTGAGAACAGCCTG 

CTG CATCAGTCTGTCT CTTG CAGTTGAACT CTAGTT CTAAACCAGATGGT AG ATTTAAAGAAAAACAT CAAAG CTTAAAAG TT AGA 
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AGCTCTGTCCAGACCTCX3AAAGGGAACTGCTGTTTCCT 

GAGAGACAGGAGG ATTTCIXiAGTTCGAG GCCAG CC TGGT CTACAAAGTG ACTTCCAGGACAGC CAGGGCTAT ACAGAG AAA 
TCTCAAAAAACAAAACAAAACGAAACAAAACAAACAAACAAAA 

ACCAACQ^AACAAACAAAAAAAGG CCCGGAAAAGGTGAATGTCAAGGTCTCAAAGCACT'CAGTT^CTTGAT CTC CTACCACCAGCTG 
TTGCTCACATGTTAAGAGAGCCCG ACTTAGGG AAAC CATAGCCTCATAGAGG AGACAGCGG CTCACGTCT CCTG AGTTCCAAATAT 
TGGGGAGATATCCTGAGCCAGGGGTCCCCCCTGTGGTTGGGAGACCCAGCACAAGTGCCTGCAAGAA^ 

ATG CG AAGGG CAAAGGTAGG AAAG AG AACACCC CTT TGAG AAAG TGATAGG CAGGGT CTAGCTGGTG C CCCTT ATAT ACCC CACAC 

CAAGCACAGTGTCTTGTCCAGTCAGTACACAACACAGGGAAAGAATGG^ 

GCCTCTCrrGGACCAGTGGTTCAGGAAGTGATCTTTTGAGTAGATGGCGTCGG 

ATGTCCAGCCCTCTGACGAGGAAATTCAAGGTTCAAAGOT 

TCTCTCACTAAACTAGTCACCCTGGCTAGACTGGCCAGTAAACTCT 

GAATACACTCTTACACTCCAGTGTTGACAGGGAACTAGG 

ATGAGATGAGCACTAAGCCCAGAACAGTCCAGTGCCACAATGGAGAAACC^ 

GAGTCAAACATGCCTCGCTAATGGTCrCCAAAGAGAATG^ 

GATT CCAGAGAATAC CCTCTGATCTTGCAAGTGTAAAAGGCGGCGCAGTGGCTTCG CCCCGGTGAGGGAGACCCTAGCTCTGTGTC 
TTTGAACIACCCTCACATCACAGTCACCTCCCAGGGACAAGGGCTCTG^ 

AACAGCTGGAAGATGAGAAACGCAGACAGGCTGCAGGGGAGCCCCGGGATGACACGGGGOTGGGGGT^ 

TGCGGAAGGCCAAGCTGGGGGGAGGAGGGACTGAGGCGTTTGAAATCCAAAAAGAATGAG 

AATTGGGGACACCCCTAGGAAAATGACACGGTTGGAGC^CTGGCAGGTCCT 

CTTGCCTTTTATTAGAAGACATTTTTAATTAAAAAATGGGGATTCCC!^ 

CAAAGCCATTCTTTGCCTGGCACCTGCCTCCGTCTAGAAGGTCGTGGCT 

CTAGACTATATCTTAAAGGCTATTCTITACCTGTTTTAAGGA 

AAGTTGAGGTAGGTCAGTTGCCAGCTCAAAGTCAGCCTGGGCT^ 

TGGTATCCCG^GAACCTCTGCATTTGCCCCATCCCTACCCTCAGAGTCCC^ 

ACTGGCAACCTGCCTGCCCAGCCACCCTTAGATCCCCATACTC 

TTTCTTTGTTTGGCCTGGAGCAATGAGTCTATCATTTVGATTTTT^ 

GCTCTTCCATGTTTAGCCTTAGCCC^TCTCCCAGCTAAGCAGAGGACACTA 

TG AAGTGTT TG CT CAGTAGGTATG AGGAGAG GACTTG AGTT CAGAT CCCAGGT ACAG CCC CAAGTGTGG CTGTG CCTGTCTG T AAT 
CCTGGTTCTGGGAGGTGGAGACAGGAAGATCCTGGGGTTCXSCrGAGCAGCC^ 

GAGACTCTATCTCAAAAAGAAAG C CAGATGAAG ATG GAATTG AGGAAT ATG CT CTGTCAAC CT CTGG CTT C CATGTG CACTCAT AC 
CAGCATGAGCATCACATACATACACAAGTGTACAGACAC^ 

CTG CCTAT AT CTT AGG TG CT AAGC CAAC GC C CAAGT AAG ATAC C CAT ATAG CTT AATGAAATCACATCTTTCCATCTGGGGTATGA 
AGTGTGTCCITTTGGAACACAAGAGAGAGGCTGTGCACCTGGGAGAAAACAGG 

AAAATGTCCTGACATAAGCAATATAAAAGAGATGGCTTTATTTGGCTCACAATGCCAGGTTACTGTCCATCATTGCAGAG 



CATTCTCAAG GGAGTAGGGACAAT AGACAT TTGGGCAACTT TGGGT AACTTTAG AAC CTG CTGGG AT C CGATCTATCTATCTAT CT 

ATCGATCACTTCCATTTGATGGTAGATTGCTGGTGTGCATATGCT 

CTCAGGATAGGGAGTTCAGGTTGCATGGTAACCCTGXS^ 

AGAGCTGTCTCTCAAAGGGAGAATAGTTGTCTGCAGATGATGGCAAATTCT 

GCTTGCCACAGGCTCCAAGCAGCACCTCTGACTGCCACTGCCACCT 



75 



GGCCCAAG CG ACACACC CAGGTG AG G AATGTGCTGTCT CCAG AATCCAAAT AG AC CCACTAGG CAT TGTGC'i'TC'rrT CTTGATGG T 

GGGAGGGACCAGGTACAACAACTTATCCCTCACTTTAGAAGGAATATCTCTGCACG 

CTGAGGTAGAAGGCCCTTGCATTTTGGTTGGACTTAATTCCCA^ 

ACCT CCAGCTT ACTTGGTCCAAT CAG CAT AATGCT AT CCAATAT AGTG GATCAG TGTGATATTGGT AGAAGAGGCAGGTGGT CAGA 
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CCTrCTTCAAACTAAGTTATGACACAGGG AGTGTTACCAT ACTTTT AAGG CAA CA 
G CTG AAAG CAAACTG CTTTTGGTGGT CC^TGGGCTG AGGGAAAAAAAAAGGC^ 

GGAGTTGTGTTAATCTGCTCAACTAAGGACACCACAT CTGGT ACAG CAGC TG CAATCAGAGTCACT ATTTGATTAGTTCTGTGATA 
GCCAACTGTCIATTCTCCACGAGCCATCTGTCTTCT^ 
5 CTGTATCTTTCAAGT CCTTGATGGTGACATTGATTT CTGTGATCC CTAT AGGA CTGAGACCTGCTGAG GTGCTTGCAGTGG 

GTGGAGCAACTATGGAATGGGTGGTAGAGAAAAGTAGTCATAGCTACCAGC 
GTAATTGGCCTAAGCGCTTCTGTCTTATTTT^ 
ATTAAGCCAAACTCCATGTTTTCTTTGCGTTTACTTATTTATTC 

GTGTAAATCTACATTATCATATTTAAGTGATAAGACACTAAAAGATGAAG CATTCAAGACACTTTG CCTCCTATGCTGAGGGGTAC 
1 0 ACCTGCAGTTAG AGGTGGAGCAG CTATAT CATG CTAT ACAGAATTG TG AT CCTGTT ATCACTTTCATTTGG AAAGTT AGCAAGGTC 
AAAGGAGATGCGATTGGATGGCAAGAGGGGTCAACTTATCTTAGTTGTCAAC^ 
CATTGGACATTCCTGGGAGGAATTTTCTTTGATTAG 
ACTGAGTGCGGACAACACGGTTGG(^TTTCGATGTT<3TC 

TGGTGAGGCAGTGGCrCTCCAGGAATTCTACATGCCTTCAGTGCCTGATTGACAGCTCGCCCC^ 
1 5 TG CATGG CC CAGATG AT AT CGTGT AAGTTAAC C TAAT AAACTCCTCTTTAACACACACACACACACACACAC^ 

ACACTCATTCrrATTAGTTTCATTCCTCTAGGAAAAGACAGATGAGAGGAAG 

CGTGG CTG C TG CAAT CATTTTGTGGCCATGAGG CAAT AGG C C CAATATTAAAGCCAAAAACTCTGACAGGTACTAAT CAAG AAG CA 
GCCAGCL"l T (J l * r ATGACrTC^ 

TTTTTTGAATGTCAGCACGTTGGGTGTCCrcCTCTITAGAA 
2 0 TCCTGGATACAAATCAACCAGACC^GAACTCTTGCTGGAGTTCAGGT 

TTT AG AATTCTTGAAGCCTG CAGGTGCn*TCATT CCCTGTACAGCGAATG GTCACTG ATCAT C CAAGCAGGAGCCTGATG AGTGT AA 

AGAAAATATAAGAACAAACTTCTTTTTAAAAATTAATTGATTCATTAAT^ 

GCTGTCCTGGAACTCTATCTGTAAACCAGGCTGGCTTO 

CC CAGTT CAAGATCAAATTTCTTTTTTAGATT TATTT ATTTATTTT ATGT ATGGGAGT AC C CTG AAG CTG TCTT CAG ACACACCAG 

2 5 AAGAAGGCATCAGATCCCATTACAGATGGTTGTGAGCCACCATGT 

AGTGCTTCTTJUICCGCCGAGCCATCTCTCCAGCCACAGCAGCAAAGAATCCTGGAAG 

CACGACGCTCTGGGTTCCATCCCCAGGACCACAC^ 

GATCACAGGTTCTGTGTCAATCACAGGTACATAGAGAGTACAAGGCCAG 

GAGG AAG AAAG AT AGGGATG CTGAGTT ACAGCTAGT CTGGC CC CTCTGGAG CAAAGAGAATAT AGTTC^ 

3 0 CAACCCTCTGTGTTGTTCTGTTACTGGACAGTTTCTTAATTGTT^ 

CTTTTTTTTTTTTTITTAAGCAACCT 

GGGGTGAAAACTTCCCAGAGAGTGTCAACTTGTTGACTCTATTGCT 
GATGGGGAAAGAAX3GATGAGAAGCTGGAGGTGGGGAAGGGCTGATTAGAAGGAGCTC 
GGGAGTCATGTGGGCGTGGATCTGCCTTGTGGGTGCTTTGGCATATGAAGGACACAGC 
3 5 " GTGGGGGAGGGTTTCATACAGTGGTCGCTGCCCAGATGCTCCCGAGAAT 

CCTCTGACCTCTGCCTCCTGCCGCTTCCTGAATTCrrAAAGTCA.TCAGGACT 
. CGAGAAGTGT CT AGT AGGAC CTTTGC CTGATG CATG AAAGTATT ATC AGTCACACAAAAT C CAGAACTCAAGAAAGACTT C CTT G A 
' TATTT AATAGATTTAATATTCTCTTCTTCGTCCCAGTTTTAGACTGTCITGAGCT 
CACTTTCAACCTGCGTCCTCATGGCTATTTCCTGTGAGTTCTGAGTCTATC 



40 



MOUSE SEQUENCE - mRNA 



CCCTGGAGACGGCAAGTGCCAGCCGGTGGAGATTCCCATGTGCAAGGACATCGGCTACAAC^ 
GTCACGAGAACCAGCGCGAGGCGGCCATCCAACTGCACGAGTTCGCGCC 
45 TTCCTGTGTTCGCTGTACGCGCCCATGTGCACCGAGCAGGTCTCC^ 
GCTCAAGTGCTCGCCGATCATGGAGCAGTTCAAATTCAGGTGGC^ 

CCAACTACCTGTGCATGGAGGCACCCAACAACGGCTCGGATGAGCCCAGCCGGGGCTCTGGCATGTTTCCTCCG 
CAGAGGCCCCACAGCGCGCAGGAGCACCCACTAAAGGACGGGGGTCCGGGGCGCGCAGG 
TGTGGAGAAGAGCGAATCTTGCGCACCGCTTTGGACTCCGGGGGTGGATGTGTATTGGAGCC^ 
50 TCTGGCTGGCCATCTGGTCCGTGCTGTGCTTCTTCTCCAGCGCCTTCACCGTGCTC^ 

GAGCATCX^TTGTGACCGGGACAGTGGGCAGCTGTATGTTATCCAGGAAGGACTGGAGAGCACGGGCTGTAC 

CATGAGGCGATTGAAGCCAACAGCAGCTACTTTCACCTGGCAGCT^ 

5 5 GCGCAGGGTGGCAGGGGATGAGCTCACTGGTGTGTGTTATGTGGGCAGC^ 

CGCTGGCTTGCTACCTAGTCATCGGCACrTCCTTCATCCrGTCCGGCTTTGTGGCTT^ 
GGTGGGGAGAAC!ACGGACAAGCTGGAGAAGCTCATGGTACGCATAGGGGTCTTC^ 

GATTGCCTGCTATTTTTATGAACGCCTCAACATGGACTACTGGAAGATGCTGGCCACCCAGCACAAGT^ 
CCAAGACACCTGACTGCCTGATGACCACCTCCATCCCTGCCXSTGGAGGTCT^ 

6 0 ATCACCAGTGGGGTGTGGGTCTGGACTTCCAAGACCCTGCAGTCCTGGCAACACGTATGCAGCraGGGGCT 

GAGGAAACCAGCCAGTGTGGTCACCAGTGCAGGGATCTACAAAAAAGCCCAGCACC^ 

TGCCCGCCCAGCC TTCAGCCTG CGTGTGAAGGCAGAGCTGGCTGCAAGAACAGGACATTCX5GAGCCCAAATGGG CAGCTTTTCTTG 
GTTGGCTGGTTGTTGGTAAATCAAAGTAAAAAGAATATGTAT 

ATTTACACATACATGAAGTCTACCTTGAGGTTCAGAACAGCTGAATGTAAAGGGTTCTGTT 

6 5 GCTCATTAAGGAGGGAGCCTCTTGTGTAACTAATTTGTGGTAAAGTAGTTGATC 

TCAAGCACACAGCTGTGTATTCAGAAGGCTTTGCT 

TCCTGGGGGGGGGGGGCAGCCAGTGAGCCACCCAAGGGCCTGCTCTCCCTCCCTCACTCCCCCAGCTAGGCTG 

AGGGAAGGTGAGGAGGAGAGGGGATAAAAGAGGTGACACCCCCCCCAGCCCAAGGTCTGCCTGCAGATCCCT 

TTCCCCAGCCTTCATAGCCATGTTTCTCCCAACACAGAGCCCACGCrACCCT 

7 0 AAATGGTGTGCAATACTTTTTTTTTTTTTTTTTTACATTTTCCCCCT 

GTTXSTAAATAAAAGGCAAGAAAAGAAATCATCTA^ 
TCAGCCTGATACATTTTGTGGCTTTTTAATGGAAACCAAGCCAGTG 

AGGGGGGGAAG CCAGTGG AGAG CAC CAGAGGGC TTATTGACT CTCTGAAT TGIT AAACAAATGATTT C 

ACTTGGGAAGACAAACTTTGTCTCCAG C CAGGGAG GG CCATG CCTG CCTCTGTAT ATCTGTAATAT AGGATATTTTTCATGCTC CA 
75 CTATTTTAT TAAAAATAAAAG ACATTCTT TAGT 
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MOUSE SEQUENCE - CODING 

ATGCAACACCCGGGCCCGCGCCTGTGGCTGGTGCTGCAGGTGATGATAGGCTCGTGCAC^ 

CCCTGGAGACGGCAAGTG CCAG CCGGTGGAGATTCCCATGTG CAAGGACATCGGCTACAACACCAC CCG CATGCCCAACCTGATGG 
5 QTCACGAGAACCAGCGCGAGGCXKSCCATCCAACTGCAC^ 

TTCCTGTGTTCGCTGTACGCGCCCATGTGCACCGAGCAGGTCTCCACACCCATCCCT 

GCTCAAGTGCTCX3CCGATCATGGAGCAGTTCAAATTCAGGTGGCCGGACTC 

CCAACTACCTGTGCATGGAGGCACCCAR.CAACGGCTCGGATGAGCCCAGCCGGGGCTCT 

CAGAGGCCCCACAGCGCGCAGGAGCACCCACTAAAGGACGGGGGTCC^ 
1 0 TGTGGAGAAGAGCGAATCTTGCGCACCGCTTTGCACTCCC^ 

TCTGGCTGGCCATCTGGTCCGTGCrcTGCTTCTTCTCCAGCGCCTTCACCGTC 

TACCCCGAACGTCCTATCATCTTCCTCTCCATGTGCTACTGC^ 

GAGCATCGCTTGTGACCGGGACAGTGGGO\GC!TGTATGTTATCC^ 

TACTTTACTACTTCGGCATGGCCAGCTCTTTATGGTGGGTGGTCCTCACCCTCA 
15 CATGAGGCCATTGAAG CCAACAGCAG CT ACTTTCACCTGGCAGCTTGGGCCATCCCGG CTGTGAAGACTATCTTGATCTTGGTGAT 

GCGCAGGGTGGC^GGGGATGAGCTCACTGGTGTGTGTTATGTGGGCAGCATGGATGTCAATGCT^ 

CGCTGGCTTGCTACCTAGTCATCGGCACTTCCTC 

GATTG CCTGCTATTTTT ATGAACG C CTCAACATGGACTT ACTGG AAGATG CTGGCCACC CAGCACAAGTGTAAG ATGAACAATCAG A 
ATCACCAGTGGGGTGTGGGTCTGGACTTCCAAGACCCTGCAGTCCTGGCAACAC 

GAGG AAAC CAG CCAGTGTGGT CAC CAGTG CAGGGAT CT ACAAAAAAGC CCAGCACC CCCAAAAACCTCACCTTGGGAAGTATGAAC 
TGCCCGCCCAGCCTTCAGCCTGCGTGTGA 

25 HUMAN SEQUENCE - GENOMIC 

AGTCTCGGCGTGGCCTTCTCTCA.TATCTTACATCT 

CGGATAATTCAGGGTCACGTCCTCTCAAGATCCTTAACATGATTACTTCTGCAGAGACGTAATC 
CACAGCCACAGGTTCCCAGGATCAGGACGTGAACIAGGTCTTTO 

TCTCAGAAACAGTGGCTCTGAGGACAGCAGCTGTTAGGATTGTCAGTTCTCTGCCTGTCTCT 
3 0 CTTCATTTACTCCACAAACCCTAGG ACTC CCTAAAC CTAGCTGGGGT CTCTC CT GCAT AGAATTTCAGAATG CTT AAAAAT AAAGC 

TGTACAAACGTGTAGATATGAAAAGTCGTGGAGTGGGTGCCAGCTTTTTGCC^ 

AGGGAAAACTGTAGGATGGAGTAGAAGAATGAGTTTTCTGGCAGGGTGCAGTGGCTCACGCCT^ 

TGAGGGAGGGGCGGATC^CAAGGTCAGGAGATCGAGACCATCCTAGCCAACATGGTGAAATCCTGTCTCT 

AAAAAAAAAAAAATTAGCTGGGCTTGATGGCACACATCTGTA 
3 5 G CAG AGGTTG CAGTG AG CTGAGACCG CAC CACTG CACT CCAG CCTGGTG ACAGGGCAAG ACTCTGT CTC^ 

AAAAAAAAAAGAAGAAGAAG AAG AATGAGTTTTCCATCAAAATGG ACAG CC T^ GGT C CATG GTCAAAATGGACAGC CGGC C CAGG C 

TTGCCTGGTGGCCCTCTAAGCAGAATCCAATCCCTGCTGACCGTGGCCT 

TCTGAACCACTGTGGCTGTCTCCTAACTGTTTCTCTGCTTCACTCTTTCCCTGCCACTATGTGTCCTTGAT 
GTCCTGGGAAGACTGAAGTCAGAGCATGGCTCTCGTCTCTCCA^ 
40 AGGCCCATAGACGCCATCATGGCCCGATTTCTITCCCTCCCACTTCCTCrC 
GCCCCTATGTCCTGACTTCCAGCCATTGGCCCCATTGTTCT^ 
GGCAGACACCCCATTATCCCAGCAGGCACTTCTAGAAGTGCCCTCAG 
CCTTCTCATCCAGGCCTTCCCTGCCCATCTTGCTAAA^ 

GCCCCGCACGCCACGCACCGTGGAACAGGCTGTAGCACGCACATGTGCAGACCCCGGCCTATCGCCT 
45 CTCCATGAGGGTCAAGACATCTGTCCTTCTATCCCCTGCT 

TTAGAAGGGAG GGAGCAGAACAACTATT CTGGGACTG ACT CAGATGAGTA 

AGATCATCGACGGGCIAGAAAAAGTACCACAAACAAAATAAGAGTAAGAOT 

TAGGAGGCAGCCTAGTGGACGCGTCACCCAGGAGCAGACATCGTTCAGCC^ 

GCTCTGAAAGCTAATGACrrGTGCCTGCGTGTGTGCGCZATGACATGCGTGAGCTGTAAGGGC^ 
50 GTGTGCGTGTATATATATGTCTGTGTGTATATATATGTGTGTATGTATATGTCTGTATAATAATATC 

ATATGTATATATGTGTATTATATATACACACACACATATATATGTATACATATATAATCTATATCT 

CCAGCAGATGGGCAGACATTTGTCAGACATGGAGAGAGTGTCTTTATC^ 

GCCCTTCCCAATCTTACCTTACGAAACAAGGGTGACAGGCATTCAGGGAT^ 

5 5 GAGAAAAGCGTAATTGTGCIATTTAAACrGTTGTTTGGGATGAGGGCAGCAGAGCACACAG 

T ATCACATG AGG CTGGAG C CTGAACTATCTCTGGGAGGCTG C^GAAAGT GGG AAGGGAGTTTC^G ATC CATG CTGG AAGAAC AGG G 

AAAGGCTCCCGCAGGGCTCCGGGTGGTGTCTGCCGCTGATTC^^ 

CGGGATTTCAGGTCCTGCTTCACCTCCTTCTCATAGGGAAGAAGCTGTCCACTGCT 

TACACAAGTGTGGCAG CCCTATAGAAAAGAGAAAGACTCGTAAGT CTTGGCTTCCGGCCAGTGAAG A 

6 0 CAAATG AATTT AACAAAG AAAAGT CAACT AAC CAG AAAGTTT ACAG ACCAG TGAAT C CCACAGG G CT AG CATT ATGTG CCAAC T CG 

GTTCTCAGAGCTTACTGAGTTATCTCACTTCATTCCTACCCCGAA 

G AAACAG AAAGAAT CTCAGAGAGGTT AAGT AACTTG C CTGG GGTCACACAG CTG AGGACATGGG TAAAC CAACACACATTAATACC 

TCCCACA1TGCTITCTTCCGCCATTTCCTCCCTGTTTGTGGCTCCAAAGAAT 

CAGGAAAGTGGCAGCCTCGCCCCAGGCCAGGAGCCCCTGAACAAAGCCATAAACCAAGACT 

6 5 ACAGCAGAACGGGGTCCCATTCTGAAAGTCAGGCTCCIATCACAAGTACCCACT 

GGCGTAGGCAGCTGAGCCCCTGGGGGAGTGGAAT«3TGTCGGCTCTACCC^ 

TCCCCAG CCCC^GTTGCTCCTCTCTGAAATGGAAATCTTCTCTGGACCAGCTTC CTGGGGTAC CTATGAGGAGGCAGTGAGCTGCT 
TCCTATCAAGGCTTAAGGCAGCACCTGTCAAATGCTTAG 

ATCAGCTTGTATTACTCTGTGCCTGACACTGACATGTCCCCAAACT CCTCCCCACAAATGTCACATTCCAGTG CCTCTCCTTATTG 

7 0 TG CT CTC TCCCT CCATTT ATATTG CATTT CAACAAT AATAAC CGCT ACTCAT CACTG AG CACTTATT TGGTGCTTGGTATTGC CAT 

CAACGCirCCAAGTGTGTTATGACATCATCCTCATCGTGCCTTGGGAAGCG 
CCGAGGCAGAGCAATTAGAAGTGGCATATCTGAGGTCATGGGGCTAG 

GTTTCCTCCCCAGATCATCACTGTGGTGCGGGGCTGCTCAGGCGTTTGGTGAGTGGCTGCAGATTC^ 
AGCAGGAACGGGGTTCCTTCTCATTTCTTTGTAAGG 
7 5 TCTGTGTCATCTTTGCCTCCTCTGTCACCCATGCAGGGGCCGCAGGCCTG 
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AAGGGTCCCAGCX7TGCAGGAGGGAAGC 

AGGGGACAAGGAGCAAAGCAATGAAGACCAGGTATGAAGTCTC 

AGCTGTTTGGAATACAAACACCAAC CAAAACAACACACX5CACACACTCTTTTGTGTCGTGTTTCTGG T 
TCT CATATATT rfT(TA.AAA AC^G TC CAGTACAGAATTGCAGGTTGAAGCACTGTGAATAGG CTCTAG GTTTG CCC CAGGTTAAGG 
5 GTTAATTTGTTGAGTTTCAGAACAGGTAGGCGACGCCCTCCTGGATTTTA 

AGGCTGGCTTTATCTCTTTAAAAAAAAAATATATATATATATATACACACACA 
ACACACACATATATACACACACACATACACACACACATATA 
ACATATATACACACATATATACACACACATATATATACACACACM^ 
AGACACACATATATACACACACATATATATACACACACACACA^ 
1 0 GTTGTTGCAGTTGCAGAGATGAAAAAGAAATCTGTGTCTTTAAATGAGGAGGGG CTGCGT AAGGAGG CTTGGG CAG CAAAAAGGCT 

AATACGCTGTGGAAGAGAAGGAGTTTAATCTAATTTAGTTGGAGGGTGTCTG 

GGCGGGAAGCTGCCCCCTTGAATGGTCCCTCCTGAGCCAGGAGCAGTGGGGAGCCAGTCGACGCCA^ 

GAAAGAGGATTAAATCAATGCTCTGATTTGCAATTCACACTAGTGCACCCCGTGGA^ 

AGGGAAGTGGCCTTAAAATAGCTGCCTAAATTGTGCTGGAGCCAC 

1 5 GGGGTTCATTTCTCACCTTCCCCCAAACCTCCACATCrCACAGCCAGTGTTT^ 

T CGCTGCG ACTTG CATGGCCTTTCTC GTGGTACAAACTGTGTTTTCTTTG CT AAGT AT CACTTCAGAT CACT AAAGGT ATTTTCCC 
TGGAGAAAAAG ATGGGGG C ACGCCACGGGGAGCATG CCT ACTTGAT CAGAAAACATAATTTTTTG TG CATGTTTT ATAT CTG AT AA 
GTTTCTTC CAGAAACTTG ATACTAGATGAGGTCAGAGAAAACAAGGT CTATTC^ CC CTGCATGTGAAGGAATTATTG 

G T AAAT AG ATCAGTTTTCATTCTCAT ACITGAAGGT AGAATTGG A T AAAT ACAAACATTG ACGACACGAGAAGA 

2 0 CAAGAGAAGAAACACAAGAATCACTTACGTGGATCCTCR.GA.CTTGA^ 
GCGTTGCGACAGCCTGAACCACTTTTTATCTCAACATTCA 
AGTTTACTCTTGCAATTAGCTTGTAACATGCTAAGCTTAT 

T AAAGAG ATTAT TTGGGTTT TCT CT AAT C CCATTTCAG AAAACCT CGC^CAGTGCTT ATAATTTC CATTGACT AT T ACAT AAT AAG 
AAAAG AT AGAACTTTGATTTTT AG CAAT AT C CGTGT CAAAGTACTTTTTAT ACTTTTG ACTTTG TG AGTTT AAATGG AATG AAACTG 

2 5 GGCCTCTCTTTCAATTCCCCTCCACCTCCAAAAAATA^ 

AAAACCTCATACTTTCCCTGGTGAACGTTTGCAA 
TTAGGTGGTTGGGGGTAAAAACMATTCCITGTCrrCT 

TGCTCAAAATTTTAGAAGCCTAATAAGATTTCCATTAATTTAAACATATCACT 
TCTTTCATCCATGGTTAAATATGTAAATAAAATGTCTAAAGTC 

3 0 AGCCATAAAATTCCTCCCAATTCTCCATAACATTTGCCAGAGGGGGGTCGGTTGCCAGCCAGGGCTGA 

TGCTCCTTAGACTCACACATCTCAAATCCCACACACAGGAAAAAAAATAAAATAAAATATCT 
CCTTTTCATTTATTAGTTACAGACTTATCTTCCAGTTGCrc^ 

TTT C CATCTCAG CTGG ACAGGG CG AAGCACTT CTGCAGACATTTTT AT AAAAAACG AG AGAA^^ 
CTAAAATAATGGGCCTGAACGTGGCTGAATTAAAACAGCACAGA 

3 5 GCTCCCGG CGTTTGGCTCT CACTC TTG CACT AAAT ATTT CTAAT AGT TG CTGAGGGAG AGACACAAAAAG G AAAAGAAGG AAAATA 

GGGCCCAGGAAGGACCTTTGCCAGGAAAATTCTAGAAAAGCCAAGTGATT 

ATTACTCACTT AGACAAAAAACAACIAACAACAAAAAAAAAAACAC TAT AAT AAAAATG G CAG TCT TT CAAAAAT AAAAT CATG CAAC 

TTTTCACTTGGGAAATAACACAGTTGAAGGTGGTGTTCTAACTGCCCGGAGTTTCTACCT 

ACTTCCTCCAGCACCTCCTAGTGAGACCCAGGACACCCAGGGACAG 

4 0 GGATTCTGGGCGCTCAAATGTCTGTTGAACGGATGAATGAATGAGTGGAAAGGTGAGATTTCCC 

ATTGTAGGGCTTGTTTTAATTAAAAGCCAATACTCTCCGAGGAAGTGGCCC^ 

GTGCAGCCGCCCCCCACGCCGGGTGGAGCGCCGCCAACGCGGGAAGCCCCAGCACTGCCTCCAAAGATGTCCGCAC^ 
GCTCCCTGGAACTATCCTCAAACGGGCAGGGGGCAAGCG AGG CCGGTGGACGCGGG ACTCGCTCCT 
45 CGGGAGGTAGCGGGAGATTGCACACGCGATCCTGCGAGTTTCCGAACTTTGGAAGATCGTGACCCG 

GGCCGGCCACCTCCTAGGGGTGCTGTTT TTTTAAGGGTCAACCCAGGACG CTTACGGGAAGCACCTGGCGCATCCTTGGG AACAGTG 

GGGCTTGGCGGTGGGCGCCCACCGCGAACGCCCTGGGCGGGGGAAGGGAATGGCGGGGGGACGTCGTGTCCT 

CAGACCCGCCCCAATCCGAGGGGGGATGAGCTC^GAGGACCTGCCCAGGACGCTCCTTCTCCACTTT 

TGCGCGCCTCCGTGTCCTCGCGGGAGCTGGGGGTCCCCGGGCAGGCGCTGACG CGCTCCT CCCACGACCCACTC CATCCCACCCTA 
50 CTCCGGGTCCTCCTGCX3GGGCTGCAGCCAAGGAGGGa3CTGGCCGGGCACATCAOT 
ATGACTTCCTATAATAAATAAACTCAGCCGCCAAGAAGGGATCTCCAGCCCTGC 

GGGGCGGGGGTCACCGCGGTACAAACCAAGCCCGCACTTCCCGCGGGGTGGGAGGCGGTGTTCAAAGTCCCGGGACGCTTCCCGCC 

CCTCCCCCTTCCCGCCTTCTTAGAGGAGACCTCGGATGAGCGGCCCTAAAATCCGGATTCrCGAGCCCCT 

CGTCTCTTTGAAATTTCAAAGAAAATCCCTCCTCGGAGCCCGGAGCCCGGGGCT 

5 5 GCAGGTTTACAAGCAAC^GCCTACTTTATGGCCCAGGCTATAAAACGCAAATCCCACCTCCCTTTGAAAGC 

CTCACCATAAAAGGAAGAGAAGATGCACCCCGCGCCCCTCGCCGCGGACCACCCCCAAACCCCT 

CAAAGTTGGAGGCCAGGCAGAGAGAGGGCGAGGAGAGGCGGGCGGG ACTGCCGGGGAGG^ 

AGGCGGGCTGTCCCGGGGCrTGGCGGGCAGACGCGGGCGGGACTAGGCGCTGCGCCCCCCGCAGGCTGGGCTGCAGGCaSGGG 
60 TCGGCCCCCGCGGGGCCCGGGAGCCCGCGGCGGGGCGGCGGGGAGGAGGCTG 
GGCGGTGCGGGGCGCGCGGCCCGGGGGCGGGGCCTGTCAGCGCTCGGCT^ 

CCCGGGACGCAAACCTCGAAACAGCTGCCGGCTGGTCCCGGCCGAGGCCGGCGCAGGGAGGGAGGAGCCGCCCGGGCTGTGGGG 

GCCGO^CTGGGCCGGCCTCGGTGTGCCCGCGCCGCCAGCCCGCTCCAGACGCGCCACCTGGGCGCTCCAAGAAGAGGCCGAAGT 

TTGCCGCGGCCGTGAGTTGGAGCTCGCGCCGGGCCGCTGCGCCGGGAGCTC 

65 A CTTTGCTGCTCTCCGCCGCGGCCCCCAACTCGGCGGACGCCGGG(^CGGAGAGCCGAGCCGGGGGCGCTG^ 

CCAGGCCGGGCGGGCATGGGCGGGGG CCCGAGCAGGGGTGGAG AGCCGGGGCCAGCAGCAG CCCGTGCCCGGGAGCGGCGGCGCTG 
AGGGGCGCGGAGCTCCC03CGAGGAC^CGTCCAACGCCAGCATGCAGCGCCCGGGCCCCCGCCT<3TGGCTC 
GGCTCGTGCGCCGCCATCAGCTCCATGGACATGGAGCGCCCGGGCGACGGCAAATGCCAGCCC^TCGAGATCCCGATGTC 
CATCGGCTACAACATGACTCGTATGCCCAACCTGATGGGCCACGAGAACCAGCGCGAGGC^^ 

70 CGCTGGTGGAGTACGGCTGCCACGGCCACCTCCG CTTCTTCCTGTG CTCG CTGTACGCGCCGATGTGCACCGAGCAGGTCTCTACC 

CCCATCCCCGCCTGCCGGGTCATGTGCGAGCAGGCCCGGCTCAAGTGCT 

CTCCCTGGACTGCCGGAAACTCCCCAACAAGAACGACCCCAACTACCTGTGCATGGAGGCGCCC^ 
CCCGGGGCTCGGGCCTGTTCCCGCCGCTGTTCCGGCCGCAGCGGCCCCACA 

GGGCGCGGCGGCTGCGACAACCCGGGCAAGTTCCACCACGTGGAGAAGAGCGCGTCGTGCGCGCCGCTCTGCA^ 
75 CG'TKT A CTTGG AG C CG CG AGGACAAG CG CTT CGCAGTGGTCTGG CTG GC CAT CTG G G CGGTG CTGTG C TTCTTCTC CAGCGCCTTCA 
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CCGTGCTCACCTTCCTCATCGACCa^CCCGOT 
TCCGTGGGCTACCTCATCCGCCTCTTCGCCGGCGCCGAG^ 

CGCTCACCTGGTTCCTGGCOSCCGGCAAGAAGT^ 
5 GCCATCCCGGCGGTGAAGACCATCCTt^TCCTGGTCA 

CATGGACGTCAACGCGCTCACCGGCTTCGTGCTCATTCCCCTGGCCTG^ 
TCGTGGCCCTGTTCXACATCCGGAGGGTG^ 

CTCTTCTCKSTGCTGTACACCGTGCCGGCCACCTGTGTGATCGCCTC 

C CTGG CGGCGCAGCACAAGTGCAAAATGAACAAC CAGACTAAAACG CTGGACTG C CTGATGGC CG CCTCCAT CCCCG CCGTGGAGA 
1 0 TCTTCATGGTGAAGATCTTTATG 

CAG CAGGTGTG CAG CCGTAGGTT AAAGAAGAAGAG CCGGAGAAAACCGG C CAGCGTGATCACCAG CGGTGGGATTTACAAAAAAGC 

C CAGCATCCCCAGAAAACTCACCACGGGAAATATG AGAT C C CTGCCCAGTCGCCCAC CTG CGTGTGAACAGGGCTGGAGGGAAGGG 

CACAGGGGCGCCCGGAGCTAAGATGTGGTGCTTTTCTTGGTTGTGTTTTTCTTTC^ 

AAAGAGAAATACATAAAAAAGTGTTTACCCTGAAATTCAGGATGCTGTC 
1 5 TTTGTTTTGGTTTTCCAGCGAAGGGAAGCTCCT 

CCTCaGAAGAAAACTTTTGTTTAGAGCGCTCCGTAAATATACATCTC 

GGACAGATAACTGCITTGCAAATTCAAGAGCCTCCCCTC 

GTGCTGGGCGGCATCCCTGCAGAGGAAAGACAGGACCCX3GG 

CCGGCCTTCACCAATAGTCTCrCTGCAAGACAGAAACCTCCATCAAACCT 
2 0 T TTCTCTTTCCTTGAA/^TAAAAAGAG AAACAAGTATTTTG CT AT ATAT AAAG ACAACAAAAG AAAT CT C CTAACAAAAG AACT AA 

GAGGCCCAGCCCTCAGAAACCCTTC^GTGCTACATTTTGTGGCTTTTTAATGG 

TTTG TGGAAAGG AGGGGG G AAGAGGGAGAAGGATC AT T CAAAAG TT AC CCAAAGGGCTTATTG ACTCTTTCT ATTGTT AAACAAAT 
G ATTT C CACAAACAG ATCAGGAAG CACTAGG TTGG CAGAG ACACTTTG T CTAG TGTATTCT CTT CACAGTG CCAG GAAAGAGTGGT 
TTCTGCGTGTGTATATTTGTAATATATGATATTTTTCATGC^ 

2 5 AG T C C CTGGITCATGGTG AGGTTT CC CTTCTCTGTCTGTGGAG AGACT CTC C CTTTG CTTTiT AAGCAAATTCT 

GGGGTTGTG CTT CTAG CCAG AGAGTACTGGTT AT C TG TAT AAAATTGCAG ATCAGAGGATTCCAAG ATT G GAAG AAT CATTCACTT 
GTGAGTTTCTGAGGAACTGCAGTCAGGCAATGTGTGAAGTTGGAAGTCGTTTCTGTGACTC 

TTGT AG ACACACATCAC CAAGGAG AG CCTCACCACTAAGCGTGGACCCACTCATG AGCCTGGATTAG CCAGTTTTAATATATTCAG 
AGAGTAGAGCCAAC CGATTAGG AGAATTAAATCAT CTGT AT T CAATTGAC CTG CTAGGCTCAAATGGTAC CT AGACAGAGCTGT AT 

3 0 ACATCAGCCCAAAGTGTG CTGG CTTTG CAGTGACT CTGAACTATGAAGACATTTACAAAACAGATTATAG AC CTG G ACGGG AT AAA 

ATACCTGCTAAGGGTGGCGAATCCTTAAGAGGAAGTTAAGCAGCTTTGGAACT 

TTAGAACTGGGTAGCTGTAAAGAACTGCCCCAGGTCTGCTGGTCAAAGGCACAGCTACACT 

GGCACAGTGGAGTCCTACTGTGGTGAGGACTCTATTAATATAA^ 

AAGTTAAACAAGGAGCTCTGGAAAACTTGGAGCATGGATGACACGCTTACCAACGAGGAGGAG 
35 TTTCTGAAGAGCGTGTTCAGAACTTCAGCCTGGTAAACATAAGA 
TTGAGGGGCTACTGCACTTGGCCGTGAATATTTGCAGGGCAC^ 

AGG AAACAC CAG CT AGGT AG G CAAAGTAAAAGAATG CAAGATCC CAAAG AGCT TTCAG TG CAT T AAAAAAGAAGG CAGAAAAG CTT 
CAGAAATGCC CCCAAATAAAATTTCTTTGTTGATTGTTT^ CTGTG 
CATGTATGGGTATAGCGTTTCCTTTTTGTCAOTCTTAGGGAAAATGAAAATAA 

4 0 AGGGTAACATTtSTCCGGATGCTGCTGTGAGCGACAAG^CTAGATATAA 

AGCTTGAGACCTGCCCATGCTCAGGTGTGTAGGCTCGAAAATGTGGTCCC!AAGG^ 
TTGTTGTTAGTTATCCCTGAATGTGTCTCACTGTGGATCCATCCCAGAAATGACTGATCTTGTT^ 

CATGTGG CTG GGTTGGG CG C AGCTGG GTGAGTTT AACAGTG AG CAGTG CAAGGAG CAG CTG TAG AG CTTCCAAG CAAAATG CT C CA 
AGGTCTTCCTCTCCCTCTCTCTCTCTCTTCCTCTCrCTCTCrCTCrCT 

4 5 ACAG ACAGTGAGTTTT AGG AGAAGCATCAAGGC CTT AAGTCAG AAAGCT G GAGAGAGG 

GAGGGAGGGAGGAAAGAAGCAAGGAGAAAGAAAGACAGGCAAACATCACTAGTGAAGGAAGGAAGGAAAGAAGGAA 

ACAG GCAAACACATCACTAGTGAAGGAAGG AAAG AAGGAAG GAAGGG AGG AAAGAAAGAAGGAAGG AAG G AAGGAAAGAAAGAAGG 

AGAAAGAAAGAAAGACAGCCAAACACIATCACTAGTGAAGG7AGGAAGGAAGGAGAA 

CGAAGGAAGGAAGGAAAGAAAGAAAAAGAAAGAAAAAAAGAAAGAAAGAAAGGC^ 

5 0 AG CTCAGTACAGG GTTTC CAAAGGAATTG TTGACAG CATTAATTTGGAGATTT CTCAGCAT CTGT ACCC CAC GGC CTTAGTTTT AG 

AAAAATGGCAACCTGTTTTGCATGGCAGTGGTGGAGTCATAGAAGAAGCCCAAAGAAGTAGATTG 

TCCCTCCCTTCCTTCCTCCAGCCTCCAGCCAATC^^ 

GTGGTTTTGCAGTGGACAGGAGAGACCTGC!ATGGCAC!ATCTTAT^ 

ATGT ATTTC CACAGGGCCCCTAAATGCAT ACAAGG CAG AAG ATT CAG CCT ATGTG AGG TCATAGCAAGG GG AAG C TG CAG GGTCAT 

5 5 ATAGAAGGGAAGAGAAGTTATCAACAGAGCATCAAAACATTC 

CCCACACCTTGCAGAGGGCTTTGGACGTAGTAAGAATAACTACAATAATAGCAATGGCTGTGATAATAGT 

CCTTGAGACCACTTGCTGTGTG<nX3AACAGGCrGCTGGG ACTAT CT CAGTGAATCCTC AACGCATTCAGGGAGGTACTGTCATCAG 

CCCCACTTTACATGTGAAGACCCTGACACAGAGAGGTTAAGTAAATTCCCAGGGTCA^ 

CTGGGTGGTCTGATAACAGATCTGGAACCCCCAATTCAGTAAGCAATTGTT^ 

6 0 TACATGCATTAATTCAATCCTCTIX5ACAAACTTTGGTGGTACTGCTCTCA 

AGAAGTTGTGTCACTITACCCAAGGTCACACAGGC^ 

TTATTCCAAGCAAAAGGGGAACAAGGTTG CATGGCAAAGAGCAAAAG CtZACTTGGCCAGTGTGGGGGAG CCCAAGGGAGGAGGCCG 
TGGCAACCTGCATTTTTGC^GCAGTCGDSAGGGTGCTAGCAGTG 

6 5 AG AAC CAGG ATTAGGTTTTAT GT AGAGAG GGAGCAAGAAGG AATGG T CAGGGAG GACCACAC CATTT CTG ACT TTGGTGG CAAAGT 

GACATTCATCAAGACIAGGAGAAAACCAACAGAGGACAGGCAGTGTGATGGGAAGC^ 

CGAGGTGTCTGTAGAGAATTTGGGATGAGGTCATGGTCACAGGCGGGTGTTTGAAATGGTGAGAGTTGGAGCTGTGGTT^ 
GGAAGCAAGTGGGTCTCTCCAGGGACAGAGGAGGGAGGAGGGCAGACCTCAGGGCCA 

CTGTGACTGGGAGCAG CCCCTTAACCACTTX^GCGCTGGGTGC CCATTTTTACTATAAGAGGATTGG CAGAGGAGATTATGCAAAT 

7 0 GGC CTT AT C CAACGT CAAAAT ACGTGATCCGGAT AG CG CTAAG GGGGAATGCAG CAAGGGTCTTACAGGAGTGACTT AGGAGGATC 

CAC TC CTGCC CGTGAT AACTCAGC CTCTGTAT CTTTAACTTAAC C CTTGGGCT ACC CGAG CAAAATT AAAATCAG CT CCAAGAG CT 
CATAGTTTTAAATTCTCCACTAGCATCTTTGGGGATGCTGTT^ 

GGCCATAGATATTATCCTACA.TTCTAGATTATTTTCTCAGAAGGAGGAGAAAGAAATACAGAAAACGATAA 

ACTGG C^TCTTACACTTAAAGGATGTG CAGAT ATAAACATCG T TTAGG ACTGAACT AT ATCTTG CCTTCTCT ATGAGCTGGTTTTT 
7 5 AGTTTGACTCAACCACTTACACATTGCITCACCTTGAGCAAGATACCAGAACTTTCC 
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CATGATAATGGTACCTAT CTCACTG GACTGTCACTCACAAAT AG CT CATG CTTATTG AACMGT ATTAC^TGCTCG ATTGCTlTAA 
GTTCTCTGTGTTTATTAATTCATCTACTCCTCACAACAGCCCTAGGA^ 

ACTTGGAAGG CTGCTTGGCGCAG AG CAAATGCTCAATAAAGTGATCT ATTTTTATG w i"iTl , Tl , GTCCCTCCCCC CAAAGAAAGTT AA 
ATC CTGTAAG CT AATGAGTAGGGCT CTTTAAAG AAAATGAGTTTTTTT CTT ACAGAAACACTTTXTCCAAT ACTGTGTTCAGGTTC 
5 CACAGAAAACCCACTGATCTTTCAAGTTTAAAAGGC^ 

AACACAATAACATCACAGCTGACTTCCXAGGAAATCAGACCT 
ATATGGAAATGCAGAGAAACGGTCAGACTTCAGGGAAGCTCGCGAAACACGTGC^ 
TTATTAAGAGTGTGAAGGCCAAGCTGGAGGGAGGAGGGAAGG^ 
GCTGTCGTAATTGGGGACACTCCn'AGGAAAATGACACTGTTG^ 
1 0 AGTGCCTGTCGCTTTGTTATTCTAAGACTT^ 

CTTTGGTTTATAGGAAGGGATT CATAACAGCAACCCTTGCCACACG C CCATCTCC TTC CCAGATGCTGGGGGCTTCAGGGGTTCTG 
TAGATGGGATGGGLATGTTCCCAGGTGGTCTCTTGCAACAGATGTCCrr 

CTTAAAAAAACATCTAAAACCTAT CCTAACAATGTT TTAAGT ATTTTAGG CAG AACT ACAAAGTG ATCGACCTTG ACATG ATG AAT 
CAAC^CCCATTTTCCAAGGGGGCGATGGTGATCGGGAGCTGCAGGTGCTGAC 
15 AGCTCTCTCCCCAGCCCAGCCCTCCCCATTCCACCCCTGCCCCCAGGGCAGCAGCACCrcT^ 
ATTTTCK3CCTCTCATAAAACACTTGGATGTTGTT 

CTG AAACAGTCTTATTGAACTTGGGT CAAT ATCAC CTCG G TTT CTTTGGATTCCAGGATCACAG C AG AAGG CT CAAATTTGAACT A 
AACTCAGAAGCATTTTAGGTCAGCAAAACTTTTTTTTTCCCCAAGA 
CACACATGGCAC^JGGGTAAAGAGAACATGA 
2 0 AAGACAGAACAAAAGCAATGAGAACAAGAGAACACGGCCAATTATA^ 

TTTCAATTAAAAG AAAAAAGACAAT ACT ACAGCAAT<3TT CTGAG CTGAAT CTT AT C TCTTGAGTAAACAGAGGGCCTCTGAG CAAA 
AAACCATGCAATAGAAAGGAGGCAT CAAAAGAAGTTGGAAATGTAGTATTTT TATTTTT AACT CTTT 
ATTCTTTCCAAGACAAAAAGCTTC!ACGCATGTCCCAGGGGATGAGCCAAGAAAAAAA 
AGGGGCGTGOVTGTCACTCG^n'CTGGTCAGTG^ 

2 5 GGG AGTG AAGG AAGGTT CTAATAAAAG ATTTTG CTTCATG ATAAAAAAAAAAAATGATGAGGAAACCTGTCCTTTTCTTCTG CACA 

CAATGTGGTGTGTGAGAATCACATATGGAGCTATTGGAACCATTTTGTGACC^ 
CTGAGATGGC^IGGTAAGCAAGACAGAGCCAGCCTCGTGAGCACTCATTAAGCGACTG 
GCCAACACTGCCTGGATCCCCTGAGAGTTGCAGGTTGGGTGTTCTGTGCCT 
CAGGTGTCTCAAACCCTACTGTCCACACAAATCACT 

3 0 CCAAATTCTGCATTTGTAATAAGTTCCCAGTTGATGTCGATGCTTCTGGTGC^ 

ACCCACCAGAAGTTCAGAACGACCCCACTTACTCCTGGCTCTA 

AGTGG ACT AG CTGT C^TTAAAACAATTTCTTATGTTGAGTTAAAAT TTG CTTCCTTGTAACTATTTT CTATTATT AT CT CT ACT T 
TTG C CACATAG AGCTT CACAGAG CAAGT CT ACTTG TTTTTC CATGGG AAAGCCACTTATATATG GGAG CTTATCACAAATGCAGAA 
TTTGGGGTACCACCC^AAACCTACTGACTCAGGAGCTGCCTTT^ 

3 5 QCCCAGGACTAATCTGCTATATCAGGTATATT^ 

AATATGTGCTCCCTGGATCTTTCITTTTATTCTTAAAACTGTTG 

TGTTT AC CATGGT C TCAATT CTTGT CAAGG AAG AATGTGC CCAAGT AAAG ACAAAT CTCCAGGAAG CAT CTAACAAGG AGTGGGAT 
CATTAGACACCCAGCAAATGTTATACAAGATAATGGGAC^CT 
GATTACTGGTCACTACCCGGAACATGATTAACCCTGAAGTCAGAGAAGAGGAAAGA 
40 ATT ATTTG CAGCTGTTGTCATTG AAAAGTG ACTTAG CTGGAAAT AGAATT G ATGAG ATT TT CAT AAGGAG AC CAGG ACAAAACAT A 
G CTAGAAAAATCAATAGCATCCTGATGCACCAGCAGGAACCAGT^ CACTTATGCAATGCCTA 
CAACAAGCACAAACTATCTAGAAATTAACATAGGGGGTCTGCAAGCCTTTTGGA 

ATAATTGGCTCAAATAATTGGAAGCTCATGTCATGTGCCTGTAAATATGTCAATTATCCCCAAGTTAATTCCTAA^ 

TTG CAACTT TAAAATATGT ATTTCTGTGGAACT C CACAAGATGAT ATTAAATC CATAAAATTTG AG CAG TAGTCAAGGAAAAAAC C 

4 5 CTGGAACAGGGATAAGAAAGGACATAATCTATCAGTGAAGTACACTGTGAGTCTGCAGAAA 

ACAGACAAAAT AATGGAACAAAATAGGAAATCCAGAAAGAGT CTCAAGTGCATTT CAAAT AAAT AAG AAAGAATGG ATT ACG CAGT 
CAATACTGTTGAGGCTTAGTTTAGATGTGGGGAAAGTAATAT^ 

AT AAAACATT T AAACAT AAAAATAATACAGT AATT AG GGG AAAATGTT AT CT AAAGTTTTCATT CAAT AG AAAG AAATT ACAGGT A 
AACATTTTTTATAAACTTATCAGGAAAATGTCTTCTTAGCATGACATCAAAGCTAAAGC 

5 0 CAT AAAAATGTAAAATTCTGT AATGTAAGAAATGAG TTAATTTAAAAAAAATT G CAAAAT CTC CAT AAACAAAATT AAAAG ACAAA 

AACTTAAGTTTCTTGTCTGGGTCTATGAGTAGGCAATTCAAATTAGAAGAAATGCTAGTGACCCATAAACC 

AAGCTCACTTCTAATAATCCAAGGGTGCATGCATAAAAAAATGATTATTTGTGCCTAACAGGTT^ 

GACAGAGTCTCTCTCTCTTGCCCAGGCTGGAGTGCAGTGGCATGATCT 

TCCCCCGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCGTCCACCATCATGCCCGTCTAATTTGTTTGTA 

5 5 GGGTTTCACCATGTTGGCCAGGCTGCTTTTGAACTCCTGAG 

AGGTGTGAGCCACCGCGCCTGGCCAGCTTGGTACTTTTTTTTTTTTTTTAAAGCGGATAATAC 

AAATGGACTCCCTGTATCACAGAGTGGAACCATAAAGTAGCTCACCCTTTCT 

GCAGTATAC!ATGTCCTTTGACCCAGCAATTCCACTTCCAAATAAATGTTG^ 

AAG CTG ACT ATTCAAGC CAAT T ATG GAAAAT CCTTAAAAT AG AGT AACATGT AGTCATT AAAG TAATTACGTAGAACCGTGTCATT 

6 0 AACATGGGAAAATGGCCACACGGCACTGCTAAGCAGGCT 

CTGGTGTGTTTTTGAATTTGTCCACACCCCTATGCCAAATCAGTTG 

CAAGTTATATAATTCCTGAATAAGACAAG CCTGGGCAACACAG GGAG AC C CTGTC T CAAAAAATAAAAAG AAAG AAAAGG AAGAGA 
GGAGGGAAAGGAAGGGGAGGGGTGGGGAGGGGAGGGAAGGGAAGGGGAT^ CAGTGGCTTTCGC 
CTGTAATCCCMCACrrCTGGGAGGCTGAGTGGGCAGATAACTTGGGGT 
65 CCGTATCCACTAAAAATACAAAAATTAGCTGGGCATGGTGGTGGGCACCTGTAATCCGAGTTA 
TTGCTTGAACCCAG 

HUMAN SEQUENCE - mRNA 

ACACGTCCAACGCCAGCATGCAGCGCCCGGGCCCCCGCCTCT 
70 ATGGACATGGAGCGCCrcGGCGAOSGCAAATGCCAGCCCATC^^ 
GCCCAACCTGATGGGCCACGAGAACCAGCGCGAGGCAGCCATCC^ 

GCCACCTCCGCTTCTTCCTGTGOTCGCTGTACGCGCCGATGTGCACCGAGCAGGTCTCTACCCCCA^ 
TGCGAGCAGGCCCGGCTCAAGTGCTCCCCGATTATGGAGCAGTTCAACTTCAAGTGGCCCGACTCCCT 
CAACAAGAACGACCCCAACTACCTGTGCATGGAGGCGCCCAA 
75 CGCTGTTCCGGCCGCAGCGGCCCCACAG 
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'c^CRAGTTCCACCACGTGGAGAAGAGCGCGT 

GCGCTTCGCAGTGGTCTGGCTGGCCATCTGGGCGGTGCTGTGCTTCTTCTCCAGCGCC 

CGGCCCGCTTCCGCTACCCCGAG CGCCCCATCATCTTCCTCrCCATGTGCTACTG CGTCTACTCCGTGGGCTACCTCATCCGCCTC 
TTCGCCGGCGCC<3AGAGCATCGCCTGCGACCGGGA 
5 GCTGGTCTTCCTGGTCCTCTACTACTTCGGCATGGCCAGCTC^ 

GCAAGAAGTGGGGCCACGAGGCCATCX3AAGCCAACAGCAGCTACTTCCACCT 
CTGATCCTGGTCATGCGCAGGGTGGCGGGGGACGAGCT 
CTTCGTGCTCATTCCCCTGQCCTGCTACCTGGTCATCGGCACGTC^ 
GGGTGATGAAGACGGGCGGCGAGAACACGGACAAG(nX5GAGAAGCTCATG 
1 0 CCGGCCACCTGTGTGATCGCCTGCTACTTTC^ 

AATGAACAACCAGACTAAAACGCTGGACTGCCTGATGGCCGCCTCCAT 
TGCTGGTGGTGGGGATCACCAGCGGGATGTGGATTTGGACCTCCAAGACTCT 

AAGAAG AAGAGCCGGAG AAAACCGG CCAGCGTGATCACCAG CGGTGGGATTT ACAAAAAAGCC CAGCATC C CCAGAAAACT CAC CA 

CGGGAAATATGAGATCCCTGCCCAGT03CCCACCTGCGTGTGAACAGGGCTGGAGGGAAGGGCA 
15 GTGGTGCTTTTCTTGGTTGTGTTTTTCTTTCTTCTTCTTCT 

TTTACCCTGAAATTCAGGATGCTGTGATACACTGAAAGGAAAAATGTAC^ 

GGAAGCTCCTCCAGTGAAGTAGCCTCTTGTGTAACTAATTTGTGGTAAAGTAGTTGA 

GAGCCCTCCGTAAATATAC^TCrGTGTATTTGAGTTGGCTTTGCTACCCA 

TCAAGAGCCTCCCCTGGGTTAACAAATGAGCCATCCCCAGGGCCCACCCCCAGGA 
2 0 GGAAAGACAGGACCCGGGGCCCGCCTCACACCCCAGTGGATTTGGAGTTGCTTAAAATAG 

TG CAAGACAGAAACCT CCATCAAACCTCACATTTGTGAACTCAAACG ATGTG CAAT ACA.TT'l"l"l~rr CTCTTTCCTTG AAAAT AAAA 

AGAG AAACAAGT ATTTTG CT ATATAT AAAGACAACAAAAGAAATCT C CTAACAAAAG AACTAAGAGGCC CAGC C CTCAGAAACCCT 

TCAGTGCTACATTTTGTGGCTTTTTAATGGAAACCAAGCCAATGTTA 

GGGAGAAGGATC^TTCAAAAGTTACCCAAAGGGCTTATTGACrrCTTTCTAT 

2 5 GCACTAGGTTGGCAGAGACACTTTGTCTAGTGTATTCTCTTCACAGTGCCAGGAA 

ATATGATATTTTTCATGCTCCACTATTTTATTAAAAATAAAATATGTTCTTTAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGCAGCGCCCGGGCCCCCXKCTGTGGCTGGTCCTGCAGGTGATGG 

3 0 GGGCGACGGCAAATGCCAGCCC^TCGAGATCCCGATGTGCAAG^ 

ACGAGAACCAGCGCGAGGCAGCCATCCAGTTGCACGAGTTCGC^ 

CTGTGCTCGCTGTACGCGCCGATGTGCACCGAGCAGGTCTCTACCCCCATCCCCGCCTGCCGGGTC^ 
CAAGTGCTCCCCGATTATGGAGCAGTTCAACTTCAAGTGGCCCGACTCCCrrc 
ACTACCTGTGCATGGAGGCGCCCAACAACGGCTCGGACGAGCCCACCCGGGG 
35 CGGCCCC^CAGCGCGCAGGAGCACCCGCTGAAGGACGGGGGCCCC^^ 

GGAGAAGAGCGCGTCGTGCGCGCCG CTCTGCACGCCCGGCGTGGACGTGTACTGGAGCCG CGAGG ACAAGCGCTTCGCAGTGGTCT 

GGCTGGCCATCTGGGCGGTGCTGTGCTTCTTCTCCAGCGCCTT 

CCCGAGCGCCCCATCATCTTCCTCTCCATGTGCTACTGCGTCTACTCCGTGGG 

GAGGCCATCGAAGCCAACAGCAGCTACTTCCACCTGGCAGCCTGGGCCA 
GAGGGTGGCGGGGGACGAGCTCACCGGGGTCTGCTACGTGGGC^ 

TGGCCTGCTACCTGGTCATCGGCACGTCCTTCATCCTCTCGGGCTTCGTGGCCCTGTTCCACATCCGGAGGGTGATGA 
GGCGAGAACACGGACAAGCTGGAGAAGCTC!ATGGTGCGTATCGGGCTCT^ 

4 5 CGCCTGCTACTTTTACGAACGCCTCAACATGCLATTACT^ 

AAACGCTGGACTGCCTGATGGCCGCCTCCATCCCCGCCGTGGAGATCTTCATGGTGAAGATCTTTATGCTGCTGGTGOT 

ACCAGCGGGATGTGGATTTGGACCTCCAAGACTCTGCAGTCCTGGCAGCAGGTGTGCA 

AAAACCGGCCAGCGTGATCACCIAGCGGTGGGATTTACAAAAAAGCCCAG 

CTGCCCAGTCGCCCACCTGCGTGTGA 

50 



2015 



WO 03/008583 



PCT/US01/51291 



Table 98 

MOUSE NOMENCLATURE 
ICSGNM Calm2 
Celera raCG13058 

5 

HUMAN NOMENCLATURE 
HGNC CALM2 
Celera hCG17033 

10 MOUSB SEQUENCE - GENOMIC 

GGGCTCTGCTGTGAGG ATGGAAGGAAAGATGAATATTTG GTGACTCCT A "X'GAGACTCTG C CC CCAGT ACGLrrTTT CTG CTTCTGAT 
GG T CTT CATTTCCACACACTAGGACATG CACGGTCACT ACCATAGCC CAGTTCTCAT ACTG AAAGTGAGGATGTGCACTGTCGGGG 
CAGTGGCAAG AAGAGTGCCTTCAAAGTGAGAGT ATTCTGACAGT ^ C CTT CCTCA 

CAAGTTCCTATGTGTCCTCTGAGCTGGCCCAACCCTAGGAAGAATTTGC 

1 5 CTGAC C CCTTCATCTGGAGTT CATTGATTCAAATGGGCGTGAATTGG GGCGAATGGGAT CAGC CC^ 

GACAGGCCACTCCCACTGTGATATGCGGCAGTTC 

CTAGATC CACACATTTT CT CT CCT TGCT CACTGGATC CTTG TGATAATG TGAATTAGGCAGAACACATCCATCCTGTAGCCATGGA 
AACCTGAGACAT CACAGTG ATTTAGTAGT AAGGCTAGGACTTGAAC CT ACCACT CTCAATTCATAACAAG GCC CTTTAAATTGT AA 
TGTGT CATCAAAATACTC CT ATCTGGGGCCC TGGGAGTTGGAAAG CAGATTCAG GG CAG AAAT AAG ACAG GAT C CCACACAGC CTG 

2 0 GGGCCCCCAATTCTTCTGCAATTGAGCCCCTCATACT 

AAGGAGGAATAACGTTTCCCCAGGATTGCCCTGAGAAATGTTTTCTTT^ 
GTAGAAAAGAAACTTTGCAAAGCCCTGCTGGGCATTTGTGCTTGCT 

GAAGACACAAGGTTCCCTTCCATGATCTCCCTTCTCCAGGGATGATCGTCCTGGCAGCCCR.GGT 

AAAGCCCT AAATGTTG CTAT GGGTGACAT CAGG AAAGACAAGTCTTGAG G CAGGTTTCAGAG CCCAAC CCAGAAACCAG AG TGTGT 
TGTGTATGAGTGTTT^GTCAGCCG 

AACTAGAGTTACAGGTGGTTGTGGGCTGCCGTGTGGGTTTCTGGGAACCAAACCTGGATCCTCTG 
GCTGTTGGACCATCTCTCCAGTCCTGGGACAATTTACTTGATTTAACTCAGTCT 

CGGGAGAAACAT CATGTTGGTGGGGACATGTGACAGAGG AAGGT CTTCAC CACT CAGTAG ACAAAGAAT AG AAAG AGACAG AAATG 

3 0 ATCAGAGCAAAGACACTTCCAAGGGCCTGTCCCCTTCTCCAGGATTTACT 

TCCAAAATAGTACCAGCAGGTGGTGACCTAGACTTAGCACTTGTGTCACAGGGCCAA^ 

GACCCACTTGCCATC^GGGTCAAAGGCAGTAAACTTGGTCATTCCA^ 

GTCTGATCAAAGACACCATTCATTCTGTCCCCACAGCCCCTCAATGATGCCT 

AGAGTCCACCATTTGTACCTTTGG CAGTCAACACAQVACCATCT CATGTCCCCT ATTGCAAGAATG CCCAGAGAGTGCATCTCTGC 

3 5 TTCCCATGAGAAGCCATCTCTGATCTGCCCAGTACAGTGAATGCCCCTCT 

CTCTTTGTTTGTGGGTTCCTGCCCTCCATACTAATCATTTCAACCCACATTTGCCACTGCCCCCT 

GGGTATCTACTTGTTGATCCCTAATGTCCTAACTCAGTGAACTGTCATACATCCTC^ 
TATTGTTAGTAATTGTCCCAACIAATACAAGGAAGGAGTATCCTTTTTATTTTATTTCT 

4 0 C!ATCCCC1AGCAAGCCTX3GGTCTCACTAGGTAGAGCAGGTTGTCCTGGAAC^ 

CAATGGAGTTAAAAGCCATACACATCTGTATGCATGGGGTATACTTTTTAAAAG^ 

ATACCACAGCCCATGCAAAGAAGTCAGAGGACTTCTTC 

TTAGTCTTGACAGCAAGCCCCTTTTCCTGCAATGCC^ 

ATAAGTTATGTGATAAATT AAGCGTTTGC CAGTG CCACACTG AG C CCAAGT CTGG AAGT CAGTC GAGT CTAATT ATTAAAGTGG GG 

4 5 CACAACCCZAGAGCAGTGAACCTACCTTCT 

CTCACACCTCCTGCCTAGACTGGGCCGTCTTTCTGATGCATGC^^ 
CACTGAGTATGGAAAGAGTCATTTCAGGACAACAGGATACrrGTCACCT 
GGGACCATTGGTAAAGAGTGCCATCCTCCAGTAAATGGAATGTTCCTC 
ACTCTCCAAACCCTCTGAGTGCTGTATGCACTGAGCAATGCAGACC^ 

5 0 CAAGAGAAGAAGGGGTGGCAAC!AATGCTTTGGGGGTAGAGTC!TACTGAGTGCC 

GCAGTAGAATTTGATTG G AGGG AAATT ACGGTGGAAAGAG CCAATCTG AG C C CAAGGTG AGTAAGTG GCTGGTTT CAG AT GTGTGA 
GTACCTCAGTCITCAGCTCTGTAATAAAGGC!ATAAAGGCTCAGCCTTTCTATGCTCACA CTGGAACACG AATGG CAGGTT 

AGCACTAATTTCCATGAACATCAGTGTGGAGATCrTTCCAGAAGTATTC^ 

5 5 CTTGCCCATC^CGGTGACAAAACAAGGGAGGCTAACAACTTAAA^ 

AATCCCAGCTTAGGGAGCCCAGGCCTGGGCCAAGAGCATTGTGAGTGTGAGGACCTGGGCTACAA 

TGAAACATGAAAAGTGAGGAGATGCAGGATATGTATTTTG 

CTGTTGCTTCTGTGGGAGGTTGGAACATCTTGGTAGGGACCIAAGTAGC^ 

AAGGGTTAT ACCC C CAAAG AG C CAACTTTTCAC C CATTGGCCT CTC CCTGT AAAGATTC CATT ACTAC CTAT CAGT AACACAG AC T 

6 0 AGGGACTAAACCTTCAACACAGGGACCTCTGGGAACACCCAAGATCCAAACTA 

ATGAATTATAAAGTCACTCTCCATAAAACAGGATTCTGAGGAA^ 

TCTAAAGGTTTGTTTACCATAAAGATATGAAAATGAAATAGGTGGTCAGGGAGATGACT 

GTT CAAAT CCC TACATACACACATACAAAACCTGGGTGGTGTGGGACACAT CTGT AATGAAATG CTGGGG AAGCAGAGG CAAGAGG 
ATCCCTGAAGCTC^CTGGCTGGCCAGCCTAGCCCACTTGGTTCACTGAGAGATCCTGTTTGCTTGCTTG^ 

6 5 TTGCTTCCTTGCTTGCTTGCTTGCTTGCTTGCTTGTCTGTCTGTCT 

CAATGATTAGGAGAACTGGCTGTTTTCCCTVGAGGAGAC^ 
CCAGTCTCAAGGAATCCAGCATCCTCTTCTGGCCTCTGTGGA 

TGGTGCACATAAAGT CAAG ACAAATAAGAGGAAAAG AGGCCG AGGAAG AAC CT CTCATG CA 

TGCACJACACGAAAAGGAACTGTGCTTGGATGAATGGTCAGGCA^ 

7 0 ATCCC CACGCT CACACCCATCAGGATACAATACGGTT CC CATGAGGTATG AAAGG CTG G AACTX3CCATCCT CATTTTGG CAGGCTT 

G CAAACTGAGACTCG AGAACAAT AACAAGCTTTGC CAAAA CATAT CCCAAAG T AGG ATGG GGTTAAGATAAACACTAGTCAGG GC 

TTCATCCCGGACTTCAGGCCACATGTGTAAGTTACTGTGAGCTCAGAACTTTGACGTCCCAGCTCCTA^^ 

TGCTCACAGGGCATAATTAGAACAGGAAGAAGCCACTGAAAATAGATCCT 

TATAT ACAAATTGTCT AAATATG CACTG AAGAAAATAT CATGTGGG CT ATAGTTCTTTG CTATGTGCAAACATCTTAGG ATAAATA 
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ATCAAGAATTGACTCATCTACAGATCATTTT^ 

CAATCATCCCTTCATCGGGTATCCCAAGGGCTCTTTCT^ 

CTCCCCTCTGTGTGCTTCCrrCGGTACTCrG 

AAGGACCCACAGCTTCTGAAGGTATCTCACTAAGAGGATGACATC^ 
5 CTGATATCCTTAGCTTGATTCATCCCTCAGGGCTTGGGGCAATGACTTT^ 
TGCCTAGCCAGCCTGAGGGAGGCTCAGGGTCACrrATCATTAT 
CCGGACTGACCTTGAACTCACTACATAATGAGGATGATATTC 
CTCTGCTTATTCAGTGCTGGGGGTGGAACCCCAGGGCrTAG 

AGCTT AAACACTTTTCAAAAAGGCACCAG CGCTCCTGT CT C 
10 " TTGTTGCAGGGGCAACCTGTCCTGAATCCCCAGTATTCTCrGAAAACT 

CCTTTGGGGCTGGAGAGAGAGCTCCGTGGATGAGAGAGTTGGCT 

ATGAAAAACCATGGGTGGCCATGTGTGCTCTTGTAACCCCAGTACTGTGGAAAGTG 

CACTAGCTITCAGACTCAGTGAGAGATCCTGACACAAGGGAATCAGATGCTGAGTTGTAGAA 

TCTCCATGTGTGCACAGGCACACACACATGTGCACA^ 
15 ACACACACACTGTCC^TTTGTGCXrrCCTGAAACTAGTTCTGCACACT 

AGTTAGCGTTGGGGTGGGTTGTGTGGGCACAATGTAACAGCCACCCCTAAGGAAC 
TGTGTTGATGGTGGCTGATTTCGCACAGTTAGATTATTCrTTAGTCCAT 
GACCACTGATCTCTGACTTACAGACTCAGAAAAGCT 
2 0 CAGTGAAGTATTGTTTCTGGAAC^TTTCTTTACTACCCTTCGCAA 
ACACATGCTATGATCX3GCTTGGOTACCTGGGGACA 

TGAGAAGCTCAAAGTGAATGATTCCAGTACTGGAAGCCATCGX3GATGTCCC CTTTGTGCCCGC CCTTCCCGATGTGTCCTCCAAGG 
AAAGGAGAAGAAACT AGAATAATTGGTTG CTGAAGTCAAACATGGTG G TACACACCTGT AATCC CAACACG C CAGAG TTAGAGAT G 
GAGGGATGAT AACTAG C CTGGACTATACAGTGAGT ATGATTCT AG C CTGGG CT ACATG AG T CAT TGT CTCAAT ACC CT CTTTCTAA 

2 5 TATAGGTTCTTGATACTGTGAGCTATGAAAGTGCTTrTCTGTGTGTGG 

CAAGGCCTTAAGTCGATGACCAGGTGTCTTTCCT 

TCTCTCTCTGCACTTAGCACAACTAGCAGCTTGCCTAGGGAATGCCTTGT 

ACCTGCCCAGTTTTTATGTGGTGACTAAGGATTT^ 

CTCCTGCCCCTGGAATGTTTTCAAGGCTCTTTAAAGAGCT^ 

3 0 ATGATTTAAGCCCCAG<ZAGCAAGCAGACCCAAAG CAAGCCTGTGGTGGGTGAGAGAG CCTTCACACTATGAGCCTTGC CCCCGCCG 

CAACT G CTGCTGAGCCAGGTTC CAAGGGGGCTCTCCCAG AAGT C CTCAGTG CACAGGGAACAGTGG ACATCTGGAAGCGGTT AGT T 
TCAGTCCAGCCTGGAGGCCGGAGGATGGACTAAATGACCCCTGCAAGT 
TTTCCAGCAGGCTGGCriTTCTCTAGGGAAAAATGAATCCAGGGCTCTAG 
TGTGGAATCTAAGTTCCATGAGACAATAACTCT 
35 TCTAGCAACACGTGCTCCTCAGAGCACCACATGCATGGGAGACAGGGACCAGGAGTCCT^ 
CACCCCAATGTCCCTACTTGGTGCCCAAGTGACCTCTTTACCTGGATGCT 
CCTGATTCTTCCTACGCTGAGATTTTTC 

AAACTTCACCCCCCACCTCTGCATAGACCCAACC CCAGACATGTGGG CTGACAAGAATGTTTCTGATGCCTCTGAGTCTGAGGATC 
ATTATGGCGATAGCITCTTTGAACCCCTTTTTGGACACCTGTTTTACTAGTTGCT 

4 0 CCCCTAAGAATAGGAGCTTCAGACAGATAGAATCCAAG 

TGGGGTCCTGTTCAGGACAGCCAGTGCACTAGGCTC^T^ 
CATGCCTCAGCCCTTCTTTGAAAAAGCTGTTTATA^ 

ATCTTCCAGGTTCAACCATTGCTGACATTTGAGATATGTGTCCATCCCTACTCCCCAGCCCCT 
AATTCCTGACATTGTGTGTTATTCTGCCCCTAAAT^^ 

4 5 AAAAATAACTATGAATCCAAATTCCTAAGGCAAATGACTT^ 

AAACACGCGTCTGCATTTGGTTGGTGCCCATAAAAAGATTCAAAAATCCACACACCGCCTTTGG 
TTGGCCTCTCTCTTTCCTACTGGTAGTTAAC^ 

GAGGACCTGGGTTTGGTTCTTAGCCAAGTGGAATAGCTAAAAACCACCTGTAATTCCA 
CTCACACAAGATACrCTTGCATGCTTGTGTATACACACACACACACAC^ 

5 0 CAGTAATGTCTCAAATTCTAGGT^ 

GAAGGGGGAACCTCTACTGAGAAATGCCCCCACTAGTTAGGCCTGTGAT^ 

GCTCGCTATGATTGGTGCC^TCCTGGGTAGTGTAAGAAAGCAGGCTGAAGTGGAGCGAGCGAGTT 
TCAAAAACCCGTAGCCCTTGGAGTATGGCTGAACAACT 

CAAGGATGGGGATGGGACAATAACAACAAAGGAGCTGGAGACAGTGATGAGGTCCCTGGGGCAGAACCC 

5 5 AGGACATGATCAATGAAGTAGATGCGGATGGTAATGGCACAATTGATTC 

GACACAGACAGTGAAGAAGAAATTAGAGAAGCATTCCGTGTGTTTGATA 

C CATGTG ATGACAAAC CTTGGAGAGAAG TTAACAG ATGAAG AGGT CGATGAGATGATCAGGGAAG CAGACATTGACGGGGACGGTC 
AGGTGAACTATGAAGAGTTTGTACAAATAATAACAGTG AAGT CAAGAC ACT GTACAGAATGT 
TTATTTGCCTTTTCTTTGTTTGTAACTTATOT^ 

6 0 CTTCATTCC TCCATGTTTTCTTC C CTTAT CTTACTGTCATTGTCCTT AAACCTT ATTTT AGAAAATTGAT CAAGTAAC ATGTTG CA 

TGTGGCTTACrcTGGATATATCrrAACTTAGATGGAGTTGGTCCAATGAGGGAACATCTGGGTTA 
AGGAACTGTCAGCATGTTGTTGTTGAAGTGTGGAGCTG 

CACTATTGTAAAACGG GTTTTT ATCCAGGTACTTGGACAC TAATTTT1TTTGT ACTGCTGGTATTG T ACCAGGAACATITl'Ci' TTT 
ATTGTTACTTGCTCTTTAAACTTTGTTTAGCCACTTAAAGAAAACCT 
65 ATATTTGTTTTCCANNNNNNNNNNNN 

GGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAAGAAAGAAAGAAAGAAAGAAAGA 
AAAGAAAGAAAGAAAGCAGGCTGAGTAAGCCATGAGAGACAAGTGTCACTCTTCCATGGCCT 
TCCTACCTTGGGCTCCTGCCCTGATTTCCATTC^^ 
TCCCTGAATTGCTTTTGGCCATTATATTAT 

7 0 TGTGGCTTAGGTGGTTGTTGTTGTTGTTGTTGTTGTTGTTC 

T CAATTAAAAAGGCTTCCATCAG ATTCATCTGGAAG CCTGTAGAGTG TTTTCAT AATGAATGATTGATGTG AGGGGACCAAGCCCA 
CTGTGGGTGGAGCCATTCCCTAAGCAGGTGGTCCCAGGT^ 
TTTAGGTCCTGTCTCTAGGTTCCTGTCTTGAGTC^ 
TTCTCTGGAGTTGCTTTTGATCATGGTGGTCATCTCAGC 
7 5 TGTGCTTCCTATTGAGATGCAGTGGAGGCAGAGGAAGGTGGCT 



2017 



WO 03/008583 



PCT/US01/51291 



AAAGTTGAGTTCAGGCAATGAAGAAACCATTCGGAAAAAAAW 

AGGCCCAGGCATGTGTCAGGGGTTCCCCTGCATGGAGGCCTA^^ 
GCCTTGAAGATCCCAAGATGTTGGAGACGCC 
5 AGAGAGGGAAGTGTIGCTGCAGTCAACAAAGCCGAJ^GGAG^ 
TTTTTAGTCTTGCTTTGGCCAGCATTTCCT 

TGTATGTTGGAAG CATGTGATCTGCTCTTTGATTTTGACTTT AT AGATGT^ 
GCTTCGAGGTTTGCACTTTCAATCATGTTGAGAT^ 

TGAT ATGGCTCTAAGTC CATGGGGGGCCAGGGAGTGGG ACGTGGTGGTTTC CTTATGTTTAAATG 
10 TTTAGTCACCAGGGAGTGGCACTATTTGGTCT^ 

CrCTCACCAGGTCTGCCTGTGCATTATCATGCT 

CACATCGGTATGAAGAGTTAAAGTAAATATCAGTGTGGACCACCTCCTAGTTCATAAAATAAA^ 

1 5 CCTCCCM 

AGTGCAGTGGGGCAGTGAGCAGAAGAGCAGGGGATGGCACACCCCTGTGAGCC 

TCCAAAAATACATTGAGCTGAAGGAAGATGGAAATACCATGCATTTCAAG^ 

GTGCTACAGGCTTACCCTGGAAAGAGGAAAAGGTGTGGGTCCCAAGCGCACACTCCCrGT 

T CAGTAGCAGTGGACT ATGG AGGAAAAGAGTCTCAGAAGAT CAGTGGAG CACTGGCCCGAGGCATACTG C C CTGGTGGGATTGCCC 
2 0 GAGGGAAAAGGAATTAAGGCCCAGGTGACCAACTCAGGATACACAGGGATGTTGCT 

AACCAAGAAACACAAATCACCACX3GTC CACTCAGTGTGAAGCAG AT AAC (^GAACACAC TGTGATGAATAGAATTG AAGTCAGGGA 

CTGGGGAGTGGGCTCAACAGTGAAGAACACTTGCTGCTCTAATACAGGACCrrcGATTCGATTC 

AACTGTCTGTAACTCTAGCTGCAGGGGAACGGCX3CCCTCTTCTGGCTGC 

GGCAGGCATGCACACAACACAT AAAAAT AAAAT CTTGCTGGCACAC CTTT AATTCCAG CATTCAGGC AG AGGCAGG ACTTTGATTT 

2 5 TGAGGCCAGCCTGGTCTACAGAGTAAGCTCCAGGGCTGCCAGGGTGACACAGAGAAACCCT 

AAACAAACAAACATCTTTTTCAAAAAGAGAATTAAGTC T ATCACATCT ATTTATTT ACT T AATGACTTATGTT C 
TCCCCTTCCACCACGTGGGCTTAAAGGCGAAACTCAG^ 

CACTAAATTCATAATTTTAACACCTTCAAAAGACAAATCTTTAATCCCAGGAACTGAT 
GAGAAGAATTAATCCTAGTTCTACTCAAATCAGCAAAAGATAAAACACAGG 

3 0 G AAT CT CAG T AAAAT AC CAG CAAGT ACAACTCAG CAAT AGGAAG AACTGCATGCACTGACAACTG AGGAAGCAAGGCTGTTATG^ 

GTAAAAAAAAAAAAAAAAAAAAAAACCCT ATTCAGAAG GGAAGAAG AAGAAAAAAAAAG AATTCCAAGAG CATTTT AAT TTGTTT A 
AAAGATTTATT'TACTTTATGTGTGAGTGCTCTGTCTGCATGCATACCTGTGTGCCAGAAGTGAGC 
CTGTGCGCAAACATGTGGTTGCTGGGAATTGAACTTAGGACTTTrc 
AGGCACCI^AGGTTTGGGTTTTTTGTTTTGTTTTGTT^ 

3 5 TTTTTTGTTTTGTTGTTGTTGTTGTTGTTGTTAAAACTTCAA 

GTAAAGAACGAATACACAATCCATGCAGTTACTGCTGTCTGTAGAGGTAAACGC^ 

AGG CAAG AATGTCTGCTTTCTGCAG CACTTAATT AATGAAATG CCAAAGAAACAAGG CAGGAAAATG AAAT AAAAGAG CACAG AAT 
GCACACACTGACACACACACACfCACACTCAC^CAC^ 

ACAAAGACTCAATGTAGAAACTTATGCCAGAACTTCAGAGATGGCrrCAGTGGGAAAG 
40 TTCAAGTCCCAGTACCCA.CATTGTGTGTCACACCATCCACATCT 

T ACCAAGC ACACATGTGGAG CACTCAT ATATGAAAGAAAT CAAT CT AG AAAT AAT CTTAAGT AAAT AAAT AAAG CTAAAAAGAAAG 
TATGTCAGTGTTCCCTACATTGCTAGACITTAAGCAACACAGT^ 

G CTCCAAAGAGGATTATGTTAG CTAAAATAATTTTGAAAAAGG AATATGT ACGTGGGGGGAAATCTrGCCTACGTGATTTTAAGATG 
TATAGCTCCGGAGTTTGGATGCAGTTCACTTGGTAGAGT^ 

4 5 CAAGGCTGCATCTATGTAGCAAGGTCTAGGCCAGCCAGGTCTATACATGTGAGTCCTGTCT^ 

AATGAGTGCAGGAAAAGACATGAGCAGCAGCATTC^TTCGGGGTGTGTGAACAGAGCCATACGCTCTTGG 
ATGTGAGATTGTGCTGTCACTCTGGAAGACTGGGGACTTCTCTGGC^ 
AAAGGAATGAAAACAGACATTGGCATATGAACCAACATATGAA(ZATGTCT 
AACAACTCAGCAATCGGAGAACAGTTGTGAATGTAGAAAGGC!AGA 
50 ATATCAATAGTGAAGGTCT^CTCAGCCACGCCCCrcCCT 

TCATCTGATGTGAGGCCCAAGGAGTCTGTATGATAAAATCAATTAAGTATACACAATOT 
GGTAGAGATG<IA(^C^TGTGCACATGT<3TATGAGGAGCATGT 
CCTCACTCTTTGAGACAGAGAGTTTCTGGTTGAGACGACTGTCTAGGAG 
GTCTGCTCTGGTCITCTAACACACAACITTACTGAGTCrA^ 

5 5 CTGAGCCATCTGTCAAGTAATTATTATTCTTTAGACGGGTTTC!ACTGTGTAACC 

TCAGGCTCCTAAATATTGGAATCCCACGCATACATTACATCTGGCCTCATCATGTCTTATAATGTG 
AAATAATTTAACCAAGATTTTTATTGAATATA 

TACATGCATGTCTATGCATTGTGTGCATGCAGTGCCTGTGGAGGCCAGAAGAGGGCATCAGATTCTCCTGAAACT 
GAGTTGTGAG CCACCATGTGGGTGCTGGGACTCAAAG CCAGGTTCTCTTGGGAGAACTG ATGGTGCTCTTAACTACT 
60 CTCCAGCACCAGTGTATCAGTTTTCMTGAATATGTTTGTC 

GGAGACTATGCTGTGAAACITCGTTCACATAACCX3GGTTCACC 

TT C CAACTT C CATG CCAAAAAT AACATGCGG CC CTGT CATCGGG AATT ATTTTT AAT AGTTTAT CAG CAG ACAATT CCT CTG AGTC 

TCCTCTTATATAGTTGAAGGCAAAGGAGTGGAGAACATCTTATCTGCAAAGAGCTGCCG 

TGGGAAAATCTACCCCAGGTCCTTTATCAGGCCTCATGGCTATGGCT 

6 " TCGGCACACTC^CTTTGCA CAG CTCCCTGGGTCCCTTCCA 

AAACCCTTCAGTCTGCTCTATGAGGTAATAAGTGGACAGACATCCTGAGCTTTAA 

AGACTGAATGATGATAGATACATTGATTAATAGATACTAGATGGTTGATTGAACAATAATGGATAATATATG^ 
TGATTGTTGATGTAGATGACATATAGAGCAGTTGGTTGATGATAGATCAGATATACATGATTCATGTTA 
7 0 GGATTGTTAGATGATGCACAGATAAATGAAATATGCATGATAGATTGATAGATTATAAATTGATTGAGATAGG 

ATATTGATAGAAGATAGAAAGGTGATAGGTAGATAGATGATTGATAGATAGATACGTAGATAGATATAGATACATGCATA 

CAGGTACATAGATATAGATAGATACATAGATGCATAGGTACATAGATACATACATAAATAGATAC^ 

TATAAATATATTCATACATACATAGACIAGATAGGTACA 

GATGATTGACGGATGAATAAGTAGATATTGAATAGATAGATAGATAGATAGATAGATAGATM 
7 5 AGGTGGTGAGTAGACAGTTGATGGCTGGACAGCAGATAGACAGTGGGCAGAGACAGACAGAGAACAG 
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CATTATTTTGATGGACCACCCTACCTACA^ 

AATCACTGGGAGAAAGAGAGTCACTAACCAAGTGGTATCAGGTTGTAAA 

TCGTTCAGCCTTGTGGTGACAGAAGTCrTGGAGTCTC^ 

GCACACCACTGGGTCAGGGAGGGGAGGGTGCATGGCTTTGTTC^ 
5 TGCTTAGAGGCACGCATGGCAGTCGGTTAGCTAAGGAT 

TCAAGATTGCCCTGTCCCTTCCAGATTCTGCGCA^ 

CCAAOTTGGTGTCACAGCCA&TGGGTCTCCCTG 

AGGACCCCCCAGAAAGGGAAAGAGCCACATTGAAGCCTOUIAC^ 

CTCTGGGAAGGAAAGGGGCAAGGCAGATTGAGGGGATGAGGGGACCGTGGA^ 
1 0 TGTTT CAGAG ATGGAGAG AAAGATACAACCG AGATTTGATGGAATGA CTT CTGGAAAAGTCCACACTTG 

TCTCCAGAT ACACX^GAAT TTCCTGT ATCCTTGCCT CTC CCCTGAATAG CTAATTTCAGAACAG AGAAAGAAAG AT 

CTGTCCTTTTCTTGGCACCTTCAGTTCCTGAAACAGTGGAAGT 

TTATACTGACCAAAGGTACATTCCAGGTCTAAGGGCCTGTC 

GGGCCAGGAGAAAGGAATAGATAGGGCTTATCTTCATCTGTTCCCT 
15 GGGAGTACACTGATAGGCTGGACCAGTCTGCCAGTAGATGGCGGCCTACATAGTAACT 

GCTGGCTAGCATATACAAACACAGAGGGGACTACAGGCACTGCGTAGACAAGAGAACTCGGGACGGGG 

GGCCCTACCTGCTCTGAACCAGCTCCCATGTGGACTCATCCTTCrrcGGTCT 

CAGCCAATGGCAGATGGGACAGAAAACTGAGATGAGTCATTATCA 

CAGAG CCTGAAAGAGT AGACTG AG ATTTGTGAGCGCT AAGATAAAG ATGAGCAAAG CTTTGGCAGC TCTT AGGT ATCTG AGGGC CA 

2 0 CCGTCCT CT ACAAAGCAACGAG AGG CACGGCGGATT AGG AT AG ACTGGTTG C ATCCAAACACTACCTTGCTG C CTCAAAGGCTTAT 

TGGACACCACAGAAAGACCTCTGCTGGAGGCAGAAGTCACAGGACT 
GGGGAC^GTGACTAAAGGACTGAGGAACTGAGGCAACTTGGCCCGGGTCTAGTC 
TTCAAAGGACATCAGCCTTAGGAAGGGCTCCCAGGGTTGCTTTGAGGAGTGCGGGA 
ATTGATGACGAGGCAGCTGCTCTACTTGGCCTCCTGTTGCTG 

25 

MOUSE SEQUENCE - mRNA 

AATTAATTTAATTACATTTTACCTTTNTACCGAGTCCGAGTGGAGCGAGCGAGTCGAGCGGTTGT 

GTAG CCCTTG CAGCATGG CTG ACCAACTGACTGAAGAGCAGATTG CAG AGTTCAAAGAAGCTTT CT CACT ATTT GAT AAGG ACGGG 
GATGGGACAATAACAACAAAGGAGCTGGGGACAGTGATGAGGTCCCTGGGGCAGAATCCCACAGA^ 

3 0 A CAATGAAGTAGATGCGGATGGTAATGGCACAATTTGATTCCCTGAATTTCTG 

AGTG AAGAAG AG ATTAG AGAAG CATTC CGTGTGTTTG AT AAGGATG GGAATGGCTAT ATO CAGAG CTTCG CCATGTGAT 

GACAAACCTTGGCGATAAGTTAACAGATGAAGAGGTCGATGA^ 
ACGAAGAGTTTGTACAAATGATGACAGCGAAGTGAAGAC1ATTGTA 
rr rr C TTTGTTTGTAACTTATCTGTAAAAGTTCCCCCCCCCTACroTCAAAA 

3 5 CCIATGTTTTCTTCCCTTATCrAACTGTCATTGTCCTTAAACCTTATTTTAGAA 

TCTGGCTATATCTAAGCCCGTCTGCACATCTAAACTTAGATGG 

TAAAAGTTGCACTATTGCAAAACGGGTGTTTTATCCAGGTACTC 
TTTCTTTTATTGTTACTTGCTTTTTAAACTTTGTTTAGCCACTTA 
40 CAAGTTGTATATCTGTTTTCCAATAAAAAAAATTACAATTTACCC 

MOUSE SEQUENCE - CODING 

ATGGCTGACCAAC TGACTGAAGAG CAGATTGCAGAGTT CAAAGAAG C TTT CTCACTATT TGACAAGG ACGGGGATGGG AC AAT AAC 
AACAAAGGAGCTGGGGACAGTGATGAGGTCCCTGGGGCAGAACCCCACAGAAGC^ 

4 5 CGGATGGT AATGGCACAATTGATTT CC CTG AATTT CTG ACAATG ATG GCAAGAAAAATGAAAGACAC AG AC AGT GAAGAAGAGATT 

AGAGAAGCATTCCGTGTGTTTGATAAGGATGGCAATGGCTATATTAGTGCAGCAGAGCTTC^ 

GAAGTTAACAGATGAAGAGGTTGATGAGATGATCAGGGAAGCAGACATTGACGG 

AAATGATGACAGCGAAGTGA 

50 HUMAN SEQUENCE - GENOMIC 

\ ' ) TGGCTGGATTCCTGACCCAGCTCCGTTTTTTCCCATAATCTTGGAG^AAG 

CTGAACCACAGAACAAGAAATTGATTTCAGTACTATTCTCTCAAG 

ATTCAATAC^TAATTCAATGCCTTAGGATATTTAGGATTTATTT 

G CTACTG GGGG CCAGACACAGTG GTTT ATG CC TCT AATC C CAG AAGTTTGGGAG GCTGAGG AGAG AGGAC CTCTT G AAG CCAG G AG 

5 5 TTTAAGACCATCCTGGGAAACAAAGCAAGATTCTGTCTCT^ 

TGGGCAGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACATGGTGAAACCCTGTCTCT 

CGGGTGTGGTGGTGGGCACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATA^ 

GTGAGCTAAGATTGTGCCACTGCACTCC^AGCCTGGGTCACA 

TGGGCGCAGCGGTTTGTGCCTGC^GTTCCAGCrrACTTAGGAAGCTGAGGCGGGAGGATAACTTC 

6 0 AAGCTATGATTAC^CCACTGTACTCCAGCCTGGGTGACGATGAGACCCrGTCTTAAAAAAAAA 

TGCCCTTCACCTTGGTCATTAGCACAGATGGCTGGAGGAGGGAAGCTATGGGA^ 

CTGACCTGCAAATTGAGGAAGAGGTGTGGAGAGGAAGTACATTCTCACTTTCCCAA 

TACCAGTCAGGACAGGAAAATTCAAAGACAACCACCTTCTTGGCCT 

CAAAGT CAG G CAGCACTGT ATGTCATG TTGTGG AAG ACAAAT GCT CT AT AAAT AAATTG ACTTG AAACAGT CTCAAATACACAT AA 

6 5 ACCACAATGGCTTTAAATGTAACTCTTGACTTTTTTCTCATTTTCTT 

GAGGCTTTACGGAAATCCAACCTAAGAAACCTGGGAGGACTTTTATGTATCT 

TTTAAAAGCT CCTGGAGG CTTT AAAC TC CAG AGATGGAT ACATACAT CTCTG AGGA T ATTT CAGACT TAACGATGCAC 
TTGTT ATATGCTAG ACATCCATCTTT TT AAAAATT CATTTT C T AGAG G AGGTGATTATT CATTTTG TCATTCAATTTC CCAT CATT 
AAAAATTT CATGCAGT AAAATCACTG CAT ACTAATGG GGTACTGTGT AAAGAGAGT ATTTTT AGTG ACTAAGT^ 

7 0 CATTGATGGCTCTTCCATCTGAGGATTTGAGGAGTAATCAGAGACAA^ 

GAATG CAGAG T AATG AAACGGAGG C CTTAGGGCTC CGTTTT CAGCTCAG ATTCTGAAG CAGAGGTGTCCG CTTTT AAGTAACATT C 

TCTCTCGATGGTATTCCATTCACAGGATAAAGTTAATTAAATCTCCTTTCCCCCCAC 

AATAGAAGCACAGGGTCCTGAGTTTGCACTGACTGG^ 

CATTTAATGCCTGGTTTGCATGACTGAAAGAAAGAGGACCCACCTCTGGCCTGGCAAAGAAAAGATTTTTG 
7 5 GTCT AACTTGG CTAAAGGAGGTTAAATTCCACTGAAG CCTTCCAAGTG^ATGCAGGT AGCTGTTAGCATC!AGAAATGGTCTTGCTG 
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GG CAGGGCAATGTTCTCTAGGGAGCTGTCTT CAGT AGGCAAGATTT CACCTTGGCTTG CAWUUIATGCTGTTCACTG^TTTTAAG 
GGTCTACATGCTTGTGCTTTTTACTGGAAAATTCTCAGACCTTC 
GGATTCTACCAACAATCTTCI^AAGGAACTGCCTTTTGAGATTTAAAC 
ACGAGGCTGGCTGTCTACTTT03GCTTTAGTAATGCCTATCCCCT 
5 CCTTTGCTGTTACTTACTTCACCCAGAACTTGCTCCC TT C TT TCCT 

T AGGAGTTACCTTCTCTAGGAACCCTTTTTGGAAC CT CITTATAG AGG ACT AAGGTC CTCACCT CTGCAT CTCGTAAGCCCTTCTAT 
AGTT ACAATTAT AATCAAACCGAGTTT ACACAGTAGGTGTT CACT 
TTTTGGATGGGACCAAGCAGTTAGTTTTTATGTGGTTTTTAA 

TTT CCTATGAAGGGGAAGGCATATT ATAGTGAC CAAT TGTTGGAGTC CT AC CCTGTGC CAGGTACTTTAACACITTGTTAATTCT A 
1 0 ATCTTTGCATAATTCTAATTTCCT 

ACTCACCCAAGGTCTTGCACCTGAGAGAGGGACCCAACTAACT 
AACAAGGGGCCATTTCGAGATACTAATCCCACTGAAGCTCG<^ 

TTGGCAAAGAACG CGAAG AG GAAGTAT AAACGAAATACCCTTTGAGATTAGCCTGT CAG CTG AACAG CT ACATTTG CC CGTGTC CA 
GATTTACATGGGTTTTGTGAGGGCGTGAAATGAAGAACGTGCGGCTTC 
1 5 GGTACGCX5TTTATQ3GAATCTTTCftGCTTC^^ 

GGTGAGTGTG<3CAGGGAAGGATACGTGGAAGAGGAAGGTTCT 
CCAGGACAGCGCGGCCCGAGCAGCAACIGGCAGAAGC^ 

CAAACCAG CCAATCAGAGGCAGG CTCAACt^CCAACGGGGGACGCATCGCCGCCAC^ 
CTCAGGGGCCGCTCCACCGGCCTTTTAACGGCTCGCTGGAAATG^ 
2 0 TTX3CCTCATCGACCAAATAACTGTX3AAGAAAG 

TCCGTTATCTGGATGOSGCGGAGGGATCTCGCGGAGGGAGGTGTTTATGAGGCGCTGGG^ 
GAGAGAGCGAGCTGAGTXXjTTGTGTGGTCGCGTCTCGGAAACCGGTAGCGC^^ 
GCTGGGCCTGTGGGGGGAAGAGACTGGAGCGAACrGACTAAACAAAGGAAACTCGCCTC 
CCAGAGGGATGGCGCGAAAGAGGCCAGGGATGCCGGCGACGGT^ 

2 5 TTCCTCTAGTGAGTTGAGGGCCGGACX3CACAGTTCCCGAGGTGGTCAAGCTCCT 

ATG AGGG ACAGC C CAGTG CACGGGAAGGAG AG AAAG TGAAGG AAGGG AG AG AG GG AG C GT AG AGAG C CTT CTGTCGGGCGT CTGGA 
AAGGTCAAGCAGGGGGCGAATTCAATCCACGCGACCG^ 

TGCGCCCAGG<MGCGGGAGGAGCGCGCGGGGCCCTGGGGCGAGCGTCGGCGTCGCAGTTAGCCCGCGTCCTCCG 
CAGGGGGCGCGCGG^CGGGC<3GGCGGCGCGCTGTX3TGGGCC<3 
30 AGCGCCTCGCGGGACCCGGAGCCGCCGCCCCGGTCGGCCCCGCTCGAGCTCCAGCT^ 

TCTCCTTGGCGCTGCGGGGCCATTGGCGGGGATGGGCGGGGGAGGGGGGCCraSACGGTT^ 
CTTCCTTCrrCCATCTG<3GCCCrGGGGCGTCTrCCACCACCCAGGC 
TGAAAGAAATTAGTCTTACCCTTGAAGTCAGGGGCGCGTCCTC^ 
GTTCTACAAGGCGTTTCITGTTCACCGCCGGAGGGAAGCAGGCTCCGGA^ 

3 5 jyrGGATGCCTTTGAAGAGCCCCTGTCCCTATTCTATGCT 

AACATTGCTCCCCTTCTGCTGCTTTGAAAACGACCCCTAAATTCCGTGTAGAAGT^ 
TATGATGTTTGTCTGTCAAATACTGTGATGGAAGAGTGTATGCGGGGGAGGAGCAGG 
CAGACTGGGAGATCATGTTCTTTCCTGAAAAAAAAAA^ 
GTGGTTCCTCCTGCCAAGTGGTTACIAGCGATGGGTGTGTC^ 

4 0 ACTGAAACCTCCAGCCTGTATGGGTGTGGGGAGAGGATTTCACCT 

CATCTTGATGCTTCAAATGTTATTCTCAATTTGAATGTGCI^GTT 
AAACATTACAAGTTGTGGAGTAAGCTTTTTTTTT^ 

TAATTCGAATAGATTTTTTCTTGGTAGAGTTGGTATCTACGCATTTGTGGT 
TGCATCACTGGTGCTTTGGGGGTTTTCTGTTAAATTGACAGCTCTTACTATAAGAGACAAAG 

4 5 ATTGACAG AT AT CATTATTTGTGACAGATACAGTAAAT TTG AAAATTG CCTT CACAT AACTTT ATAATT AG AGG AC TGAGATTT TA 

CATTAGGAATCCAATTAAAAAAAATTAGCCGGACATAATC^ 
TCATCTTGGGCCCGGGAGTTGGAGACTGTAGTGAGCTATGATC^ 

CAAAAAAT AAAAAAAATTAGGAC C CCAAGTT AAAGTATAAAAAATTTTATAC G G AAGT AAAGTAAG ATTG CTCATGTAACTCTTG A 
GTTTACATGTAATCAACATATGCTC^TTGAAAACGGGATTGCTTCAA^ 

5 0 GTAAAAAGGTAGAAAATTTTTGTCACTTGAGTCTA 

CAAAGGATTTCGCTCTTTTAAAATATAGAAAGCTOTAGCTTCAGCTAGAATT^ 

CCATTTTGTTCCAGTGATCTTTTGGTGAGAGATG CTATGT AAG T ACTATTCTT CAG AATT AGGTGTCTTTT T AC CCTAATGAAAT A 
ATTT AGATTGCTTTTGAT ACAGGT AAAACAAATATCCTGG CTT C CATAATTGT AGAAAAAACTTCAT AT AGG AATC CTTGTTGT AT 
CAAAGTAG CACCTG ATGGGAATGAACAGACAGGAATGGATGAAGG AT AGCAGTTTG CGTTC C^ GGCTCACACA 

5 5 TTTATTCAGATAAGAACACCACCTTTCACTAGATAAACTCCAACAGTAT^ 

ATAGGTACATAATGTATTCACTTCAGGTCACTAATGTAAT^ 

AAATGAACATTCTAGTACAGGAGGTCTAGGGAGGAACCTGAGAGTATACTAATGCCrAGGAACTTTCT 

TGG GAAGAATT ATGT CAAT AG CT ACAG AAAT AAGGGAGTAAG AACAAGTCATCTCTC T AGTGAAT TCTTCT TC^ 

AAACATACATGTTAATGAGCTTGAGTTTTCCC^ 

6 0 AAGAACCAAATTTATTCGCCTTTGTAGCAGTTGGGAAAGTTAGTGGTAGGAAGTCTTCT 

ATTGCTGGTAAAAGTTTATTCTAAAACCTGAACTCTGGA 
TTCAGGTTATGTGAGTAATTTGGTCTCTTGGGTGCTTA^ 
GCTTGGAGGTAGGGAGTTAGACAAGATGGTATGAGGTCCCTAAATTTT 

65 CCCATGTAGAGAGAACGAACCTGTCTCCCITCCTGTACAGAGTACGGGATCCTTCAACTTTCAC^ 
CATTTAGCTGAACTTTTTTTTAGCCTTAAAGTGATGTCCGCT 

TTTTCTCAGC TT AAAGAAAGTTAACATTG AAT AGAGT AAGCT GGCTT AAT AAATG C CATG CATGT CT ACAT TCGGT 

GTGGAAATTGAGCTAGTCAGGTTGATATTTAACATTGTAGGTTCTTTGTTAAT^ 

AGGTTAGCTTTGTACATAGCATCTCACTTTGCACAACAACCCTGCAAGGT 

7 0 TGAGAGGAGGAGTACCACGTCCAAGGTCACACAGCTATTAAATGGCAGGGCTGGGA 

TCCCCCCACGCCACGCATGCCAGGTTGCCCTTCCTTTCIA^ 

TT CT ATTAAT ACAAAGT AAAT AACTCAGATTTACTGG ATTTTAAA CTGACTGACTTCACAC CAA 

AT ATTTGTTGG CGGAG G ATTTGGACTTT AGGGAT AAAAGT GG ATACATTTTTT ATT TTACAAACTCTGT ATTT G AACTT AATTATT 
GGCTCTTCAATTTTACGTTACCAGCTTTTTTTTT^ 
7 5 AGGGAAAATAAACTACTTTCTGGATTCCTTCTTGAATTTTCTGATGTGCCCTAGAGAAAATGT 
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TT C CAGGGGTGTGTTCATTTAAAAAGAATGAAG CCAGGCAATGTTT^ CITTT ACCTATAAAT AAATGAATGGATTAATCAT 
TGTATACTTGACTCCCATGTTGGTAG(3GATTTTAGATAG 

TCATGGATGTATATACTAATAAGCAGTGAAAGAAAGTGCATGTTCAAAGAATACA^ 

TAT AT ATTACGGTG CTCTGAATTAAAAGCTAAAAGTTACTGGGTATGT CTGACACCTTAG TGCTTT ATCTTTGTTCTACTAATTTT 
5 CTGTGCCCCAATCCCACTTAACCCrAGCCTCATTCCTTATCTGTAAGATAGGGGATAATA 
AATAAGGATAAAATTTATAATGGGTTTTAGCAAATGGCy^^ 

AGCCTTGGGCI^ACTTTGGGATITTAAAAAC CAAGAGAAAATGG ATGGCTGAACTTTCAAACATT TGGTAAATATTAT AGT ATTGT 

AGTTOVGAGCTCTGGATTCTTTGCACT 

ATTTCAAAACAATGTGTTTAACCATTTGT^ 
1 0 GATTTATATCTGGAATAGAAGTGAGCTTAAGTTTAGCTATTCT 

TATTTTCAAAAATATTCriXSAAGTATTTGTATATCr^ 

GTGCATTTTCTATCCAGATGTGTTTGTTGCTGGAACTAAATGAAACA 

TAACTGCTAATCTACATACTCrCAAGTaVCrAACCTTCCT 

AGGTGAGAAATACTCAGCTAGATTGTACCCATTAGATTCTTATTATAAAT 
1 5 ATAATTGACTTTGGTATATGTCATGGATACTAACATATGGGTATAATATAATACAGTAATTCAACCT 

TAAACITGCCGATGGAAGATGTGCAGGGCTAATTGTAGTTAAAGACATGTATTTTAGT 

GTGT AGTGGTGGT CTTTG'rrrrG'r TTTCTGAAGACTACCTT AG T AT AAAACAGGAATT CTGGGCT AG CTAACACGCT AGAAAAGGA 
GGCGAAGAAGTAACTAAGGCAACTTGAAATGGAGAAT^ 

AAC CTAAGAAT TGCAT AT AT AATCAGT CAGGGAAT AAC CATTT CTCAAGTG AAAAAATTT AAGAAGTTGGCATT CT ATT CT ATAAA 
2 0 TGTCACTGT AG CAATGTGGATTTTT CT CTTAAAAAACAGTGACAGCTGTTG TT AGTTT 
GCTATGGAAGT ATGCAAAATT ATTACAT AAAACCTGATGT CATGTT CTCTTTATG CCATGC CAT AGAACATTAAT ACTTTTGATTT 
ATCCTTGT ACATTGAGATGGCTGG TT AATGT CAATGTTGGGGAAACT TTTTTTTC CC CCCTTG CCGGAGTTG 'lTTT'i"i"rrrT TTTA 
TTGTAGTGTTG AAATGCITTGCAAACT ATTCCITTC TTTTATT AGCATGAG CACCAAG CAGTTTAT TTG CCA 
CAGTTTTCACAGTAAGCAGGAAGGGTTTGATAAGAGCTCAAAAGCACT 

2 5 CAGTTGGAACTATATAGCCCATTTAAAGCAAGCrCCCCACT 

CACCTCAAATACTATTTGATAGGACTGTAGGTAGAAAGCTATTAGTATTAAAGAC 
TGTAATTTGACTTAAACTTAGTTTTTACTTTCTGAGGT^ 

GGG AATAATTTTTTTTTTGTTG AGTTACACTGTTTTAAAGGTAT CACCTATTTGATCTCATGTCT CCTCAAAAAAAATAGACTGGG 
CCTGATGGCTC<5TAATCTTATCACTTTGGGAGGCTGAGGCAG 

3 0 TAGTGAGACCCCATCTCTACIAAAGTAAATGTATCGTCAAAATGTTTTTCT 

CAAATTGTCTCAGGCTGTCTAATACTTTCACTCTTAAACTCTGACCT^ 

T AC CAACTGTATG CAGT ACAGCATGTTCACTTTAGGGT AG AAAACG TTACTCAATGTG CCATTGCATAGTTGATGGTCTG CT CT CT 
TAAACAACTCTTTGAAAAGAAAGTTTGTTG<3CCGGGCGCAGTGGCT CACGCCTGTAATTC CAGCACTTTGGGAGGCCGAGGCGGAT 
CACITGAGGTCAGGAGTTCATGACC^GCCTGGCCAACGTGGTGAAACCCCA 

3 5 TGGCGTGCACCTGTAATCCCAGCCACTCAGGAGGCTGATGCAGGAGAATCACTTGAACCT 

GATCATGCCACTGCACTCCAGCCTGGGCAACAGAGCGAGACT^ 
TCCCTCTTCAGTGATTGATAGCATTTTTATTTTTTAATTTATTTTGAGAC^ 
CTCAACTTCTGTCTCCCGGGTTCAAGCTGTTCTGCTTCAGTCTCCCGAGTAGCTGGGATTAG 
TGTATTTTTAGTAGAGATGGGGTTTCTCCATGTTGGTCA^ 

4 0 CCTAAAGTGCTGGGATTACAGGTGTGAATCACCATGCCTGGCCTGGTAGAATTCTTATAAAGA^ 

TGACATTCAGGACCTGTTCTCTTTATTGTCTTTAAGTTAAATCCrATCTTTTCTTG 
GAGTAGAGGGAAAAAAATCTGTTCTTAGGTTCrrCTTCTTATCCTGCCTTTTCCT 

AC CT CCAGGT T TCAG TT AAG AT TT AG GG CAG AGGAACCAAAACITCT ATGCTACTAAAAT AAAAAT CATGTG CGGTCAGGGACCTT 
TTTGTTCTACTTTGTAGGCCCG GTAC CTG C CACAT AGTAAT CATTG AAT ATTC CTTGAATAAATTTT ATT CT CAATG AGTGTGT AA 

4 5 CAC AAAAAAGTTCTAG AGCACTACTAT C CTAAG ATTTCTG C CTACT C T C CAAGTC C CTTC CTCCCAAATTTGGT ATATTT CAGTGT 

TTACCTTTGAGATGAAGGCAATTTTTTAATCTGGTAAATTGTAATTGTTGGTGGCT 
CTTTGGGAGGCTGAGG<IAGTGGGATTGCTTGAGCCCAGGAGTTCAAGACCA 
AAGAAAAAAAAATTTTTTTTTTTTTTTTTTTAGTTGTAAGTGCTGGTGACCGCT 
TTTTTTTTTTTTTTTTTTT^ 

5 0 TCACTGCAACCITTGCTGTCCAACTTCAAGCAATTCT 

CCCXSGCTAGTTTTTGTATTTTTAGGAGAGATGGGGTTTCACCATGT^ 

CTTACCTTGACCTCCCAAAGTGTTGGGATTACAGGCATGAG 

AGCATTCCAGTCTAGTATGACATCACTAACAATTTCA 

TTGG AAGACTG ATAG GAAAAATGTACAGT ATTTTTTCCTT AAG AAGTCTGT AC CT AAAC TTTCATTAA^ 

5 5 TG TTTG AGAAAAATATT AATAT TACT CAATGTAT AG CAACAACTGCATG AAGCGC CAAAAGAGTTCCAGT ACATGATCTTTAAGGG 

ATTATTAAGTTCAGAATTTTGTGAAGAAATAGAAGTTAATAAACAGTCA 
TTAATAAAATTTTTAAAACATGTTATTTTAATAAAATTATATGTTTAGGTAATAAACT 
TGAACAAATAAAACTTGAGTAATAGATTTTTCTGAAAGTTTCAGTTTG^ 
TCAATTATGTAGAAGATAGCTGTTCTGTGGCATTCAGGACCTGT^ 

6 0 T CAGGTCATT C CTGGGAGTTACAGTTT AGAATTAT ATGCTT CTATACAT AAAG ATTACG CTAGTT CTG AAGTACAGGAAAACTGT A 

AAGGCAGGTGrAATTAAAATTTTACTTTGGGGTTTATTATTTTGAGTTTTTACAGGTATTT^ 

T AAAAATTTGGAATAAAC CTTGACCTTTAAAAGAAATGTTATTTGTTCCTTTCTTG AATG AA TAG CAT AG ATA 

CGGTTTTATTTTCTGCATTTTACTTATTCTAGCAAACATTTTAAGTGTTATTAGAT^ 

TTACACTATGCATCTTCCAGTATTTAGCTTTTTGTTAAATCAT 

6 5 T GCAC CT AATGG TGGTGTTAAAAGATACCCATAGCAG AGAAT GTAATATAAACAAATAC CTCAAAGT CC TTT AATTTTCTTGTAAC 

CT AAATT AGGTG AAAGGTGCAATGTGTCTGAACT C CAAT ATATGT AT TT C T AG AT ATTT CTACATT CTGTTTAAAAACAGTAAAAT 
TGAAATGTCTTTATCAATTGTGCTATTTGTAGGTGGAGACT 

T ACTATCTT CAT AAAATCAATTAGTGT GGT ATGAACCTT CTG ATTGGT ATT AG A C CT CT AT AAAC CAGT AAT ATTTAAAT 

AAGAAAGAATGCAGTTAGGAGGCAATTGTAATAGCCCTGGCAAAGGACGATGGAGCCTTGAACCAA 

7 0 AGGAAAAATC^GGCAGTATTTAAATGATAGAATTCGAGACCTGTTGACAGATTTCAGAC 

TTTTGCTAGTACCTTTAGC^TTGCCTGGGAGTATAGAATTCATTTTCATATGTTTTGCTTG 

GTAGAC^GCCAGAAATAAAGACATTTATTC^GCGAGATGGGTAGACTAAAGATATCTTCGTTGTAAGTGGGTATAGA 
CAAGGAGTGGAAAAAAAGGATAAAATTGGTCACATCTAGACATAAGAG 
AAATCAAAGTACTGTTACGAAGGCCAAGAAAAATTTTGAGAGTTTTTCCATGC^ 
7 5 TTATCTTGACATTTATTAAGTATGTATTTGTTGGACATAGTTCTAAG 
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GTGAAT ACCC CTTGTGGG CATGGTXTACTGTGTCACCCCACTT ACTGGGCAC^TTT ATGT CATTTCACAGAAATGAGGAAG CTGAG 
GCAGAAAAGGTTTAGTAGCTTGTCCATATTGCATT 

AAGTCTCATTTTAGTTTCTATG CCATATGG CTTCACATTGAAG CATGCTTGATGGATTGGGTGACTGACTAGTAGTCA^ 
CTCAGCATTTTCAGGTGATGAGTTGGAGCTGGATGTCATTTTA 
5 GTGCCTTCCAAAATTTGGGGAGGGATAAGGGCAAGCATTTAAAATATAACT 
AGATATTTATGTGGTAATGGGCAAATTAGTGGAAGAATAGTTGA^ 
GCGAATTGGAGTTAGGGATAAGCAGGC^GGGAATAAATTGGCCOT 
ATAAATTTTCAAATGTATGGCTATGTTGGGAAAGCITAATGGCCCAATGAACTTGGA 

CTAATATTTT CIGGG AAATGACITACTTGGAAAATACTAG CTTTATTATTT ACTGATTGGT ATAACTTTGCAGT ATCAAAGTTAGT 
10 CTGTTTT AGTTTCrri' CTJLTCT'iT T TTTTTT T1TGAGACAAGT CTCACT CIGTTG CC CAAGCTGG AGTG CAGTGGTGTGATCT CGG 
CTCACTGCAACXrrCCGCCTCTCGGGTTCAAGCAGTTCnX3CTGCT 
ACCCAGCTTGTGTGTGTGTTTGTGTTTTAAT 

GTTTTAGTAGAGACGGGGGACTACAGGCACACGCCGCCACACCCAGCTTGTGTGTGTGTG 
TTTTTCTTTTTTTTTTTTTTTTTTGAG^ 
15 AGCTCTGCCTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCT 
ATTTTTTTGTATTTTTAGTAGAGATGGGGTTTCACCGTGTTAGC 

AACTCACCCTGAGCTCAGGCAGTCTGCCCACCTTGGCCTCCCAAAGTGCT 
TTAGTTTATCTGTAAATGGTGAATGAAGCCAGGTAGTCIGG 
2 0 AGAAGGAACAT ATAG C CGGG CACAGTG GCTCATGCCCTGT AATCC CAG CACTTTGGGAGGCAG AGACAGG CGGATCACCTGAGGTT 
GGGAGTTCGAGGCCAGCCTGACCAACGTGGAGAAACCCCATCTCTACT^ 
GTAATTCCAGCTACTCGGGAGGCTGAGGCAGGAGAATTGCTTG^ 
GCACTCCAGCCTGGACAACAAGAGCGAAAGTCCATCTCAAAAAAA 
TGAGGAATAATCAGTTTCACTATATAAGGTTGGGGTTTTAATATTCCAAAG 

2 5 ATTTAG CTAAAACTGGACATT T ATATCTTGATCAGC CCATTGTTATAG ACT CTTT ATAT AGTCCCTTCTCCTGCCTT ACT AGAAAG 

GGAAAGTAGCCTGTTTTAATTTAGTAGGTGATTGATTTAGACATAGTGGCTAACTCT 
CTCTGTAGAAATACGTATTTGGGTACCAGTAGTCTTATTTTTGTTTTGTGAGT^ 

ACAGAG CAG C TT CAGTG ACAG AGG AGC CAGTCATTTGGGG ATTTGGTTCTTTTC CTCTGT AAAC CTAGG AT AATTTTACTTAGGTC 
AGCATGAAAGTACAGTGG T ATGATTGGTCAAAGATAT ATTTG AAAT ATTAT CATAAGGT ATCAT AAAGATT GGTT AAT CTT TAGGA 

3 0 GTCAAAGT CACATTT CAATTG CTAC CT CCATTGGTATTTT TCCG TTAAAAGTTTTTGGGT'rrri' CTGAG ACGTGGGGT CTTACAGT 

TTGGCCAGGCTGAAGTGCAGTGGCTGTTCACTGGCACGATCCCACC^ 
ACATTTTATTATTAATCTGTGTATTTGGATTGTGGTCITr 
GGTGATGGAACTATAACAACAAAGGAATTGGGAACTGTAATGAGATCTCT^ 
GATTAATGAAGTAGATGCTGATGGTAAGTCTTCAATTTTGTGGGGCTC 

3 5 AACCTTTGCTGTTTGCTTATATGTTTCCACTAAAGCTTTGATGTTTC 

GAATTTCTGACAATGATGGCAAGAAAAATGAAAGACACAGACAGTGAAG 

ACGTGAAGGACTGGATATGTTAAATTTTGAGGATGAAATGAAGATACCAGGACTC^ 

TTTTTTTTTTTTTTTTTTTTTTTTTTTTGATGATTTAC<ZAACATATTTT 

ACTTT CTAATAT ATGT ATTGCGTTCAT CTTGT CAAAGGMGTGTCACT TG AC TTTGGGACT AGG AAATAATGATTTCAATGTAGGA 

4 0 CATTTTGCTTGGCTCTTAATGACATTTGTAGGTCAACTGTGTAGATTTGTATGCATTTCTAAAGTA 

ATTTAGTAGTAAACTTAATAAATTTCCATTCCAACTCTCAAGTC^^ 

GGCAATGGCTATATTAGTGCTGCAGAACTTCGCCATGTGATGACAAACCTTGGAG^ 

GATCAGGGAAGCAGATATTGATGGTGATGGTCAAGTAAACTATGAAGGTAAATGTTTCAGTCT 

TTTT AAGATAAGAGGCTCTTTGTTAGGG AACTGATCGAAATTAGTGACCCTTCC CCCCAAATTAGGTTG CTAAAACAGTTACTTAC 

4 5 ATTTAACCTTCTTAAAATTCAGAGTGTTGTCTGCAGACTAGGTAACATTGCTCTG^ 

TCAGGTT CTATCCCTACACCTGGAATCAGC ATTTTAATAGGATTCTTGGGTGAT TGTTCAGT 

GTTTGTGTGATTGTGGAACAATGTGGAAAGCAAACTTATTCCAAGTTTCCT 

GTATTGTCTTTTTTCTTTAATTGGCTTTAGTCTGAAACT 

TGG CCATAT CAGGAAG CAAG GCAT ATGAAACTTGAAGG ATTTTG AGT AAAACTCTGTATTTTT 

5 0 GAAAAGAG CATAAAATGTAATGTT AT AAAAATTAAACCTAGCC CAT AG CATTGAATTTACAAT ATTGTCTTGT AAT CCAG CAG ACT 

TAAAATTCAAGTCTAAACTAAGAGCCAAGAATGCTCTTTTTTATAT^ 
CTGAATAGTTATGGTTAATGAAGTTCCAGCCTGGGCAATAAACTAGCT^ 
AATGGAAAGAGTACTCAGTAATTGAGTAAATGTGATTTTTTTT TTTTTT^ 
ACTTCfTTCTTTTTCAGAGTTTGTACAAATGATGACAGCAAAG 

5 5 TTTATTTGCCTTTTCTTTGTTTGTAACTTATCTGTAAAAGGTTTCTCCCTACT 

TTCATTCCTCCATGTTTTCriTCCCTTATCTTACTGTC!ATTGTCCT 

GTGGCTTACTCTGGATATATCTAAGCCCTTCTGCACATCTAAA 

TTTTTTAAAGTAGTTTTCTTTAGGAACTGTCAGCATGTTGTTC 

AO^TATGTACTTAAAAGTTGCACTATTGOVAAACGGGTGTATTATCCAGGTACTCGTACA 

6 0 TACCAGAAACATTTTCTTTTATTGTTACTTGCTTTTTAAACTTTGTTT^ 

AATCCATTCCAAGTTGTATATTTGTTTTCCAATAAAAAAATTACAATTTA 
TGAAGTCTAATAATTTTGAAAATAAAATATGGCATTGGTTTCTGCTTGGTATAGTOTAGAT 

ATGG AGATGGGTGG CAGAAATAG TGAAGGTGGGAAAGGAAGG AGAAAAGGATTG GAAAAGTG AGAAAAGAGGG AGAGAAGCACA 
AAATAAGAGGAACAAAATAAGGGGTGTAACTAAGGAGAGATTGAAGAGGAAGGCCGAAAAGATC 
65 GTGTAGTCACCTTTTGGGGGCCACCATCATGC^ 

ATCCAGGATACCTGTGAGTTGCCTCTGCCCACAGAACAAGATGGTGGCAGTC^^ 
CTTTATCCAGATTGTATTTGACAACTTCAAAA 

AAGTAGGTATTTACTGTTTCCTACCTTTATAGTTTTCAACTTTGGCATTTGTCCATGT 
ATGTTTGCATGTTCAAAAAAAAACCAAATCTAGTTGGGGTGAGCATCCT 

7 0 GTG CTTTTTAATTCT AT AAGATG AG AATTTT CAG CTTGCC CATCTG CAACATTTTG CAAAAAAAGATGTCTTT AGG AG TAAATTGG 

CAATGACTACACCTTGTTTTAGTTTTTGCTGGCTTCTATCATTGCrrTTCTCTATAA 

TTTTTATTTT TTTAAAGT ACCTTTTAAATGC TTAATGAAAG ATTCCATAG CAAACCAGAGC CAAG AT TAAAACATTGAGTGTAATT 
GTTACCrGGTTATTATAGCTACAAGGTCTTTAATTTCCCAGGGGTAATTCAT^ 
ATACTACTATTAGTATGTTTTAGAGCAAGGGAAGTTACAGGCATTGATA 
7 5 TAGGAAGATTTTAGATGTGCACTCTACTCCCCTTAATT 
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TGTAAAGACAGGGACTCATGTTC^ 

GGATTATGAGTGGGGCTTTTTTAAAAGTTAAGTC^ATAA 

TCCATTAATTGTGCrATTAAACCAGTTATGACCCTOAAAAATGAGGGACT^ 

GAGGTTTATATAAATTGATATCAAGGTAXAATTA^ 

TTGGTCATTTCTATTGTTGAAATCTTTACAAAAQTCACGAAGT CTTGGCATTTTGATTTT ACAGGAAACGAG TAG AATGATTGTAT 

AATTTAACTGTGGTCATAATGAGCTGGGTTCAAATCTGTTCCAATGCTT 

ATCACCTTCAAGTT* fl X 5TT T TT GTTTTTO 

CAATCTCGGCTCACTGCAGCTTCTGCCTrc 

CACCACGCCCAGTTAATTTTTTGTATTTTTAGAAGA 

GATCCGCCCACCTCGGCCTCCCAAAGTACTGAG^ 

ATTAGTGTTATATTAATAGGATTTTGGTAATTGTACCAACCATAATACTGTCTG^ 
ACAACTATTGCAGTTAAATATCACCCTTCGTGT^ 

TGTAACAGAGGCACTGTTGAGCTCAATTATAGATATGTATTCTCAAGAGAAATG 

GGAAATAAATCTCAGTGATACTATATAAGATAATGTTTTCTTAAAGGAAC^ 

ATAGTCCTTGAAATCATTGACCCATGCTGTTTCTATTTTTGAGA 

TCACACCTTGTATGGGTTTCTGGACTTCTCTGTACCTTTCATATTCT^ 

GTTTTACAGGCTCAGCCAAAATAAGAAAATGAAATGAGGCACAGGGAATGGACT 

GTATTGCCTCTTCACTTTCTAATTTCAGT 

GAATCCCAGTATTTTAAGAAACTAGATGAAGTCCTCTAGTCTCTTGCTCCTCAA 
TGGAAATGCAGAATCTTAGGTCCTGCCCCAGAATTGGAATCAATCTGTTT 

AATTTTTG AAAAGCACTG CTGT AATCTAGT CTTGGATTTACTTAGAGGGGAAAACTAGTCAGTG GTTTATT CAACAAATATTTTAG 
T CCTTGTGTT CAAAGGGG T CAATTAAAAAC CGTGTTTTGGGTGTTTAAAATT AGAG GAG AAACG AT TTGTCATAAAATG AATCTT C 
CAACAATGATAGGCACAGGGTCCTTTGGGACTTAGTGGATTAGGCTTAGAGGTCTACT 

GGTGGTCAAAGAAAAG AATAG CATTTCATG CAGAAGAGCT AGCAT ATGTGGAGACAAAT AGAAATGTGAAGATGGCATGTT^ 

TTTTTGCAAGCATGATACAGGAGAGCATGCCATCCTAATAAGAGAAGTGGCC^ 

TGGGGGACCGAGGTGGGTGGATCATGAGGTCAGGAGTTCGAGACCAGCCTGAC 

CAAAAATTAGCTGGCATGGTGGCGGGCGGCCGAGGTTGTAGTGAGATC^ 

TCGTTTCAAAAAAAAAGAGAGAAGCGAGAGACTGGATTCTAAA 

CTGCACTGTGCCAGACATGGTTGTGAACAGTTCGAGTTAATCTTTTTTATAA^ 

TCTCT AT AG ACAAGGAAAT AAGCACAGAAT AAGTCATTAT ATG CAAC CAGCG T CCAAGTCC TGGT ACT CAAGGGAGG AG AG TG ATT 

CAAGAATGAGGGAATAATCAATGCAGCAATGAAAATTGAAGGCCAAGTTTCATAAGAGGTGAAA^ 

GGATCCCTGGTGAGTTCAC4GTGCAATTTTAATTAAAAGCTC 

GAAAAG AGTCCATATTACTCTTGAAAGTTTGT AAAG^G AAGCAG AAACTT CATGGTAT AGG GCAGTGG AG CT CTAGAGTCAGG T CT 



AAAATGAAACTAATACCTACTTTGTAGGGTTGGTATGAAGATTAAATGAGACTAATGTGTGTGAAGTCCCT 

CTGT ATAAAATGAAATTAAGGTTGTGG AGGGTTTGT TTTT AAAGGTG AG AATG TT CC T AGGTTAAG AAG AAGCT ACAG C CT AGAGA 
GGCTGACAATAAAAAGGGTCTATACTGAGAAGAATGGGGTGGTCTGTGGTTTCAC^^ 

CT C CACT CTG CAAGT CAG AAGCAAG AG AAGT CAG CC C CAGAG TTTCT ATTTGG AGG AG CTT AGAAACAATCTT AG AAAGTGGGTGC 

TAAGGCAGATTTCTCTTAAGTATGCCTACCAAGCACACTGCCTCCCTGTCTGGGGTGTGC 

AATGGAGATTGTGGGTACGCTCAAGCCCTTAACAAGCCTCCCTTTTCGCAGATC 

CCTCTAGATTTTCTTCTCACAAGCrrrTCIATAACCGTCT 

TGCXSTGCCCGCTTCCCTTCCCTGCAGCACTAAAGAOUACCT 

CACGGGCCACACCCGCCTCCTAGCAACCGGAGCTCAG 

ACCGCTTCCACCTCAATAAGGGAAGACCTGGTGGGTGGAGAATCATTCATCACAGCTT 
TGGCTCCTGCTCCCCTCACC3^CCCACrA^ 

CTACCCTTCTCTCCCTGCCOTCACGCACGTCTCC^CCTGCGGACCTTCCTTCCTG<3T 
CTTTTCCCCrTGCTTCrTCCCTTTTCCTTCTCOT 

ACAAOTGCTTTAATATGGTGATTGGATTTCTTTCCCTTTCCGCTCCATCTGCCG^ 

AGACACGTTTTGCTCCGCCTCCTACTTTAATATTTAT^ 

CTCTCACCCAGGCAGGAGTG 

CTCCCGCGTAGCTGGGATTACAGGCATGCACCGCCATGGCTGGCCTTTGTTAC^ 
CCTAATCTTTTATCTTTCCCCTTAGAGTGAATTTAAAGTGGTAGATA^ 



G G AGG CCTGT CTTGAC CTGC CTG AGAAAGTT GTATTT CTG TG ACT AT AAAAGTG CAACAGAGAAG ATT CAACATAAAT CTCG CG AA 

G AAGT AT CAAAGAAG AAG CT CAGCTTTAGCAGTGTCATGAGAAAATGTTTTT T AAAG AAATT AACAT ATTT CAT AATAAAAT AAAG 

C C CAGTGGTACT CAT AAGTACAGT AGATTT AAGT TACTAG ACAGGAAG AATGAATT AT AT CAGTTTT ACTG ATT ATTACTTTTAAT 

G CAGTCTTG CTT GATG ACCTATATTTTAT AGTAATAAAGG CATTATAAATGAAAGG AG AACT CAACATTTTAATGATTTTAT CT AG 

GACAAACAGCAATATACTAAACGGTTGTGTTCTGTGGACAATG 

GTACTAAATATGAAGACAAAAGCTTCCTTGGAAACTGAGGAATACCAAAA 

AATTAAAGCTACCATTTATTAAGCACTGATTC 

GGCTTCTTGCTTTGTCGCC(^GGCTGGAGTTGCAGTCATGTGAT^ 

TCGTGCCT CAG CCTCATGAGTAGGGATT ACAG CCTTCTGAGGAGGG CTAATGTTTTTGT 

ATTTTTAGTACAGATAGGGTTTCAACATGTTGGCCAGGCTGGTCTTAAACTCCT 

AAAGTGCTGAGATTACAGGCGTGAGCCACCACATCTG 

TACTACTGTGAGGAAGCACGAGGGTTGGGTAACTCACCCAGTGCAC 

GTCTGGCTTCAAGCCTAACTCTTTTCTTCCTAAGGAC^ 

GGTGTTTATTACTOTAATTTTATCAAAGGGAATTTAGTTATTTGAAAGGACT^ 

G CAAATTT ATTATT ATGT AGCTTTT ATTT T AATG AAG ACAT AT CAGTTGGGT ATGG GTGT GAATTT CTTCAAG CAAATGTTT TATA 
AATTATATTTAATTTOSAGGGATC^CCTCTCAAATG^ 

CTTTTGAAAG AGAAGG ATGGTGGAGAATAG CATT AACAGTATGTACATTGTT CAGT CTTTCCTATTCACATT ATTC CT ATTCCATT 
AAT ATTTT AAT CAATT ATAGCTTTT AAGACT AACA.CTAAAT TG T AAGAAAACAGTGATT GTTCAGATCTGC^ CT CATGT CA 

TACT TAAAATT TTAGTGTCAG CAT ATGAATAAGAAACTAACTTTTAA CAAT AG AAG CTTGGG AAAGTTAAACAATTC 
T AGAAATCT AG GTG CTTT ACAG AATAT CTAGAT AAGG CTAACCT ATG CT CTTATGATCTGGGTGAGAT TG CAAAACAAAAAAAGCA 
TACATTCTGGAAAGGTGTTTTGGTTTCAGTTTTTATCTTATATGGTAAAGACCTTCTGCAAATGGT^ 
GAAAGCAAAATGGTAATATATGGAAGAAAAATGACCAGAAGATCCCCCT 





2023 



WO 03/008583 



PCTYUS01/51291 



CAATGCCCTTCAATAATAGAGGGACTTCATGGGTCAAAGCCrTCTTGGAAT^ 

TCAGTGGAAAGATGG CCACAAAAGCAGCAGTGACTAAACGTT CAATT CTGG CCATTCAGC CACACTGGTTGTAATTTG AAAGTAC C 
AAACCTCTTATTGGACTAACT AGG C CAT ATGCTCTG CCAGTTT ATCCTGGTGTCATGTCATT ATCCTTC^^ CTTCCCA 
CTAGAGAGAAACATAGTATTGGCTAAGCAGATGTAGGATAGCTTCT 
5 CTGAAATTCATACTCAAGCTTGAATCCCAAAGTT CGG CCGGGCCTGGTGG C 

T CACGCCTGTAATCC CAGCA.CTITTGGGATGCCAAGGCX5GGCGGATCACAAGGTCAGG A CATC CTGG CTAACACAGTG 

AAACCCTGTCTCTACTAAAAATACAAAAAAAAAAAAACAAA^ 
CCAGCTACTTGGGAGGCTGAGGCAOTAGAATGGCATGAACCTG 
. CAGCCTGGGTGAC^UIAGTGGGACT CrGTCTCAAAAAAAAAAWUU^TTAAATTT CAACTCTAACCCX3TTTG CTATAATTCAG CAAT 
1 0 TATTTCAGAAAGGAAATAATCATX3ATTTTGAAATACTCTTCT 

AAACAAAGAGAGATGTTACTTGGGAGCTTCCCATCCAAGAATGGTGTTTTA 

TGTCAATTTTCCACAGGATACTCCr ATACCTGGCACTTAC CACTTGAAAACTTTT ATTG AAG AATC CCTAT T AAAT CCAGTG ATAG 
CAACCTACAATTTTAAAAACGAAGGAAGGAAAAAG 
ATCCCTCCTGACTTCCTGGACCTGTTAAAGAAGCAAGTGGCTACT^ 
1 5 AGTTGACAAAGATCAGGTAAAGCGCTACTGG(IAGATTGCTT^ 

CGTAGAGTATATCTTCCCCCAGGAGAAATGGAAAACCTTTAAGATGTCT 

GQUUVTGTTTCCAGTAAGATTTTTCCCTATAACAT 

CTCTTGGTGGGATTCCX;CTAGCCTGGAGAGC!ACCTGGGATC!TTATCT 

TGG TGAACC CT CTT CTTTTAAAG AGAAAACC CTG C CT CT CGGGGAG T ATCCTT ATTAGAG CATAAAT AT AACT ACATACCACAGGG 
20 AAG (TTTTTAAAAACTTTTCrTATAG AAATGACC CAT CTCTACAAAAAATAAAGAAAAATAG CCGGGTGT AG T AGTGCATGCCTGTG 

GTCCCAAC!TACTCIt5GAAGCTGAGGTGGGAGGATCACTTGAGGCCAGGTGTTGTAG^ 

CTCTTT AAAAAAAAAAAAAAATAG CTT ACTTGGGAAGC CAGGG CAGGAGG ATCCCTTGAGC C TGGGAGTTTG AGGCTGCAGTGAGCT 
ATGATTAT ATGACTGTACC C CAG CCTGGGTGACAGAGTGACATGCTGTCTCTAAAAATATTTTTTTTAATT 
AGAAGTGAGCATGTGGTCCCAGGAGGGTCAGCTTCTACTCTCAGCrrCCACCAGT^ 

25 

HUMAN SEQUENCE - mRNA 

AGTCCXyVOTGGAGAGAGCGAGCZTGAGTGCTTGTGTGGTCGCGTCTC 
GAAGAGCAGATTGCAGAATTCAAAGAAGCTTTTTCACTATTTG^ 
TGTAATGAGATCTCTTGGGCAGAATCCCACAGAAGCAGAGTTACAGG 
3 0 TTGACTTCCCTGAATTTCTGACAATGATGGCAAGAAAAATGAAAGACAC^ 
TTTGATAAGGATGGCAATGGCTATATTAGTGCTGC^ 

AGTTGATGAAATGATCAGGGAAGCAGATATTGATGGTGATGGTCAAGTAAACTATGAAGAGTTTG 

GAAGACCTTGTACAGAATGTGTTAAATTTCTTGTACAAAATTGTT^^ 

TCTCCCTACTGTCAAAAAAATATGCATGTATAGTAATTAGGACTTCATTCCrCCATGTTT^ 

3 5 CTAAAACCTTATTTTAGAAAATTGATCAAGTAACATC 

CTTAGATGG AGTTGGT CAAATGAGGG AACATCTGGGTTATG C CTTTTTT AAAGT AGTTTT CTTTAGGAACTGT CAGCATGTTGTTG 
TTGAAGTGTGGAGTTCTAACTCTGCGTGGACTATGGACAGTCA^ 

TATCCAGGTACTCGTACACTATTTTTT'TGTACTGCTGGTCCTGTACCAGAAACATTTTCT^ 
TGTTTAGCCACTTAAAATCTGCTTATGGCACAA 
40 CAATTTACCC 

HUMAN SEQUENCE - CODING 

ATGGCTGACCMCTGACTGAAGAGCAGATTGCAGAATTCAAAGAAGCTTTTTCACTATTTGA 

AACAAAGGAATTGGG AACTGT AATGAG ATCT CTTGGGCAGAAT C C CACAG AAG CAG AGTTACAGGACAT GATTAATGAAGT AGATG 

4 5 CTGATGGTAATGGCAOtf^TTGACTTCCCTGAATTC 

AGAGAAG CATTCCGTGTGTTTGATAAGGATGGCAATGGCT ATATTAGTGCTG C^GAACrTCGCCATGTGATGACAAACCTTGGAGA 

GAAGTTAACAGATGAAGAAGTTGATGAAATGATCAGGGA 

AAATGATGACAGCAAAGTGA 
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Table 99 

MOUSE NOMENCLATURE 
ICSGNM Ncf4 
Celera mCG13347 

5 

HUMAN NOMENCLATURE 
HGNC NCF4 
Celera hCG40038 

10 MOUSE SEQUENCE - GENOMIC 

TCCTAATGATGGGAGG AG CAG CTATAGGCTGGGAACACAGCTCCAGGAGG CTGGAGTGGATG CAGCAGGAAGAGGGGAGGTGGTCC 
AGCAGTCATTG CCATGGTGTGGCC CTAACAAGGGACAGCTTTTCC CTGAAGACC CAGGGTGGAACCCCT 
CTCAGTCCCCCAGGGGGTGCTGTTGGGGCITGGAAGGTCAGCCTCCTGGCCCTCT^ 
TGTCTCTGTCTGTCTGTCTCrCTGTCTCTGTCTCrCTGCCT 

1 5 TCT CT CT CTCCCCTACACACTTGTG G ACCCT GTTTGAACACT AACG ATTTCT ATAT CAGTGTGACTTGTG ACTT GCTCTAAACATG 

CTGT<2TTGCATCTCAGTCCATAGCAGGAAAGGGCAGCT 
GGCAAGAGGTGTCCTCACGGAGCACAAACCTTCCCAGTGCT 
CTGGAGTGAGTACAGTTAAATACAGAGCAAAGGCTGCACCTGGGAAACTG 
CCTGCCATGACGACAGAACAGCAGCCGCAAAGCCCCAAGTGTTTACTATGTAGCCTTTTCTGA 

2 0 TCATTGGGCAGAGAGCGCTGTGGTGATACTCTGTTCCACACTGACAGTCrGG 

GTGGGAGGAGAGGAGGGAGATGGGAGATGGAGGGTGTGCTATCTTCCCTTAGCAAATACGAGAGTAGCCCCG^ 

GG CACTG GTCAGCATATGGTACT CTGTCCCATGCAG CATTG AGG CAGGTTGGTGATGT ACTTTAAGTGTGGTTTGT CAC CCAAGGA 

TCCCTGTGTTGGAACATAAGAGGAGGTGGCAGAACATTTTTGAGGCGATCCAGTGTGGGGCCTCTGA 

CTCAACATGTAATGAGTTTATAO^ACCTGGGTATATAGATGCATAAG 

2 5 TTATTTTAAAACAAGTCTTTGATATATATATATAT 

CAGTG CT ACT CAGGAAATTCTTGTT ATTTTTAATAAGTAATT AAAT CTTGGTT AAATAGATAGTCCTG CAG GACATCG AAAGT CTT 

CAATGGTTTTTCGAGTTGATATTTTGAATTATAACCATCAACIAG 

CAGTGTGCTCAGAACCATTTCAAAAACTAGTTGGAGACTCTGTGCT 

CAAAACTTGAGGCAGCAACTTTTAACAGTCAAA 

3 0 ATATTTACTGACAATAAATTTTATTTTCAAACAGTTCTTTGATGTGTGTOT 

AAGGAC TAAACT AATGCAGAGATAC CC CAGTCTGATGCTT ATGT AT TGAGAAATGATGG AAAGAAATG TTACAG TTCTGTT ATGT T 
TCCATGAGAGAAGAAAACCGCTTATGCATCTCATAGTGTATCATCGCCTGGAGTCCAA^ 
GTCAGCATGGTGAAATGTGATGTCACAACCTTGGCAAAGTGCACATGCG^ 
TTGCCAGAAACACCAGGGCCATTACGCTTTTGATTTCGA 

3 5 TGTTTGTGTGTGGTTTCXACAGTTATGTGACCGCATGCAGAGGGTG 

CTAGTCTACACAGTGAGTTCCAGGATAGCCAGGGCTATACAGAGA 
CTGAGGGCCCCACCCTAGGAAGGAATTAAGATGGTTCTAATGGGACC 

4 0 TGCAG CCAGGGAGTGCCTCAATAGAGCCAGCATAATG CTATTTGGATTTTCAGCTTCCCAAACCAT7VAGTTAAAAAAAAGCTACGT 

TTCTTTACCMCTCACCAGTCTCTGGTATTTCATTCCAGTAGGGGGATAACAGACTAACT 
ACCCATATAAGCTACTCAAATCAGTGCATTCCTTGCTACCCTCCCCAATGTTCT 

GGGCGC CTTAAAAT AT AAT ATGT CTTAAACAATGGTTGGGCCACTTTT CTTAACCTC CZU^TGGCTG ATGG ACTTTTT TGACAATCT 
CAACTCATCCCTTACTAAAACCTATTCCATCTTGGCTCCCCCAG 
45 TATGCTCTCTCrrTCTGCACrGCTTAGGCCTAGTGTCTCCTTCTTCCTTCCCAGCATGGCAGCTATCCT 
TTGTCCAGGAATTCTAAAAGTCCCCCTCTGTCTTCCTGCCCAGCCATTGGCT 
GGGGCAGGGATCCTCAGCATCTTAACGTGTGGACAAGTGG 

TAT CAAAC CACAACATT ACTTTTAG CTGAGT GGCCATG AGTCG CCTTGCTAC CC CCAG G CTTTTTAAGTAAAGGAGATGAGAAGAG 
TGGAGTATACCAGGGCAGGC!AGTCTCTGCCACACTAGTCAAGAACTCAGGCAAAGTGTGT 
50 GCGATATATATATATATTTTTTTCTCCATTCT 

TTATAGACAAGAAAACCAGAGAACAGGAGCCAGGTTT^ 

TCCCAGACTCTGCAGGGCAGGAGCCCAGTCrCAGAGCCAGCGAGATGTGGTGCCTGG 

GCCAGGCCCAAATGGGGGTATCCCTGGACTCCTGAAGGGTCAGCTCAGTG CCAGAAGGTGACAGCT C CTATCT 

AGGGTTTTGGGTTI^GTTTTTTTGACTC CTTGTCTTTTCCTGACTCCCTTTGAAACAG CAAGGCTGCCCTCCACACCCCTGTGCAT 

5 5 GGCACTGCACTGGAAATCAGAGAAACTAGATACTAAAGGTCGGTTTCCCAOT 

TGTGTGTGTAAGACAATAGCTCCAGTGCATGGGTGTGTATGGCTGTATC 

ATGTGTGCATGCATCTGTCCAGGCGTGCTGTACACAGGGTTCATTAGTGTACCTTAACAGCGTTCTGAGAGGCT 
ACTGAGTTTGTCTT CGT CTACTGTTACCTGC TT CAGAG TGGAG CAT ATGAACGCTTGTGTCG CGGATGATTGATTGATT GAT TGAT 
TGATTGAATGGCATATGGCIACAGTTAACA^CAGCATGAGTTCT 
60 CTGAGTTATTTTAGCCTCCTAACCTCGTAGCG CCTCAGTCTC CACATCT CCAGAG CAGGC AAG AG ACC CTGATC CACCCTATCTAT 

GAACCTCAGTGTGGGGGTGAAGGAAGAGTAATCAAAGCCGCCCCATTTCAAGCTGGCCC^ 
AGCTCCTCCCCCACAGTTCATGGGCCGTGTGGTAGCACAAACTCTTCTCAGTGCGCA 
GACATCCAAGAAGAGCTCAGACCTTAACTTACAATCCAGGGGGC^ 

CCCGACTACTGAGTCATAGTG CCCCCTGGTGTTCATTGACATAAAAATG CAACAAATGGGGAG CTGTTCTAGACGAGACTAAATAT 

6 5 GGAGGTG AGGGGATTTCAAT ATGATAAATTGT ATCAT AG ACAG GAG CAAGG CAAATGGGAGG CAG ATGTG CTGG GG AG AGAGAGAA 

AGGAGGGGTCTAAGTCATACrAAGGAAATTATCATGAATAATGTTAACATCTATAATTCCC 
GGACTAAGCTTTCATTTCCTTCATCAACACAAGTGCTACAAGGTGTGTTTGGAG 
GACTAGATGACTGCTTTTTACATTTTTTTATTGTTTTGTGTGTGTGTGT^ 
TGCCAACAGGATGCACGTGGCGTTCAGAGGACAACATTTAGGAATCAA 

7 0 CAAGTGGGCAGGCTTGGCTAC^AACACCTTTAGTCACCGAGCCATCT 

TATTATCTCAATTTAGGAAACTGGATGCACAAACTCGTGGCCAAGAAGGCGG 

CCAGCTCTGCCTCCGGGAGGGTGGGGCCTTACCTTCATGGGATTTGCCTTTTCAGCCTCCCCCCAGCCT 

CCTGGG ATCGGC CAG CG ATGGCT ATTT ATGGCTCAGGG AGG CG T C CAGTGAAACTCC CCTCAT AAAT T ATGAAGGGAGACACATGT 
GGGTCAGGACAGGAGAACTTGGTGCATGAGGGAACCCCGGGACTCTTGCTCTCT 
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AGCAGCTGCTGCiySATGGCAGAAGte 

ACTGCTGACAGCTCTCTGCCTCGTCCOrrAAATCGGGCTGTCTGCAT^^ 
GTGCTGGATGTTTTGTGCCCTGCCCATTCTCTCTCTCTCTCTCT 
ATTTTCTGAGCATGCTCAGTCTTGCTCTGGGGG 
5 CCr GG TTTGAOTGTGTG UT Urri r^ 
ACTCCCTCTAGOUITTCATTCCTCC 
TGGAAGAAGCCTGACAGAAGTCAGGGGTroAGAACTT^ 
TGGCTATCTGTCKnTTGAGACAGGGCTTGCCGTGTA^ 
GGAAGATTGCACCAGOSGCTGCTiraTATG 
1 0 CTCCTCrerrCCCrcGGACAGGTGGCTO 

CCCACTGCTGACAGGGTGTCTGGCTTCCAATTACACTGCCTCAGAGCT 

CCAGAGAAATAACCACATTCAGC^GAACCGCAAGCAGGGACXSGAAG^ 

AGGTGCCTTCTAGGACACCCrTGGCCCCTC 

GGGGGGGGGCTGAGTGTGGGAGTGTGGGAAGGGGGGGACTGAGGGGACCTGGTCAGGTGGCCTAGAGGGCT 
15 CACACACACACACJVCACACACACACACAACACACACAC^ 
AATAAAGGAGAAATGGTTTAGAATATGTTGGCT^ 

TTTCTGTATCTCTCTGTGTGCATGATGTGTGTATATATGTGTGTATGTGTGTGTTTGTOT 
GTATGTATGCTTGTGTGTACCTGTGTGTGTTTATGTATATGTGTCTGTGAGTGTATGTGTGTGTATCT 
2 0 CTCCTGTGTGCATGTTTGTGTATGTGTGTCTGTGTTCTGCATATGCATGTATGTGTGTGTGTA^ 
GTCTACTACTGTGTATATGTGTGTGTGTGTACATGCGCCCCCACCAAGACT 
NKNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 

2 5 NNNNNNNNIWNNNNNNNNNNNTATTC 

GGCATACCTGCCCACACACAGTTGTACTCGTCTCTGCACAAACTGGCCTTGAACTCCTGGTCCTCCTGCCTCCA 
TGGGATTACATGTGTTCTTACCCTTGTTTTATTTGGTGCTGGGACTGAGCCAGG^ 
CCAGTCCCAAACCAGCACATTTTCAAGCCAGGACAGGCIATGCT 
AAGCAATGTCACCATTTCCTGTTTCATGACCACCTGTTGACCAACATTCT 

GAGCCTCTOAATCATGTCTAGCTCCTCAGGTGTGAATTCITCTGAAGTCCAAGGCGC 

TCCGAACTGCTTCCTCAG CACAGTTCAGAGCCAGCTITTCTCTGGATTTCACACCAGTGGGGGCTTCAGGCT CGAGGCTCTGACCC 

TGTGCATTAGCTCCCTCTGTCTACCCATGGCTGGAGAACTTCCTTCGCTTCCTCGGGCACCC^ 

TTGGTGTTTGCTGTGATATCCAGCTTCACATGAGAGCTGAAGGTC 

3 5 TCTGGCCAGTCaCTGGAGCCTCATCTTTAGTA^ 

GGGTACAATTCCTGCTCGTCTTAGACCTCTCAGATCACCTTTCCTTGTCTTGAGCAGGGATTTGTATGCT^ 

GTGAACTGCGGTACTTCTGTGTGCTGGGAGTAGGTCAGACCGT CCATT CTACGTTCCTCTACCACCCTGATCTATCTGTGCAGTGC 
AAAGGCCTTGATGAACATGCAGTCACTGGTGGAGTGTCCCTCTCCCACTTT^ 

CTGTCTCCCCAGGAGCCCATGCCCTGCCTGGCCACGGGGAGGGCAGTTCTCACTGGGCAGACTCTATCCTGGGAGCTCTA 

4 0 TCCTCTCTGGGCTGGGTCAGGGCCAGAAGATCCCTTCAGGGTGACAGGATGG 

ATTCTATATT CTTGGAGCCAGAT CCTGAGGAG T CH^GACTG CTTCCAAT CCAG CCCCATTT AAGGTG AG AAACAT AGGCG GTTCC 
AGATTCCTCAGGGGGAAGGCCCTGTCCCCACAGCCACGTTCCCAAGTTGGGGAGATAGGAGCCT 

GTAGGCTTTGGAGAGCTCCCTACAGAAACAAGCATTAGAAGGAGCCGACGAAGAACTTCCTCCTTCTGCT 

4 5 CCGCAGTGTAGTGCTGGCCTGGAAGAAGCCTAACACACAAACACACATTTGGAACACACTGGGGTAGGGAAC 

CTTGCCGCAGGAAAAGCCCAGAGAGTGGCAGGGAGGGTCTGGTGAGGAC^ 

CTACAGGAGATAGGAGAGGAGCGTTGGGAATGTGGTGGCAATGAGGGAAGAGAGATTGGGAGAGCAGGC^GCCCACGGGGCCT^ 
GGATTCTGAAGCCAGCAGCGGGGGCTTGGGGCTTTTCTGAGAGCTGAGTAGGGAGGTGGA 

TGTGG ATGGAAGCTCTGGGAGAG CTG AATGTGGTAT CT CAGAGC CC AG AG CCC CAGATGGGG CATAAAACAATG ACTTC CT AGTAT 

5 0 AAGAGGGAGGGGGACCAG^AG^^CTTCCTGGIAAAAAGCACTATCTCCACCAGACCTTAGGAATGGC 

AGGGATTCAGGAAGAAGCCTGAGGAGGAAGTCCACAAGTGTCCTITGATGGACTACC 
GCTCMCACCAGAGACAGGCACTGAGGAAATCCCCAAAGCCAGTGGCCrCAC 

CTGCTGTGCAGCTGGGAAGTGGCTGAGAAGACGTGTGAGGGCCAGGCAGCTCAAAGCCTGAAGGAGGTACTTCAAGGAGGGTCCAG 
AGAAGCTGGGGAGGGCGGGACCTCCTCAAAGGCTGCCCCCGGGGACAC^ 

5 5 AGTGAGAGGTGAACTCX3GCCTGGATCTGGTTGGGCAAGGCCCCCATCCATCTGCTTTT 

CAGCAGCTGCGATCAGAGAGGTGAGAGTGTG CCTCTATTCCCTG CCAGGCTTTCTCAATGATCCCAATAGGCCTGGGGGGTGGGTA 
TGGGGAGGTAAGAGGCTGTCCCTCCTGTCACCTGATATCAGTGGATCAGCACAGAGTCAAATT^ 

G CAG GAG CT G CATGCTC CAGCAG ATGTTTCT AGAACCTTG ACTTAGT AGGTCGGCAGTG T TATG CACCAGTGAGAAG C CT AAGGCT 
CAGAATGGGGAGTGGCTTGGATGTGGCCAAAGGCCACAG 

6 0 GCTTGGGTTCrTGGGTTCCTGGTTCCACACTGCAATCA^ 

TCCTATCTTTAGGTTCAGATAGCCCAGGCTCTG CCGTCTGACCCCTCACTCCACTTCCTTGTCCTGAACAGGTCTTGGATG CTTGG 
CACAATGTTGGGGGCCGGGTCTCTGCTCTGCCTGCTCTGAGGGTGTTTTA^ 

ACGGCCCTGGG ACACACTGAGAGCTCAATACATCTGCTAAGAG CTCAAAAATGACCCAGATCTATCTTGG ACAATG CTGACACTAT 
ACAGGTAGGGACyiAGGAGACCTGATGGCCCCAAGCCCGG CCAC CTCATTCAGTCAGCCCTTG CTCAGTGTCTCTTATGGAGACAGC 

CATGTGAACTGCTTGAGAGTCACCAGAACAGGTCTGACTGA 
GATGGGATTCCTGTAGOTCAAAGGTCAGAGGAGGTTTGTCTGTGTGGTAACJA 

CTTCCATAGCTATAGAGAGTGTTACTGGGGTGCTAAGTAAGGCTGCATAATATGACCCCAGCCTTTCTCACCCCCA^ 

70 GTTCAGATGGTGCCTGTAAGGACTATCAGGGCTCAGTGTTTCCTCCGCCCACT 
AGGGGGGCGGCTCTTCTCCTCTGCCCCTGATAGGAC^ 
TTGAGAACCAGTCCGGCAGAGATACAGACGAGGAGATATCATATATAGTGTCT 
GCCACATGTCATTCCAGCTCCGACTCTGATTGAG^ 

ACCTCCATGTGTAGCTCCCGTGTGCACAGAGCCCXSGTGGTATCCACCC^^CIAAAACCC^ 

7 5 TCCCAGTCCCCTTCCCTCTCCCCACACCACAGCCTCCTTGG 
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GCTCAGGACCTTCTArMCGCCTGGCGCCGCTOTGGGTGCTTAGGGGATGCAGTC 
GTCTCCATTTTCAGAGAGGGGTGGGAGGAGCCGGAGGATGGGGAGAGGCCC^ 

CTACCCCAGAGAAAGGTTGCTGTTCTTCCACGAGGCAGG 
5 NNNNNNNNNCTCCCTGGTCCCATAGCCGGCTCTC^ 
TAACX3TCCCCACCCCCACCCCACCAACrTC3TCTTCTC 

CCTTGGCXICGTGATGAAACAGAGGGATT ACiT LTTGGGTATTCAATTTCCCCAC TCCT CAGAG CT ATGGAGTACAAGAAGCAGAG A 
GCAGGACCCCGTGATG CTAAGAAGTGTCTGTTCrGTGCTCCAGGCTG CAGAATGCGGGGAACAGG C C CAGTG CTGGGCTTCATC CA 
AGGTCTCTCAGCTTCITTGTCTCXIAATCCCACTCACCTC^ 
1 0 CCAGGGTGGCTCTATCTCCAAGGAGTAGGTGCCCAGTTCCTGGTC 

GAGGATGTCCTCATGGGACATGTTGAAAAGCGAGTGGGAAGGCCCCAGTGACC^ 
CTGGTCTGCTGACTTCCCCAGTCACTGCCTGGTGACTTC^ 

GAGAACCAGCATGGACTCTGTGCAGGGTGTGTGGGCTCCTAGGGATCCCATACTT^ 
1 5 TCAGGCACTGCTAGGAGAACCCAATCACrrCTTCAGCTCCTCCTCCTCCTCACCCT 
CTTCTCCTTCCTCTATCTCGCCCTCCTTCCCCCrTCCrTCr 
TCTCATCTTCATTGGAATTCTATTCTCCTCTTCCTCCTCTCTTCCTCCT 
CTCCTCTTCCTCCTCCTCCTCCTCGACCTCCTCCTCTATCTCTGCCTCCTCTTTTGAAAC^ 
AGCTAGAACTCCTCATTCC^TCCTGACTCCCrCTCCTCAGCCTCTTCA 

2 0 ACACCCACTTCCTGCTTCCTGGTCTGTACACAAGTGAC^ 

CTTGCCCTTCTC^CCCrCTCCAGTTCACTTAA 
AACCACACGTCMCCCCACTCTGCCATCTCCACACC^ 
GAGGCCTGAGCAGCCCTGCCTCTTCCTGTGCTGGTCAGAAACAGGCATCA 
TGCTATACCTTGGTCATGTATCCCTTGGAAATGCATTAGACTGCAGGGAAAGGCTG 
• 25 GTT CTTTTTCACAGTAAAAGTC CAGACTGCAACT CG T T CCATAGCTCAGGGGGCT CAAAG C CAG CC C TG CAGAATTTTTCATT CTG 

CAT CATGTT TTTG C CT CATGGGTGCAACTGTGGCTGTGGATCCAGATGC C ACCCCAACTCCATCTCATTG AC CAG AGCTAGTCACA 
TGGTTGTGT TG AGCCT CAAGGG CAGGTGGGAATAGGAAT AGG AG CAATGTTTGGT CCAGGC CAATCATGCC TCT C CAAG CTAGACA 
CACTTAAC^TTGAGCATGGCTGGCACTTGAATGGCOIGGAAA 
TATGGAATGCTACTCTAGCATTGTGCTGGGGTTCTCTAT 
30 ATAGAGCTGTCTC^TTAAGTCTCTCCCCCTCCTTCATGGCTGGCCACCTGTGGTGACCTCCTTG 

AAGAC CCT CATATGAACGTATCCAC CTT AAG ACC CACCAGG CTCTCAGCAAATCT AAACT CTT C CT CAGG GCTC AG AACCTT AG CA 

CGAGCCATTTCCACTACCCGGAAGAACCTTCCTAGATTCCAGTC 

AAGCTTTTCCTTACTTGGGCyvrAGTCIAACZAAACCGATCT 

CTATGACAAAATACTGGAGTCAATTAATATATAAAGTAAAAAGGTT^ 

3 5 • GGCJU^CCCCTTTGCCTGAGTCTCTAGAGATGGCAGTGAAGTGA^ 

CTXSGAGAAGAACATGGTCCCOTAGTCCCCrrTCAAGGCAATCCCTGGGGA 

CACAACACTCCCCAGTGGTGATGTCCATTAACTGTGGGTTG^ 

TCATCCTGTGACACAGAGCCrrGGCACIAAGGCT 

4 0 TCAGTGAGGTGCCAGGCATGGACAGTGTTCAGAATGGGGTC 

AACCGATCCCCTCCCTCTGGCCTCCTACAGCGACTTTGAGCAGCTTCCAGACGATGTGGCCGTCT 
AGGAGAAGAGAGGCTTCACCAGCCACTTTGTAAGAGTCAGCAATCCCTGTTCCCTTG^ 
AGTGGTTTCTCTGGACTGGGTGGTTGCTGATATTTGCTGGATAGAATTCAGCGGTGCTTCT 
CACACTGAAGGCITACCCCACTCAGTACACTGC^ 
45 ATCTCCCAGTGTCTCTACCTCCCAGCCCCTCCCCAGCCCCCCAACCCTTTCCAC^ 

CCTTATAAAGACACCCAGCCTCCCGGTATATCCCCCCCACCCCGTTCTAGCTCCCCTCCCCCAGTCTCTCTCCT 
GTCATCGAGGTCAAAACAAAAGGAGGGTCCAAGTATCTCATCTACCGCCGCTATCGCCAGTTCTACGCCCTG 
GGAGCGGTTTGGGCCAGAGAGCAAGAACAGCCCTTTCACCTGCAA^ 
CGGAGCTGGCCCTGACCCTGGCCTCACCTTCCAGCTGAGCAGC^ 

AGTTTCTGCATCATAACCCTTGACCTACCTCTGGGGACACAGT^ 
TGGGAGCACTCAGCACCTGGGTTCTGGTCJTT^ 

TCAACAGATGACTGCAGCTCTCCCTCTCTGGCCCTTGGGGAAGCAGGGAGC 

CTCCTTCCCJVTATGCCCCrCGTCCCAAATCTCCC^TGCC^C^ 
55 TCTATCATATCACATC!TTCCCTAAGCCTCAGAGGCCTGCCCZATCTTCCCTTCTTCCA 

AAACACATTCTCTTCATCCATCTCCAAAAAG^ACACTG 

TGGTGAGGAGGAGACACCACTCTCCATCTCCCCCCCCTTTCCTACACCCT 

CCCCCTC!TGGAGCATGTGAAGAGGTCCCCCATCAC!ACCCTGTCTGATGAGCGT^ 

GAGCCTCCTATCCTGAGGGCrCCCCCrTGTGCCCAGCCTCrTCCCCCTVTTA 
6 0 CTTTCTTGCTGCAGGCTCTGCTCACCriTGG^ 

CCrrGTGACTTTGTCTCGTCTTGGTGTGTGGCTCCTGGGTAGCCCCAACCrTTATATACCCCCCTCCTGGC^ 

TTTCCTGCCCTATTGCACAGGGGCCTCTGGGTGGCAGCTCCTTTAAGTGTCTGCCAAAGG 

CCTTCATGCACTCCGCTCCGCGTTAGAGGCATGGCTCATTCATCCACAAAGCA^ 

GGAGCTCATGGTTTCTAACTATGGTAGCCTGCTGAAAGGAAACTTACAGAGCCCT 
6 5 GGGGAGCCC TATCTAATGG GATGCTCAAAAGCT C CTATGGTG ACCTGAGTGTT AT C CAAG CCTG AAG CTGGGAGATCACTGTG AT C 

CIAGGQIGGCAAGAGGTCCTGAGTTCAGAGCGTGCAT 

GTGTGCATTTGTGCATATACATATACATATGTGTATGTATATGTATGT^ 

ATAGGCCTTACATGTGTATTTGCATATATGTGTATGTCGGTATA 

CAGGTGTATOTGTGCATATGTGTGTGCACATATGTGAGCATGTATATATAAGTTCTTGTGTATC 
70 TGCATATGAGTGTGTATTCATGTGTCTATGTGAGTGAGGGTGTATGTGC^ 

AGAGTGTGTCTGTGCTGTGATGAGCATGTATGTGTGTTCATGCATACATATGAGTATGTGTGTACCTC 

GGAGCAGGCATGACIATTGAGAGC!TCCTTCTGCCTTCTCCTGATGGTATGATG 

TGGTAC(^TCATAGAAGGCAGGAGGAACCCTGAGCTGG 

GAGCTCTGGGCTGGGGAGGAGCATTGGGGAGAAGCACTGATTGGTGGGGC^^ 
75 GCAACTTTTCTCTTTTC 
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GAAGGTACCTGCTGCCACATCTCCTTGGGGTTTGAGTGGGCAG 

AGGCCTCCATGATCCCATCTrTCAATCCTGAAAGGCCrrGGC 

CAAAAGTCACGGTTGACAAATCCTTGAGTGCACCAAG^ 

AG AGCACCTG CGG CCCAGGGGAG ACC CTTACAAGGTAGTGAGCACCCTGTCAACAGTAG CATCCAAGC CATGGTCTG CTGGAAAGC 

5 CITCCTGGCTGTCCAGAGGTCCTTTACAGCIX5GAGTGCTCT 
CAGTCCTCTCTCAGGAAGTCTCAGCTTC 
GCTGGGCTCTATTGAGAAAATGAAGGGAAGGGGGTTTTA^ 
GAAATCTGGATGCTTCTGGAACTGTAGCTCCICTACAGTGG 
GCCATGCCAGTCCCCACTGCTCCATGTCCTCACGCCTTA^ 

1 0 a^CCCTCCCCCTTTCCTCACCCTGTCCCAAT 

TGATTCATTGGATAGAGTGCTTGCCrAGCATTTGAGAAGTCCTGGG 

CACCTATAATC CCAACACTT AGGAGGTAGAGG CAAGATGTTCCAGTCATT CTTGGTT ACAT AGCAAGTT TGAGACCAGTCTGGG CT 
ACATGAGCCCTGATATGAGCACCTCCGGTGTCTCTGGATGCCCCAG 

1 5 CTTAAGCCAGCATCCCTCCCCTCCCCACTACC 

GACTAGTAACGTTTTTCTTTTTGTTTTTTGTTATC!AGCTGAGAATGCAT 

CTGTCTAGATCTTTTCGTCGACTGCCCACCrrcTCCCTCCTCACA 

GACGTCAGGATCTTCTTCTATCAGTCTGCATACGATGC^ 

AATGTAAGTGGCCATCCCTGCACTTCTGATCCCGCGACGGAGTCCAATCTCTCTAGCCTCCAGGAGTC 
2 0 AAAGACTCCAGTGGTCCCTTCTCACCCTGTTCCrCTTAGAAAAATGAACCAGGCAGGCCTTGAGGC^ 
CTOSTTTCCGTTCGAGCCACTTCTTTAAGAAGCCAAGGAAAAGCAGGC^ 
AGGGCAGGGTCTGAATTCCGGCTTTGCAGGAGAGTTTGGCTCTG 
03TGATCCCCATTCCTTCTTCAAGCTCTCCCTTTGGGTCIT 
TGAAGGGCTAGCGGCAAGAGCAACCAACCATAACTGGTGGCTC 

2 5 TGATACCXZCATTTTCTTTCCATAGCAAGGGTGTGTCTCCACAAGGGGCCATC^ 

CCACACACAGGCTAGACCTCATCCGCCGCAACTGTGG 

AGCATGAAATGCCATGGCAAAGAACACTGGGATGGATGCAGGAGGGCT 

CAAGCTAGGCACAGAGTTTGGATTCCCCCCTTCCAAA 

GAGTAGGGTTTTAATCGAAGTCCTCTGCAAACAGAGGGGCTGTATCAGCACCCTTCCAAACCCT 

3 0 GGACACCACAGTGGTCAAGCCCTTCAAAGACTCAGCGTTCTCTCTGTCTCAGCAATGCTC 

TCACTGGGAACAGCAAATTGGAGCTAAGTTTCAAAGCTGGAGATGTGATCTTC 

GTAAGCTGGGGAGAGGGGTTCAGAATCAGGAAGACTAACTAGAGGCAAGAAGGAGGGTGATGGC 

GAAGTGGAATTGGAGGTGAGGGTGGAGGTCAGATGGAGATGAGATTGGA^ 

TGGAGAGAGGTTGGATGTAAAGTTAGTGATGAGAATGGAGGTGAA 
3 5 AGTTAATGATGGGAACAAGGCTGGGGTTGATGATGGGGTAAGGGTGGGGGATGGTGGTAAGGATGGGGGTGAGGATGGAGGTGA 

ATGGGGTGAGGATGGAGGTGAGGATGGGGGTGAGGATGGAGGTGAGGATGGAGGTGAGGA 

NNIWNNNNNNNNNNNNNNNNNNNNNNI^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNTWNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

CTNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNM 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NtJNNNNNNNNNNNNNNN 
55 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

GACCTGTGGGGTGTCCTGTTTTTGATCCAGATGTGTTGGTGAAGCAGAACATTCATGCCACAGC^ 
GAAGTTGACCCTCTCCTGTCAGGCrrGGTTGTGGAAGTGAAACTAGGGTTGTCA 
6 0 TTTCACAGCCTCCTCCCCCCACCCCTGCCATTCTCTTTAATTTCTTGA^ 

GTGGGGTGGGGTCGTGGAGACAAATAATTCCTGGGAAAGGGCAGGCTATTTAGGGAAGGAAAGGCAGTGAGGGTGTT 
CCTGGTCTT CCCACTAGGTGTCACCAGTG CACCTGGTCACAG CAATCACATAGGTTTCTCCAGGTTTGCTTTGTGGGGGTCAGACT 
TTCTCC^CTGAGAGAAGTGAAGGCTCATATGTAATAATG 

CGTGCCTGAGGCAGCTCGGTAGAGGAGCTCTCACCTAGTGTGCAAGGCCCTGGGT CCCACT CAC CATACTGTGAGACAACAG AAG A 

6 5 TCTAATGCATTTCTAGCTGTCCTTGTGTACCTCAGGAGGTT 

CTTCAGTGGTTCCCGTCAGGGACyVAGGGAGAGGCGGATGTCCTCCAGGCCTTGGCCCCAGAGACT 
CATGTGAGGGGACTGGGAAGTCTTGAATGTGACCCTGAAGCTGGGCCCCCTTGGGTGTTTC 

CAGGGTTCCCGGTATCCTGAGGCTGGGCAGGAGTGGGAGTGGCAGGTGTAGGGATATAACACTTTGAGCTTCTG 

7 0 CTGGGTCTCAGAAGCCATGTCCACCCGAGCAGTGACTGTTGTGACGACCACTGTCATAAGAGGAAC 

TCTGGAAAGTCTTACTGTCTAGTGTGTCGTCACTGGCAGACCTCAGGCTGCAGGAA 
CCTCAGTCTGCGAGGCAGCGGGAAGAGGATGTGATTCCTGGGGCTGCGGAGATCIAATC^ 
CCAGAGCCCACACACTAAAAAGTCAGCCATGGTGGCTTGTC 
GGACTTGCTGTCCAGCCGCAAACCAGCTACAAATNNNNNNNNNNNN^ 
75 NNNNjnjNNNNNNNNNNNNNNNN^ 
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NNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
5 NNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNN^ 
GACtTGGAGTGAC^TCTCAAGTAATCCrCTAGA^ 
1 0 ACACATGCT CACATGCACATCTG CATATG CATTCGCACACATAAGTGCATGTAIX3T ACTTACACACATACAAATA 

3TTTTAAGTGACCACATGAG 
CCCAAAACCTGTGGAGG 
AGCATGAAGCACTTTCAGGTCTCTCTGCAGTCCCCAATGTCTGCOT 
CAGAGGGCTGCTCATGTGTCCCTGTCX3TTCTGTG 
1 5 AGATCGGGC^TAGTGGTGCTCGAGAACCTCTCCTGAAGGAAAGAATGGATGAGG 
AAGGGAGAGGAGAGACAGGTGAGGGAGGGAGCTTCCCAGATA^ 
GCAGCCTGGTGAATCTCCTCACTTTGCCTCTCTGTGTTAGTTGGCTCTC 
AAGAAGAAAGGTTTGCTATGGCTCACAATGCTGGAAGCTCCAATTCCTGCGCTGAG^ 
TAAAATTCTTGCATTGTGGTTAGAAAGAAAAATAAATAAGTAAAATAGATTAArTAATTAAT^ 
2 0 CCCACTACTTGATTAAGGGAGCCCTCTCCCTAGCAAGAAAGGACATTTCTATTAG 
TG GGG ACCAAGCCTTTG ACACCTGATCTTTGG 

CCTGGGTCTCACCAGCCTCITCTrTCACTTCC!AGGGCACCTCCCAGGG 
TTAAGGACTTTCCCGAGGACGAGGACACCACCAACTGGCTACGA 

2 5 CCCTCCCCTGCAGCTGCTGCTGTGGCTCCCAGATAAACCATAGTGCTTTAGC^ 

TCCACCCTGATGCTTGTCTCTTCTACTGTCCCCCTTAGGGACATTGCGGTGGAGGAGGACCTCAG 
CCTGCTAGCGCTCATGAGGTGAGAGGTTGGGAAGGGAGGGAGACTTTGGAGGGCA^ 
GTCrrCTGCAGTACACGGTGCCTTAGAGAGGAGCCAAGCAGGCTGCAGAGGGCAAGGGCTCAGTC 
GATCTGCTATCAGAACATCTCTCTAGCGAAAAGCCTTC^ 

3 0 TTCCAGGCCATTGCTGAAAAGTTTACTCATAGCACTGTCCCATCTCTCACX3ATTC 

GCACTCCTATCGTTAGCTGATGTGTGCTCAGATTOTGTCAGCACATGCACAAACATGTGT 

ACACA.CA.CACAAAAAACAAACACACGAACCAATACA 

ATACACAAACACACAAAGATATAACACACAA 

CACAAAT AT ACGTAAATAC CTAAAACACAGACATACACATAAAAG AAG CACACACACTCACTCACT CACAATGTCCC CCTGGCCAT 

3 5 AGCTACCATTCTCAAGACCTCCCGATGGCTTAGCCCCCATA 

TACCGTCTC^TTATCAAGAAGTGGGATTATGGGATGGAAGTGTATGCAAGAG^ 
AGATOTGTTCCCTCCAGCCATGATGAAGCATGGCCCTCCTCCCAAAGTTGTCACTGGT 

TTTCTCTTAGGAAACTCCTAAACCTTCCACCCACTTATACACAAGAAGCTTCAGGCTGTACCATTATTGGATTACC 
TTCCTTTGAGGTCTCTGTTTCCCCATCTGCAGAACAGGAGTAAGGCTGTTGGATGATAGGCCCCGGTACCACTTGGATCCACAGAC 

4 0 TCAAAGGCCCTGGGAGAAGCCAGGATCCATAGACTAATCCTGGTGTGTTGCCCTAGGCGTGAGTTCCAGAGAGAAGACAT 

TAGCTACCAGGATGCTGAGGGGGACTTGGTTAG^CTC 

CTTCCCAGAAGCGCCTCTTCCCCTGGAAGCTGCATGTCACAC^GAAGGACAAC 

GGGACCCTGTGGCAAGGACGGGGTGCTCAGCAAAGGGCCTTTACT 

GCAAACCTCCTGGCCCAGAGTGCCTACTGGACAGGATTCTATTAAGTTTTTGACACT 

4 5 TCTCTAAGGGCCACAGTTGGGAAATGGGATTTTGTGTAGGGCCAGGGA 

CGACGTTATGGGGAACATAGTGAACCCACATTTAATAGGACATTAGAACCATCCGCTAAAAGTCTCT 
AAAAGAGTCACTGTCCATCTGCAGAGTTTGGGAACTGTTGGGTCTATGGTATTACAT 
GTACAAATCCCGCCCCAGCCACTTTGCTTTGGGTAGGTGCAGTCAAGTAGATGGAGTC 
TACAATGGGTAGCCTGCCCCAATCCCCTGCCCTTCCTAGCCCCAGGCTGGCCTGTGGAGATTCAAGGC 

5 0 AGCCTGAATCCCAGAGCTGCCATGAGAGTACCACAAAATGGA 

AAGTCCAAAACTAAGGAGCCCTCTCCCCACAAAA 

CTGTCGGAGCTTCATTCTGATTTCACAGGAGTTCCCTCTTGTGTGTCCCCCCTCTG 
AGTCCACAGCAATGGCCCTGATCTTAACTTGACCACCCATC^ 

GACATTTTTTAACCTGGACATATC CACT CAAC CTGTAATAATCACTTCATTCATGCTGTTCCCACTAAAAGGGCAAGCACTCTGCT 

5 5 G T AGG CCAG AG GTATGGGAAATG CAG CTTT AGTAAAGAT CACGGTTTC CATG ACAACAGGGAGAACCAGAGT CCAT CAG AAGACAG 

TCCCCCTCTGGGATTTACCTGTAGAAAAGACTCCAGGAGAGGCAAGCTGACTTCACTGACATCACCCAACTC^ 

TGG CGATG AAG AAA^TGGAATl^ AAGGATGGGGAAATCACAT TGGTTTT TCTT G AAG AG AAAG CTGGG AACATT AC CAAAAATAA 

GGGACCCTGGAACAAGAGGCCAGTGGGCAAAGCACTTGTCACACAAGGGGAT^ 

GCAGT CTTGGGCT CTGT AAC CTCAGCACTCATGAGGCAAT ATGAAAAG AGGAGG C CAGAGATAGC CAGAAACTCAT AGGC TAGCCT 

6 0 GGCACGCATTGAAGTGACAACTAGAGACCCTGTCTCGAATGATGTAGAAGGTGAGGAGACCTAAGGTTG 

ATAGCACCCTGGCATGCACAC^CACAAAAGACTTCAAATAACAGAAGCTGTAATGAGCGATAATGCCGAAGGGGTTG 
AGACACACAAGGTCAGAAGCATAGCGGTGAGTCC^GGGAGGOT 

CATC CACT CAGG AGAGGTTAGAACAGGAAGTG CTTCCGACTTGAGATAATTCCCT AAACCACC CACAG CTC CCTGGTAGAACA 
AGGGAGTCTAGCCAGCTCAAACATGGCGAGCATGGCTCTGGCAGGCCTT 
65 GTTGGATTACGTCCCTAAAGCTAGCCACCAGGCTCTACT^ 

G CTAT CAATGT ATTG AAGT CAAT CAAACAAAAGCC CC CTTTGG CT CACCTAAGT AACATG CC CAATTAAAATTAAAT ATCCCATCC 

TGACATGGGATTTCCCCTTGTACCTTTATAAACCGCCACTTTCCTATGCACCACGTCTGTCTCTTC^ 

NNNGTGTAGTGGTGCATGTCCATAATCCCAGCAOTCCAG^GGTGGGAAGGA 

TTCAAGACCAGCTTGGACTCTCTGAGA^ 

TCTCTTTTGTGCTGAGAACTTGGTCTTAGGGGTCTTGAG 
GCTGATGCTGACTAAGCCGCAAGCTACTCTGACT/TGAA 
TGGGCTGGAAGCCTGCTGCACAGTACACACTCAAGGCT^ 
GCAAACTTACCAACATTCTGAATGGGCIAGTC 

7 5 AGAGGCCTAAACCCAACTGCTCACCAATAAACAG CATATG CAAAC 
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AAAAAAAAATCACAGTAGAGATGTTAGATGTTAGA 

AACTAATGAATGACCAAACAAACAAGAACAATATTAATCTTTAAGGG CAGTAT CTACAGTTCCTATAATTTGTTATTTAAA 

ATGTTCAGTATTCAGCAAAAATCATGAGAAATTAGAAGAAA 
AGACTlCTCACCCCrrCCGAAGGGAGAGGCTGATTAGCATTTCACAGACAAC^ 
5 ACCAATCAGTTTAAAGGCrcCACTGTTCC^ 
AAACCCACTGACCAGTGCTGGGGGTCACTX3C 

GCTTTTTGCAT CGATC CCAGCTCTG CATGGTTCACTCRjGGGGGTCCCTGGTAAGCT AAAG CTCC CCAGAGTCTTACAATAAAC CAT 

TC CT AG AGGGGTGAAGGGAAGTAGGTCATAGAACTG C CTTTGAGGGTGGACT AGATG CTGGACTCAGCAAG CAAGGACCT 

GGATGGTTTAAGTAAGCTCAACCCATGCrrCAAGGAATTAAAGAAGGTATAGGGGGCTGATGAGATGGT^ 
1 0 GACTGCTCTTCCAAAG<3TCCTGAGTTCAAATCCCAGCAACCACATAG 

TCTTCTGGTGCATCTGAAGTCAGCTACAGTGTACTTATGTATAATAATAATA^ 

ATGAGTGTTCCATAGCAACTCCTACTACTGAATGAGGGTGAGAACAAGAAT^ 

GATATTTTCTCTATTTACATTTCAAATGTTATCCCCTTTC 

CCTCTATGAGNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNN^^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNK^ 
25 NNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNIJNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNIWNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNTO 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

jsjNNNNNNNNNNNNNNNNN 
35 NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNIWNNNNNNNNN^ 

NJ^JNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNISH^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNl^^ 

NNNNNNNHNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNt^^ 

NNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNN^ 

GTGTGTGTGTGTGTATATATATGTGTGTGOVTGTGCATATATGTATATATGTGTATGTATCCATATATGTATATATGTGTGTGTGA 
60 CATATGTGCACATGCATQTACAGATG 

CTTTTACTTAATGTCTCTCTCTCTCTCTCT 

CCCTCTCTCCCTTCCCCCTCTCTCCCTCTC^ 

ATACTAGTGTTTACACCCCGCAAGGCTCT 

ATCCTCTGAGCCATCTCGACAACCACTAAGGTAGGGCCTTAAGATAATGTCCTGTGACCT 
65 TTTATTTGTGTGTATTATGAGTGTACTGCATGAGTTTATTGCACCATATATGTATGC^TGTCTGCAG 

CATCTCCCGGTACTGGAATTGCATGTGGTTATGAACTGTGTGATACAGGATCCTGAGAACAAACT 

TGCTTCCAACTGTTGAATCAGCTCTGCAGCCCAGCAATCATC^ 

GGAAGGAAGACACACACACACAC^CAOTGTCACTCAGACATTCAACAT 

ATGCtSCTCATAAGGTTTGAGTGATGGAAGTGTGACTGTCGTTACT 
7 0 TAGGGGCAAAGTATTTCATGAC^CCATGGAGATTTGCTGGGTGGGTGATTATATTGATGCT 

GCAGCAAAATAATCTAAAAAATAATCTTAAACAGTATGCCATCTGCAGCACAACAATATG 

G ATGTTCAAAACACACACCACACAAACACACACACAAG CAACAG AAGAAG AT AAGCATG CCACACACAC CCACACAC C CATACACA 
CACACACACACACACCAAAAGATAAAC^CACACACCCCCAAAAGATAAAC^ 
AGCACACACACACAC^CACACACACACACACACACACAC^ 
7 5 GCCCAGCATTTGACAGTAGAGGCAGGGGGGTCACT 
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ATGACACTCTG TCTCAAAAAGG AAAACAACGAT CAAGCAAAAAACAGAG AATTAAGAGAAG CTGAAATTAG AGGAAAGCAAGCC CG 

TGTCCATGAACAGGAAG CCAGTAGTT ATGACAGTTCCCCCAAAT AGTT CATAGATTGTGATGAGGTT TTGAAACTTCCTGCTCTGA 

TTAGCTTTTGATAACTTGCCAGAAACTAGAGTCACCTGTGAAAAG^ 

ATAGGCAAGTCTAGTGGGGAATTTTCTCCATCAATAATTGATGTGAAAGGACC 

TGTGTATAAGAAAGCAAACTAAGAAAACTCTGG^ 

CCAGGTTCCTATTGTGAGTTTCTGCCCTAACTTACA^ 

TTGTACTAAGATGGTTTGGGTCAGTGTTTTATTACATTACCAG^ 

GTTGTGACGGACCTACCCGTGTGGTTTGTATTTCTTGGAACATC^ 



1 0 G AGGGTTTG CATTTCTTAAAGAGAC TAG AACTTTTG AAGTATTTGGTTTTTT AAG ACT^ CTT 
TATCATGAT ATTAT CACAAGATC CTGGGA 

MOUSE SBQDENCE - mRNA 

G AG AAGACGTGTGAGGGC CAGG CAG CTCAAAGCCTGAAGGAGGTACTTCAAGG AGGGTCCAG AGAAG CTGGGCAGGG CGGGACCT C 
1 5 CTCAAAGGCTGCCCCCGGGGACACCCCAAGCTTCAGGAAGCTCTCAGATC 

TCTGGTTGGGCAAGGCCCCCATCCATCTGCTTTTCTGAC^ 

TTTGAGCAGCTTCCAGACGATGTGGCCGTCT 

TGTCATCGAGGTCAAAACAAAAGGAGGGTCCAAGTATCTCZATCT 

AGGAGCGGTTTGC^JCCAGAGAGCAAGAACAGCCCTTTCACCT 
20 CAAGAGATCGCTGAGACTCGGATCCCGGCCCTCAATGCCTAC^TGAA 

CGACGTCAGGATCTTCTTCTATCAGTCTGCATACGATGCTGAGCAGGTGCC^ 

AAATCAAGGGTGTGTCTCCACAAGGGGCCATCATGGATCGCATGGAA 

AGCAAATTGGAGCTAAGTTTCAAAGCTGGAGATGTGATCTT^ 

GGGAGCCACAGGCATCTTCCCAGGGTCCTrCGTGAAGATCCT^ 

2 5 GCTACTTCTATG AAGACACAGGGAAAACCAT CAAGG ACATTG CGGTGG AGGAGG ACCTCAG CAG CACGCC CCTGTT CAAAG ACCTG 

CTAGCGCTCATGAGGCGTGAGTTCCAGAGAGAAGACATCGCCCT^ 

AGAAGGACAACTACAGTGTCTACAACACTGTCCCCTGAGTCAC^ 
ACTTTCCAAGAACACGGAGACCTTCAGGGAGACTGATGTAAGATGCAAACCT 

3 0 TAAGTTTTTGACACrCTACCTTTTGGATTCAATAAACCATTCTGTCTCT 



MOOSE SEQUENCE - CODING 

ATGGCCCTGGCC CAGCAGCTGCG ATCAGAGAG CGACTTTG AG CAGCTT CCAGACGATG TGG C CGTCTCAG C CAACAT CG CTGACAT 
CGAGGAGAAGAGAGGCTTCACCAGCCACrTTGTTTTTGTCATC 

3 5 GCTATCGCCAGTTCTACGCCCTGCAGAGCZAAGCTGGAGGAGCGGTTTC 

CCCACATTGCCAGCCAAAGTCTACATGGGCGCAAAACAA^ 
CCTCCTGAGCCTGCCGGTCTGCGTGCTGATGGACCCCGACXSTC^ 
CCCAGGCACTCCGCAGGCTCCGACCGCGCACGCGCAAAATCAAGGGTC 
CCAAGAGCAGAGGCCTTGTTTGACTTC!ACTGGGAACAGCAAATTGGAGCTAAGTTTCAAA 

4 0 TAAGATCAACAAAGACTGGCTGGAGGGCACCTCCCAGGGAGCCACAGGCATCTTCCG^^ 

TTCCCGAGGACX3AGGACACCACCAACnXK3CrACGATGCTACTTCTATG 
GAGGACCTCAGCAGCACGCCCCTGTTCAAAGACCTGCTAGCGCTC^ 
CCAGGATGCTGAGGGGGACTTGGTTAGGCTGCTGTCAGAT 
AGAAGCGCCTCTTCCCCTGGAAGCTCCATGTCACACAGAAGGA(2^ 

45 

HUMAN SEQUENCE - GENOMIC. 
ATGAGGAGCGAGAGATGCTTTTATTTCZAATCACCTG^ 

ACAAACCCTTATTGGTCCTGTTTTGGACACTGGTGATTCATAGATGAATGGGACATG^ 

5 0 GGAATCTTCAGCAGGAAGAGGTGAGAAGTGAAGACGCTGACAGAGGCT 

CCCAGCCTTCCAGGGAAGTCTTTGACCAGTTTTGAATCTTAGGTG^ 

AG AG ATGG GAGG AAGACGGGGAG AATG CAT C AGG AGAGGTCAAGTTG ATGT ACAGG ACAATG AGCAGGC AG AGGAAG GGAGG AAGG 
CATGTCCTGGGGCCCX3GGCTGGAGGGAGGGTGCGGGTGGGTGAGGGGCTTCAGAGTGCTC 

GTGGAAAGGGGAGGTTGCCATTCCTTTGTAGTTAATATTTGGAGG CACGACTGATTTTTGAATAT AGTTT CATGCAGATGCTGGTC 

5 5 AGCCTGGAGCCACACTTTTCTGCTGGCTGTAGATTTrTCCCCTTTO 

GATAAACTGCAGCTAGAGGAGCCCTCCTTTCCTGGGGCIACrTTCA 

CCCT(2AGGGAGGAGGCAGCCCTC^GGCAGCTGCTCTG 

CTGTGAACTGTCGCAAGGTAGTCTCTTTGCTGTTAACTCT 

ATTGGAAGATAGTCATGCACATTCAACTATTTATTGTTTATGG CTGCTTTTGTG CTGTGACAGCAGAATTCAGCAGTTGTGACAGA 

6 0 GACCAACTCCCTGCAAAGCCTAAACTATTTACTATCrrGGCGTTTTTTAGAAATTATTT 

CAATCCCAGCACTTTAGGAGGCTGAAGCAGGTGGATCACTTGAGGCCAGAA^ 
TCTCCACTAAAAATACAAAAATTAGCGGGGCGTGGTGGCAGATGTCTGTAATCCTAGC!TACTC^ 

CTTGAATCCGGGAGGCAGAGGTTG CAGTGAGCTGAGATTGCACCACTGCACTCCAGCCTAGGTGACAGAG CGAGACTCCAACTTAA 



6 5 CTGT AACAAGCAT CAAAG AATCAG C TG CTAAAACACAAT AGAAATTTCTTTCTC CCTGAAAAATCTGAACAGAAG CAGGGTCAG AG 

ATCCAAGCTGTTTTGTTGCTTAACTGTTCTCCATAGCTG 

TTCTGTCTTTCACGACACAGCCTGATAGCATGTCCATGGCCTCCCCTCCCATTCT 

GAGCTGCAAGGTAGGCTGGGAAATGTTCTCTTTAGCTGGGTGGCCA^ 

AAAGGACAGAGTGGATCGTAGGGGATATTAGCAGTCTCTGCTACACCTTATCAAGAAG 

7 0 TTTGG CATT C C CT GCAGAGT ATTAGAAAAAAATTTGTAACATG CT CAAATGAAGT AGGTATCAAT AACGGAT AT AAAATAC CTCCT 

TCTTCCACTTCCTCCCTCAAATATTTATTAAGCCCTCATTC 

GCTCX3GAGTGCTTGAGGGAGATGTCTGAGAGTTCATAGCCAAAGAGTGAGAGC!TCTTTCTGCCT 
CACCCTGGGGCTGGGAGCTGGGCCTTGCAGCCTGCTAGGCTGGCTAGGTTT 

GAG AATT CT ATAAAG GAG ACTG CTTGAACAGATTCTGTTAG AC TGAAAAG GCAGAAGG AG ACACCGAGGT CACCAGAGAT AGAAAC 
7 5 TTCAGGAAGCAGCCATCACCCCTGCGGCTGTGGGAAO^AAGGAAG 
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CTTAGACCTG CGGGGAGGA 
AAACTAAAACCTAGAATCAACTGCTCCTCGGC^ 



AACAAACAGGGTTTTGAGTCCCAGCTCGGGCACCACA^ 
5 ACCAGCCCCAAGGCTATGACTCATTTAGACCCCCAGTCTGTGC^ 
CTGGAGCCTCTGTCTTCTGATACATGGGTCAGGGGGAATGC^ 
grrrGTTT GTT T GTTT G T T TGT T TGTTTO 
TCAGCTTGCTGAAATCTCTCXTrGGGTTCAAGGGAT^ 
GCGTGGCTAATTTTTGTATTTTTAATAGAGATGGGGCTTCACCATC 
1 0 ACCCACXTTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGC 

ATGCATCTACCATATGACCCAGCAACTCTCTCTTAGATATTTACCCAAGAGA 
TGATGTTTGTAGCCCCTTTGTTrTAACAGCCCCAAACTGGAA 

CAACTTATGTTATATCCATACAATAAAATACTACTTGACAATAAAAAGAATGAATAC 
AAAAGCX3TTATACTATGTGAAAAAAAACAGGCACAAAAGATCACATACTCT 
1 5 CTGTGATAGAAAGTGGATCAGTCATTGCCAGGGCCT 

GAT AATGGAAATGTCCTT ATATAATGATTGTGG TG CTGGTTACACAACTG T T CAGTTGGTAAG 

TTTTACTGTGTGTGAATTGCACCrTGATAAAGCTGATTTTTTAAAAATTGAA 

TGTTTCT AATGCCACCCT CTTG CAAGCCCAT ATC CAAGTAGTGG ACCAAGGCAT CTT ATAAAAAC ATAAATTCAATCATAT CACT C 
CTTGTTTAAAACACTTGAATGGTTTCTCTTTCCT 

2 0 TCACTGACCCTGACATTTGGAGGG CCT AGAGCAATGGG ACAGAGG CC CATGT AC CACGAT TC TAAATATGTACAGTT AGG AATCAA 

GCCAATCAACTGCTAAAAAGTAAGTTCTATCCTTCTTTGATAAATGTACCTTCATGATAA^ 
GTTTGGAGTTCTGTCCCAGAATGTAAGGTTAACGAAAGGCAGACGCrAGACTATA 
CCATCTGAAACCTCCAGGCTACCTGCCAGGCCGCTC 
GAATCAGAGAAACTGGATGTCAAACATGTATCTCTTGGTTTTCCCCAA 
25 TGGGTATGCATGCACGTGTATATTTGCATCTGTGTGCATGCATGTATGTGCATTTCTATAT 

CAG GTGT ACATGTACATGTGT AT ATGT ATGTGT CT CTGTGT ACAT GC CT ATGTGTG CATG TGT AT ATGTACATGTGT ATGTGTATA 

TGCGTCTGCATGTGTATATGTGCCTGTGTTGTGCATC 
AGTTGTGCTGAATTCATCTCAATATCCTTTCTACCrGGTG 

3 0 TGAGGGCCAGTACAGCAGTTATTAAGTGTGGGTCTTGAAGCCAGACTGATC 

AGCCGCTTTGGTTTATTTTCCTAATCCCGTAAAGCCTGGGTTTCCTTATCTCCCGGTGGGGA 

GTTGGTGTGAGGATTAGGTAACAACAATCAGAGCCAACATGGACCAAGCCAGGCTGTGG^CCAGG 

AATGCTAGCTCTCGTGCTCCTCCTGGCAATTTCACGGTGTAGGGGTGGT™ 

T ATTT ATG ATG GTGG AGAGGAG ATTCAGGATTCTT AG C CC CTGGG GGAAGCTTC C CCTG AGTG CAGCT CAGATT CTGGCTGATAGG 

3 5 GOTGTCTGGAGAGGTTGTCTGAGCCAGCAGGCTGCGCGCrcrGCCrGGCC^GCT 

TAGTGCGGCTTATAGCATTCAACCTCCTCTAGGT 
GTTCCCGTCAACATTTAGATTAAGCGCAGAGCCGGTTCCGCC^ 

ACAGATTGCGGGAAAAAGTTAGAATGTAACGTTAGTTGTATGACAGAGAGGACCAAGCT 
GGGACCAAGCCACAGCAAGGAATGATCGTCAATAACATTAACGTCTAGAATTTA 

4 0 AAACATCTGGTATTCATTCCCTTAGTTAACACACCTCT 

GCACACAGTGGGTGTCTGGGACATAGTAGGTTCTCAACA^ 

TAATAATGTAGAGGCAAATATTATCTCCATTTAGAAAACTGAACAATGGTCGGAGAGTT^ 

GGCACACGAGGTTTAGACC(ZAGCTCAGCCGTGTGCCAGGGCTTCTCAGAGAAGAGAAGACCCT 

TAATTGGCCACGTCTAC^CCCTCCCTGCCTGGGTGGCATGCCAGGGATGGCT^ 

4 5 CCCTCATAAATTATAAAGGGAGACACATGTGGX3TCAAGATTGGGGAATTCAGT 

AATCAGGGATTTTCCTTCrCTTACCCrGGAGAGTGGCTGATGTAGGCAGTGGGGATTTGCT 
GTATCTCCTGGTCCTGTAGAAGAGGCT^ 

TGGCTGCTCAAAGGAGAGTTTCCTCATGTCCAAAATGGGGGCTATGGTTTATCT 
GAAAAGTGCTGTGCACTTTTGTGATTCGCACATCCAGTCCCCATTCAG 

5 0 ACTTTTTCCTGGAGATGTGATTATCAATGGGCATTAGTAAG 

GTTTTTTGAGTT CT TGAGAACATATATA'ri" rrrrGGT ATT C CATG GTCTTT ATC^TTGTCAGGCTGTCACTGAACTGTCGCTTGCA 
TTTGGCTGCCCCTGGTAATTTTCTTCCTTCCCCTAACTCACTGCTTCAA 

TTATATTTCTTCTGTGAATAAAATATGCCAGTGTCAATGCATAGACTTTGAGCTATCAATTCTAACCAT^ 

TTTTTGGTTTAT TCACAGG C TGTGGGATGTGGAGGTT AT CAAAAATC AG^T CTCC AT AAGTGAG AGGT CACTTCAGGCAITTCTTT 

5 5 G<!ACTTATCCAAAAAAGATTCCATCCATACTTCTTTAATGGAACGTGTGA 

CATTCTCCCCCTGCAGCAGGAGCAGCAAGAGCCCGGTTCTGGAGTGGGAGT^ 

AGCCCATTGCTGACAGAGTGCCTGCCCTCAAATTACACTGTAT CAGAG CTCGGGCAT AAAATTGG CAG CAGCGTTCTGTGTATGAG 
GAGACAGAGAAATACCTGTATTCAGCAGAACCACX^C^GGGACAGAAGAAAAGGAG 

CCGTCGCGGG CTCTTCCCTGGACATCTTTGTCCCCCAGGTGCAGAGGTGGCTTCC CAGTGGCTG AGTCAGGGCCACAGGAGGTTTC 

6 0 TTGGGGGGTTGGGGGGGCCGGGGTTTGGAGGGCAAGCCATGAAAATACCTGGGAAGGTAAACTTGAGGGCT^ 

ACACACATACAGATATACACACTGAAAACGATT CCATGTGT CT AGG CACTTTTATACAGTTTT CT A 

TTAATTCCTACTACAACTCCATTAGGCAAGACTGTTTCACAGATTGGTGAAGAAGAAAGGTTCA^ 

G AGTTCACCTATTATAAATGTACTTGTGTTTATAC TCATTTGT CAC CGACTG CAACC TAAACAT CT AAACACATTCTCTCACAG C C 
ACAATGCCACCACTATACACATTATATGGTCTTTTATAC^CACAACACAAATG 

6 5 TAT ACACAAATATGTG TTT C CAAC TATAGGCATATACT ACACACACAG TTT ACACAATCAC CT ACAAATTCT GT ATCATG CTTG CT 

TGCTAAAATTGTTTTGCCACTTCCTGTCCTTCCTCACCAACTGTAGGCCAATGTCCTCCT 
GCACCTGTTGGTACTTACAGACACCTGCACTAATAACCCTG 
CCAGCAACTCATCAGACTTATTGTTTGTATTATTTCT 
TCACAGCTCACX5ATTTGTAGTTTTGGTCTGTGAGTTGACAGAGTAGGCCCT 

7 0 GCCTCAGCTTAGGATTTCTGCACCACGTGGGTAGCTTATAT^ 

TACACACGTCCCTTGTTCCACCCATGGACTGTGAGAGGACCTCAGCT^ 

CCCATAATCAGTTCTGACACCCACTTTACACTCAGAACTCAGATTTGACT 

CCCAGATTTGGCATTAAAATCTCAGCCCAACAATGCATATCCAATCT 

7 5 TAGTTTAAAACAACTTTCTGCTCTGCATGTATGAATACGAGTGTGTGAGCGTGTGTAGCT 
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GCCAATGACTCCTGTGGACCTCTTCTCAGAGAAGTCCCTTCAGAATCAA^ 
CAGCGTTTCATTCATTCACTGAATATTGGTTGAGCACT^ 
ACAGATAAACAAACAAATGTGAGATGACAATAGTTATGAGTGCTCTGC^^ 
AGGGCCTTAATTTAGGGTGTGTGGTCAGGGAAGGCCTCACTAAAGATG 
5 CTTTTTCCAGCAGAGGGAACACCATGAGCATAGGCCT^ 

AAAAAGGGAGCTCAGGAGCAGAGGCTGGGGAAGGCTGCAGGGCCT 
ATCCTGAGCACAGGGTGATGGGGCAGGAGAGTTTAAGC 

GCTGG GCATGGAGATCTAAGG ATGAGG AAAGG ATATC CCAGAGTGC CAGGGCKXZG 
GGCCAGAGGAACCAGGAGGGCTTCCTGGAGGAGGTGGCATTTACTCTGGGC^ 
10 AGGCCTGGGGAAGAGTTTGGGCAAAGGCTTGGCAGTAA 
GGTCACAAGACACCXTTGATGACTGGGACCCCATCT 

ACATCAAGGAAGTGGATTCCTGCAAACCrGGCACCCGAGGGTCCACTTCCTCAATTTCTG 
AGTGTGGGCCACAGGCTAGGCCCAGAGAGCCrc 

CAGCTCCTGGGGACTCCCAGGACCACAGGCTGAGACGAGACGCAGGGTGGCTGGAGGAAG^ 
1 5 GCTGGGCGAGACTCTCCACCTGCTCCCTGGGACCATCGCCCACCATGGCTGTGGCCCAG 

GGGTGTGGCCGCCCCCGGGCCTTCTCACTGCTCCTCATAGGCCTTAGGGACAAAG^ 

AGAGGCTGGGGTTGAGTCAGAACCCAAATACTCAGAAATGCAACCTCTCAGAACTGCTTTAA 

ACCCTCAGACACAAATGAGAACACTGAGGCTCCAAGAGTGGGAGTGACTTGC 

GAGAGGAATGCAGGTGTCCTGGGTCCCAGGCTGGGGCCCTTTGCTTC^^ 
2 0 CAGAATTCACAGAATCIGGCTTGTTCTTCC^ 

CTACCTCATTG CTGTCTGACCCTTTGC C CTATTTC CCTTTT CTGCCTGGTTCT TAT ATT ACAG GAATAATATT AT ATG CT CATTT A 

TTATTTTTTCTTGAACGTTGGCTCCCCCATGAGAAGCAAGCTCCACGAGGCT 

GTAGCTCAT CC CTATTGAG CACCAGGAC CAAGG GCT CAAT AAATGTTG CTGAATG AATG AAT AAATGAATGAATG AATGGGTGGAT 
CTTTGAG ACGGAAAAGGT CTT AGATT CTTCTAGTCCAACCTTGC CATTG TACAGATTGG AAATCTGAGG CC CAGGG AGGATAAGAG 

2 5 ACTTACTCAGGGTCACAGAGAGTGACTGGCAAGGTTGGGGCTTGGAGCACGGCCT 

ATTTTCAGAATTTTTCCTCAGTAGCCAGGTCTGAGCTCT^ 
GGATTGGAACAGCAGGTGACCrCTGTTACCTACCTGGAG 
GCGGGGAACAGAAGGCCCATGTTGGGGCCTGGGGCTGGAAGTAGAACCT 
CCTACAGGGGAGTCCACGTGGCC^CCTGAGCCTCAGAGCGGGGTGCAGGAGCT 

3 0 CCCTCACATGCTCCTGGGGGACCCAGCCGCACGCAAACTCGAATCTTCCGGAAGCAGCAGCTGT^ 

TTTCTGGGGTACAGGGTTCAGGGGATGACACGGAGAGCTGACCAGCAGGACTGTGCACTCCCTCCCCCA 
GCTGCCITAGCCCrCRGAGGGGGAGGGACrGGTTTTCATGAGGCCTTG 

AAGGTCTTATCTGAAAGCGAGCCTGGATTGTTCTGCTTAAGGTCAAAGCAGAGAGTACAAGGAGAGGGCGCAGGT^ 
GGACTTTCTAGAACCCCAAGGTGAGACATCCAAAGATGGGGAGGGCTGCCTTAGGGG 

3 5 TGAGCACCTACTATGTGCCAGGTGCCAGAAAACCrrGCCCTGGCTGCCCTCCAAGGGTTCTTCCAACCT 

GCTCCAAAACCTCCCCCAGGCCAGGCTCCAAA 

CTGGGTGAGGTCCCCAAGGGAGCGGATATTCCAGGGGACTTCCC^TTGGGTC 

GCCATTCTCATGCTGCCCATCAAAAAACTCTATAATTGCAGACGACGGTAGTGGCCGTGTATTGAGCAGTACT 

4 0 TGGGACTGCATGACCACCCGGAGACATAAGTACTTCTGCCTGTCCCCATTTTACAGAGGA 

TCTTCCCAAGGGAGCTTGATCTTCTTGTCCCCAGTGCTGCGCCTATGCCTrc 

CCTCAGCCCCCACCACCCCAGGGAAAACTCGAGGTCCCTCTGCGAGGCATCAAGGCACCTCGTGAGCGGGCCCTGCCrGTOT 
GACTGGTCTCTCATCACGTTCCACCTCCCGCTCTGTGGTCCAGCCATGCT(^GCAACCTGCGGTTCCCTGAACTGAGCAGCT 
CCTTTGCACTTGCCGCTCCCCCTGCCTGAAATGCCTTTGCTGCCGTCCrrGCAACCAAC 
45 GGTGCTGCTGCTTCTGGGAAGCCCTCCTTGACCACCCTCAGCCCAAGGATG 

TCCTGCTCCCAATCTCTCCTGTCTGCTGCATGATTGATTATTTGAGGGAATATAATAATGGCCAGGCTCCTGTTOT 
GCCACX3TGCTTTTCCTTCCCAGGACTATGCCCATTTGAATTATGTCATTATT 

GG AG CTG TG AG ATCTGAGAAGC C GAT ATGTCTT AATGGG CTT AGAACAG TGTCTGGC C CACAGG CAG TGCCACAT ACAGGTTCGCT 
AAAATTATAATTATTTTAGACGGAGACACAGGCTGAGAAGGGCTGGGAATTTTTTAAGTTCA 

5 0 GAGTGCAGGGCAGTATGGCAGTGTTTGGTGACAAATT^ 

TACTCTG G CT AATTTTACT GTGC CT CAGTCTC CCGATCTGTACAAAGGTGACACTG TCTT AG AAGAG CACCTGGCACC CT CAGGAA 

C CCACCCC CACCTT C CCAC CTG ATCGTACTGCAG AGG AAATT AC CT AGG ACAAG CACAGC CT CAG GGGAG AAAGTGGAGGAG G AAG 
CTTCTCCCAAGGGTATCZACCAC!AGGGGCATGTGTGGAAAGTGAGTG^ 
55 CCnXSAGTCACCTCCATGTGGCAGCCCCCTGAATTCTGATGCAGCCTCAGAT^ 
CTGTGTGTAGACTCCGGAGCTCCCAGCCAGCTCTGGGGCCTTGGACT 

ACCCCCGCCACTGTTTCCCCTGTTTCCCAATTGCCGCACCAGAAGAATCATAGTTGCTTCTTGTT^ 
CTGGACTCAGTGACCACTCAGAGACTCCTGGCTGTGCCrTCTC^ 

TCAGACCCCTCCTCTGTTCCCTGATAGGTCTTCACTGAGTCAAGAGCGCGGGTCTTAGCCACAAGC 

6 0 TGCCTCACCACTTTTAGTCGAGACCTCGAGACCCCCATGCTGAGATCCTGAGCTATAAAGCAGGCCTCTCAAATTA 

ACATCCAAACAGTGCCCAACCTCTCCTGATCATGACTTAGGAACAACAAGAAATTGGTAATG 
GCAGGGAATGGAGGAACTGACCATGGCTTAAACAAACAGGGGATTATTTGGTCACCTAAC^ 
TAGGTGGATCACTTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATG^ 

NNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
70 nnnnNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TGTTTGGCTCAGACCCGTCATGATCTATCACCTTGGGGCTAGGCACATGGCCAC 

AGAAGCAGAGGATACAGTAAATGCTCACAGTGTCCACCAA 

TCACATGCACACAATCTCTTAATCAGACCACTTCAGGACCTCCCAGTG 

7 5 CCACACACCTGCCTGAGCT'GGCCCCTAAAGCCAAAATTGTATTTATGTTTTACTTT 
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TGGAGTCTTGCTCTGTCCCCCAGGOT 

TCCTGCCTTAG C CTCCCAAGTAG CTGGG ATGATGGGTGT CATrTTTlTAT ATTGC CTGCCTCAGCCT C CCAAAGTTCTGGGATTAC 

AGGCATGAGCCACCATTTTACTTT cttc caaagtt atgttttactittcttctgaagc c ct acatgact ccccattgcctacataagt 

TCCCTACTTGGCCCTTGGGACCCTCAGACTGGCCTCACCltKrCCACT 
5 GCCCACT C CGCAGGTCATG CTGCT CCCACTGCCTGGAAGACCXTrCCC CAATTTCTACCTGG CAG ATG CCTATTCTACCCT CTAGAC 

CTGCTCACATGTCATCTCCTTTTTGAGGATTTCCCCAACCTG 
TCCAGGGCACATGTTGACTTTGTTTACATCTGTGACCCAG 

AGGTGAATCTGTGCAATTGACAGCAATGTGTTAGACT^ CTT CCCATTCCT ACACTCA 

ATATGGCCACCAGCTTCCCATTCCTGCACTCAGTAT^ 
1 0 TCTCCGTATTCCCAGGGTCAGTGGCTGGACCTGGCCTGGGAAGGGGTC 

TTGCACTCTACCT CATACCCGGTGGG CCCACAAAACACATCAGGGTGAT AAGCAGGATCTCTTTT C CCCTC CTTCG CACAGTG ACT 

TTGAACAG CTT CCGGATGATGTTG CCATCTCXK5CCAACATTGCTG 

CAGACTCCTAGTCCTTCACCCAACAAC^ 

TCTCTGATCTCTTAACTTCTATCTCAGTGGTTCTCAACAGGAC^ 
1 5 GCAT CTTGGGG CAGGTCTACTCCACAT CTAGTG GGTAG AATCCAGG GTG CTG CT AAACAC CC CACTG TG CACATGACAG CCCTGTC 

CCCAGTAAAGAACTGTCCGGCCCCAGCATAATACTAAGA 

GAGTCGGGGGTGAGGGACTTTAGTGTTGCCCCCACAACACCCCTGTCATA 

GCACAGCCAGGGGCAGGTTCACTCTGCACCATGACCTCCAAGX3 

TTCCAAGGGCTCTTGCCTCTAACACTAGGTCTCA 
20 ' AAGCTCGTCTGCTTCTGGGGCGCCAGCCTTTTGACT^ 

TCTTCCTCCTCCTCCTCCrCCCCTTTTCCCTTCCACCT(TTTTTTGGCT 

CCTCCTTAGGTTTTCGTCATCGAGGTGAAGACA^ 

GCAGAGCAAGCTGGAGGAGC2GCTTCGGGCCAGACAGCAAGAGCAGTGCCCTGGCCT 
CACTCCCGTCCTGCTGCTGCAGAGCTGCT 

2 5 TGAAAACITGTTCATGTAGTTTATAGCCCCC^CTCCCGG 

CCATGGCTTCCACCCCAAGCACTCTAGCCTCACAGCCCCGGAGCTCTGATGACCATCGTGG^ 
AGGAGGCCACAGTCTG<5AGCACTTCAGCACCTGGC 

CTCACCTCCCTGCCCACCCrC^CTTCCTCC(^CX3AGACCAGCCCTCTGAOTTCATGG 

3 0 TCCTCCCC!AGCTGGATCCTAACX3CGCACCTTGGACAGCGCC!AAGCCCTTAGC 

CACGAATGGGTTACACGGGGGTGGTCTCAGCG&CGCCCTTCG 
GCCTCAGGGACCCTTCCCCAAGCTCCAACCCCATCTCCAGTTTGTT 
GACCCCGCTCCTTCTC^TCCAGCCCCACa\AGGAAC^ 
AGGCTCCATCATTTTCCCTCCTCTCTCTTTTGGGGCT 

3 5 TGATATACCAACTCCCTCTGCTAAGCGTCrrCCACAGCCrCAACCCTCTTCTGT 

TTACAAGCCTCCCTGAGCCTCCGTGCCCCCAGGGCCCCCAAAGnrTCTCT^ 

ACCTTGGCATGGCCATCCTCTCCTTAACTrCCTCAGCTTG 

ATGACCCATCAATTCTGGCrTTCCATGGCCAGTGCCAAGCCrrCATCTTC^ 

4 0 TGCTCCTTGGAAGGTGGGGATCCTCTGAGCAGACATCCTCT 

TGACCC&TGTCrGGGACTTAATTGTCACGTTTT^ 

CATCACCTCTGTCCATCTCCTGGACACCAGGAGCCAGGCAGGTTTCAG^ 

(ZACCAGTGCGCCCACATGAAGGCACCTGGAGGGGACATTTGTCTCITGGTAGCCC 

CTTG AG CTTGAGG AATTGTTCATTCATCCAC TTAACAAATG CTC ATT CAT CTGGGCATG ATT CTAGAC CCATGGTTCTTAACTTTG 

4 5 GTGGTATGTGGACGTGCCTTGAGAGATGGAAAGAATCCTGGTTCCTGAGTCCTGCCCCCGGAGATTCrGATTTAA 

CAAGTG CTTACACAGTGAG ATATTTACAAGCTCCTTGGGTAG CTTGAATGTGCAG CCGGTGTTGCCACTGTCCTGGGTGATGGGGGT 
ACAGCTGTGAGCCAAGCAGGTGAAAATCCTGCTCrrCTTAGAACTAAGAGTCG 

TGCG CTTGG ATG AACAGG C AGT AAAT AAAAAAGTAAGACAAG AAAAATAT CAG CTGGT G ATAGCACAACTGAG AGGATGAAAGTTG 
GATGGTGC CIAGAGTGGCCACTTCAGTGCTGTGAAAAGATAACAGGK5AAGAATGCTGAGCCAT CGGGAAGGGGCTCTGG CCAGGGTT 

5 0 CCTGGCCTGTAGGGGCAGCCCTGAGCCCCGGGGCCATGTTGGGCCAGGCTCCTAGGACAG 

TACGTGGGTGTGAAACAGGAGATCGCCGAGATGCGGATACCTGCCCTCAACGCCTACATGAAGGTACCAGTGGGCCT 
GGCACGTGGAAGGGCATGCAGTATTGGTGGGGGAGCCCACGTACCATCCCT 
CTCTGATGGCCCCATCT/TCAATCCCTAAAGGCCTGGTCTGGGGA^ 
GAGCAACXSGAGAAGGTAAAAGAGCCACACTAGGAATTGACAAGTCTTC^ 
5 5 AGGAGTT AAAAT ACAAAGGTGAATT CCAT C C AGTGTC TG CC C C CAAGGT ACCCACTGT CC TATGGGAATG C CAACAGGTGG ATCAT 
AAGATCACACGGAGACATATGTGTGCCTCCATCATGGGGGTGTGGCCAAGCAGGCQ3 
ACrTAGAGTAGGCCTCCTCTGGGACAACTCACTCCCTTACGTGAACATG 

G CCCTTGTGTATTCTCCG ACTTGCAAACTCTGCATCTCT ATGAGATGGGAG CGTAGAGGAAGGAG CATCTTAAAGGCAGAGGCCAA 
GGTGGTCAGGAAAGGACTTCCACCAGCCACAGCCATCCAGGAAA 

5 0 TAGAGACCTGAT AGTGACCT CAGGAAGCTGGCAAGCGAAGCCCTCCTCCCAGCCCAG CGG CATCCTCACAGCTGG AGAGCGCTGGG 

CTAGK3ATCCTTTAGTCTTGACTCTGACACTGACTGGCTGGGTGATTTGGGGTGAGT 
ACTTGTGAAATGGGGCAGTGCCCTCTAGTATCACAGGAGTGAAGTTGGG^ 
TGTGAACCTGCTTCCAGAATGTGAGAGATCTTGACTCCTC 
GGAGAGAGCAGGTAGCTCCAGGAATCCTGGGCTCTGAGCAGGGAAGAGGGAG 

6 5 AGGGTGG CTGAGGCTC CACCCTTCTCCCCCTGGG CTAGAATAGGGGAGTGGG CTGTGAAGGGAGGGAGGGAGGGAAGG AGGAAGAG 

AAGGCAGGAGGCGAGG CTGGGGAACTGGGG CTCCTCATTCCACACCTGGTCTTAG CAACCAGCAGCCCAGAGG ATCCATCTCTGCC 
TCTTTGTGTCCAATTGGGTTCCTCGAAGCCAGAGATCATTCAGGGCA^ 

CCCTCAGAGATGGGGAGCTCACTACCACTCCGCCCTGACCCAGGCTGCCTGTCCCATTGTTGG<5CAGCT 
CTCTGTAAGTCCTACCCACTGGTCTCACCCCGCCCTCATGAAGAA 

7 0 AGAGCTTTGACTGGCAACCAGGAGACTCTTGATACACGTTTATTGATAAATGAATAATCT 

GTTAAGTGTGACAGAGCAGTTAGGTAGGTGGGATGAGATTTGAATT 

TGCCTCGTTGTAATGAAAATCACAGTAACAATGATGACATCTAAC^ 

TTCTGCATGTATTTACTCATTTAATTCTCACACAATCCCACGAGGGAGAGGCTAT^ 

AGCCAGGTTTGC CCAAAGCCACCCAGCTGGTGACAGGCAGAG^TGGGATTCGATCCCAGGAG CCCCTGGCTCCTTAAACACCTCCC 
75 AGGCTCTCCCCATATGACAGCCTTT^CCGGGCTCTGC^ 
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GCCCCCATGGGATCCTCTGOU5GGCT 

CTTGGAGTTTGGTGAGATGACCCCGCCJVGGCrCTGGGTGCTCGCGTCCTCAC^ 
CAGCTTGGATGAAACAAGGACACTAGGGCTCAAGTCCSGGCCCTGTCACT^^ 
CAAGCATCAGTTTCCTCAACTATGATATGAGATAAACT 
5 GGGTTATTTTGGGAGCCCCCCACCCCCGCCAACTGTTGTCTC^^ 
CCCCAGCTCTGTCCCTTGCTAGGCTTTGTG^ 
GAGCAAGGGQSCXSCATGTGATACCCACCTGGTAAAGTC^ 
AACTTTAAGTCTTTTCCAAATGGACTCTTCTCAAGCCCCTGGTCCCTCT 
TACCTTCAGCATGCTGAGTTTTCTTATTCTTTTAAACAAOT 
1 0 GATGTCAGGGCTCGGGGACGGGACATCTAGGCTGGGGTGTCCAGATCTTTCTGCT 
CACAGAGCCTGCTCAGCCTGCCGGTCTGGGTGCTG^ 
O^TGCCCCAGGCACTCCGCOSGCTCCCCCCGCGC^^ 

CTGGGTCCCTGCTGGAAAAGCATCTCTTTTCCCTGAAAGGAGAGAAGGAAAATAC^ 

G ATTTGATTT ATT AGATGGGTTTAG AAAGT CATGTTCAGGCTTG GAGGGACAG AGGGT GTGTGCCTTT CAATGTGAT AATAAGGAT 
1 5 TGGAAGArcAGGACTGGGATATTGACTCCAGAGAGTT^ 

AAAGAATTGGGGGCAGAGAGGAGGGAAAAGCTGCCCCATCCACAA 
GTTGGAGCTCCATCGGGCCAGCTATGGCTCTAGTTGTATCATGCrGGCCT 
CATCCTTCCCACCAAAAATTCAAGCCACTCGTAGACTGCTGGGCTTCCA 
CAGGACGGAGCTCCTGAGCTCCCCACCAGCTCCTCCCTCTGCT^ 

2 0 ctcccctgtgcacacaggcaggtgtgtcttaattccagctgtgccatgtggc^g 

ccaacctt cagt ttcc ctcagctgaaaaatt ag aaacag tcacj^tttacatcagag cttggttgtagggagcagaggaacaaacac 
atctgaacactttggcaagtcaaaggcaccgaatgagcgtgtat 
aaagaagggagaacaatcccacctccaaagaggttgtgaggatttggggacaagtgtgccc^ 
tttcagcatcttctgtctcctctgccttccctgctcct^ 

2 5 aatcacagggctaacaagcccctcttct ctctccacagcaagagcgtgtccccacagggcaacagcgt tgaccgcatggcagctcc 

gagagcagaggtaacccccgcccccacgctggccaggctctcaca^ 
tctcctgctgggtgctgctgtgtgcctgggacagtgctgggcaccaggaggaagg 
gggtcccccgagcccagctaaggggagggcatgtgaccagcgtgcggagcc^gaggagaatg 
cagtttacctgggcctcctctgagtcagccctgccttaggcatggaagcrc 

3 0 agcctggggaccagggtgggtgctcagccgagagagttcgaa 

ccatcctggagtcagggagacctcggttcaattcccagct 

ccaagcctcagtttctacatctgtaaaatgggagactaaagtatgtaaggcacttctata^ 

c ctcctt actg cacgcttctcctcaggc tctatttg acttcactggaaacag caaactggagctgaatttcaaagctggag atgtg 
atcttcctcctcagtcggatcaacaaagactggctggaggtgagttcagaagtgaggatggaggtgagat 

3 5 gggaaattaaaagtgaagatagaggtgagggtggaagtgcaattggaggtgaggatgaaggtgagggtggagg 

gaggatggaggtaagagtggaggtgaggatggaggtgagattggaggtgaggatgaaggtgagattggaggtgagggtggaggtaa 

gagtggagatgagattggaggtgaggatggaggtgagattggaggtaaggttggaggtgagattgg 

ttggagatgagattggaggtgaggatggaggtgagggtggagatgagggtgaa 

tgaaggtgaggatggaggtaagggtggacgtgaggatgggggtgagactggcagtaaggttggaggtgaggatggaggtgagattg 

4 0 gaggtgagggtggaggtgagactggaggtgaggatggaggtgagattggaggtgagattggaggtgagggtggaggtaagggtgga 

ggcgagagtggaggtgaggatggaggtgagactggaggtgagattggaggtgaggatggaggtgagattggaggtaaggttgg 
tgaggatggaggtgagattggaggtgaggatggaggtgagcgtggaggtgagattggaggtaagattagaggtgaagat 
agaatgg aag t gggattggaggtgagtttagaggt aaagatg g aggtgagg at ag aggtgaggaaggtg agtttgcaggt aag gat 
ggaggtaagattagaggtgaggatgggggtgagtttggagttgaggat 

4 5 agatgaggttgggaataaggatggaggtgaggttggaggtgaggatggaggtgagaatgcaggtaaggttaggctgagattagagg 

tgaggctggaggtgaggttgcttcaggcattgtatgcttcagagactgaggatggtctcagaaggcctcagggaagagatgctatc 
tgaggatagccctggagatccmgcaggttggggagcctgcgggaacaaagcctttggggaggcaggtgttcx;gtggcagcagaac 
attcttggagctcatgctttggttctgacccrgaaaaatgggttgcttagggcaagtaagttaacctactga 
tctctaatgggtgtctcagcaggggcactattgacattggggccagatggtccot 

5 0 cagcaatatccctggcctctccactagatgccagcagcactgctaaattgcaccaatcacatggct 

ggggtgaaatcgcaaccacttgtgtaatatgggacaacagtacatac^ 
actcagagtcagcctgtttgggccatttcctagctctgtttttcactagctttgtggacct^ 

tgtgcggcgcctggccaggtactagggtcattaggagacitaaatgaaacttcaaagccagggac 

5 5 cctgtgtggcccacagaagggctgtgttctgactcagtgagtggagg 

ttttgtattaggtcctgtcitcagacttagtgcrrcctgtgg 

cccccaggggtcccaggaccittaagagttgggttgggtccctagcgtccaggccctggagaggagctagtagtgacc^ 
agccagtttagcagtttccattttaccctaatcatagatctccatctcaggaactgagggctcttacagaggcct 
ctgggccact ttccagaaatagaaaac^tgtattgcctt cctgttttccggaaagtcctacttat cagtatg ctggctaacttctg 

6 0 6cctcaggttgaaggtgggggcaw5ftgqggcagggagg^ 

ATCCATGTTCCATAGGGTCGTTGTGAAGATGAAC^GAACACGTGCCCAAAGCCCAGGCCAT^ 

CTACTTGGTCTAGAGTAGAGGCACAGTTGTATCTGATGAATGAATGAGTGAGTGAGCGGGATCAGCAGCAGGCCCTT 

CAGCGTGTCATTCCCCATCATACTGGAAACTCCGTGGGGGCAAGGACTTATTATTATT 

6 5 GAGTTTCACTCTGGTTGCCCAGGCTGGAGTGCAATGGCGTGATCTCGGCTCACTGCAACCTCCACCTGCCAG^ 

CCTGCTTCOSCCTCCCGAGTAGCTGGGATTACAGGCCCGCACCAACACGCTTGGCTAATT^ 

TCACCAAGTTGGCCAGGCTGGTCTCGAACTACTGATCTCAAGTGATTCACC^ 

TGAGCTACTCCGCCCGGCCCAAGGACTTTTATCTATTTTGAGAGCCCAGAGCCTGTCACG 

GAATGAATAAGTGAATGAGTGAGATAAAAG CAG CTTT CT ATAT AAATCCAGATTCTGG CAT ATTGTAAAAC CCAAATATAAG C CTT 

7 0 crrAGAAGTGACCAGCTCAGCGAACTTCCTCGCTTTCCCTCCTC 

CACTAGAACGGGGCTGAGGGCTCTGAGGCGTGG CTCTGCTGCCTCTCCTCTCACCAGCATGGCATCCCTTCTCTTCCTCTGCAGGG 
CACTGTCCGGGGAGCCACGGGCATCTTCCCTCTCTCCTTCGTGAAGATC 
GG CTGCGTTGCTACTACTACGAAGACACC^TCAGCACCATCAAGTCTGTGGCCT^ 
TCCCTACGACCACCGCCCCTCACATCACCTTCTC^TGGGTCCCTCTCC 
75 TGCTGTAACAAGCCATC^CGTCC^GGGTCGCCTGGCCAGCCT 
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CCTCCTTACTCCAGCCTCTCACCCCCTTAG^ 

AGCTCACMGGTGAGGGGCTGGGAATGGGGCTGGGGAGTTAGATACT 

ATTAAGGTGCIT3GGGTTTTGCGTGTACTTTGTCTTCT 

TAGAGA!TCAGCGGGGAAGCrGCAGAGAGGTTAjGGGGAACT^ 
5 GAGCTAAAGTCTCTCTGGCTAAAAGCCTTGGACTTGAACCATCCACTCTCAGGGA 

GGACAGTTATTTTACAGAATGGTAACCGCATTGCT^ 

TGTCAGTAACCATGTCAGAAAGATACAAAGAAGTGA 

CTTCAGTACTTACACCCCACACACACATX3CAC^ 

GGCCATGCGCTTTTGCTGGCCTCTTAGGATTTCT^ 
1 0 ATTTGATGACTTATCTGATCATTT CCTTTGAATGATTTT CAAGAAGTG G ATTTGGGGATT AAAG CATTTG CAAGTCTCAT AGG CTT 

CTGGCC CACACTGTG AAATTACTGGTAGAAAG CTCAGAT CAATTTCTACGTACAC CAGCCAAGTAGAAGGATGCG CATCTTC CCGC 

ACCCAGGCCAGACAGTTCCTGTCCTTACAAGGTTCTCCTCAATAGCCTCAC^ 

CTGTGCCC^TCAAACCCACTCTTCTTCACAACrCTTAATCCrTT 

TTCCGTGAAAGTCTCTTCTCCTTCTCCrTTTTTTTTTTGAGA^ 
1 5 CTCAGCTCACTGCAACCTCTGCCTACTGTGTTCAAGTGATTCTCCTGCCTC^ 

ACCTCGCCCAGCTAATTTTrGTATTTTTAGTAGAGACMGGTTTCAC 

AATCTGCCTGCCTCGGCCTCCCAAAATGCrcGAATTATAAGCGTGAGCCACCGCAC 

TCCCCATCTGTACAATGGGAGGTTAGATTACTTGGA^ 

ACATAAAACCCTTTTCACCCCCTCTCCCTACCCCTACCTTACGCTTAGGCCCTTTGATTAT^ 
2 0 AGTTCCAGAGAGAGGACATAGCTCTGAATTACCGGGACGCT 

ATGGTGCGGCAGGCTCGTGGCCTCCCCTCCCAGAAGCGCCTCTT^ 
CTACAACACGATGCCATGAGCTGACX3GTGTCCCTG^ 

ACCAGGAAAACCTGGGAGGATGGGCAGACTTCCTGTCTTTGAGGCTAATGGAC 
TTACATCTGATTTAAATAAACCATTCCATCTGAAAGGGGCAGGCTGC 

2 5 ATGAGCATCCACAGGGCGTCATTTATATCTGGCAGGGGCCCTGGCTCTGAGACCCCTGTCCTGCCACCCCATTTCCTTGCTC 

CCTGTGTTGGGGGGAAACCCGAGGGTCCCCAAGCCIATACCCTTTCTCCCT 

CCCACAGAAGTGATCTGCTCTGGGTGCCCTCTGGGGTTCCACCAATGCCAAGTCCTCGGCTGTGTGATGAGGGTCAGAAGCTCTCC 

TCTGCCTTCCTGTCTCTGCTGCTCTGTGGAGCMGAAGGCAGGTCT 

GACTCTAGTCTCATCCAGGAGGAGGGGGACACAAAGCAACGGGA^ 

3 0 AAAATTAAACAGGTGTGACTCCCAGAATGGTGGTGTGAGGAGCTTGGTGGACCCTCTCCCCAGGGAAACAG 

GAAATACTT AAAAT CAACTG TTT AAAGT CT CTGGAAATTGTT C TAAGGGCACAGAG CACATGG AG AG ACAG TCGTT CAAGTAGATC 
ATTACGTCTCAGTAAGAACAGTGAGTCGGTGGCATTGGAGAC^CATCGTGTTCCCGTCCCCCCACCCCCAGCTCCATGTGATGGAA 
ATTCCACTCCAGATCCTCCACTCTGGATGGATGCAGCCAGGAAGATG 
ACCCCAGGAGGAACAGGCTGCCAGAATCTCTTCTCTCTGCAGCCCTGTCT 

3 5 GACTGAGTTTCCTGCAGTTCCCATACAAAGTACCAGGAAA 

AGGTGTCCACAGGGCCATGCTCCCTGTGAAATATGTAGGGAAATCCTTCCTTGCCriTTCTTAG 

CATGTCTGTCTCTGCATCCAAATTCCCCCTTGTATA 

CTTGATTACCTCTATAAACCTCCTCTTTCCAAATAAGGTCACATTCTGGAGTACTAGGGGTTAGGACTTCAAC^ 

4 0 TTCC^CATAACACTTCAACCCATGACACTCCCTTTACCCACTCAGCTCCCCCTCCTAGGGCAAGGGCTC 

GGCTAAGAATG CTGAAC CTTAAC C CTCTGGTAGAG CAGGGGTT CCATGCTGAGAGGGGCAAT CTG AG ATCAAGGGATACCGTCCTC 
CACTCCAGTGCCCACTCACAGAACAGGACATCACTCCTGGAGAAG CAGGCTGATTTCC CCATCCCCAGCTCTGATGCAGTGGTGCA 
G AGGTTCTG C CCAGGGAGAG AGTCAGGC CT T AGGG ATGG AGAATT CCACAG CTTTTC CTGAG GGGACT AACTTT ATTTGGGAAGCA 
TGCCGAAGAAGTTGGCAAAAATAATGGAAATCTTGGTGGCCGGCAATTAAGGAGGGGCTAGATACTAATGCCATCAAGAGAAATGT 

4 5 AAGGC CAG AAATTTAACAG AGAAC CAGATAAAG AAATAACGAAG AAT AGTGGTTCT AGGACTACAG T CATC CCTGAGG ACTCAAAA 

GATTGTGCTCTCGTGCCAGGCTGCACCCACACAGCAGA 

T ACAG AT CCAT CAG CAAAACACG CT AG CTCAAG AGGC TG AAGCATAAC CT CTGAC CAAGCAATGGCTGAACAATAAGCTAATCTGA 

CCGATCACTCTCAGGAGTGACTGGAGA7UVGAAACGCTGGGCTATCAGTTTCT 

5 0 TAAAAGATGTCTTC!AAACCIACATTCACGGCCAAAAGTAAGGGGTGAAACC 

AATAGGTGGTTTATGCAGACCCAGGTGCAACCTCTAGGAAACCAGGATAAGAGAGAAAAAAAAAAAAAAAAACACAAGAG 
ATACGAGCGGGGGCATGATAAGCTACACATTGTAGGGGAAA 

AGATAACAAACCAAATAACAAAAACATTAAGCCCTAGAGGGAATAGGAGAGCAGCATCTAGAGTTGCTTCA^ 
TGTTCAGTTCCTGATATGGTTTGGATCTGTGTCCCTGCCCAAATCTAATGTCTAATTATAAT 

5 5 TGGGAGGTGGTTGGATCATGGGGGTGGATTCTCATAAATAGTTTAGTGCCATTC^ 

TCTGGCCATTTAAAAGTATGTGGCACCTCCTCTTTCTCTCTCTC^ 
ATCTTCCACCATCATTGTAAGGTTTCCTGAGGCCTCCTCAGAAGCAGAAGC 
CTAACTAAACCTCTTTTCCTTATAAATTATGCAGTCTCAGGTGTTTTTTTGTT^ 
TGTTGTTGTTTTGAGACTGAGTCTTGCTCTTGTTGTCCAG^ 

6 0 CCAGGTTCAAGTGATTCCCCTGCCTCAGCCTCCrrGCGTAGCTGGGATTACAGGCACCCCCAACCTTGCC 

TTCAGTAGAGATGGGGTTTTGCCATGCTGGCCAGGCTAGTCGCGAACTCCTGACCTCAGG 
GTGCTGAGATTACAGTCATGAACCACTGCACCCAGCrrGGTATTTCTTTATAGCAGT 
AAAATTATAAGAATTGCAAAGACACAGCAAAGTGTAAT 
GCCTTTGAGGGAAGGATTCAGATGCTGGACTTAGCAAACAATGACTTT 

6 5 TTT AAAG AATTAAAAGAAAGTATTGTAACAATGACTCTTCAAATAGAGAATAT 

AAC CAAATTG AAAT TCT AGAGTTG AAAAGAACAATAACTGAAATGAAAAGTC CACT CAGTAGATTTGAGTTGGC 
AAAAGAATGTTAGCZAGAGAAAGTATAAGCAAACTGAGAGATAGATCAATAGAGATGATGCIAA 

GGAGTGAAG AAAAATG AGCAGAG CATC^GAGAAATGTGGG ACC CATGAAGTGTACCAACAT ATG CAT AATGAGATT ACC^ 

GAG G AGAG AAGG AGACAG AAAAAGGATGAAAAGAAAT AATGAT TG G AAACTTTCCAATTTTTATT TTTTAAAAAAC CACCAAT CTA 

7 0 CATATCCAAAAAGCTCAACAAACTCCAAGTAGTAAAATACAAAGAAAGCCAC^CTTAAACAC^ 

AGACATAAAG AGAAAAT CTTGAAAGCAGAAAGAGAAAAATGACTCAC CGCAT AAAAGG GAACTACATTAAGACT AACAG CTGATT T 
CCTGTCAACCAAGAATCTTACATAAAACAAATCTACCACTCAAAAATGAAGGTGAAATTAA 
AGAATATTCATTGTTAAGCAGACTTGCCATTACAAAAAAATATTAA 
TCTGTAAAGACTCTATTTTTTTTTTTTTTTTTTGAGATGGCG 
7 5 GCTCACTGC!AACCTCTGCCTCCTGGGTTCCAGCAATTCCCCTGCCTCAGCCT 



2036 



WO 03/008583 



PCT/USO 1/51291 



CGCCTGGTTAATTTTGTATTTTTAGTAGA 

ACATGCCCCAGCCTCCCAAAGTGCTGGGATTATAGGTGTGAGCCA 
TTCTGAGATACTGGGGTTAGAAATTCAACATATCTTTTGGGGGAGA 
TTAAGAAAAATTTTAAAAGACCTTGCAAGGTGTGATCTCCCAAACC 
5 CAAGAAAATTTTCTGCCACAAAACAACCAGACCTCAGCT 

TTCTCCAATGATTCTTCCACTCCCCACCAAAAAAACTATG^ 
GGAGTGCAGAGGCAGTCCAGAAGGCAGAGCC^^ 

G C CCTAGATGTGAGAGCTGAGGTGATCTGGAGAGAGTGGCATTC CTG^ 
AGTATTTCCAGTITAGGCrrGGCAGGAGTTACCCTGATTAATA 
1 0 TGAGAATGAGCAGAGGCAATAAACAATAGCTTTAGAGCCCCA^ 
ATAAAGAATGTAACCATAAAGATAAACAAGCAAAAAAGAATGA 

CATCTATGTGTGTGCGTGTGTGTGTGTGTATAC^CACATATATATTATTCAATGGGTGAGTTGG 
AATTCTGATACCATAAACATAGATGGATTCCATGCATGAGTTAAAAGACT 

CAGCACTTTGGGAGGTAG AGGCTGGCAG ATCACGAGGTCAGGAGTTTG AGACCAG C CTGGCCAAT ATGGTGAAATGC CGT CT CTAC 
1 5 TAAAAACACTAAAAAATTAGCCGGGCGTGGTGGTGCACACCTC 

ACTCGGGAGGCGGAGGTTACAGTGAGCCGAGATCGCACCACTGCACTCCAACCTAAGCGAGAGAG 

AAAAAAAGACTGTC AG ATTGAATTTT T AAAAAT CT AAATCT ATGTAAGTGTCATATGT AGAAG AGACACACATAAGGTAAAACGTC 
ATGGAAACATTGAAATTAAAAAAAAAGTATGAAAAAAGATATTCCTGGAAACCACT 

ATTCACAAT CT AC TTTTGTGTAAACTTGGTACT CAAACTCACTTGG TAGCACTC CTTGACCCTGAG CTGAAT AGTT AACGCTTACC 
2 0 AG CCAT ATGTGT C TCTG CTATT C CCTTATTTTT TG ATGCTTTTGGG GGGATG AATTTTGATTTT ATGT ACAGCTC AAGTAAG AACC 

CGTCAAAATCAGAAGTGAAAAGCTTTTGCrrGATAATGAAGCT^ 
GAAAACCTTGGCTCTGATGAATTGGCAGTGGGCTTCAGTCTG 
GTTGACTGTTCT^GGCGTACGATGTGCrrACAGGAACACACCAGATACA 

TGTAGGATGT ATC CAAT TT ACCGT CTGTCAGTACACAT ACT AG ACACCATGAGT AACCAG AG AAAT ATTTT CTG ACTG ATT AATAA 

2 5 CATT TTTAT ACCAGCGGT AC AAACT CATTGCAG GGATT CTAGCTTGGAAAAAACAATTGTTAGATGTT AT CATTC CT AGG AAACTG 

CTATGCATACTCCCTCCCTAAACATCTTGTGGAAAAGCAGTT^ 

GTGCAAAGAAG TTT AGAGG AAGGGT AAACACTTTTTTTCAGGCTGTGCT CTTG CTACCATCACAGAAC CT AAGATTCCATGGCTCC 

CX3AAAGAGTCCAGTGTTAAGAATGTCATGTACACAATTGACAGTGTTTGGAATGATGCTG 

ATTCTGACTATACTGATGCTTGCAAACAAGTCTGTACATAAAGCT 

3 0 GACTTCATTCTTGCACAAAAAGTTTATCAGCTAAAAGTGGAAATAACTGAGAAA 

TTCAGCTATTGTTCATTCTTTGCATAGTGGGGAAGTTGCTAACAA^ 

AACACAGAGTAAAACATTCCCATTAAAAATATGGTGAATATTTAATTAGTTAATTATGGGCCTTGAGC 

AG CAGGAGAT AATGGAAAATT ACCCAATTAGAAACAGG CTTCT CAAGT AGGCAT CT CATGGAAT AGGATG CTGGAAG AAGTCATT A 
AAAAGGCAATTGATTGCAGTGTGCTTCTTGGTTTGGGGACCCTGTTCCTGGTACATCATGAGATGTCG^ 

3 5 TCCTCAAGTTAAGATAATTCAAGGTGTGTTTAATAA 

AGTT CAATATC CCAGGGATAGGACCTCAGACCTGTTAGATTCCTGAGCCCAAAGGA^ 

AAGG AGGGGATTGTGTAGACATGGCCTCCTTAGAAGG ACTGGTGATCTT TGACTGGTGGGCGAGAT TGAC CCAAAGTAACCCCGCT 

GGGAGGCGATCAGAGGAACAAATACCCCGCCCTCCTC!TCCTTCCCCATTCTGATCT 

ACTGGAAGCCCTAGGGCCAGGCTGTTGTAATCCACACAGGTTAGCTTCCCAAGGTAGA 

4 0 ATGTGAGTGGGGAAAAATGAAGATATCTGGCATCCTCCACCCGTTTTGCCCCT ' 

ACTCGATGTTCCGAAGACAGGGAACACAGGAATTCCC^^ 

CCTGAAACCCAAAGTGTGAGCTGCTACGGGTATTCTTTATATAAAGCAGTGGAGGCATGATGAATO 

TTAGAGCCCXTTGCATCCGCAGCTGGTCACAGGTCCAAAGCTGATATTTTGAGCTACCT 

TTCCTCTGCCAGTACATGGGTCAAATAAGGGTCTTTATGTGGTGGAGCAACTC^ 

4 5 TAGACTTGTCTTTAT GAG ATTGCCT CAGTATT C CATTG ACCAGGACTATTGGG TAAGAAAAT ACTG AG AG G CACCT ATGTG AATCA 

CGGTGGGACCACACATAGTCCTGTCTACCTCCTTTCTTTTTTCTTTTCTTTTCTTTTTTTTTTC 
TTTTGAGATGGAGTTTTGCTCTTGTTGCCCAGGCTTGAGTGC^ 

TTTAGTGGAAACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACT 

5 0 CCACCTATTGGCTCAGTGGCACAAGGAGGGTCAAATGGTTCAAATTTCTAAAAAATATCACT 

TG C C CTATTGT CATTTG ATTGTAACT ACTATTT AAATT CAATTAG G AT ATTATCTG AG CTATCAGTTACTTAGAAGTGTGTTTTAA 

TGGAGTGCAGTGGCACAATCTCGGCTCACTGCAACTTCT 
GGGATTATAGGGATGCACTACCACACCCAGCTAATTTT^ 

5 5 TGAACTCCTGACCTCAAGTGATCCACCCACCTCGGCCTCCCZAAACTGCTGGGATTACA^ 

AGGACTTTAAAAAAACACTTTTTTTTCTTTTTCTTTTTAA 

TCAGTTTTAGAGACCTAGACAAGCTAAGGGAGTGAATCAAAATCAACAGATAAC^ 
TTTACGTCTGGTAAAACCACCTCCACAGAACATCATGCGTCGTCAGGTATAAG 

TTTATCTCTTCCITTGTAGCCCATGAAATAAAGAGAAAGTTTGTTCAAGAGATGATGGTAGTTTTCACT 

6 0 AAAGTAAGTGAGAACAATAACCCCAGGACAGGGAGGCTGCTGCTCACTGCTCC 

AAGG CAGCAG AG AGACATT TC CCAAAATAAACAGCGAGGGAAAT AAAGGTGTAG GAGT CACAGTGAGGAGGCTAAAGTGACTCCAT 

GTGTGAGAACATATGCCTACCATAAATCCTGCCTTTAGATCAAATTA 
CTGGGGGATAATAGTGTAGAGATCTACCTGTCTCACAGCTGCCCAAGACCATCCT 

6 5 T AT AACAAGCTGG ATT CGT CTG CTT CCTT CTTTGGT AT CCTGGCTCCTATGTTTG C 

HUMAN SEQUENCE - mRNA 

GAAGCAGGCCTGAGCCTCCCCAAAGGCAGCTCCTGGGGACTCCCAGGACCACAGGCTGAGACGAG 
TGAGAGGTGAACTCAGCCTGGGACTGGCTGGGCGAGACTCTCCACCTGCTCCCTGGGACCA^ 

7 0 AGCTGCGGGCCGAGAGTGACTTTGAACAGCTTCCGGATGATGTTGCCATCTCGGCCAACAT^ 

TTCACC^AGCCACTTTGTTTTCGTCATCGAGGTGAAGACA 
TGCTTTGCAGAGCAAGCTGGAGGAGCGCTTTCGGGCCAGAC^ 
AAGTCTACGTGGGTGTGAAACAGGAGATCGCCGAGATGCGGAT 
GTCTGGGTGCTGATGGATGAGGACGTCCGGATCTTCTTTTACCAGT^^ 
7 5 GCTCCGCCCGCGCACCCGGAAAGTCAAGAGCGTGTCCCCACAGGGCAACAGCGTTGACCGCATGGCAGCTCC^ 
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TATTTGACTTCACTGGAAACAGCA^CTGGAGCTGAATTTCAAAG CTGGAGATGTG ATCITCCT CCTCAGT CX5G ATCAACAAAGAC 
TGGCTGGAGGGCACTGTCCGGGGAGCCACGGGCATCTTCCCTCTCTCCT 
CCCCACCAACTGGCTGCGTTGCTACTACTACGAAGACACCATCAGCACCATCAAG^ 
CTCCCCTATTGAAAGACCIXXrrcGAGCTCACAAGGCGGGAGTTC 
5 GATCTGGTT03GCTGCTGTCGGATGAGGACGTAGCGCTCATGGTGCGGCAGGCTCGTGGCCTCC 

CTGGAAGCTGCACATCACG CAGAAGGACAACTACAGGGTOTACAACACG ATGCCATGAGCTGACGGTG CAGTGAGGG 
GACACCAG CAAAAAC CT TCAGCT CTCAGAGGAGATTGGG AC CAGG AAAACCTGGG AGGATGGGCAGAGAAGCAGGCCTGAGCCTCC 
CCAAAGGCAGCTCCTGGGGACTCCCAGGACCACAGGCTGAGACGAGACG CAGGGTGG CTTGGAGG AAGTG AGAGGTGAACTCAG CCT 
GGGACTGGGTGGGCGAGACTCTCCACCTGCTCCCTGGGACGATCGCCG^ 
1 0 CTTTGAACAGCTTCCGGATGATGTTC 

TCGTCATCGAGGTGAAGACAAAAGGAGGATCCAAGTACCTCATCT^ 

AC^GGAGATCGCCGAGATGCGGATACCTGCCCTCAACGCCTACATGAAGAGCCrcC 
AGGACGTCCGGATCTTCTTTTACCAGTCGCCCTATGACTCAGAGCAGGTGCCCCAGGC^ 
15 AAAGTC^GAGCGTGTCCCCACAGGGCAACAGCGTTGACCGCATGGCAGCTCCGAGAGCAGAGGCT 
CAGCAAACTGGAGCTGAATTTCAAAGCrGGAGATGTGATCTTCCT 
GGGGAGCCACGGGCATCTTCCCTCTCrCCTTCGTGAAGATCCrCAAAGAOT 
TGCTACTACTACGAAGACACCATCAGCAC 

ACCACCGCCCCTCACATCACCTTCTCATGGGTCCCTCTCCCACTCCAAAGCCCCCAGTGGCT 

2 0 CAAGCCATCAACGTCGAGGGTGGCCTGGCCAGCCTG^ 

CTCCAGCCTGTCACCCCCTTAGGGACATCGCGGTGGAGGAAGATCTCAGCAGCACTCCCCTATTGAA^ 
AGGCGGGAGTTCCAGAGAGAGGACATAGCTCTGAATTACCGGGACGCTGAGGGGGATCTGGTTCGGCTGCT 
AGCX3CTCATGGTGCG<3CAGGCTCGTGGCCTCCCCT 
ACAGGGT CTACAACACG ATGCCATGA 

25 

HUMAN SEQUENCE - CODING 

ATGGCTGTGGCCCAGCAGCTGCGGGCCGAGAGTGACTTTGAACAGC 

GCTACCGCCAGTTCCATGCTTTGCAGAGCAAGCTGGAGGAGCGCTTCX3 
30 CCCACACTCCCAGCCAAAGT CTACGTGG^TGTGAAACAGGAG AT CX3CCGAGATGCGG ATACCTGCCCTCAACGCCTACATGAAGAG 

CCTGCTCAGCCTGCCX3GTCTGGGTGCTGATGGATGAGGACGTCCGGATCTTCTTTTACCAGTCGCCCT 
CCCAGGCCMCCGCOSGCrcrcCCCGCGCACCCGGAAAGTCM^ 
CCGAGAGCAGAGGCn'CTATTTGACTTCACTGGAAAC^ 
TCGGATGAAGAAAGACTGGCTGGAGGGCACTGTCCGGGGAGC^ 

3 5 TCCCTCtAGGAGGACGACCCCACCAACTGGCTGCGTTG 

GAAGATCTCAGCAGCACTCCCCTATTGAAAGACCrGCT 
CCGGGACGCTGAGGGGGATCTGGTTrcGCTGCTGTCGGATGAGGACCT 

AGAAGCGCCTCTTCCCCTGGAAGCTGCACATCACGCAGAAGGACAACTACAGGGTCTACAACACGATGCCATGA 
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Table 100 

MOUSE NOMENCLATURE 
ICSGNM Rac2 
Celera mCG13343 

5 

HUMAN NOMENCLATURE 
HGNC RAC2 
Celera hCG41833 

10 MOUSE SEQUENCE - GENOMIC 

CCTCCAAATGCCACCATACTCAGGGCTCAGGTACTCAC^TC 
GTTTGGATCCTTCTCCTTCTTCAGATTTATCTGACAGGTCT 
CATCCCCTCIX3GTTACATTCATATATAGTTAAGCAATACTATGTCT 

TGTATGTAATATATATAATATAATAATATAATATATAATTATATATTAATATAATATAAAATTTATACTAATT^ 
15 TATGTAATTTATAATATAATAAATATATAATGTATAATAGAATAGAATATATAT^ 

TGTGTGTGGACACATGTGTACATGGGTC!AGCTCCTTCCTTCTATTTTGTGAGTCCTC 

GG CAGCAAGTGTCATCTCGCT AGT CAGGCTAGCATTTTGG^ CAGAT AGGAGATT CT ACT ACT ACCAAGTTCCGAAGACTGTGT 
AATACATATTTGCAGTTTGTCCGCATCACATCTGCCCTCCACATCT^ 
CCTGTAGCTGCTTGGCCTTTTTTCCTGACTTCTGCCCCACCCACCCTGG^ 
2 0 ATTCACATCCTGCGGTTTTTCATTTTATGTGTGGGAAAAAAA 

CATCTCCCTTTCCCAGTGAGATGTCTGTCTGTTTGTCT 
CTTGTTTTAATATGTTGCGGGAGCCACATAACATCT 

2 5 GAGACCTCCAGCGGTTGGACZ\G AGTGAG ATTGTG TACAC AGGCAAAGC CAGT CAGTT CCGGGTTCAAT ATGTTCCCGC C C CT CTTG 

TGTGGGCTTCCAGGAAGCTGGACGGATTGGGACTGGGGCTGCTGTACCCCAGCGGG^ 

GT ACT AGTGAG AGGTGAG CAAGGGTGTT AGACCTATTTT AG AG AAGT AAG CAG GAAGT AGGG ACATG GTTGCAG AGAAAAGGCAGC 
AGGGCATTGGGTAAGGC^CTGTCCCAGGGTCAGGCCAGAGTCCCC^ . 
CCAGTAGGACAGTTCTTATCATAGGGCCANNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
CATACACACACATATATACATATACIATACATACACACAC^ 
TCTGTAATCCCAACGCTTGGGACACAGCAGGAGAATGAGGTTAAGGCCAGCCT 

ACC CCAAAAGG AAATGAAAT ATTTAAG AGAGAGAAG AGTCCTGG GT GCTGGAAC CCAGAAGG^ CTT 

3 5 AGGCCAGC!TCGAC!TTTAAAAGTAGTTTAGTAGGAAGC!AGGAAGCCTGTC^ 

T AATGGGGG CCATCTGG AGAAATAGGAAGT AGGATGT ATT CCACATGG ATGGG TGAT ATGAG CTGT CCCAGGCAAGCAACTGTGAA 
ATGTTAAACCAGGAAACAGGGCXSTAGAAATCACAGAAGGATGCTTCTAAAACACG^ 

TCGCACCCACCCAGAAATGATTAAGGAAGAGATAGGTT AATTAGCG CACX3ACCCATGAAGCTAGAACCGACTCTCATG CCTGATAC 
TCGCGATAGAAAGAGCGCATTTCCCTAGTAAGGLRAATCAGTGGGAAAAACCAACGTA^ 

4 0 CACTGCGGTGG^GGAGAAACTGGGTGAGCTCCAAGCTGACTGGAAAAATGCTAGCT 

CCCACTCTITCTTTTCTTTTTTGCCTGCCCGACTGTCAAAAACA 
AGGGGCTGGCTAACGCATTGCCTGCCAGCAGGAGCCTGAATGAGTCCT 
ATCCAATGACCCAGAGGATTGGGGGGGGGGGACGGGGGGNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 
NNNNNNNNNNNNNNNNN1WATGGGTCAGGTCAG 

CT ACITTGTGGG TT CTTCTTT ATTCCACCCTT CTGCAC C C TT ATTC CACCGTTT CAAC C CCTAAT ACTAGGCAAGAAAGAATGAGA 
GAGGAGAAAGGGGT AATGTTATTGTTAAAAGAGT T C CTGCTAATT AGG GC C^TAG AGTTCCTTGGGGCAAGTTTGAGCTT CCCTGT 

5 0 CAAGATATCTAACMCTTTCCTCCTCCTCCTCCTC 

AACAAACAG CAACCAATCAACAAACAACAGCAG C CCAC CAATGGGT CTTT AGCATTTAT ATATT CTCCGAAAAGTCCCTAG AATTC 
CAACTATCACACAATTGCAGAAACTATCTGCAGCTGGCA 

G ACAATCTG AAG CAGT CC CATTT CC CACACCTGGG ATT AAAACG AAAACATGT T CTT AT AGTATTTCTGTA rri"i"i "l AAAG AAT CC 
AAAATT CT CACTAAC T AG CCAGAGATATTCTTTTTG AAAATATTTT ATTTT ATTATCT^ 
55 ATACATGTCTGCAGTTGCCCCTGAAGGCCAGAAGAGGACGCTATCCCCTGGAGAGTACTCGCTCATAATC 
AGCCCCTTTAAATATGTTTGAGTTTGGTTGTGTGAAC^TATTTCCCACTCAAATGTAAC^ 

CTCTAAAAG CTTTT AAACT AT ATTG CTT AGGGTTCAG CTAGTGG GATGG ATGAG TGG AAGG CTAG AGTGAAGGATCATGGGTTAC C 
TAAGGTTGAGTGAAGAAAGGTCCTGAGAGTCTCACCCTGAACC^^ 

TG CAG CAG CT AGAGAGGTGATGT AC CTGAAAG AGGGGAGCTCCATC CATC CATCCAT C CAT C CAT CCAT CCATCCATG CAT CCATC 
60 XACATCTCTGTGCATCCACCCATGCACrCACCCATGTGTGTATCCATTATCCA 

AACCATCCATCGATGCATCATCCATCTGATCCACCTCT 

G(ZATCATTCATCCATGCATCCATCCATGCATCTATGTATCCATGTAG^CA^ 

ATCCIATACACTCATGTATCCATTCATCTATC^GTATATCCATTCTACCCATCCCT 

TTTGCZACACCATTTATCrGATCCATCACATGTACACA 
65 ATCCATGTATGCIATCGATGCATCTGTCTGTCTACCTGTCTGTCTGTCTGTCCATCCATGAATT 

TGACCCATCATGTGCATATATCTTCATACCATGATCCACCACCTAT CCATTTGTGCATG CATGCATCCATCTCTCTCTCTCTCTCT 

CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTTCTCCCTCTCrCTTTC 

CTGAAGTGAG CT AGG GGAGCTAG CTG CAGAGGG G ACAAGGATG T C AG ATC CTAG CT CTG CCACATG ACTAGGTTTCTGGTG AGGAA 
AGTCCCTGATGGAGGGAATGGCCTGCGTTCTTCTGGGGATTTCCTGG 
7 0 AGCCTCCCAGGTGGCATCTCTTGCTCAGCTGTGGAGAG^AAGGAGTGAGCACGACAGAGTGGCTTAGCG^ 
AAGAACAGAGAGGGGCAGCTC^GTGGCTCTGGGGAAGGTCCTGCCAGGCTG 

TAATCTATC^UVGGTAGAGGGCACTTTGACCCATATGCTGCCCACCCTGAACTCTATCCCACACCCACT 
GCTTTCCATTTAAAAAAAAAAGGTCATTTAAAAAAATTATTCCATT^ 

AGAGCATTGG CTGCTCITCC^GAGGACCCGGGTTCAATTCCTAG CACGTACTAGCAGCTCATAGCTGTTCAGGTCC^GAAGATCTG 
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AGACTCTCACACAG^CTITACATGTAGGCAAAAACATCTATGTA^ 

GTGTATGTGCCTGTGCATGCCATGT6CAAGTTTAGAAGTCAAAAGACAA 

TTCTGAGTTCTGGGGATTGCATTCAACTCACC^ 

TCAATCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTI^ 
5 TGTGTGGAGGTCACAAGGCCAGCTCAGGACTCAATCCTAGTCATCCCT^ 
GGGTGTTCCTCCTTCCACCTTCTGTCTCACAGTAGGACT 
TCTGAAGAAGTTCAGGTTCTCCTGCTGTGTGGCAAGC^ 
ACCTCTTATTTCCACAACAATTTAAAGAGTCAGCTCTGGGGAGG^ 

TTAGGGAGGT ATAG CAATTCT AAAGGT ATGTGTGGCTGATGATAT AGCCTCAGGGTTT AGAAAG CAAAGATTGGATGGGGGTTTGG 
1 0 GCAGAGCTACTGTTGTAGTGGAGATAGGACACCTGGCC^ 

AGACTGAGCACTCTCAGCAAGCATTT CTGATGGT AAAT AGTTTGCAG CTAACTTACAC CCATTGACT AAGAATGC CACAAGCCTGG 

TACTCAAGTCTGTCATGCCCTCAGGGCTCAGCAGGGCACGGTACATGGTGaSTGGTT 

GGAGGCATAGGCTTCCCCTGGCCTGTGGACATGAGTGATACACCTTCAA 

CGGCGTAGCGCTAAAACATAAAACAA03CACACAGAAATATCAAGATTGGTAAATTGTGTGT 
1 5 CTCACCACCCGTGAGCCCACGTGGAGGCAGAGTGAGA^ 

GGGTGACAGACAGCACATCAGAAAATATTGTTCTCTGTAGCA.TCT 

GAAGCCACTGTTAGGAGGAGCAGTGGCAAGCAGAGGTAATGGGCAACAGCTCTCTC^ 

TOTTCAGGCC^GGACTGAGGCTCCACGCCTAGCACTGTGGGCrTAT 

CTATTATAACATTAAGAGGAAGAACACTGTCACTTAAAAAAAACAAA^^^ 
2 0 AAAACTTTCCTCTGTCCACrrGTCAGGTTTGTTAGCAAATT^ 

GAACTTCCTGCCAAGTCTGATGACCTGAGTCrc^ 

GTGCGTGGG CACGCA.CACAAATACATTTTCCATTTT AAAAG CAGCCATGGGTCTCTCTrTTGTGC CAG GTTT CCACATACACAAATC 
CAATATG CAAAGGCTGAC ATTT C CCCTTCAT CCAAGGAGTGGCAAGGAAATTCAATTTGAG AA 
AGTAATAAGGAAGCAAGAAGTGAATCTTTTGAAAAAGGTGCCTGACCTTTCTGGGAGGAAAT 

2 5 AAGGAGAAGGTGAAGATGCTAACAGG CAG ACAGAATGACT ATCACAAGGATGT CAGGT C CCTACTCGCTGCTGCAGGTTGAACG CA 

GGACAGCCCAGGACTCTTATGTCATTTGCIT'GAGC^ 
TCCTGAAGGAATGTAAGAGAGGCACGTTTCTCCTAGGCTTGGAGCT 
GGGAGGACTAGCAGAGAAAGGGGCOVAGGTTTGGTTTGCAGGTGACAGTCT 
GAAAAGATAGGTAGAACTTTGTTTAATAGATGGCACT 

3 0 GAAAGACATCTGGAGCACATCGATTTGAGGTGGGAAGATCAGCCCACTGTGGGTGG^ 

TGTTAAATGAAGAAAGGATCTGAGCAGCAATGCGCTCATCGTTGTTTCCTGATTGT 
TCTGACGACTTCCCCACTGTGATGGATTCTAACCIT^ 

GT ATTTT TTTT TTC ATAGTGAT AGG AAT AGAAACTAAGAAAGCC CTAGAG AAATCAAAT C AACATCT CACAAAAC CT ACTAGCAG C 
CCCTTTGTGATCTAAAATCCACTCTGTGTATCAAGGTC^ 

3 5 AATTG CATAGG AATG CTT AAGAAAG AAT CACTCAAAGAGTATCAATCATT AG GGAAGAAAAATTAAAGTTGACCACATACAAAT AA 

ACCTTT ATAAATATCACAG CTTCAAAAAAAATAATTC CACATACTGGAAATG AACT ATGAAACG GTT ATAG CAATTCACACTTGAA 
AAATCAAACAC CGAAC TTCT C CAAATAAATATG AGGGGAAAAAATACTC CAGT AGAGAAATGG ACAAAT ACACAAATAGTAACT 
TGTAATAGGAAACCCAAAAGTCCAATAAACAACTAAAAACACAAATGCCCAACAAAACACCT 

ACT T AT AAACACAAATG C CCAACACAAG G CX5AGTGT AGTTGAT ATC ATCAAAACGG CAAGAAG CCTACG ACAGTTTGT AGGT AGGA 

4 0 TTTTCGGTTTGGAGTGGTTTCCCAGCTAGGATCTTGCTC 

GGACTCAAGCTCCCAGCAATCCCTTTGTCTCAGCTTCCT 
CTTCTACCAG CTTGCTCTACTGCATCTGTNNNNNNNNNNNNNI^^ 
NNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNN^ 

4 5 NNCAATTTCCCCCCTTCCTCCCCTTCCrCCTITTTCT 

TTCTCTCTACACACTGGCCCTTGCTTAGGC^GCACC^ 
GTTTTCCCTGCTGGTGACTCTGAGGTCACTCTGTAGCT 

CACAGTAGGCACTCAAGTGTTTGCCCATGGGAGTCAGGCCAACT 

5 0 CTGCCCTGGCTTAGGAAGTGAATCTTCAATTCTCTAGGGTGCCTGGGTTCCCTTTGGAGCCTAA 

GGCCCAACCTAGCACCTGTGGGGCCCCAGGTCCTCT^ 
AAGTTCCrCTTCOSTTTCCTCCCGCCTCTTGAGCTGGGGTG 

GGGTGATGGGTAAGTACCTGGCTGAGTTGTCTCCCACCCCTACCAGGATGCCTCTGAGATCCTCCCTGGTACA 

5 5 GGCTCCCAGGAGGTTCTGGGGAAAGTCGCTTCCTTTCC^ 

GTCCTTCAGCCrTACCCCACTCCGGAAGCTACTTGGGTGGAGGGCCTGGACTGTGGGTGGGGTGOT 
AGCTGGGACAGGGCTCCTACCGGGCCAGCATCAGTTCAGGGTACAGGTACGAGGAGGGAAATCAGCAGCT 
GTTTCCCAGGCTGGCCCAGGCTCTAGTTGCTGTCTTGC^ 
CA.GCCTCTGCTCTTAAGGACTCCAGAGGGCAGGCTGGCrcCAGCCCT 

6 0 GGCTCTCATAACCCAAGGCCTGAGTTTCTTCATGTGTCTCCCATATATTCCGA 

GTCCCGCTAGATCCATCCTCAC^TGTACTAGACAGCCATCTTCAGTTCGAAACGGGGAGGGGCTGAG^ 
CTTAGGTCCCAAGGTCACACACACTTTGGCTCTGCTGGTC 

GGAGGGGGCGGACACGGAGGGCCGGAAGTCAATGATTCTATGCATTCAAAGTT 

6 5 GTTTAGGGGGCAGGGACAGCAGAGCAACCTCCTCGAGTGGGAAGTGACTTAACGGAA 

TCTGG CCTCT CAGCT^CCTGCTG AATGACCra 

GTTTTACAGGTGGGAAAGACCAAAACTCCX3AAAAGTTACATAACCTG 
TCAAAGAGGGGCGATGGGCAGTCCCTAACAGTCATTAGTTGGTGCCACT^ 

7 0 TATTTTACTGG<3AAGATGGGCTTGTCCTTGTTGTCTA 

AGATG^CAAATGATGGAGGCAGGATCTGAAGCTTAAGAGAGAC^ 

CCATGGTCATCTGTGCCTATCACCTGTATCCCACACACTGGTACATGTACAGTACTGGAGTT 
CATGGCTAAGACTGAAGTCAGGTTCTTTATGGGCACTGCACC^ 

75 
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TGGCTTCTTGGGGGCTGTCCCCTGTGGGGGCCTGAGACCAAGTAGGCTGGTAAGGCT 
CTGGTGATCCGACATGCAACGCAGTGACTTATGCTAAGTC^ 

GAGGTGAGGGAGGAAGAGTACCTCS^GAAACTTAGCAGCAGTGTCTG^ 
5 CTGTGGGAAGCCTATCTTCTGGTGCAGGATGTAGCCAT CCXXITC CCACATGAT ACACAAGGAGT CCG CAGGTGTCAGAAGAGTCT 

AGAGCATCCCCATACTCACAGAGCAGCTCCCC^GCrTCCTGCA 

AGATGTTACCCACTGTTTTGCTTTATTGG^ 

CAGAAATGCCCTAGGGTGATTTCTTGTCTCATTCT 

GGCTAAAGGCCTGTCAATCGCCAGAGGAGCTTGCCAGCTCTCTCTCA^ 
1 0 AAGAAAGTGGTTG ATAC ATTGTGT AATATACATATGTC TCATAAGTG ATATCTGTG CCTCGAGGTGACGAGATCCAGAG AAGT CTT 

CACCACTAGCATCTAGTTAGAATGCCTTTGGGTTGGTGGC CCTGT ACAG GAT ACCACACTAGTG ATTGCATTGGGT CCTCAAGGGA 

AATGGGTGTTACAATATCCTGATGGCCAATAGCCAATAGCAGCTATGAGGCTTGTCCCCCT 

GTATCTCACCGGGG C CTT CATTTTT ATCTTCATGGAAG CC CTGCCTGGAGT AG AAAGCAGAGAT 

GCCCAAAGCTTCCCACACTATCAAAGGAGCCACCACCCAACCAGTGCCAAAC 
1 5 TAGGGATC CCTTTCTGGAACAGAGGATTCAGCTGCCTCTAATACCTGGCTAAG CTGAAGGACCCAGCAGGGGGCAGCCAGAGGGGC 

TGGGCAGGCAGCTGGACACCGATTTCTGCCCAACTGCATCCrrGATTTGAATGAAGACTAAAA^ 

ATG ACTCCTG CCATT AGGCTT ATAAGT CAG AGAG GACTGAGG CTAG ACCTG AGCAAG AACAGGAT CAAAGGGGATTATGG CTG AAG 
GTCACTGTCACCAGGTGTGCTGATGCTGTTCTTGTCCCTGAGTCAAGGCCCTAGCCT^ 
CCCACTTGCTCAAGGATTTCCAAGCACTCACTCTTCCTCT 
2 0 CTGGGAGATCCAAGCCTTGGGAAGCAGTGTCCCTGTGAGGAGGTGGGAGGTGGAAG 
TGACAGACCGCACTGCTGGCCAGCCGGTCAGGTCCCGAGGGCCCT 
TAGCTGGGGCCGGGCTGTCACTGGGACTTCCTCCTCTGAGATGAACTTTCT^ 
ACACCTX5CCCGAGAATGGTTCTATGTCAAGTGTTAGGGACACAGA 
GCTCTCATCCTGTCTCCTACCTGACTCTGTCCAACTGAA 

2 5 AGG AT ATATACTCAGG CTGG CCT TGAG CTTGTAGNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

TTTATTTATTTTCCCCCTAGATGAACTITTTAAAAAGAAGAACT 
CCTTTATGCTTAAAATCC!AGTACAAGGGGTTATCAAAGCTCCCC^ 

CTGTCCACAATCTC CTGGGCTCTCTTGGCTCCCTGCTCTCCCTCAGAGAGCCCCAGGAGCACG CCCCCCTTTCT 
GCTGGTATATCTAGCTCACTCCTGATTCTAGAGCAAAATACCAGTTTCT^ 

ACCCTGGCCGGTGGGTGGAGCCTGTCCAGTGCTTTGCCAACCCTCTG CTGTTCATTGGCTGG CACCCCCCAATGGATATCATAAGA 
AGAGCAGCCACTTTCCATGCTTCTGAGCCGACCTGGGGCCTCZAGTTTCCT CTCTGGTCT CCTGGGTCATGTGAG ACTTCAAGGC CT 
AATACGCGTAGGGAATCCCCTTCCACCTGACCAAGGAAGGCACTTGAAGGGCGCTTTGAAAGGTC 

3 5 TGGGT GTC CTG AAAGCTCAGAGGAG CCAAAAAGACAGATGCCTTTGC CT CAAG G CATCTTGGGACGAT AGAGGGTC CTG ATG TCTC 

CAGCTTGCCAGAACCCCACACAGCTAAGGCACT^ 

AAGGATGCTTCCCAGAGTAGGCACACAGCAGCCCTATGTGCCGACCACCTATTCT 

GATGCAAAGAGTCCTGTTGAGGGTGAGAAGCAAGTTAGGATACGTGGGACTC^ 

4 0 TTCTCCAGATGTCAGTGTGGGCIAACCTCAGGAGGTGGGTGGTGTGAGCCTGCCGGGT^ 

CTCGAGGTCrTAGTGAACCCTGTCATGAGACAAGGAGTGGGTGCATGTGTAAGCTCAGT^ 
GCIAGGCIAATGGAGGTGTCCCTAAATCACAGCTATGGCAGGGCATAGTGAGCCTGTTGGAC^ 
CACTGGCTGTGTAACCTCGGGTGAGTCACGAAGCCTCTCTGATC 
GTTCTTTTAACCTAGGTATTCGATTTTACTTTCAAAAGTGCTOTCT 
4 5 TCAGAGTGTTCCCTGT CCCAACACGAGCTTGCTGGGCTCCTGGTCTAGTGTTTGGCCCACACCAATAC AAGGCTTAGACATGTCTA 

TGGGTCTGTCTGTCCCrTGGAACTGGCCCTAGCACAGTCACTTGTCATATGAGTGGAGAGT 
CCAGGAGGCTCAGGACCTCCCITCACCTTCCCACAGGCCTGCC^ 
CTTCCTGTTTCCCCTCTGACAGCTTCCCAAGTACACCT^ 
CTCAGGGACAAGCCGCCACTCAATAGACACCGTGCGAC 



50 



AACAAACAAAAACAAAAAAAAAACCAAAACCAAAAACAAAAACAAAC^ CT 
CTTTGTTTACTTGTACTTAAGGCATTTGTAATAGTATCCA^ 



CCGTTCCrTTCTGCrCCTTACAGATTTGACAACTACT 
55 TACCGCAGGTC^GGAGGACTATGACCGCCTCCGGCC^ACTCTCCTACCC^ 
CCCTGTCTTGATCCTGTTCCATTTACTGCTGATGGCCTTC^ 

GGGGGAGGGAGGGGC^GAGCTAGGGTCCTCCCTCAGCCATGCTGCCCCGAGTATCACCATGCCCCG 
GTGTGCC^GGCAGKSGGCTAGTGTGCCTGGAGCTGGGGTAGGCAGGT 
GCTGGGCTCTCTACCCACAAGGGCAAAGGGAGTCGGCAGGTAGATGGTTATGGGAGCCGA 
60 GCCTCCCTATGCTAACCTGCCCTGTTCTGTATTTAGGATGTATTTCTCA.TCTGCTTCrCGC 

TGTCCGTGCCAAGGTGAGCCACGAGTGGCCTXSGGAAGGGTGCTGAGGAAAGGGGAATGGGCTATCCT 
GAGAGAGGGTCACTAGTACCACACAGGACCAATGAGAGGAGAGTCTCCTTGGGCTGTGTGTGTGT 
GTGAGGGTTCCTCCTGCCTCTGAGCCCTGTCCCTTTGTCTATCCA^ 
CTTATCTCCAAAAAAGTCTTGCCATTTTTTTGTTTTTGC^^ 

TGGTTCCCTGAGGTACGGCACCACTGCCCCAGCACCCCCATCATC^ 

CATCGAGAAGCTGAAGGAGAAGAAGCTGGCTCCCATCACCTACCCGCAGGGCCTGGCACTGGCC^ 

AGCTCAGAGAGAGAGGGTATCTCGGGGTTGGAGGTGCCTGAACACAGCCTTGTCTTAGCGC 

TCCCAACC^TTGTACCAGCTGTGTGACCTAGGCTGGTCCCTTGGCCTCTCTATGCCTGTGTTTCTCTC 
70 AGTGATGGCTTCTGCCATGTCAGTGCCTGGGTAAAATGACTTGGTCCACGGTCCCC^ 

GCCTATGATTGCTTGGAGCACAGNNNNNNNNNNNNNNN^ 

OTTNNNNNNNNNNNNNNNNNN^ 

NlJNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNN1TONNNNNNNNNM 
75 NNNNNNNNX«JNNNNN1^^ 
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NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNmiNNNNNNNin^ 
5 NKNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNKNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNKNNNN^ 
XO NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNK^^ 

NN*TCJNNNNNNNNNNNNN 

NNNHNNNNNNNNNNNNNNNNNI^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNN^ 

2 0 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNIW^ 

t^JNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNim^N^ 

NNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNN^ 

NNNNNNNNIWNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNm 
30 NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

AATGGC CAGCTGAATG CCTAGGGACCCAGCTTCAGAGTGCCTCACTGCTGGTCACCACAAG AATGG 
CAGAGTCATCTCTGGAACAAACGGCTGGATGCCAGGGCAGGACCCTGGCTTC^ 

3 5 TTCCAGGAACAAGAAGCACCTGGTGTGATGCACAGGGTCTGGCTTCCT 

CAAGCCCTGGCTTCAGTGGCTTCCTCTCCAGGGGTGACCGCTCTTTG 

TGAGGCGGGGAGAGCCACCAGTTCCTCACAGCTGTCAGAAGTGCCTGCCCCAGAAAGGAAA 

GCCTCCCTTTCTTTCCCTTTCCTTCCTCGGACCOTGCATCT 

TAGCCATGCAGATGCCACCAGGGGCCTGTCCTCTGGAGGGCACAGAAGAGCTC^CTATTT 

4 0 GCAGGCMGTCCCACAGCTGCrGTTATGGTTTGCCCATAAATTGGGGTGTGACTAAA 

TGCTCAGAGGTGTCAGCTCTATGAGGAAGGCTGTGGGGAACTTGGTC 

TGTGTCCrrGGGGTCTTTCACCATCCCTACTGAGTCTACAAGAAGAAGATGAAGTACAGAGAGGCATATTCCCATT 
CCCCACTTGAGTGATACTCCCTGTTTGAGTGATACTCCCCACTCAGGTGAT^ 

ACT CC CT ACT TGGG TGATACT C CC CACT CAAATGAT ACT CCT AGT TGAGTGATACT CCCTACTTGGGTGATACTCCCACTCAAGTG 

4 5 ATACTCCCTACTTGGGCGATACTCCCCTCTCGGGTIX^CACAGCTTCTCCATGACTCAGGCCCCT 

ATTCAGTCAAGTACTTGGAATGTTCTGCACTCACCCAGCGAG^ 

CCACAGCCCACACGACAGCAGAAGCGCCCCrGCAGCCTGCrCTAG^ 

CCCTGGGCCCAGCCCTTAGGTCACTGAGAGTAAAGTTAGACCAGGACAGAG 

T CTGTGAGTACCTGTGAAG AGT C!TT CCTTGACGGAT AGAAC^ C T AG AAC CTG CCAATGC C CT AAG CCAAGACACAAG A 

5 0 GCITGCCTGGCTCTCATTAAGCCTGTCCTTGTCACAAAAACCAGAGAACCCCAACCATACCCTGGGCAGGA 

GGGG AGGAGTTGGAAGAGAGTT CTGGGGCAGGCAGAAATTGAAAGCAGCCATGGTC CCGGATGGCCCTGAGCTGGGACCTTCTTGT 

CCACCAAAGCTCAGTGACGTGCrGGTGGAATTGCTGTTTCTCCGTGTGGCGTTCTTTC 
CTCCACGGAGGTCCCTGGGGATGGGCGGATGTTCATAGGCCACATCCTATTTGGGAGGTTCCT 

5 5 CCCCCAAGACGCAGCI^TCCATCCTCATACAGAGTCAT 

TCTCCATGGCCTGCCACTAGTGTGTTCGCCATGTTGGGATACCTTCT^ 
TGTTCTCCTTTTT^GAAAAGC^CAGAACAAATCTACTTCAGTAAATCTCTCATCT 
AACCCCTACTGTCTCTCGAGACCTCCTACTCTACTTGAAGGGTAGAGCTA^ 
GGACCCGATCCATAGCAAATGGAAAATACAGCTCTCTTGCTTACACAGAAAATAAATGTC 

6 0 AGTTCAGGCACAGCTTGATCAAGGTGGGGCGAAGGGATCAGATGACA^ 

CTGCTGCCAGGCTCATTTTTCTGGCCATTGATTCTTGGGACTTGATCCT 
CTCTAAGGAAAGGTCACTCTGGTTGAACCAGCTTGTCTTAGGCCCACCTTCAAT 

GGCTGGCTGTGAGTTGTGTTAGAGCCAGGAGC^GAGGTGATGCCAGGGAGCTGGGGTATAGAATGGGTGTGTGTGGCGTTGGGGGA 
ATAAATTCTTTAGGGAACZAACCTTCTTAAAACTGGG 
65 GTAAGTGTCCITTCTACCATTATCAGGGAGAGT^ 
TGTTTGTTTGTTTTGTTTTGTTTTGTTTTGTTTC 

CAGGCTGTCCTCGAACTCAGAAATCTGCCTGCCTCTCCCTCCTAAGTGCTGGGAT 

TTGTTTTTGAGATAAGATTTTATTATATAGCCTGGGCTGACCTCAGACTTGCCA.TCCTCCTGCCTCAGCT 
TACAGATGTATACTTCCGTGCCTGGATAAGGTTGAAATTCTTACTAGAGA 
70 CCCAGGGCCTGGGCCTGAGACAGTCCTACCTACATCTTCCCTGAATTATCAATCTATGGTC^ 
CACGTCCTTCTCCGAACAGACTTTTGATGGC 

CTCCACCCCTCCCCCACCCCCCAGCCCTGGAGTCTCCGAGGCTGAGAGAAAATGGCTCC 

AGCTCAGCATTTTCTGTCTCCAAGAGTTCCAGAGCTGCCCCCCAACT 

CTTGGCCCAGTGAGGGAAGCTTCTGTTCTTAGCTTGGCn' 
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10 



15 



20 



25 



30 



35 



CCCTCTTTTAGCTCTCTGCTCCTTTTCTAGTGCATGATACTAAG 

CCTCTOU^CTAAGCCrCTAAGTAGCT^ 

CTTGAGCCTCTGTGTGAGTTGTGGAGCACACAGCC^ 

TAATCTGAATAAAGTCCCCGATACTTTTGCAATTTGTG^ 

(XCTGACGGGACTCCCGACCACC^CTAGTGACCTT^ 

ACTCCCCTGTGTGCTGGGTTGGGCTCCTGCTGTGGGTGTTCAGCT 

AATGACATGAAATACCTGTTCCACCCCCTCACCGCACATGTA^ 

CAGAGGAGGAAACAGGGCCTGGAAAGCCTAGGGGTCAGTGCTCT 

GCCCCTTCTCCrrGTCrCACCCCTCCTTCTCTCTTCCAACCTCCCTACCOT 

CTCTCCC^CCCTrCCATCCCTCCCACTCCCTCCTATCCrCTCCCATCCCTCCCACCCATCCCTATCCTCT 

CCT CCCC CATTC CTCT CTCCTTGCCTACAGAATCCCAGTTT^ CT CAG CATTG ATG AAATACTTGTTTT 

CAGTGAGAGGAAAG G CTGTTTAAGTG 1X3TGTGCTTGTTG CTGTTAGAGCAAC CAAGTTTTGTTCCTAATTCCC CCTC CCCTGGAGG 

AGGATCTGACACCCTCTTTTGGCTTCTGTGGGCACACACACACA 

GTTTTCTCAGCCTTTGGGGCTCCX3AGGTCACACTGTTTTCA 

GCTTGCAGC 



CTACAGCATCAACAGCAGAAGCCAACGGCT^ 

CrTCOVGAATGLAATGCTGGAGGACCCAGTTACTCGTT^ 

AGGCCCTAGACCCTGAGTGCAGCGGCTGAGGATCAGAGGCCTCCTGAGCAGC 

GGAAGTGCTTGCTGGGTAGGCTCAGAAGTGAGGCriTGCTCCACAGCATTCACCTGGTACCA 

GGGCCGGGGCTCTCTACTCTATGCTCAGCCCTCTTT^ 

TCCATCTCTTCCAAAGCTCTCAGCCn'CCAGCrAGCTGCTTCCrGCTC 

CGTCCACACTCACAAGTCAAGCCTGGATGACTGTCATCCTAGGGTCCCCTAATTCAAGC 

GAGCCTTGGCCCTTTCCTCTGGT CAGCAGTGACGTCTTTCCCAG<5ATGCCTCCTGCTTTCTCTCTCTCAGTCCCCGCTCCAAATG C 
ACCGGGAAGTCTCAGTAGGATCGTGTGTCACAGACTGTGCCACCCCAAGTGTCATCCTAAGATCTC 
TCATCTGACAGACTGTCATGGAGTCTGGGGGCTGCACCGTGCCCTCAGGAGCTCACA^ 
AGAACAGGGAAGTGGTCCCTCTGTGTAGCTGTCCCCTGTGCTrTCAGTCCCATGTGOT 



NNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNJJNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNN^^ 

GCCCATAATCTGCCATCCTGCTGAAGCAGGTTTCTACTGCCCACCAAGGTGTAGCCATGCCT 

TCAGCTGAGTCCCCTCTGACTGCCCCATCTACTACTAATGTGJVGCGAGGAGAGCGGCCT 

AGGGGGAGCAGCACAGCTCCTCTGCAGGCTCTGAGTCTGGAGAACCTTGATGTCTGC^ 

GCCTCTACACAGGCTT CATGGGATGT CAGTAAAGGACACAGTAAAGGGTCCTGCAGATGGCGAGGCAGAGCTAGATGCCAT(^ 

GACCCCCAGGAAGGCAAAAGGCCACAAGGCATAGAAGAACACTGG^ 

CTGCACACACAGAGAAGGATGAGGCCCAGTCTTTTCTC^CCAGGGGCTCGGAGGTCA 

CTTGTGATTTGT TCCACACATCAGAATCTT CTGTACAG AAAG GAAGAGCT CTGGT CTGAGGGTCAGTACAT CCGGG GTTGG TATAC 



40 



AATCTGTGTGCAGAGAAACTGTGAGACAGCTATGACCTCAAATCCTGGTAGT^ 

CTCAGGGCATGATGGGAAGAGGGAAGGATCGGGGAGGGGTGCAGATCAAAACGACACAGAGAAAGCCCACCTCCTGGAACTGGGGG 



CCrTTCTGAGCCTGCTTCAAGAAAGCCAGGGTGAAGACCCACACTGGGTTTGGAAGAT 

CACAGGGACAAGTTAGCCAAGATATAGCATTTAAGGGAAGAAAGAT^ 
45 CAATTCCTCCCTTTTAGGCCAGTAGCGAGGCTGGAACACCATGTAGGAAGGACATGGAGCAGAGCTTGCTC^ 

CAG CAAACGGATAAAG AACAAAG AG C CAGG AC CAAG GTTT ACC CTT CCGAACCACGGT CTGAGTG ACCTGC GCCGTACGTTCCG CA 

GTTCCACTTCCCAACAGTCTGT CTGAAATTAG ACTC TCGGGGGGTTAAATCAGTGGTT AACTCAGAGGCTCCATTATCTAGTCGTT 

TCTGGAAATGATCACACTCAGGCTGTTCTTTATTAATTTCCTAGGTGTCTCTC^ 

TGCAGTCCCAGCCCATGTCAGTTTGACACCTAATGGATTTCCTTAGATCACACT^ 
5 0 ACCAGTCTACTCAAGACGGCTGCCCTGTTCATACCCCGGACTACACGGAGGTTTTC^ 

CAAAAGTTCAAACTGAAACTGAG AGT C CAGG CAAATTGACAAAATCAAAAGT ATATAACIACCACAGAGTAAACAOT CT TT 

TGGGTTGAACAGGGTGTAGAGAGACTCTTTCGTGCACTGGGTG 

GACAGGAAGTAGGAGGTGGGAGTCTTGGCAGGAAG&GAGG^ 

AGGAGACAGATTTATGAAACTGAGAAGAGGTAGCCAGCCGTGTGGCACTCATGAAATAGAATAAACAG 

5 5 AGT CGGGG AACAAGC CCAAG CTTGTGG CTT AG CCAT TTATT AATT CAAGAAGTAATTC CGAAGGCGGGTTC TGGG C CG CAG CAATG 

GCGGAGGCX3GCGGGCTGGTAACGTGAAGGTGGGTTGGAAGGAGGAACAGCAGCAGAAATGCAGA 
TGCAGATGGCGCCTACGGTCCGACTAGCCGCTGCTGTCCTGGTAATC 
TTGGAAGAGGACTGGAAGCCTCCCGAAATCCCAGGAAAAGCATCGTGCCTTTAA 
GCCCTCTTCGTGCCTTTCCCGACCTCTTTGGAGGCAGAAATCGC^ 

6 0 GGTAGGGAAGGAGCTTAAAGTGAGCGGCTGCCTGCTGGAGGTACGCTGG 

TCAACTTTCTTGACCAGCTTTCCCTAGTGGTAAACACCATACAGCTTTTTGGGCCCCCGGT^ 

CAAGACTGAC CTTATTTCCAT CTCCTC CTAAG CAATATAATTTTCCCT AGG G AAAAGATTCTAG AAAT AGTAACT ACTGC ATAAAT 
GTGGGGGCCTAATATTTTTTTCCTGACTTGATCCTCAGGCACT 

AAATAAAGCTATGCTGCTAATCTGCATTTGGAAAAAAAAAAGAAGAAGAAGTAATTCA 
6 5 CACTGGT AGAAAAG CT CATGG TTAAG CAGGAGAACAGAAAGATGTGAGG CCAAAGACC CAACAGGT CAAACAT CAAAT CCTGT AT A 

TCATGTTTTGCATCTGGGGCACACGGTGTCA^ 

ACACGGTCTCTTTCCTGGGTTGGCrrCTGCTCACTGCTCATGGTTT 

CCTGGTCTCCACAGCATCTTCACCTTTCTCCCCAG^ 

CAGCCTCCCAGGATTTTCTTTGGAATCTGCGTGGTTGCCTCCGTGA 
70 TCGTATGAACAACAGAAGGTCTGCCACTGTCTCTGTGGACCATGACTGTGGCT 

TTTTAAAAG ATT ACATT AACAGT AT AG ACACTG CCAATGG CTG AGG TCTG ACCAATT CTTGAGATG GC CTTGAAGT AC 1TTT1 1 CCT 

ACTGGTCAGATACAAGGTCTTGGCTTCTTTTGTTGTTTTTGTTA 

CTGCACAGACACAACTGGCCTGGC^CTTCCTTTGTAGACCAGGTTGGCCTTGAACT 

AGGGTTGGGAATAAAGACGTGTGCCATTACACCAGCTATCCTTTTAATGGTGTG 
75 GC T AACTTT AAC CATT CC TTTT CTG AG CAATCTG CAAGGTTTT C 



2043 



WO 03/008583 



PCT/US01/51291 



TAGCAATAACCACGCTGTAACCTAAATG 

ACTCACAAAGTCTCATGGTATGGGGAAAACAGAGTTAGGTTCT 
TAGAGCATTC^TGTTCCCGTC^GAAGCrTCACAGGCACTGAOT 
GGCACCGTGACCAAAAATAACTTGGGGAOAAAGGGTTTATGC^ 
5 ATAGAACT CAAGCAGGGCAGGAAC AGGCAGGAGCTGATTCAGAGGCCATGGGGGCAAGCTG CTTACTGGCTTG CTCAGCCTGA 

TTTGTTATAG AACACAGAG CC CACAATGTGCTGGGCCCT 
TTTTITTGTCCriTTCAAATATCTCTACCTTGTATCA 
CAAACCCATCACCATTAAACCArAACCrTTCCTTGATTGTACT^ 

TCCAAATTTTAGAAGTTCCAG AGTCTTTAAATTTTTTAAACACTT A CAGTCT CTTTAAAATAG CCAG TCTCT TTT AAAAT 

1 0 CTGGTCTTTTAAAATTCAAAGTCTCTCGTCTGTGGGCTCCTGTCAAATAAAAAA 

GAACCAGGGCATGGTCATAATCTGAACAAAGAAACAC CAAACTCCAAAAGTGTGAATAAC CTAAACTGT CCAATTACCTGGGATCA 
CTCACAATCTTCTGGGTTCTTCTAAGGGGCTTGGOT 

AGGGCTTCTATTGCAGTGAAGCGCCATCATGACCATGGTAACCCT^ 
1 5 GAGGTTTAGTTCATTACCTTGACAAGAAGCAAAGTGCATGAAGGTAA 

ATCCAAAGGCAGCAGAAG ACAGTGTGC CATACTGGGTAT AG CTTGAGCAT ATGAGAC CTCAAAAC C CACTCCCATAGTGACACAGT 

TCCTCCGACAAGGCCACACCCACrCCAACAAGGCTACATCTCCTAATAGTGCCACTCCCT 

TCTATGGGGCCATTCCrATCCAGACCACCACAGGTTTCrrGCTGCAACTGA^ 

TTCAGGGACTCCAGTCAGGCTACACAGTGCCAAATCTCAGCTGCTCTTCATGA 
2 0 GGGTACTCTTGCACTATCAAGTTTGGTTGCCAGCATAAGGTACAATCTTGGCTCCT^ 

CAGAAGACACTTTCCAGAAGATTT CAC CTCAATAATG CTGT CT CTTAAT CACAGCT AATT C TTTAGCCGAACCTGAGCAGAACCAA 

AG ATT ATT CACAT AAATAG CC CAG CAATCTGCTTCCCTCTG AACTTCAT AAGTTGGG CATCCATTCT CT CAACAGG 

TTTATCTTC CAAGCT C CTAC AGAACGG CTCACTGAGCTCTCAACACTCAACGCCTTTTGCAGCC C3^^GTT CT CCAT 

AAT CCT ATTC AAAACAACATGGTCAGTTCTGTCACGG CAATT AC C CCACT CCTGGT ACCAATTT AACTT AG CTAAG T iTT CTATTG 
2 5 CK1AGAT AAAACACGGTG AT AAAAACCACACAGGGAG GG A^ CTTT AAGGC TTCATTT AC TCATG GTT CCATAT AATAT CT CCAT 

TGCATTCCAGTTTC 



30 



MOUSE SEQUENCE - mRNA 

CCAACTGTCATCCCTGGCTGCTGGGCCTCAGATGCAATGCAGGCCATCAAGTG 



TGGACAGTAAGCCGGTGAACCTGGGGCTGTGGGATACCGCAGGTCAGGAGGACTA 

AC AGATGTATTTCTCATCTGCITCTCGCTAGTCAG CCCAGCC TCCTATGAGAATGTCCGTGCCAAGTGGTTCCCTG AGGTACGG CA 
CC^CTGCCCCAGCACCCCCATCATCCTGGTGGGTACCAAGCT 
AGAAGCTGGCTCCCATC7UZCTACCCGCAGGGCCTGGCACTGGCCAAGGATATT 
3 5 ACCCAGCGAGGCCTGAAGACCGTCTTCGATGAGGCAATCCGCGCAG 
CAGCCTGCTCTAGGGGTGTCACCCAGTCCTCCCATCCCAATGGTCCT 

CGGATGTTCATAGGCCACATCCTATTTGGGAGGTTCCT 
CATACAGAGTCATATGCCCACCTGCTCCAGGCCTT^ 

GAACAAATCTACTTCAGTAAATCTCTCATCTGCCCAGCCAAGTGAGGGTCTGAGCTCAGCCAACCCCT 

CTACTCTACTTGAAGGGTAGAGCTATTCCTTCTTGGGACT^ 

ATACAGCTCTCTTGCTTACACAGAAAATAAATGTGTTGATTCTTTTGGTGG 



45 MOUSE SEQUENCE - CODING 

ATG CAGGCCATCyAGTGTGTGGTGGTGGGTGATGGAGCC^ CTACACCACCAACGCCTTCCC 

TGOAGAATACATCCCCACTGTATTTGACAACTACTCAGCCAATGTGATGGTGGACAGTAAGCCGGTGAACCT 

CCGCAGGTCAGGAGGACTATGACCGCCTCCXSGCCACTCTCCT 

CCAGCCTCCTATGAGAATGTCCGTGCCAAGTGGTTCCCTGAGGTACGGCACCACTGCCCCAGCACCCCCATCATCCT 
5 0 CAAGCTGGACCTTCGCGATGACAAGGACACC^TCGA 

CACTGGCCAAGGATATTGATT CAGTCAAGTACTTGGAATGTT CTGCACT CAC CCAGCG AGG CCTGAAG ACCG TCTTCG ATGAGGCA 
ATCCGCGGAGTCCTCTGCCCACAGCCCAC^^ 

HUMAN SEQUENCE - GENOMIC 

5 5 GCTACTTGCGAGGCTGAGGCATGAGAATCACTGGAACCCAGGAGGTGGAGGTGGCAGAGAGC^ 

CCTAGGTGACAGAGTAAGACTCTGTCTCAAAAAAAAAAAAAAAAAAAAGGA 

ATTAGTAGTTTAAAATCTACCCCTCTTCCCrCCTCTCCCC(^TAATTCTGAAACCAGGTCCAGATACAGATGGCTTTA 

T AAAATATTT AAGGG AG AG ATAATGT AATATTAT C C AAACTGGCTGG GCG CGG TGG CT CACG C CTG TAATC C CAGCAC TTTGGG AG 

GCCGAGGCGGGCAGATCACGAG<7TCAGGAGATCGAGACCATCCTGGCT 

6 0 ATTAGCCAGGCGTGGTGCTGGGCGCCTGTAATCCCAGCAACTCGGG 

' GGTGGAGGTTGCAGTGAGCCGAGATCGTGCCACTGCACTCCACCTGGGTGACAA^ 
CATAATATTACCAACTCTCATAATGAAACTAGTATAACCTTGATAACAACAACAACAA 
GACCGGCCGATCTCCCCACCCTACCTGTAAATATTGGCAATTTCAAGCC!AGCAATA 
GTGAGGGTCTGACTGAAGCTGCTGTAACAAAAAGATCTGAAAACGCAGTGG 

6 5 TCGAGCTGAGTGGTCTGGCTCTAT<3TTGCCCCCACATGGTTCCAGGATTGTTCTGGCG 

GGAAAATGCTGTACAGGATAAATGGTTTAGCTTCCAAGGAGATGATGCA^ 
CCCAGAC!ATGATGATAGGGCCCACTCTGCTGCAAGGG 

TGGGATC TCTGTT AT T AAAAAGAAGCAGAG AACGGTTG CTT AGGG AT ACTCAG CAATAGT C CC CATAT TAACTAAATT AGACTT AT 
CTGGGAATGCAAGGAGAATTGGACATC^GAAAATGTAT^ 

7 0 AAATTAAACGCTTAATAATGATAAAACAAAACTGCCGTTTCTAAGAATAGTAG 

AAAG CCTGCAGTAG ACATCAGGT TCAATG GTGAAACATTAG AAACAGTCTCT TTAAA^ GTTG C CTG CCAG 

CACTG ATT C C CTTCAGCCCTGTAAT AAAAGATCT AG ACAAG CCAAGGAAT TAACGT TATAAAAATTGGGAAGGAAG AAATAAAACT 
GTCACTTTTCATAAATGATAAGACTGTCTATAAAGACAGCTGTAGAC^^ 
TTTCCATCCAGGTTTCTATATAAGTTCAATATACAAAATGTGACATTTTCCC 
7 5 AAAATATTAGTTATAATGGCACTGAAAGTAATAAGGTATCAAGGAATGAATCTTTCAAA 
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ATTATAAAACTTAATTGAAAGATATAAA^AAGCAATATACC^ 
CCCAAGTTCCTCTAGAAATTGAATACAATTCTAATCA^ 
TATATGAAAGAGCAAAGTGTCAGTCGTAACGATAACAAT^ 
TCTGATATAGAACTTCAGTAGTTGGCTGGGTGCAGTGGCTCACGCCTGTAA^ 
5 ATCACCrGAGGTCAGGAGTTCAAGACCAGCTTGGCCAA^ 

TGGCAGGCGCCTGTAATCTO^CTATTTGGATGCTGAGGCAG^ 

ATGGTG CCGT CG CACTCCAGTCTGGGCAACAGGAGTGAAACT C CGTCTCAAAAAAAAAAAAAAAAAAGAACTGC^GTAATTGAGGC 
AGTGTTTTACCGGCTTGAGGGTGGACCAAGAGTCCAATAGATGGAGAGGCTA^ 
GTGATATGATTTGGATCTIGTGTCCCCATCTAAATCTCATGTCAAATTC 
1 0 ATTGGATCATGGGGGCGGAGTTTGCATGAATGGTTTAGCACCATCCT^ 
GGTTGTTTGAAAGTGTGTGGCACCTCTCACCCCCTTGGTCCTGTTCTTC 

TTACCCATTCCCAGATATTTCTTTATAGCAGTGGAACTGTGAGATGOT 

AGAG CTACTCAATAAATGGTTCTGGAACAATCAGTCATGGAG CGGTGGCT 
1 5 CACGCCTGTAATCXTCAGCACTCTGGGAGGCCGAGGCGGGC^ 

GAGAACCGC1TGAGCCCCGTAGGCAGAGGTTGCA.GTGAGCTGGGATCATC 
AGTCTCCAAAAATAAAATAAAATACAATAATAAAAAAATACATATCACACAAAAA 
AAATTCCGTCTGAAGTGTGTAAAGACCTACTCATATTTATTTTCTTTT^ 
2 0 TGGAGTACAGTGGT ATG ATCATGG CTCACTG CAGCCTCGAC CTC C CAGGCTCAAGCGACCTGCCCAC CTTAGCCT C CTG AGTAG CT 
AGG ACT ACAG GCATGCAC CAACACATC TG^CTAATrriTGT A1TTT 'i"i'GTAGAGATGGGGTTTCACCATATTAC C CAAG ATGGTTT 
CAAACTCCTGGGCTC^GTGATCTATGTGCCTTCATCrCCCAAAGTGCTGGG^ 
TGTTTATGAAAAGCAAAACTTTAAAAGAAAATGAGGATATTCGTCCT 
TCACTCCAAAAAACACAAGTCATTATGATGCAGGGG&GGTGAGCCT 

2 5 C^GGAAGGAATTCCGTGCTGAGCTGACGGTGGAAGAAAA 

GAGTAGGGCTACCTGACGGGCAGTGTACrGAGAGCAGCAGCTCAGGGGCAGGTCCACCGT 

TGCAAATTAAGGGGTGGGGTATTTAGAACTTTCTAGAAAAAGGGTCOT 

CCGGGTGTTGCCATGGCAATGGTAAACTGACATGGCGCIX3GTC 

GCTAGTCCTCAGTCTGGTCTTGAGTCAAACC CTGCCTCCT CAGATAAA 

3 0 AATTGTGTCTATTAAATGTCACAACATATTAAAAGACATCCCACGGACTGGAAGAAA^ 

AAATCAGAATATGCAAAGAGCTTCTCCAAATAAATGTAAGAG 
CCAGCACTACOTGGGAGGCAGAAGTGGGTGGATCACCTGAGGTCAGGAGTT^ 

TCTACTAAAAATACAAAAATT AGCCGGGCATGGTGG CGTGTATCTTTAATCAT CG CTACTCAAG AG GCTGAGGCAG AAG AATCGCT 
TG AACCTGGGAGGCGGAGGTTG CAGTGAGC CGAGATCACGACATTG CACT C CAGC CTGGGCAACAAGAGTGAAACT CCAT CT CAAA 

3 5 AAAT AAAAT AAT AAAATAAAAT AAATAAAAT AAAAT AG AT AAATGCT C CAATAGAGAAATGG AGAAAAG TGAAG AAAAAG CAATTT 

ATGGAACAGGAAACACAAGGTCCAGTAATCACTTACAATGATGCT^ 
TGAGATGCCAATTCACIACCC!ATCTAGCTGGCAAAAGTATGAAGTCTTACAGGCCA 
TGGAGATGGGCGTAGCACTGTCCGTTGAAGTGGAAGATATGTGTGTC 
GTTCACAGCAGAATraTTTGTAGTGCAAACATTGCAAGCCAC^ 

4 0 TTGCACTCCAC^CTGGGCAACAGAGTGAGACCCTGTCTCAAAAAAGGAAAAACGA 

GACAGAATTTAGGGTGTGGCGATATATTGACTGTTTTTAGTTTTTGTGTGATATGTAT^ 
AAATAACATTTCAATACATTCATATACrGGAATATTTACCAGAAAAATA^ 
ACTGCTGGGTGGGACTGTGCCTCCC&AATTCGAAGGCCTGTGGTGAGGAGGAGGC^^ 
GAGGTAGCACCTCTATGAGGACAATCCAAGGGAAGGATTTGGGCT 

4 5 AGTGAAAG AGACTGCCCCTTAAG GGAATG AG CT CCCTGTCATTGAAGGTATG TAAG CAGG GACTGG GCAGCTGCAT CATGG G CAGT 

GAGC CAAG CATTGTC TATCAATGGCACAAGGGTGGGTTGAAT ACAGATGG C CTTAGGCCC CTTCTAAG CATGC C CAGAGG C CT CGT 
GGTTCTGTGATTCAGTGAGTTTGCAAAGTGTAGTCAAAAGAGGATGGGACTAAAT 
GGACACTACTCACTAATAAAAGGGACAGACACACCATGACACATGTAACAAC^ 
CCTTGGATGAGATGCCIACACCGTGTGATGCCGTTTATGTC 

GCTTTTGAAAGTGATGGAAGTGTTCTGTGATGTTCTGTATCTTG^ 

TATATGCTTTAAATGGAT CACTT ATTGT ATT CAAATTATACTT CCACATAGAT ACTTTTTTTTT^ 

TCGCC!AGGCTTK1AGTGCAGTGGTGCGATCTCAGCT 

TGAGTAGCTGGGAjCTATAGGTGTACGCCACCATGCCCAGCTAATTTTTG 

5 5 GAACGGTCTCC^TCTCTTGACCGCGTGATCrrGCCCGCCTCGGCTCCGAAAGTGCATATGGATACTTTT 

CG^GAGGGCITGGAGCAGAGGTCAGTGCCGCTGCATCZATGGACTT 

TTCACTCCTCCATCTGGCCATCCTGCCAACCCTGGCCTGTCCACACA^ 

GTGGTTCCAGGGCTCCTGTGTCATTCTCGGTGTGGGTTAGAGTGCTTCAGGGATAGCTT^ 

ACAGTAC CAAG CCCAGAG CTGGGTATTCAGG TGGTGT ATCfiTTG ATG CATG GAG AATG AAGGAGA^ CT AAG 

6 0 GAG AATTTT TG CTTGGG CTGC CCTGGAGAAGTGGCTTT CTTGGG AGGG AGG G CAGTTG CAATTTTAGGAC CGT CATGGC CATGT AC 

AGTTGAGTGGGTTGTGTACTGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGCCGGGCGCGGTGGCTCATGCCT 

CTTTGGGAGGCCAAGGCGGGTGGATCACGAGGTCIAGGAGATCAAGACCAT 

ATACAAAAAAATTAGCTGGCTGCGGTGGCACGCGCCTGTAGTCCGA^ 

GGAGGCGGAGGTTGCAGTGAGCCGAGATCA.CGCCACTGCACTCCAGACCGGCGACAGAGCGAGACTC 

6 5 AAACAAACAAACAAAACAAAACAAAAAACAACAATCTCTGCTCTAGTGGGTGAACCAAGGA 

CCAAGTCAGGTGGCTGCACCCATTGGCTGTGCATTCCT^ 

TTTTATTCTGGAAG AATTTTAGATGTACAG GAAAGGTG CAAAGAT AG CAGG AGTGTTCACATACATTTT CACCCAGTTTCT CT CTA 
TG CT AACAT CATTTGTGT AACCACGGT ATATTTGTCAAAACTAAGAAATT ACT^ CTTAACAAT ACT AT CAAAAATTTCAGACTT 
GATTTGGATTTTACTACTTTTTCCACTGATGTTCTTATTTO 

7 0 CTTGTTGCCCAGGCTGGAGTCCAATGGTGCGATCTCCACTCACTGCAACCTCTGCCT^ 

ATCTTGAGTAGCTGGGATTACAGGTGCCCGCCACC!ACGCCTCGTTAATTTTTTGTA 
GGCCAGGCTGGTCTCGAACTCCTGACCTCAGGCGATCCACCroCCTC 

CACCCGGC CTGATGT T CTTITTTTTTT AAAGC TAATTTAATCGTTTAAATTGCTT CAAG AAG CTGCATGTAT AG ATGAAACACTTG 
TTTTTTTTTCCCCCTTTAAGAACTAGAAATGAATTCrrTTTAAAA 
75 CGT ACATATTTATGGGGT ACAATGTG ATGTTC CAATGCATGT AT ACACTC 
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CAT CACTTTT AATATTT ATCATTTC! TTTGTTG CXIACATTTGAAACCTTCTCTTCCAG CTGTCTTGAAATATGCACGAGATTGAGAT 

TTGCTGTAGTCACCCTACTGTGTAACAGGACACCAGAACTTA 

CCAGGTTCTAATCTGGGGTACCGCACTGCATTTAATACATTATTAT^ 

agcagtggatggaaaaagaaaggggcatagagaagtatccagaacaaagtcct 
5 gtcacggggcacacctggcctctctgacccacagtctcagcacagcgt^ 
ctttaacagttgacttgcccagox:ttgaggaagt^ 

AATTCCTCTGAAAAGGTTCAGCCTCCGGGTGCCTG^ 
CTTTGGAGTTCCTGGCTTGTCCCXSAGTGACGAGTCCGGGGT^ 
CCCGGAGCTGGAAAGGTGTGTGGGTGGGGAGTGATAGCAATGAACAGTGA^ 
1 0 AGTCCACGTGGAGAGCCCTCGTGGCGAAGGGAGCGCCKrrGCAAGCTG 
CAATAGCTCCAGCTTGGAGAAGAGCTTGGAGTCACCTG 

CCTCCGACCCAGGGACTCCCACACAGGCCCTTGCCrGGG 
15 CCTCTTTAGGGCGC^CGTTTCAAGAGCCAGCTCCAATGCTAGCTT 

TATAAACTCCAGGGACCAGCTGTCTCTTGCATCCTGGGATCACCAGCACCTGGTACT 
ATGCTCTGCTTAGAATTGAAGTGCACTGAATTCTTGGA^ 

AAGTGAGCCTCCCAGCACAGAGGGCCCCCAGGTCTCCTGCTGGAGCCTCCACGCAGGGGG 

2 0 CAGCACCTGAGCGCCCCTCCCACCAGCTGCTCrGCCAGACCGCATGCT 

TAAGTTCCCCTTCCGTTTCCTCCTGCCCCACCACCG^ 

AAACGCGCCTGGCCCTGAGCTGTCACCACCGAC^ 

TGGGTAGGTACCTGGCTGGACTTCCTCACAGCCCC 

TGGCCCTGCAGAACTAGGCGTCCCTGGAAGGCAGCrTGGAGGGGACGCTTCCT 
25 GGCTGAGGGGAGCTTCACCCTCTTCAGCCCCAC^^ 

CAGGGCTCCTCACCTGCACCCCAAAAGCTGGGGGCAGGTCTCCC^ 
AAGTGGGAAACCTGAAATTCAGCCTTTGTCCCCAGGCTCA^ 

AGTCACAGCACCTCTCTGAACCTCAGCTTATTCATCTGGGAGGAGGCCTATGCCCTGGGGATGCTGTG 
TGAGTGGCTGTCTTGCTTGTAGTATGTGCCTTCTAGCACCCCCTCCACTTTCTCT 

3 0 ATGTGTGGGGGATGTGGGTGAGGAGCAGAAGAGGCAAGGGGACATTGAGAGACA 

GTGATTGTGGGGTCGGCAGGGGAG^GGC^GGCTGGGCTGAGACCCTTCTCAGTTGTCAGATCCCAG 
CCCTCGGGGCCTCAGTTTCCTCATCTGTAAAATGAC^ 

AGGCCAAGATG^AGGATTGCTTGAGGCCAGGAGCTTGAGACCAGCCTAAGCATCA 
TTTTAAGCCAGGAGTGGTGGCATATGCGCCTCCAG 

3 5 TGAGGCTGCAGTGAGCTTATAATGGTGCCACTGCAGT^ 

AAGGAC!AGGTCCAATTTGTCAATCTGTCATGCCTGCCAGC^ 
CT AG CTGGAAGGGGGCCTCCTGTCCAGCCTTCCCA 
GGCCCJTGTGGCCCCGACTTTGGCTGTGACCCATCTAGGTG 

GCTCCTTATCTGCCCTCGCCTGCCCCAGCCCTGTTCACTTCTGCTTTGTATCTGATCGGAGGAGGGGCGGGCGGAAGTCAAGGGTT 

4 0 AACGTTCCAGT CATTCAAAATTACCTCGGTGGCAGGCCCAGGGGAGGGGGATGGTAGGAGGAGAGGATGAG CTGTCTTCT CTGAGC 

AGGAAGTGACAAAACCAC<3CAGGGCAG^GGAAGGCCCAGAGC!TGT 

CCCAGCCTCGCATGCTGAGGGTGGAGGAGGCGCXSCGTTATCACTTCCT^ 

ATTTATCCTCTCACGAAACCTAATAGGTCCATT^ 

CACAGGATTTGAACCCAGACCACTCTGAOTCAAACCCTGACCrGGTAACTGCrr 
45 GGTCAGGTCTTCCCTAGCAGCCATCGGCTGCTGCCATGCATCAGGCCCTGTGCTCAGTGCTTTGA 
CTCAGCACACTCCTATGAGGATGGGCTCATCTCTAT 
ATCATACCACTGGTAGGTGGTACAGCTGGGATTTGAAGCCCAG 

CACCCTCTGTCCCACATCACTGGCCCCTGGAGAAGCCrrGTGGGAAAAGCGGATTGCCAGCTCrGCGCACACCTGTGG 
TCrTGGGGCTGGACCCTGAAGTCTCCACTTTAACAAACATCCACATGACTTGCAGGCTOT 
50 TCTCACATGGGGAGACGGAGGCCCAGAGGGCTAGGCTGGGTGGATGCTGAG^ 
TCTCCGGCAGAGCCCATGACCGGCGCCCTCTTCCACTTCCTGTCTTTCAGGGCC 

CACCAACGCCTTTCCCGGAGAGTACATCCCCACCGTGTGAGTTCCTGGGGATATGTGGGGTGGTGGGAGGAGGGGAGAGACCGTCA 

CCAAGTCGACCCCCCTTTCAGGCA<nTCCTCT^GCC^ACCCTGTGTGGGCACCC^ 

AGTCCAG CCAAG AAGG CAGCACTTGGCX3CTGAGAGGTGCTGGGGGCTCCCAGAACCCGGG ATGGGGTATC 

5 5 GAGGAGGGGTGGCCTTTAGGCTGGGCCTCCAAGCAAAGGGACGTGGA 

GAGCTTCTGCGGAAGCAGGCTGCTGGAGGAGGTGATCCAGAGAGCCT<^^ 

GAGGGGTGGAACAGAGCTGTCTGC^GGCCTGGGAGGCCCTGCGCTCTATTGCAGGATGCGGAGGCAGGGCTGTCATCGGACGTGGA 
GGGAGCGTGTTGGTGTCAGACTCAGGAGGGAAGGATGACrGGGGAGGCACCAGCCC 
ACTCCTAGGACATTTCCCCGACTCCCTAGCATGGCACTCAAGGCT^ 
60 CATTACTTTGCTTCACACTCTGCCTCCTCCGGACCCT^ 

AAGCTGTTCTCTCTCCCCATAGTGCTTTTTCTCCCCACAT^ 

GGGAAGCCTTTCTGATCTACCACGAAGGGGTCTCCACTCCCCAGAGTGACTTTATCTC 

GGGCAGGGAGCCATGGCCAGCTCTTTGTGGGTCTCCCATTATCTCAGCACAGTGTCTTCTACATGGTATGACACATGCATTACCtT 
ACACACAACACATGTGTTTTGAGTTGACCTGACTTGGACA^ 

6 5 AGGCCTGCCACATGCTGGCCCCTGCGCAAGATGTC^CACATGTGTC 

CAGGACACCCATGGGGAAGCTGAGCCTCACAGCCACTAAGAGGCTTGCCTCCCAGGOT^ 
AGGTCCATGCAGAACCTCGCTGGGGCCCTCATTTCTTGATCTTAATGAACCCCCCrrrCAGGGGCAGGGAT 
AGAGGAGGGCAGGC&GAAGGGAGAAGAGAGCCTGGAGCCGAGATGCTTCCA^ 
AACCCCAGACTTGGGTGGGCTGGGCTCTCTCTGCTGCTGTCTCTCCCCGGGACACTCT 

7 0 GGCTAGAGGAAGGCAGGCAGCCAGGAGAGGGO^GAAACCCTA^ 

ATGCTGGTGAGTGCCC CTCTCTGGTCCTGTCCTTATATACAAAATGGGAACAAGGATT CCTG CCCACTCACCATCAACTATCAAAC 

GTTTATCAGGTGCCTCTAAGTGCCAAGGGAGGGCTGGCTGC^AACCTG^ 

AGGCCATGGTCACCAGCAGTGTGTTGTCACTGCCTCTGTCCCTO 

ATCTTGGCTCAGTTGAAGGACCCACTTGCTGAAGGACCTTGGACCAATCACTTCCCCTCTCTGAG 
75 GAGGTAGAGGCTTGGAAGTCCCAAGCTCCTTCCCCAGGCACCTCCCTTGTACGTTGAGATGGATGGCTGAGGCTGCTGAGGCCAGG 
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TTATGGCCAGGTGCTCTCAGCAGCCTCX5AGCTCC 
AGCTGGAGAAATAGGCCCGACCAGJVTCTACCTTCCCAAGTCTCT^ 
GCGTGTCGCCACCTATCCAATTGCACAGGGCCCCATGCrrCA^ 
AATTCTTAATCATTTTTGAGCATGG^ 
5 CTCATATGGCTTCAGGCACZAAAGTTTCTC7U5CCCCAQA 
GAAGGGAGGGTAAAATAGTCACACTGGGCCCTTGA^^ 
CAAATCTCTGGCACCTGAGATTAAGGTGGAGAGAGCTC 
CAGATACAAGATCAGATAAGCCTGAGAGCTGGCTCCCCATGCTGCCAGCCA 
TCGGAACTGACACCTTCCTGAGCAGCCAATCAGTTGCCAGGACTCGTGGGGCT 
10 GTATAGACCCTATATTGGAGCCGGOIGGGCCCTGTGAGTTCTTTGCAT 

TCTGTGGATGGCCCCGTGCTAGGTGTTGGGGATACAGAAACATGGCAGGAAGACA^ 
GCCCCCTTACACCTCACCTCGTTAGAACCAAGCTCGACGGAAGCCCTGG 

G CTCATGC CTGTAATCC CAGCACTTTGG GAGGCTGAGGCAGG CAG ATCGCTTGAG CCCATGAGTTCGAGAC CAGCCTGGGCAACAT 

GGAG CAG AAACCCCATC T CTACAAAAAATACAAAAATT AGCCGG G CATGGTGGTG CATG CCTGTAGTCCCAGCGACTCGGGAG ACT 
1 5 GAGGTOSGAGGATCGCTTGAGCCrrAGGAGGGTGCGGCrGCATTG 

GAGACCCTGTCTCAAAAAATAAAATAAAATAAAAAGAGCCTCCCAAAAAAC^ 

GCTCCCTTCACACITAAAACTCATCTTAATGCGCAAA^ 

CCCACTTCACTCCAAGTACACAGACTGCCTTGATGATCTCTC 

TCCCTGCTTCCTGTCATTCAGTTCCCTGC 
2 0 TCTCCATCCCACC^CCrGCTTCGCTGTCTTCATAGGCCC 

CTCTTGGATGGAAGCTCCCTGAGGACGGGGCCCITATCT^ 

ATTCCTGTTGCTCCGTCATGACrGCCTCCCCGACCCCGGGGAGGA^ 

TTGAGGTTGAATCATTTGGCACTTAATTTTCACTTTCATTTGAAATCC 

GTGGACAGGATGACATCAGCCTCCCTAGCTGTGTGCX3CCAGGCCC 
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TGGTGCGATCTCGGCT CACTGCAACCTCTG CCTCCCGGGTTCAAG CGATTCTGCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAG 
3 0 G CATG CACCAC CACAC CCAG CT AAGTTTTGT ATTTTTGGT AGAG ACAGGGTTT CAC CATGTTGG CCAG G ATGGT CTCG ATCT CTTG 

ACCTGGTGATCCACOTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGO 
CAAATCTGATCTCATCACCTCTGCCrrGCCTTGGCCCCTTG^ 
CCTATGACATCAATCCTTTCTCACCCCCAAGTCACTAGCCATCCX^ 
TCCGTGGGGGTGGGCTTCGCTGGTGGTCAGGTGATACCT 

3 5 TTCTAGGGCCAGGCCTCAGAGTTCrACAGGCIAC^ 

GCAGCGATGCTTACCATGTGGGCCTTTCTGAATTGGGTAAC^^ 
CIAAGTCCCTTCCTCTCTGTAAGCCTCAATTTC 

TCTGGTCCTCTGGGGAGGGGCCCTGTGGCAGGGGCATTTGGGGACATGGA 

4 0 GGGTGGCTTGAACATCCTCTGTTTTTTTGCATTCTCCTAATGGCT 

GGCTCAGAGAAGTTAGGTCACTCACCCAGGGTCACACAGC CAGGA 
GAAAGGTCCTTGACCTCTTCAGGCCTCAGTTTCCTCATC^ 

ACTTCAAGACCTAACATACAAAGACCCTCAGCCCCCTGCCTGGCCTCAGAGAAGGTGGCTGT^ 
CCCCATCACCGTG^CCCTGAGATTCCAGGGATGACCCAGAGGAGACAGGTGAATCTTGTGGGG 

4 5 TGGACATTAOAGGAGAGAGTGTCCAAGCATTTCCCAAAAGTGTG 

CCATCATGCCTCACK3TGTCTCCCrrGCA.CAGCGCCTTCITCCTGAAATC 

CCAGC!ACCTGCTGTGCACCAGACCGCX3GGCTGGCCCCAGGGTGCA^ 

TGAGTTTCCCCAAGGAGTGCTGGTTTAGGTGGGGTGGTAGCC^ 

TTG AG CAGGGGCTGG AGGG ATGAGT AAAAG TCAG CCAG C CTGG CAGTGG GGAGG AAGAG AAGGG AG AGAGG AAGGAGAGTGGGAAG 

5 0 TCAGGGGCCACTGTTTGCAGGCACTTTCCAG^ 

• GGGCTGGACAGGCAAAGTGTATTAAAAAGGCTGGGC^ 

GTGTTACTCTGAAGTGTTAGCGCTTCTGCAGCCTAACATTGAGTAGCGT^ 
TTCCACCTGCATTCACTGGGTGACAGGCCIXIA^ 

GAAGGGGTGGAGACTCACCAAGTGCCTGCTCGTGGTCCAGGCAAAGTAGTCACCT 

5 5 GAGGACG ACGAAG CTCC G C CAGAG AGCAGG ACTT GTCTGCGCAG G^TT TTG AAGC ATGAATAGAAGTT G GGAGTG AAATGGGGCTG 

GAAGCGTATTCCAGAGGGAGATACAGACAGGATCAAAGG CTTGG AGGGCTGGGCAG CCTGGAGGGTGTGGGGGCCAGAGACCAGGT 
TGGTGAGAACATATGGACAGCTAAGGTGGGAAGGACATTCTGGGCAGGGGCT^ 

AGCCTTTCCCX^GAAACXaGCTTGGCrGGTGTGCGAGGGGAGGAGTAGGAGGAGAAGTCAGAGGAGCCGGAGGT 
GTCAGGGCCTTGTGTGGGGGACCACAGCGGGAAAAGTGACAGTGATTCCCTAACCCTCAGGGGCAGGCG 

6 0 GCACAGCTCTGGCCTGGGGCAGTGAGCA 

GGCCTTGGGCAAGTCACTGCATCTCTCTGAGCCAAGTTTTCC 

CTCTGATATTTCTCAGGCCCCTCACCCCGTCTCTGGCCnTCCGCCAGCTGTC 

CAGCG CTCGCTCTGCACCCGATGCATAGTAGGTGCCTAG CAAGTGCATCTGGAGG CTCGTTCGATGGCCCTGCAGGGCTCTCTGGG 

6 5 TC!TGGCXKK3AGACATTGGGAGTCAAGGTCAGTGCCTCACACCAT 

TCCCATGGCC^GTGACTAACAGCTTTCACGCACACTCTC^ 
GGAGATGAATGACAGCACCCTAGCTTCCTGCCTTCTCTGCGGACGCCCTGAGC^ 
GGACTGAGAGC^GCAGCAGTCCACAGCC^TGACAACTCIAAC 
CTCCCCTTCTGCAGCATCCTGGAGTCACTTCCCGAATAA^^ 

7 0 CAGCCTGAGACGGGATGAAC!ATTCATTCGACAAAATATTGCGGCT 

GC CGAGGCGGGCGGATCACTTAAGTTCAGGGGTTCGAGACCAGCCTGGTCAACATGG 

AATTAGCTGGGCGTGGTG<5TGCATGCCTGTAATCCCAGCTACTTGGGAGG CCGAGG CAGGAGAATCACTTGAACCCAGGGGGCAGA 
GTTTGCAGTGGGCCGAGATGGTGCCACTGCACTCCAGCCTGGGCGACAG 

AAAACAACAAAAAGCAAACATTG AGTGGCAG CAT AGGG AG AG AGG T AAGTTT ATTCTCAT CAAGACTGGG AGGAAAAG TTACACAG 
7 5 CCTCAATTACCTTTCCAAAGGACAGGTACGGGTAGGAAAA 
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TTTTAAAAAATATAATGAATCAGTAATGAAACAAATAAAGAAACCCAATTC 

TCTCTAGGTCTCAGCCTCCGAGGAATGACATGGA^ 

TTGCCTGCTCCCCACAGGTTTGACAACTATTCAGCCA^ 

TGGGCAGGAGGACTACGACCGTCTCCGGCCGCTCTCCTATCCACAGACGGTGTC 
5 GCTGTCTCTGTTCCCTCTGCTTGGGATGCCCTTCCAGCCAGCT 

GAGTCTGCCTGCCTTGCATGTCCCAGGCAGAGAAGGGCTTCCTCT^ 

GCCCATCTTCCCACCCCATTGCCTCCTGGGCTCTGGCACCCCCT 

GAGGCAGCCCAGCATGGTGAGCCCTGGGAGTGCTGGGGGAGGGGAGAG^ 

TT CCTC CAGGG TATG CAGGGAGGCGGCTAAGGCCGGTAAT CAGATGGTGGATAGTAT ATGGATTGGAG T ATGGACTCT AGAATCCC 
10 AGATCCTCCCCTCACCAGAGTGCACGTTGGCAAACTCTTTAACCTCrrGGAAGCCT 
GCAGTGCTTGCTGCAGAGTGACATGTGAGGA^ 

CGTGGTAGAGGCGAGAAGGGTATCTCTGCTCTGCCCAGATCCTAAGTCCCCAGCATTCCCTGGG 
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AGGGTCCCCAGTGTCAACCAGGGCTGGCAGAGGGACTT 
AGCTATACCCCCACTTTCCATGGCTGCATTCTCCTCCTCCCCACACAACCT 
TGCTTCTTATGTCTGCACAAAACCTCGCCTCCrrCCTTTCCTACAATCATGCGCCCCCTC 
TCCCTGTGCCACCTCACGTCTCTCTCTCCTCACCGTCGTGCCCACCTCTGATCT 



20 CCCTCACACTCCCTCTGATCCCCGC^C^GCCTCGCGCCCCCTCOTGACCCATACAACCTTGTGTCCCCC 

GTTCCCAGAAGTGCGGCACCACTGCCCCAGCACACCCATCATCCTGGTGGGCACCA CGGG ACGACAAGGACAC CA 

TCX3AGAAACTGAAGGAGAAGAAGCTGGCTCCCATCACCTACCCGCIAG 

CTCAGAGGGGGGATCTGGCTGCGGGGGAGGCCCTGACTTGGCCTCGTGGTGTGATTGTA 

2 5 AAGCATGGGATAGTGATAGCTCCTGCCTCCTGGGGTGGTTX3GGGGAATTCAGTAACTTATT 

CTGTGTGTGAGAAGCACTCTGTGTTAGTAGTAGTCGTCCTAGTAGTATGAGTAGTATCTATTAATAGATACTGAAAATA 
GTGCTTTGAGCAGACAGAAGCAGAGATTGAGTGGGGGTGCTGT 

CT^CTTTGGTTTCT 

3 0 AGTAGGTGGCCCAGCCAGATTTGAAGGAAGGACAGCAAATTATTGGCTCAC^ 

GCCTGCATCAGGGAGGCGATGACCATGGGTCTGGCTTCATGCATGGTACCCAGGCCTGCr 

GGTCCCAATGTGTCTGTAGCTCAGGGATCCTTGCCCAAAAGCCCAAAACCACCT^ 

CCTCACCACAGGACTCTGCAGGCCCAGGGTTTGGCCCC 

CCCACCTCCCCCTGCCGGAACTGGGAGGGCCCTGGGGACTCACATCTCTTCACAGGAAGAGAAAC 
3 5 TCTGTGACTAGCGTGGTGACACAGCCATCTTCCCCGGGGGCCTCAGACTATGGCTCACACCCTGGCrCCC^ 
CCACAGCTCCCCAGCCTGCAAGTCTTTCCTTTTAAATATCAAGGATT 
AACCCCTGCCCCTGGACTCCTTCTATTCAGGGTGCAGCTCTCCAGCCTG 

ATGCTTTGGGATCATGTGGAGGGTGCTGGAGGCCTGGATTCTGGTCCTGGCTCTGATACTGATTGGCTGTGT 
GGCTGCCCCTCTCTGGGCCTCAGTCTCCCTCTCCACATAAGCTGACCTGGACAATCCCCAAAGGCCCTCTCT 
40 TGACTCAGAGTTCTTGCCTTGGCCTCTAGCCAGT 

TGCCTCAGACCAGCTTAGGCCTAGGGGGAGTCCTGGGGGGACTGGCAGGGATAGTC 

GACTCCCCAGCCCTGGGATGACAAGTTTTGGGGGCAGCCCTGAGCCCAGCTGATAATCACAGCTGATGTGGACTGA 
GGAGCCACTTCATTCTCACAGCAGCTATGACGAAGAGCTCTTGTC^ 

TTGCTGGCTTAAGGCCACCCMCCAGGATTCTGACAGGGGTGGACTGGCTCCCTAAGCCACAAAGCT 
45 TGCAdCTCCACTGCCCC<^CCGCCCCCTCACCCTCCTTGCCCTCTCTGTAGCTCCCCTC 

GCATGGCAATGAGTCCTAGACAGATGGGAGACTGAGGCCCGGAGAGGGGGAACTGACTAAAGTCCTAGAGCAGTGAGAGGC 
CAAAGTGAGCCCTGGCCCCCCAGCTCAGGACCCTTGTCCTGGGTGAG^ 

TGGGGGAAGTG<3ACGCAGATGTGGTCCCTTATAAGGAGACAGTGGCCAAGTGCCCAGTGACACCTCCTCAACA 
TGACCAGACTCCGGGCTGAGCCCAGGGCACCCTGCTCTTGCCAG 
50 TATATGCCTCCAGGTAGAGGGGAGACGCCCAGCATGTGACTCAGA 

CCCCACCCTTCCmCTCCCCAGCACCTGGCCTCAGTGGGCTCCCCTGTTCC^ 

GAAGCACCCAGCTCAGTGCCIXKSCACAGGGTAGGTGGTGACCA^ 

TTGGCGGAGGGCCCAGAGTTCCCCTCCCCTCCCTTCTCCTTCCTCTTGGTTCTGACATCGGCCCCCAGGCCTG 
5 5 CAGAAGGGGTTTTGTGGTTTTTCACTTCAGGGAAACTGTGCAGCTTGAAATGG 

TCATGGAACGCCGTCCCrTGGGGGTGGGAACTAGGCCTGCCTTTCTCCTCATCTGTGCCCCGGGCTC 

GTGGGAGAAGGCTGGGCCTCCCATGGCAGGACAGAGGAGCTCTCCAATGCCACTCCTCCCAGAAGGCCCCCAGCTGGGCCCAGGTT 
GAGGGAATACATCCAGCGTGTGAGTTTGGATGAGTCGCATACTCTCCAGGAG^ 

CTAAGGAAGGGGCTGATGATGTTGGGAAGTTCCTGGGGAGGGGGCCCTGAGGGCTTCAGGCCTCCTGGGGATTATTTGC^CCCTG 
60 GGAGAATCCTGGAGGGATGGGCAAGACAATTTTCrrAGAAGGTGAGGCTCCCAGAGATTTGGTCACTGATGCTTGAGCCCTTTCCT 
CACAGAAAAGCATCAAAATTCCCGCCCTCCTTCCACGGACTAAGGGCrrATGTGTCCAAGAAAGACACAGACCTGCCCTG 
T AGTCTCT CACTGAGTTCACG CTCGCTATT ACAG ACATG AGG AAACTG AGG CACGGT G ACGTTAAACT C CTTGCTC CT GGTGACAC 
AGCCTCCAGGTGACCCCACTGGGACTGGATGGCGGCCCCCAGAGCCTGGTGTTC^GTTGTATGGCTTGGGAGGGGGTGTACGGCAT 
GGAGGGGTGTATGGCGTGGGAGGGGGTGTATGGGGTGGGAGGGGTTGTATGGCATAGGAGGGGTGTATGGCGTGGGAGGGGTGTAT 
65 GGTGTGGGAGGCGTGGGAGGGGTATATGGCGTGGGAGGCATGGGAAGGGGTGTATGGCGTGGGAGGGGGTGTATGGCGTGGGAGGG 
GTGTATGGCATGGGAGGCGTGGGAGGGATGTATGGCGTGGGAGGGGGTGTATGGCGTGGG^ 

CGTGGGAGGGGTGTATGGCGTGGGAGGGTGTATGGCGTGGGAGGGGTATATGGCGTGGGAGGGGTGTATGGCGTGGGAGGGGTGTA 
TGGCGTGGGAGGCGTGGGAGGGGTGTATGGCATGGGAGGG^ 

GGGAGGGGTGTATGGCGTGGGAGGGAGGGTGTATGGCGTGGGAGGGGTGTATGGCGTGGGAAGGGTGTATGGCACGGGAGGGTGTA 
7 0 TGGCGTGGGAGGGGTGTATGGCGTGGGAGGGGTGTATGGCGTGGAAGGGGTTGTATGGCATAGGAGACATGTTGGCCCAGAACCCC 
CCCCGCCACTTAGGCCTTGGCTCTCCTGTCCATCTCCCCAACCAGACT 

GG CCTGAAAACCGTGTT CGACGAGGCCATCCGGGCCGTGCTGTGCCCTCAGCCCACGCGGCAGCAGAAG CGCG CCTGCAG CCTCCT 
CTAGGGGTAAGAGTCCCCCACTCTAGTGCCCAACTCCCATACACAGCAGGCCCT^ 
GGCATTTAGGCCAGGACAGAAAGAAGGTGGGTATGGAGGTTGCCTCAGTTTCCGCA 
7 5 ACTCTCTGGAGCTCATCTTCCAGAGGCAAGGCCCAAGGTATGAATGAATGCCTGGTAACCCCCCACCCCCGG^ 
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GTCACXXJAGCCCCACTCTGCCCATTCCTGCM 
AGGGAGATCCTGCACCCACATATGTGGGTG 
TGGAGGCAGX:CCACAAGGCCTCCrGGAGGTGTGGGAC 
CTGGTGGGAGTCAAACCTTGAGCAAAGACCTGGAGGTG^ 
5 CCCAAATTCTCTTTCCCTGCTTATCTTCCTTCCAGGTTGCACCCCAGCGC 

CCCAAAGCCTGATGGCACCCCGGCTGGCCATGCTGTCCCCTCCCTGTGGCGTTTCTTA^ 
TCTTTTCTGTAC1T3GAGGCCTCCTGAGGCCAGGAACGTC 

TGAGGGCXIAGAGGGGAGCCCCAGGACCCATTAAGCCACCCCCGTGTrCCrrGCCG 
CCCGTTCACTCCAGTCCCACCCC^CGCCTGACTCCCCT 
1 0 GGGATGGGGCTCTTACTCCCTCCTGAGGCCAGCTGCTCrrAATATCGATG<3TCOT 
TGAGTGTCTCAGAAGTOTGCTCCTCTGGCCTCAGTrCTCCTCTTTTGGAACA^ 
GATATCTGCTCAGCCAATGGAAAATCTGGGTTCAACCAGCCCCT 
CTGCACTTACCTGTGAGAGTCTTCAAACTTTTAA^ 

GCTT AAGCAG AAAAT AAATTT ATTGATTCAAGTTTGGAGATAT CGTGATTT CAGGTG CAG CTTGATCAAGGGATT CAAATG ATGTC 
1 5 ATCGAGGCTTGTTTAGCTCTGGGCTCTGCCT^ 

CAAGTTCAGGGTTCATCCTTCCAAGTTCAAGTTCAGGGCCT 

CCAGG CAAAGTTCTAACT AAGATTTACTGTGATTGG AC CAGCT TAG GTC C TG GG CCCCCTTTT AACCAATCAT ATTTGT CAGAGTG 
GTGCAGTGCTGTGCTCTGACrr<3GCCAGGCATGG 
CTCGGGGGGAGGGTGAGTTCCAAAGTGGGCTGAGAGCCTC 
2 0 GGAGGTCATGGGAAAGTGTTCATTTCAGCAG 

AAGAGAGGCGCGGTTGTTGTGAGGATTTAAGAAGTTCCGACATG*^ 
CTAAGTTCTGTGGAGGTGCCAGACCTCCCCCTCCTC^ 
CATTTATTGAGC^CCTCTTCTGTCCCTGGCACTGTTCT 
TGACACTCTAGAGAGGCTAAGAGATCTAAGAGGCT^ 

2 5 TTCCACATTGTGCTTTCCCAGAGCTCCCAGCCAACCCCACTATCCCCAT^ 

ACTCACCTCCTGGGAGCCCCCCTTAAAAGGGATTAGGTGCAGAGGGGAAACCCCTC 

CATGATGT TTTCAAAGT TACTT CTCTTCTGAAACAGAAAGGAAAT ACTAAACCCCAC CCAT^ CG 
GCCAGGACCCCACTCCCTAGTATGTTCGATGTCTAAGATCCCTGAGGCACCCCCTGAGG 

3 0 CTCCCACTCTTATTGCACTCGTGCCTCTACCATCAGACTGGACTGTCATT^ 

CCTTTGAACATACTGGACCCCTTTCCTTTGCTC 

CAGCCCGTTAGGTtX^CCCCAGTAGAGACAATCT^ 

TGGCTCCCATGCACATATTATCTCATTTAAT 

TGAGGATGCGGGAGGGCACAGGTTTGGCTCAAATTGACAAGGCAGACACT^ 
35 TTGCTAACAGAATCCTAGTTTTGTGCATAGC^ 

GGGTTTTCAGGCCACATAGTTCTGGCCGAGAAGCCGATGTGAATGAAAGTCAGCGGGA 
TCCTTAAAAAAATGACAGGCCCAGGCCGGGCGAGGTGGCTCACGCCTGTA^ 



GTGGGCGCCTGTAGTCCCAGCTACGCGGGAGGCTGAGGCAGGAGAACT^ 
4 0 TTGCGCCACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCAAAA 

CTGCAG CAGCCGTCT TGTG AG CTCG AAG CCACAAACATGAAGAT ACAAGGCCC CGTG CTGAG AGGGGATGAG CAGAAAG ATAAAAA 
GAGCGTGAGTTCCTGGCAGCAGCAGT<^GCAGCTGAGCCAATGCTAACAGCTGCCTTGTTATG 

GTGCTCTTCCCACTGTATAACTGTTCCC 

4 5 GGAGGCCCCTCATATAGGAAAGAGAAAGCTTATGGGAAGTCAGTGAGAACCTGTTTTGAAAC^GGAAGACCTGCCGGTTGTGGG^ 

GGAGCTGACACCATGGTTTGCTCTTTCATGTGACAACAACCCCTCCCAAGTGCCCACCTAGTGC 
TCCATGCCTCACTTCGCCCCTTCTTGC1AATGAAATCATGGCCTGGTGOT 

CCCTGGCATTCACTGGCCACCAAGCTTGGACAATTCAGCCTOT 

5 0 TAACCCCTGCCATGTCTCCCCTGGACAGTAGCCACAGCCTCCTC^ 

CCACCAGGTGG CCTGGGGATTTTCCTCGTTTACAAACAACTGTG CAGTCAAAAATGCTGATAGAACCACAGACTTATTCAATCATC 
GAATCCTTCAGCAAATAGTTATTGAGACCGTGTGTCAAGCCCAAGGCTTGGGGGTGG 
ACAGGCCAGTGGGGAGGCTGTCCAGCCAGCAGAGAGCCCCAGGAAATG^ 
ACACGCCCCGCCCAGCCTTCATCACTCAGT^ 
55 CTCACCTTCCACTTCCAACCCATCACCGACAATCTCCTCAGT^ 

CA.CTGTATCCTCCCTGTGGTAGCCTCCTGTCATAGTCAGCAATTGGTACAATAATGCTGTGTAACA 
CAGAGGCAAGAAGCAGCAGGCCTGTATTCTCACACTCACTAGTTGTAGGTGGACAGTGGGTCTGCT 
GTGACTTGGCTTCGGGCTGGGGATCGGTGGTCTCGGTC^^ 
TAGACCAGCATTCCCTGGGGCATGTCCTTCTCATTGTGGAT 

6 0 CTCTGCTCATGCTTCACATGGCCGAGCCCAACACCACCAATGGGGCAGGGACAGGAACTCTTCCGAGA 

TTTGCTGAGCACCAACTGAGCACCAATCCTAATGCT 

TTATCTCCACGGAGTGGCCAGAGTGATCGCXjTTAAACTACCAATCAGATC^ 

TCTGGACATTTGGCACGAAACCTCCTCCCTCACCTACAAGGCCTATAG 

TATCTTCCCTTTCCTTGCAAAGCTCCAGCCACAAGGGCTGTGATCTGGTTT 

6 5 AGTTCCCATAATCCCAAGGGAGGAACCCAGT 

AATCCCATAATCCAGGTGGGAGGAACCCAGTGGGAGGTAAT^ 

ATAGTACGTGAGTTCTCACAAGATCTGATGGTTTTATAAGGGTTTTCCCCTTTTGCTTGGCACTTCTTCCTG 

AGGATGCGTTTGCTTCCCCTTCTGCCATAA 

TTTCCTTTATAAATTACCCAGCCTTGGGTAT^ 

7 0 TTGCCCAGGCTGGAATGTAATGGCATGATCTCAGCT^ 

CTGAGTAGCTGGGACTACAGGCATGTGCCACCAOVCCCAGGTAGT 

AGGCrTGGGGGTATGTCTTTATTAGCAGTGTGAGAACAGACn'AATGCAGGCTG CTCCCAGGATTTT CACTGTGTGCCAGGCCCCTTC 

TACCTTGGGACCCTTGCTCAAGCCGCTCCTTCC^ 

CTCGATCACCTCCTCCTCCTGCTT 
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CTAGTGAGTCCTGGCCACCTAATCCATGCAGAA^ 

AATAGAGAGACTGAACCATGCCAGCGACCCAGCTGAGCCCAGCCTCCAACC^CCCC^ 
CCTGGATGTGCCGGCCTAGCTGAGTCTCCAGGTGACTCGC^A^ 
AACCCACAGAATCATGAGGGATAATAACATGGCTATTC 
5 AACCTTGACTTTTTGGAGTCTAGACTCTTCGTT^^ 

TCCAGAGAGATACKIACATTTGGGGACATCTGACTCCATCCAGCTCTC^ 

GGGAGAGG CAGATCCAGG ATATC CCTAGAAGG CTTC CGGAGAGGAC CTCAAGGAAGGTAAAATGAG CAGTTTTTTGCAGAATCCAA 
AATTCAAT CAATACAGTTC CCACCTATTAACTATGTG CTTCACACCAAGCTGTGG CAGGAG CTGGAGGTGCAGAGAGGGGTGAGGC 
ACAGCCCCITGCTCACCAGCAGCTCAGAAGCCAGTGGGAGAGACAGACGTG^ 
1 0 TCACAGAAATATATT C7UVTACAGAAAGGAAGAGACTCTCGTCTCAAG GCGCTAGGGTAG AGATGTTT CCACCCCCG CCTCCT CGCT 

GAAAAGCAGCCACGACAAGAAGGAGAACAAGAAACAGAAACACAAACTCCCACC 

AACCACAATGTATGAGCAG CAAGAG CIAG ATGG AGGGTGGT AAATG ATTGAACAG AGCTGAGGAAGGCAAACCT 

T ACATGGTGGGGTGGAGTAGGGG CTTTGGAGACAGAAAAGATG CAAATCAC CTGG TCTTG CAGAACCCTGGGGAGGCTTAGAATTA 

1 5 GCACCACCCCAGGTCTTCTCTTTTCCATTACGTC 

TCTGGACCCAGAGACATTCGCTACAGGGCAGAGTGGGGGTGAGGCACC^^ 

AAATAAT CACT CCTCCACTC CT ACC CTCC CT ACCCATCACTCCT AC CATGCTCCAGAGAGAGATTGACACACCCTCTATGGGAG GG 
GCGTGTTTCCCCTCTCCCCTGATGCTGGGCTTGGCCATGTGACTTGCTGTGGCTACT 
AGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTAC^ 
2 0 ATGGGGTTTCTCCACGTTGGTCAGGCTGGTCTC 

AC^GTGTGAGCCACCATGCCCGGGCTTAATGCCTCATTCTGAAAGAGGAGTC 

AATGAT AG CATTG CC C CAAATGAT AAATGAAAGAGAC C CAATCAAAT AAGTAG AAGAAAAGGAAT AT CAGTAAG CAATAAAGATG A 
TG C AG TGAGCAG AAGAAAATTTTGACAAAACCATCATTG CCATC C CACAAATGAGT ACAG A 

ATGCCATAGCAAAGAGCAGGCAGAAATCAGGAGAGAGCTGGTTGGGCTCAGTAGCTCATTCCTGTAATCCCAGCAAT 

2 5 CGAGGTGGATGGATTGCTTGAGGCCAGGAGTTTGA 

TT AG CTGGGTGTGGT G G CATGT G CCTGTAATC CCAGCT ACTCAGGAGGC TG AG G CAG AAG AAT CACTTG AACCCAGG AGG CGGAGG 

TTCCAATGAGCTGAGATTGTGCCACTGCACTCCAGCCTGGGTGACAGA 

AAACAAACAAAAAAATAAAATAAAAAACAGGAGAGAGCTCTTGGAAATGAAAAATAGCT 

GAAAAACAGATTTTTTT AAAC CT CCCAGGAAGTAGAACAAAATT ATAAAG ATG AG CAACAAGAAG AAT AAAG AT TTCATC CAGGAG 

3 0 ACCCAATATTCAAATATAATAGGAGTTACAGAAGGAAATAACAGAAAAATGTGTCTC^ 

AAAGGCTACTTGGTGCCCCCGAAGACAAATGAAAAAAGAATCAAAT 
ATCCTAGAAGATTCGAGAATAATAATAATAAAAAAATTAGCTTAAGTTAC^ 
GAGACACCGACAGACGCCTTCCATCAGCTGAGTCAGAGT^ 
GAGAGGGGAATTCACTATTTTGAAAGATGCAGACCTTCTCTGAAAGT^ 
35 AGGCGOSGTGGCTCACGCCTGTAATCCCAGC^CTTTGGGAGGCTGAGGCAGGTGGATC^^ 
TTGGCAACATGGCAAAACCCCATCTCTACCGAAAATACAAAAACrrATCT^ 
AGAGGCTGAGGTGAGAAGATCGCTTGGGCCTGGGAGGCAGA 
ACIAGAGCAAGACCCTGTCTCAAAAACAAAACAAAACIAAAACAAAAAAACCC^ 

AAAGGAAGACATGGAATCTAAGGAACTGGATTCATACAGGAAAAAGCTGAAGGGAATCCCCAAACAACAATGGAGAGAGA^ 
40 GGATGTCAGCTGCGCACAGGCCCAAGGGGCAGCTGGCCCAGATA^ 
TGGACTCTGGAAACTCCACTGTGTGGTTTTTACGAAGCT 
AAAAACCtAAAAAACGAAGCAATTATGAACTCCTGGCTGAGGAGGAAATAA 

GCATTTAACCG AAACT CATATT GAGAAGATGG GGAGAAGGGGTG AGAGTG CGTAAGAAATGTAAACTTCT CCTGT ACTGT AAGAGG 
AGTCAGTAGATAATGGGTAAAATTGAGGGATTAAGACATAAAAAAATCAATATGAT^ 

4 5 CAATTGCCGAGCGGGTGGGATTACCTGTGGGATCGGGGGGTG 

TTCAGCGCTACAGTAGTCGACTTAGCGCAGTTTCACT 

AAGAGACAGAGAGGTCACCATCATATAACTTCTGTTATGATATATTGTTATATTGTTTTATT 

TGTTCGTCTCTTAACTGTGCCTAATTTATAAGTTAAACATTATCTTTCTCCACCATCGTGGTGTGTGGTTGA 

ACAGCTTCTCAOVAGACTTTC^GGATCAAGTGATTCCTGCACAA^ 

5 0 GAAAACTGGTAAT AACATCA.GGT ACAGCTCCAAAAGGAG ATATTGGAAAG AACCAAGCTG GG T C T AT AAGGAATTGAGCATGAG AT 

GCTACACATATTCATGTTGTATCAAGGTTGTGATCIA^ 

GTTTGATTGGGAAGGTACTTTTTCTCrTTGTTCATATGCTCTGCACTAGTAGGC 
TTTAAAAAAAAGTTATCAGCCTGTAATCCCTGCTACTCGGGAGGCT 

AGTGAGCCAAGATCGCACC!ACTGCATTC<^GCCTGGGCAACAGCGCGAGACTCCGTCrCAAAA 
5 5 GG CG AGG TGGCT CATG C CT ACAATGC CAGT ACT TTG AGAAGCTG AGGTGGGAGGATTG CTTG AG 

HUMAN SEQUENCE - mRNA 

AACAAGGGTCACCACCXBACACTCTCCAGGCTCCGGACACGATGCAGGCCA 
AGACCTGCCTTCTCATCAGCTACACCACCAACGCCTT^ 
60 ATGGTGGACAGCAAGCCAGTGAACCTGGGGCTGTGGGACACTGCTGGGCAGGAGGACTACGA 
AGAGACGGACGTCTTCCTCATCTGCTTCTCCCTCIGT 
GGCACCACTGCCCCAGCACACCCATCATCCTGGTGGGCACCAAGCT 
GAGAAGAAGCTGGCTCCCATCACCTACCCGCAGGGC 

CCTCACCCAGAGAGGCCTGAAAACCGTGTTCGACGAGGCCATCCGGGCCGTGCTGTGCCCTCAGCCCACGCGG 
65 CCTGCAGCCTCCTCTAGGGGTTGCACCCCAGCGCrCCCACCTAGATGGG 

C^GAGATACGCGCCGTGGTGCGCACGGAGCCCTTCACGGGACGCTACAGCCTGTGCCGGG^CGGGAGCTGGGCAGG^ 
CAGTGGTG AG AAAATGT AT AAAG AT AGATTCTGGGAAAG AATTTG CTG CAAAGTTCATG 

HUMAN SEQUENCE - CODING 
7 0 ATGCAGGCCATCAAGTGTGTCGTGGTGGGAGATGGGGCCGTGGGCAAGACCTGCCTTCT 

CGGAGAGTACATCCCCACCGTGTTTGACAACTATTCAGCCAATGTGA 

CTGCTGGGCAGGAGGACTACGACCGTCTCCGGCCIGCTCTCCTATCCA 

CGAGCCTCTTATGAGAACGTCCGCGCCAAGTGGTTCCGAG^ 

CAAGCTGGACCTGCGGGACGACAAGGACACCATCGAGAAACTGAAGGAGAAGAA^ 
75 (^CTGGCCAAGGAGATTGACTCGGTGAAATACCTCGAGTGCT 
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ATCCGGGCCGTGCTGTGCCCTCA(X:CCAaX^ 
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Table 101 

MOOSE NOMENCLATURE 
ICSGNM Mbnl 
Celera mCG19416 

5 

HUMAN NOMENCLATURE 
HGNC MBNL 
Celera hCG28028 

10 MOUSE SEQUENCE - GENOMIC 

TTAATAAACCATTGGCAGTGATTTGAGTTTGTTGTTAAATATGGTTOT 

AGAGCGTATGATGCTGGTCGAAATTCATG^ 

TGTTACAGCTTTTCCAGJ^TGXA(^GAACCTTATOT 

CAGATTT CAAACTGCC CTGCTTAATTTGGTTAAT ATGT ACTTCl'l' I'lAGAAAGT ACCAAATTCTGATAAATTCl'ri'GATTTTTAT CT 
15 CCTGAGAACATGCCnTTAT/TTCTATCCTGACIAACAGCATA 

NNNNNNNNCCCAAGTCTTAACATTTTCTCCTT^ 

CAACAGAGTGAGGG^AAGGTGCTTTTATTTTTCTAACAAG^ 

GTATTAGTTTAACCGGAAAGTGCAAGCAAGCTGCTTCCTGAAGAGCCT 

GTGTATGTGTTTAGGCTTACAGTGGGTTTGCGGATTCTGTTGTC 
2 0 ATGACCTTTGTTCAGTCTCTAATGAAGGAGGAGAAACAAGGCATGTTCTA 

TTAATCAT ATG ACATGAT AG TGC CTGAAAATCTTCAT ATGTAATGTTTTTGT AG GT ACCTTACACTG CTT ATAT ACTATT AACAAG 

AAG CAAACATTT TCTCCAT CCCCACGATTGTCXiArGGACTGTTT ATT CAAG ATTCT 

ATTGTAC AT ACT T ACTTTACCT ATTT CTTTTCTG AG CAAGGTGGTACTTTACITTGAATTAGCAAGAGCTGGAAACTGATGT TATA 
AATGAC CCATGGG CTGGCTGACGTAAGTGGAGCAATACTGAACCG ATTTTGGAG^ T AG ATT ATTTTT ACAGCAA 

2 5 AAGGCATTTCATACTTCTCTACATATCCCCATCAGAGAGCATCGCAGTGGGGATTGAGGTATAGTGT 

AATTT CAACAGTAAGAGTATGCrTTAi rG i' 1 GAGCAAAAGCTCTGGTAAATGCTTT CTTGG AAGCAG T GACTTGTT CAAGTTCAG A 
CTT AAT AC^TG AACATGCCTTATGGTCCAAG GATTTATGTTAGT AAAGTGATGCAAACA CTTCCTAT GAGCACTTG 

CAGTCTCGCTGAGAAGTCAAAACCTGACATTGATTTAGGT(ZACTACT 
TAGCACGTAATTATGTTGGCCTGTTTCATTTGACA.TAAATCCTTATAACAC^ 

3 0 TATTAGAATAACAACATTTGTTGCTATTTAGCATCTTGGGTCTAGGTA^ 

ACATTGAAGCACCACAGTGAGACTGAGTTACTGGTGAGTCTT^ 

TG ACTTAAAAAAG CTCT AATTTGTATAATAATCT AATATGG AAT AACT AATATATGAAAT CT AG ATTGCTATT AAG ATCAAAT CT A 
TTTATCAGAAGAAATATTTTTAGGATTGAAATGATGAATTTTGCAG CAGC CATGTGG CTTGTTTTTATGATTTTGG AACATGAAAT 
ATTTTGGTGAAAAATACTTCATTGATTTTTATGGGAAGAAGAAAATATTTGTG 

3 5 ATTTCTTAGAATATTTCCAAATAATATATACAGATAATATATACAAATAAAT^ 

GTCATGGTTT CATTG ATTATATTCT AATGAAAAATACATTTTAATTAT AAACAATGCTTTTATACT TTTCTCAT AAT AAT AAC TGA 
GCAACATAT AT AGGGT AATTTT CAAT AGACATAGTCCTGAAACTTTTTT C T AAC CTAGCT T AC CGTG CTAG AATTT AAAAC CAATG 
T ATG AAATTGAAG AG CTCG CTGTTGAAAATCAC AG AGT AGGAACCCAAAT CT AGTTGATAGTAGTG AG ACCAATTG C 
AGAGATCT (^TAAAAT ACAGGG CAGAGT AAGACTGG TGCTTACAG AGG T AAAATTATATTTTGG GTAGGT TTTAAAC TATATATG C 

4 0 CATTTCTTCACATGTCATTTTTATAATTTATGCCACTTGAGTCTGCTAAAATGAA^ 

CTAGTCAAAGGGAAGCCAGAGCATCTTAGTGAAGGGTCTCAACCGCA.GGTATCACTGA^ 

CCCTGTTGATCAACTTCATGCTTTTGAATTGTGAAGTGACATC^TTGCCACTCATCTGTAGCATCT 

CTTTCACCAACTTCATAAAGGAGCrrTGGTGATGCTATTCACAAAAAGAGAAGTTGAATATTA 

TATGTTG AAT AAACG AT TT CTCCTTTGAGCAGGCCTT ATAGT CAT AGT C CACX5 ATCAGAGTATCACTG GGGC CAT CACC 

4 5 ACCAGGCTTCCAGGCTTGATTAGCAGTTTGGACAGGCTCATT^ 

CCCTGAAAATTCCAGGGAGGTGATTTTCCCGATTTCTT 

CTATTTATCACTTTGAAATAGTAAATGTTACAGGCAGATCAGATATGGTAAGCGAAGCT 
AGTAAAGGGCTGGGATCTCCTCAGGAGGAACAGTAAAATAGCATGCAGTTCTCTCTCCT 

TTCAGCGCTGGCTAAGTGT CAT AGGAAACTG CAAAGG AATTAT ACTTT AT ATTAAAG CAAAT CTTTT CACTTCACAATCAGGGAGG 

5 0 CATTTAAATGGGGTTCTATTGCCTCCTATTCTCTTGCCA 

CAGGCTGTGGAACTGGAGTCCATACAATGTAACTTGTTGAATGCATT^ 

CTCTATCTATCaTCCATGTCTGGCTGTCTCTATCTCTGTCTATCGCTATCTTCTCrGTGTTCCT 

CTATCTC CCT AC CACCT ATGTACAGT CTGGAAGCAG AG ATTAT TTGTTT CCTT T GG T CCAGTGG AGTT TCAGT ACTG ATT AT AGAG 
AAAAAT AGCCTGTG CTCATGGG ATTT CTATGCCAAGACTGAAACAGT C AATGAG AT AAATGTTT CTACTTCTCTACTTCTTCTTTC 

5 5 CGTTATTTCTTCTGGTTGATATATGATGTGATATCATGTACTTCTCTTACAA^ 

GGT CACTTTT GAT AAAG GTG AACCAATGGACTG TTTGGCTACAAAG CTAGTG ACTAGTAT ATGAA^ CCATCATT ATGAT AA 
CTATCTTCTCTTGATGTTAATATTTTAAAACTTATTTTCTTATAGACATAAATTCAACA 
TTTTACAGTCTAGTCATTCCCCCATCCTGGTCCTCCCTCCCACAGTTCCTCATCCCATTCCITCTOT 
GTTCCCACCACCCTAACCTCCIXXrCAGGCCTCCCCACTACCTGGGACCTCAAGTTTCTGG 

6 0 TCAGACCAGGCAGTCCTCTGCTCTCrTATATATGTTGGG 

ATC^TTGCATGCATTTAAATATGTATTTTGTTTAAAATTTTCTAATTTGAATACAG 

ATTAAGTAACCTGTACTCCTTCATTATCTGTCCTGCTATAGCTATTTGACTGTTTAGGGAACCTGTGAACGAATCATTATTTT 
AGCTTGTATTGACAAACCTCAGGTTCTTTGTGCGA^ 

CCAGGAGGCCGCATTGACAGTCTCTGATAGCCAAGAGGGTGTGTGATTG™ 

6 5 CAGTCATTTGCATGTCACC^TGATTTTATTGCCAACACATTAGCTTGGTCT 

GTCTGTGTAAGTCATCACATGACATATTACAGGCTGTACAAGCTGGTGTCAGCACTOT 
GTCCAGGCACACCCTGACAAGTACTGAGTTCAGCAGCTCTGCTCT 

ACAGCCTATCACAAAG C1ACAGTTGTGACTTTAAG CTGG GACAGT ATGG CTGCAGAG CTCTG CAG AAG CACTTAGT ATT ATTTTCAC 
CCTGAAAT CTGT TAAT ATTTGGGATAAATGCTGAGAAAATTATTGTG T G AGGAATCAGTCTTGTTGAT AT TGGCAAC CAAG AAAAA 

7 0 ATGCITTGGACAAGGTCTTCCTTTTTGTTAC 

AAT AAATCAAAATTGTAATG AG AC C C CACAT TTTAATTC CTCAATACAT AACAT AT CTTAAT ATCAAAGT ATCAG 
TCAGATGTCTTATTGATTACCTTGGGATATAATTGGAGTTACTTACTTAAGTTTGTCTTATAGCTTAAAGGGCTAGTAAGTGGGTA 
GCCTAAGACATTAAGGTCCTTCCTAAAGGCTGTTTAGT^ 
CACACTGTGATCCATGCCCTCCCTAGTCAGTGTCCAGTGCCGTC^G 
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TCATGATTTTCTITTCCCTTTGT 

GT AGCTGTCCTCAGACACTCCAGAAGAGGGAGT CAGATCTCATTACGGATGGTTGTGAGC (^CCATGTGGTTGCTGGGATTTGAAC 
T C CTGACCTT CAGAAGAG CAGTCGGGTGCT CTT ACC CACTGAGC CATCTCAC CAGC C C C CTCAACACATTTTTCrriVrriTTCTT 
TrATTTTCTCTCTCTTTTTTTGTAATTTTTACTTTC^ 
5 TTGTGACATCACTTTAGGTGAAATTGGTTTTTGTGGCA^ 

GGTTGGAAATTGATCAGCGAGTCACAGAGAATTTGTCCTATAAATATCAG 

AAACAAGCATTTTGTTTTTAAGGT AACTTTATTTGCTTATAGTATA T CAGAAACATCACAGAGGCAATTTAC CTGTT 

ATTAATCATCTTAGAAGTAGATTAATTTCAC!ATGATAATTATTAAAA 
TTATAATTTGTATATCACTCATACATACAGTTTTCTTTTCCTCT 
X 0 TCTCCTCTCCTCTCCTCTCCTCTCTCTTTTTCTTCTTTGTTCCT 

TCTCTGTCTCCTTGTTTATGATACAGTTTTCATTAT ATAGCTCCAACTGGC^ C CTCAGA 

CTCTTGGAAGTCTTCITACTTTAGT C CCCT AAGTCTGAGGGCG CAT ACACTGTT AAATATTATCT CAGATTCT CTCTG Crrri'AGG 
GCCAGACACTGTTACTAGTC7JLAGTTACATTCT 

ATGAAATTAATTACCAAATGTGTACAGATTGATGTCCATTCTCCCTATTCACT 
1 5 TCATCTCTGCCTATGTTCATAGTGATCAATTTACAG 

GAATGAAAATAAAATAGACAAAACnATGAGCTACAGGCAAAATTTTA^ 

GAATTAAATCACACATATGTTAAACATATAATAATATCTGTTGTGTTTTAAG^ 

GCTGTCAGGGAGATGGTAATTTTAGGCAGAAGGACCTAAAGTGAGAAAC^ 

TGGTTGGTCCAGCrTTACAGAGAAGATGGTATCTGAATAGCAATTAA 
2 0 ATTCAG CCAGATTGAGTGATAGTGTGTATGTAGTATGCrCCT 

CT CTGTAAACATAGGG AT ATGGGATCAGAAG CTAAAGAATGATGTGTGTTGGTTCATTCATGGAGAAAGGCCTCTTTACAGTGG CC 

TCTTCAACACTTGTGAGGACTGAGAACCAACTGCTGTTTGATAGAGAATGX3GCGCCTCT 

CAGGGGGAACTCCTACCCAAGGGTGACGTTTGAACAAAGGAATAAATC 

2 5 CATGGAAGAGGAG AGAAATGAGGT AGAGAAGACIT 

TGGGCAAAGGAGGGCTTGCATGACCCTGGGATTTGTAGTATGTATTTC 

CGTGTGT ATGTT ACTTTTGTGT CCT CAGT AAT CTGAGT C CTGCT GT AGT AAACAATCATTT AT ACAC CT ATGCTTTTGG G ACAGCT 

TTCTCGCGTTTGTTGTTGTTGTTTTGTTTATTTGTTTATT^ 

TACrGAGATGATGTATGTTGTTTCTGCCCCTGGAAACGTGGTTTAGTGGCT 

3 0 TGGAGATGTGGAGGCAATGCAAAAATGGTGATATTAGTGTGTAAGCGAACA^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNCCTTTTTTTTAAAAAA^ 

CCTTACTTT CCT CACAG CGTT CAAAAGGAAAACAAAGCAAAAACAT AC ATG CTTT T Arl l"i l T TGT AACCAGTT AA ri'I'l^ri"!' TGAG 

3 5 AAACATi _ ri"l'i _ rCACTTGATTGAAAGAGAAAGTGAGCTATTTA^ 

GAGAAGG CAGCTGCCTCAGCCCCTACCCTCCAG CATAGTGTGGGGCTGAGG CCCTG AAGTGCATCAGACTGCCCAAATCAAATTGC 

TCACTTAATAATTAATGTTCCTGCAATGGAATCGCITTCTCCCTX 

ACCCCTTATTCTCGACACAAGACCTC^GCTTCTGCTT^^ 

TGATAGAGTGATAAAGTGTCTGATGGCTGAATGGAATAGTTT 

4 0 TTAGGAGGAAGTGAATTGGATAGGTCAGGAGAATGTACCATTTGTAAACACCCCT 

CX5GCAGGCTGCTGAGTGTCCACCCCTG CACAG AACCAGAGGGAAAGCA 

ATTACTCGGGAGTGATTGTCAAGTCCTTGGACTTGCGTTAGAAGCT 

GGGTAAATTCAAAACCCACATGCTCGTCCTCTTGCAGTGGAAT^ 

TGAACGTCTGACAG GCGCGAC CTTT CAT AAGACAAC C CACACTATTG G CTTTCTG CC CAGAAT ATTG CTG CAACACT ATCTGTG AT 

4 5 TGGTTATCTCTTCTATCATGCATTGCTTCACAAGGGGAAACGGCC^ 

ACTGTAAGTATTTGTGTGTGTGTGTGTGTGTGTGAGAGAGA^ 
AGAGAAGCAGAAAGAGAGGAGAGAGAGAGAGAGAA 

CTG CT ATAGGACATCAGTG AATGCTACTTAGT AAAT CTTAACCTAT CT CTT TTCTCT ACAGG TTGGTACTAAGAAGTG C CTTTC CT 
GACGTCTCTGCTGCTTGGAACCGCrrTCTAGAGCACKICTCTGCTTTTGCCTTGC 
50 CCCTCCACCTCTAGCCCAGACACCCCATTTCTACTTCTAATCAG 
GTTTAGAAGAAAAGTTTGCrGAAAAAGTAAGATACCTT 
CTCTAAACTGCTGCATCTGTCTATGCCAAACTAATCAATA 

CCCTGTCAG ACAACTTTG CTGAAAG CAGCTTG GAAATTCGG TGT CAAAGGG TCTG C CACGTT T TCATG CTTGCATTTTGGG CTCCA 
AATTGGCACTGGGAAGGGGTTACTGAGCACACGGCTGAGTCCA 

5 5 CTCAAAACCAAAACCTCTTTGAATTAACAGTTT 

TCCATGTGCCGTTAAATCGGGGACGGGGGAAGCAGCCTTTCGGACATTTTC^ 

ATTTTGTTTAGTTTrTGTCTTTTGTTTTGGT^ 

TTTTTTTTTTTTTTTTTGG 

TGTCACACCAATTCGGGACACAAAATGGCTAACACTG 

6 0 AATGTAAATTTGCACATCOTTCGAAAAGCTGCCAAGTTGAAAATGGAC^ 

TGTTACCTCCTTTTAAGGATATGCAATGATTGATTAAGGGGTATGGGTACATGG 
TAACACTTTAGTTACTCAGAATGTTCTGGCTCCTGAGGCTC 

CTTTGTTATAAATAACTTTGTCAGGCCTTTAACTCCGAACTCCATGCCTCCCCTCCCCTCCCTT^ 
TTTTTTCCTAAAGCAGAATTGTTACACGTTGATGTAGTTGTGACT 

6 5 GTTTACATTCAAGACTTTC^TATATAATAAGGCCAAGATTGAATGAATAT^ 

GTTATAAATGAATGGCAAATT CTGTGT CAGCTGATCATAGGTAACATT TTTACTCT 'ixiTlTTTTTTTTTTTTTf TT TTAATGAGA 
TGATTTTAATGGCTAAAGCCATGTCCTTTTTGGGAAAA 
AAAGCATATACTATATTAGATGTGGACTTTGTAAA^ 
' AATGCAGATGTTCATTITCrGTGAGATTCTTC 

7 0 GTG AT AAG AAG AAATGGTTTATTT ATGTT AG AATT TTTG ACAT TCAAA TGT CATTCGACITGT AAG CCCTGTAT CAGGTTTC 

AAGTGGGTCTTGTAAAGATAGCACACATGTCTTTGTTT^ 
CTTGATTTAAT CAG ATT AATTTTTGAATACTAAATG CCTACAGCT AATT CT 
GTCTGTGAAGAAGTCTCAGAGACCCATCATGTTTAAAACC^^CCATACA 
GTTAGAAGTTAACCGTTTCAGCTCACTTTGTGATGCTCGATCT 
7 5 GGAT ATGAGGAGGT ATGTGTT AGT AACCACAGACCT CTGTT AAGATGG CATG C!T AT AT AGGCAGAGT CCT AGTTG AGGTGAATAAG 
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GGGAAATTACCACTTTCTCTCAGCTTGGCCTTiWSGGATGCCTTO 
TTAATGCTCACCAATTCAGGTGTAATGAGAGAAGGATCCATAACAGGAGGAG^ 

TGTCCAT AAGTAACTCCTTTTTGACCCAATTC CTAGTAAAT CCAGTTTATAATCCT ATAGATAATTGGTGGTAAACTCCTAG CTCA 
GAAATGAAGGACATTAAATTTGGC CGTGACATTGA^ 
5 TTAGTATTCAGTTGCTATTTTTCCAGCTCCAAATATTTT^ 

AAGCAATCGTCCIGCTTACAGACACCCAGTGTCCCTAACGAGGTTCAACCCCCACAT 

AAC CACAAGAAAAAAAACCCAATAACAAATGAATTATATC TTACTAT AC CAGCAAAT 
CACAGTAAACTTCATCCTTTTGTTTTTTACTAAGAAAAGCT 
TGTCTGTGTGTGTGTGTGTGTGTGTAGAGGGAAAATTC 
1 0 TAGATGTTAGTGAAATAACTGTTAAGAATCAGTCAACAGAGTTTCT^ 

AG TCCTATGGTGTT CAGTTTTGTGG CTAAAATAGATCCAGTACACATCT 

GTTCTGAAGTTTGGGAGCATTCTGAGACTTGGCAGCTAG 

AAAACTTATAATATCOT^CTTCTTCAATGAAATGTGCT 

G C CATGAGATGGAAGTAG CTTGTGTAATCTACTATTTTTAGAAAGTT AG^ CAGAG AGG ACC CTGTGA 
1 5 AATGG CTGT ACC CTT CAG CCATG CTGTTTCAGGCrTTGG CTGTGCTC CCAGGTG ACIT AGGTTT ATAG CTGACT TCTGGTTAGAAG 

AGAAATCCT CTCT C TCT CTCT CTL^C^Cl^TCrCTLTl' UTLTA' C Tut lti CT CrCT CTCTCTCACT CT Ci CTCT CTCTCAACACACACA 
CACATACACACACACACAC^CACAC^CAGAACAC^ 

TAATTTTAAATGAAAGATAGGATAAAGCAGTCCAAGTTTATAATAAATGTGCTGC^^ 
TGTGTGTGTTAGGCCTTTCCAAATGCAAATACTGAAAACAAAAGAG 
2 0 ACTTT ATTAACTTAG TTTAAATTCATTGT AGTCTAAGAGTAT ATT CT AGCCATT ACTTTT AAGTGCTGATAACTT AC AGGT CTG GG 
GGCTCTAGGCACTTCTCAGAACAATTGATGAAGTGCATTTGATTCTGT^ 

GTACTCCAAAAG AGGAGG CAAACGAT T AAAAGAGAAG ACT C CAACACT G CAAGCTGCCAATAAAATGATGCCATGTTT CAG AAATG 
GTAATTCCAGCAATCTGACTGTCATGATCTCTTCACAGTT CTG AGTTTGTGGG AATGG AGT AG ATG GG ATACATTTTT ATG TGACG 
CCAAATGT ATTACAGTCTCAAAAATAAGTGTT CACTAT AT CATCAGTAGAAATAAAC CTCTGGTGTTAACAT CAGG ATT ACATGG C 

2 5 ATGACIGACTTGAAGAAGTGCCCCCGAAGTGATGGCAGTTAATT^ 

GAAATGGTTAGTCTCATTGTTGTTCTTACAAAGGCACCATCGTGGTCCTTTC^ 

TGCAGAATGAAGCACAGGAGATTGGCAAGTCCTGGATAATCTAAATTCTAG 

CTGCCTTGTCTTGCACACACACACATACGATGCTAC^ 

GTGTGTCTTTAGCAACAGGTTGACACAGCAGAGTTTTTTCTACAAAAGAAAC^^ 

3 0 OTGTTGTATCCAGTAAGCAGAAAGTTCACAACTCTGTCAAGTTCTTCCTGGGCCT 

TATTT CCTT AACAAAATG CTCAGCAG CT ATAGCTGCCTCAG CAGT ACT AAT AAT AGCTTT AG AC ACATGCTGTTCACATTCTT AAA 
TT CTTAAACTTTCCTTAAATAC-GGACTTTTATTAATTTTT CTTTT AT TTTTTGATT AT AACATATTATATT ATTTT CCCTTTCATT 
TTCTCCCTTGGAACCCTCCCATATACTTCTTGCATTCCTTCAAGTCCATGGCCC 
ATGTGCCTGTGTGTGTGAGC^TTTAC^CACACACACACACACA^ 

3 5 ACACACACTAATATTTTTATTTTTTCCTTAAATACATTAGTGCCACCAG CTCAGT CTGTATAATGTAATTTGCATGTACGTTTTAA 

CCTGTGGCTTTTTATGTAGGGTCCAGGCCTCCGGGTATTTCCACITTGCT 

CCATGGT CAG CTTG T ATTTAAGGG CT CAT ATT AATT CCTGACATT ACT AGGGGATGCAGTCTCAT AG CAACGTT CCTGATCTGG CT 
TTTAACAATCTTTCTAACCCCTCGGCTTCAGGGTTCCCTGACCTTTAGGTGCAGGG 

4 0 AT TCCACAACTCTGCTTTATGTTTGTTTGGTTTTGGTTGTGGTTTAA 

CAGGCAAACAAAATT CAAGACAT AAAG CCTGTT AAGGTATAG ATT CATGCTATGTTG CATT CTT TTGT AAGATTTTAAT ATATT TC 
TCCTTTTTCTCATTTAGTCITGCTTGACTGTGTTAGCATGGTCAACT 
AGGTTCAAAGTTTACCTGTGTGTTGGACATCTAAAACTACT 
TACTGCTGAACATGGTTCATACAAGCTTCCAAGTCCAGAATGCTTCTC^ 
45 TGTTTTGTTGTTCTCTTTGCTAAGGGTAAGTAGAAGGTTTGAGCT 

TCTCTGCTTCCCTCCAGACCTAGACAAAGCCTTTTATTTTCTCTAAAGAGGTTCTAATGAGG 
AAACITTAGCAGACACACATTCTGTGCTCT^ 

TAGCAATGGGTAACTAAATAGGTGACCTTTGACTGTTCTGCTGATAGAGATTAGTGTGCAGAG 
GGCTGGGTTGAGAAGACTTCGTTGTACGGTGCATTGTATGTTAGAGCGGT^ 

5 0 ACAG ATT AACTCT AGTAAGCATGTAG CTCAACACTCATTTACACCATGCTGG CCCACTAAAGAGTTTTGTAACTAACACCTATCAT • 

GTAAGGTAATCACTAAATACACTTCTACTGATCAGCTCCTTTCTGGTATTAATATTTG 

GATTTATT AGAAG AAAAGTTTTTT TTTTCTTAATTTT i u iTCITT ACACAAGTCTATG CTTTG AGAGTTT AG TGAGAG TTTC CAAG C 
TGTAGAACCCATTCTATACTATACCTTTACTGGCAGACTAACCGGGTATTTTTGGG 
ATCCTGGTGACTGCCCACACAATGTCATTTCACCTTCCACCTCACCTGACAACTGCCCT^ 
55 AACCCGTACAGTTCTGGAGTTGTTGGCTCTGATCTTTCTTTA 

CGGATTTTGTTGTTCCTCTAATTTTTATTTGTACTTCCTTCACAAATACATTTTTTTTACTTCTTTAGTTTAGAATAAA 

AAT AGTT ACT AAATTG CT CTGTGTTGATCAT TGTTTT CATTTT ATGAT AACCT AAG CCTATACAACAACTCTGATAATAGTATCAA 

AATTGACAGACAGGGAAGTACAGAGGAGCCAGGCGCTGCTCAACTGTAAGCTATGGACCTGATGCCTGGT^ 

CTGTACTCAATCACACCATTCCAAATGCTGCTTTTAATTTACATATCC 

6 0 GTTGAGCATAGCTAAATGATGATTCCCAGCACATTGAAGGCATAGG^ 

• GTTAACTGGTAGGCCATGAAGCTAATCCAAGGTAGCAGGAAGATTGTGGTGGTGTTAGACACAAAGATTATG^ 
T ATAAGAAAATG AGCTT AAGTT TACT AC CTCAGACAAT AAAT CAATGTGTCAG CT AGAT AGCTG CCTC C ATGTCTAG AAAG GGGG G 
CTGAAGAGGTGGCTCAGTGCTTAAGAATACTGGGCTGCCCTTCCAGAGAAGCCAGGCTTAG 

TAGCTTGCTGTAACTCCAGTTCCAGAGAATTCTGTGCCTTTTTCTAAACCATGGTTACCAGCCGTACATGTGGTAC^ 

6 5 ATGTAGG CAAAATAC C CATG CAT AAAAAT AATAT CAAT ATGGGAG AAGAAAAATGG CC C AATGTCTT CATTGTTTAG AACAT AGTT 

AAAACATCAGGCCCACTCTGAGAAGCCAAACAACTATAAGGTTCTTGAGCTATGCTTTAAAACCC 

AGACTCCACAAATATACTGTGAGCTCCATMNNNNNNNNNNN^ 

CAAGAGCAGAGTTGACAGCAGTGTCCACATTTTGTGGGCATCACCT 

AAGCCCGCTGGGCTGGGCTGCATGAACTGTTGTAGCrrrCATGAAGGTAGTCCTGGTGAAATACGTG^ 

7 0 G C TCAGAATTTT ATTTTGATTTTTTCT TGC CATTTTAT AACAG 

GCTAAGAAAGTATAAGGATTTAATATTTAAAGAGGAT^ 

TGTCCTCATGGTGCTGATTGTTTCTAAGGGATCCTGTGTTAGGAGCGATGAGGTGGAGATTAGGAGGGAACT^ 
GAAGGTCAGCGTGAGAGCAACTTGAGGAAGGGGGAGAGGGATGTCTTTAGTGTCTAAGCTTTACrrOT 

AAACCACAG ATGGG AACAGAC CAG AGAGTACCAAAAAG AAAGAAGTCC CAAGTCC CAGG CTT AGG C CT ATGTCTTGGATGACCTTG 
7 5 CTCAGAGGGGAAACTCCACTTTAACCTGTAGTCTCAGGTTCAGGGCTAGAAGAATAC^ 
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TATG^CTTTTAGAGTTATCTCTCTGCCTCCAA^ 

CATCTGCGGTTAATAGTTGACAGTAAATTACACAAAATAAAACATGGAAAAT^ 
TTTCAATTTCTCTACAGATACGGACATAATTAGTGTTTAGGATCGCTTGT 
5 GTT ACTG CATTCAGAGAATAATTTG CT TTCAGAAATCATTG CT CAT ATAATATTCT AAAATCGTTCTT AGACTGAAAAa^GTTTA 
GTAGCATGTCCCTATCGAGTAAGCTCTTATATCCATCAGT^ 

TGGTGGATGATG GGGTTTCTT AAAAGATTATTCTCCTTCAGGGGAGGCAAC CACAAAGAAAG ATGG AGAAAGAAGAACCACG AAAA 

AGGAAATACTTGAAGAATTAATTTTTAAATGAAGAATCCTACT 

CTGTTTCTCCATGGCTATGTGATGACTAATGAAATCTAAGTCTAGCCATGAG^ 
1 0 TTTTTCTCTGAAGAACTTTTATTACTAGAT 

GAACTCAGAT ATATTTACAAGTAACAATCAGAATAAG CAAATCTCCAAACAC CACTAAATCTCAAAACTAGAACAAACAAACCCCC 

CAAACAACCCTTTAGCCT AGCCCOTGATGAG CTATAAATCATCATT CATATTTAAGAAAAAT CATTTATATTGC CAAGTTTTACAG 

TGGGATCTATTAAAATAGTTCCTGTAAAACTCAAAATCAATGCTTCAAGA 

AATGGGCAGAACAGTGAACGCAGAAC^GCCTTAGTAGCTACTGTCTTCTG 
15 ATTTTCATGAGAAATATCTATTTTAAGAACCAGAGGGTTGGGGGGTGGGT^ 

ACCAGAATCCAAATGCCTTACTTGGGGAAGAG AG AAG CAG AT CCT AG AAGCCAGCTGGTT C CGTGAG CAG CACTGTGT AAAGGCCT 

CAAGGTGGAGTTTAGCACCCTTGTCACCTGCATGGCCACCAGCTCTAACCCC^ 

GCACATGTTGTGCTTTTCACTGTTCACT^ 

TCT CTGGTTG TTGGT AG ATG AGTGGCTATTTTAAGAT C^TAAAAATAATAAAAATTTGATl"lTATTCrrcAATTTGTriTGAAACAC 
2 0 TGGAAATTTTCTGTATAGGAGAAGCCAAATACTCGGCATGTTTTTATTGGAATGG 
AGCTAAGCTCCTCATCACAGTCTGCCATTTTGTGACATAAATTGAGACAC^ 

TTT CCTCTGCTGATTTCTTCACCAG CTCC CT ATTX3T AT AGATTGGGAAAGTT CAGAG CAGTT AAGT T ACTTCTC CAAG C TTGCTCA 
GTGAGAGGCAGCTTCCCGGGGCTTAACACAGGCA^ 

AAACTATTAATACTGAGAAATCTT ATAGCATGACC CTGTC CAT ACAGCCT^ 

2 5 TGAACAAGTATCGATTCCGTTACAGCAGCTC 

ACAATACCATCCCTGAATAATTGCTAGCAAGGTATTCrCTTTC^ 

ATAGATCTCTGGTCAGGTTACATCTGCATAGTTTC^^ 

GAAATTGAAGAAGAGACACACTGGAGTTCCAACCTGGCAGTC 

GTC CTTATAT CTTTGGGTTAAAGTGGAATGTCTCTTTTTATT CACTG AT ACGT AAGC CCATGC^ TTG TATG 

3 0 AGGCTrGCAGGTTCAGCTTATTTTGTAGTTTTCCTAGGAACT 

TTCCATtjTATTGATGTTAGAATTTAGACGAAACAACTGAAATCT 

ATTT ATG ACC CTAAAAGG AT ATTTCAG AAAT TATCTAGGACT AAAATTTTT ATTCTTTGCAT ATCGT CAAT ATTT ACTCACT AT AC 

TAATTATG CCAAAG T AATTTT CATGTG AT ATAAATAATTTT ATT AAGT ATTTT AAAACAAT AT AT AT ATTT CTC 

ACCACAG TTAGACAT AATGAAAATG AC T ATTAATAAAAG AATTG ATT C CATTTTT AT ACAAG ATGTGG AATAAAG AGTATGTTGTG 

3 5 CT AAAAACTTTGGGTTT CAG AATATGAAAT AAATT CTTATGCTT Cl'T AGAT ATTT AATGAACACTGTGT AACCCTGACCT CATATT 

ATGAGCTCGTGAGACAAGAGCCCCTACCCCAGAGGATCAGCIAACTC^ 

GTTTCTGTTTAAAGTGGAAAG CTG CATAG GT TTGTGG ATT CTG AT ATG C CAAACTGACTTTTCTCT AG AGCTCCAGAG ATTACAGA 
GAAAAGTGTGTAGTGTTCTTTGTCCTTAGATTTCATTTTTCTGG CT TGTCTGTTTCCTTTATTAATTTGCTGTAGATTGAGTTTTG 

4 0 - CirGCCrTGTTCCCTTTGGTGTTTTCTGAGTGTCTAGTTT 

TATTATTGCAAATAAATATTGATGCCAAGAGCGATGAGCAAGTGGCCACC^ 

ATC CTAGTCAAATTTGGATCTTCATAATG AT CACACAAAG C CAT ATG CTAGG CGT AATGT AAGTT AGAAACAAAACAAACAACAAA 

CCTATTTTCTATGGCIACCGACAAGGCCTTGTAAATAGATC^ 

ACTTCAGACTCTTTATTGTCTTATGCATTTTTTTTCT 

4 5 ATT ACAAACAT CACTAAATGTATGGAT ATACTCTACACAAGAGG C!ATTCAT ACTGTT CTGG 

TTATGATTTCCCTCTGTATGAGGAGCCTCGCCTTAGGTTAGAATAAAT^ 

ACTGTGCATTGTTTC^TTGTGAAATCTAAACCATTCTCTATGACATAGGTATAAATGT^ 

CTTAACAGGAGACAACCCCAACTGAGCATGTGCACAACTTC^ 

AAATCC^TTTAATAAAGGTAGTTATCTATGAGTGGAAAATAGTTATAGTCATCTGTTCTG 

5 0 CTCTTT AGTT CT GTGCTGTT CTACAAGAAATAATT CAT ATTTTACATTTCAAATG CAAT ATGT GGATTT CAT ATTAAAAGACT CAT 

ACCAGAGGTATTAGATCTCTGGAGATGGAGTTTCAGATGACCAT 
TAAAACAATGGGGTGGACTCTTAACCACTAAGTCATCTCCCTAT^ 

GTTCCTGCACAAACAGCATCAACTTCAAGTCTTCCTTTTTGTATTTTATTATCTTCTTCCTTT 

5 5 AAACT AT AG ATT CAGTT AGGATTT CATT AG ATT CCAAAGGGTT TCTG ATAG CATAAAGT ATAAAC CTT AG ATCTGTCT ACACT CCT 

TCCTTTATGGTTTAAAAGTAATATCC!AACTATTCTGATTGTTCCAACC 

CTGTTAGGTATTAATATAACCCAAGGTATTAACCTAATTCATAAGTGTAAGATGTGACTTGGGTAAGAGAAGAATAA^ 
TTTTTA TlT Tl T GTGAGCAAGGGAACAGCCr^^ 
TTATTGGTTATTGCCCATGCTAGAGTGTTGCGTATTGATTTTGT^ 
60 TTCAAGTATGTGTTC^GTGGATTTGCTTGTAGAGTTCTAA^ 

GAAAAAAGCAGATACCTCTGAGGGCAGGAAAAATAAAATTTATTTTCAAATCT 

GCCAGGAATT CTGTAATCG CTTTGTAGTGAG TT AG CTG ATG CCT AAG ATTTGAAATTTCATTTACATGTTT CTT ATAAAAAT TAAT 

GGTTTAAGCACCACAAAATGCTTAGTTTTATOTTGAGAAAATCTCACTTGTCTTTCAGAGTGGT^ 

GGATCTGTAACAC^AAGGAAACTGAGCTGCACAGTGTGTTGATATTTGGGGTGTAGTAGAGATAGAAAACTAAGCT 

6 5 TTGG GG ATTAGATGAG AT AG TTTGTTT ACTTACAGGATT ATTT C CrGTTTCATAATTTTTT ATAAAATGGAACAG AT AGGG CTAAT 

ATTTACTTATCTTCTTTGATATTCGTAAACTGATTTTCCA 

GTAGGTGCACATCCACATTGTGCAAACCCACACATAT^ 

TTGCTGAGGGATGGTCTXSTCACTGAACCTAGAACrrTTCTGATT^ 

GTGCACGTGAAATAAAGACACATAGATACCTATTCCCCTCAACAGAGCCTCCTCCATAGCTCCCAAA 
70 CCITCTATAC1ACACAC!ACATATATGTATACACACIACAC^ 

ATATATGTGT AT AT ACATACACAT ACACACACACACCTAC CTT AG ACTG CCACT CTG CTAATGAACGCAATT 

TTTTATTCTAACAACTrCACTTGGCTCCAATTCATC 

AAGAACCATTTTCAGAAGCCAAGTTTCTCTAGATAACATCA^ 

GCTCTCTCTGAATAAAGGCATGCTTTGTTAATCCTAAGTGACATTATG^ 

7 5 TTGGTTGAACTGAAGCAGAACACTTGCCTAACAGTAACACGTGCCAGGGCATTCT 
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TTGACAACTGTATAAATTTTGAAAATGTGACCAAATTTTAGAATG CGTATT AAATATTTAC CT (^GTTGT CCTCATTGGGATTTTA 
GCAGTAGACTrGTGATAGTGTGTATTAAAGGTAGAGA 
TCACCTTCCAAGGAAGCCATtJTTTATAATGAGT^ 
GGTTAAAAGGCAAGTCTGAAACTACATGCACITTG 
5 ACCCTGTTTAATGCATTTAAAATTGATGAATAGOTCATGAAACCCAG 

GGGTCGTCATGTTGGAACTATGGAGATTTTACTTCGTGAAjGT^ ct agtaacat aaataaaac catatcag cataa 

GCAGAAAGATAAAGTGTTTTAGCTGTACAGAGTTATTGTAA^ 
GTCTTGATCGATGGAACGTCGTTTGCTTATGGGTTTAGAGATCT 
TTCACCTACACTAGTGCrTTGCGTGAGCTTGATG 
1 0 G CAATCTCAGTT CT AAGCTGTGACCCACTATTAAGCCAATTTG AGGAAT AG CAATAAATCAATACGGACTCATAGAAAAT ACAGAG 

ATTGCAGCGAGCTGTCAAATTATAAAGGCTCTTAGCTTTT^ 
TJirATAGCCTACCCTATTTTTATTTTACTAGAAAAGATACTAGTA^ 
CTGCATGAAGCCCTTATCAGAGTAACTAATAGAAACAATTCATTGAGTAG 

AAAACAAAACACCTT AAGATTGAAAACCTTGAAGGTAGAGAAG AATCT TGG TTT ATGTTTAAGGTTTGCT ATT CTTT 

1 5 GTACTTAGATTATTCXSTGTGGTTTTTAATGGTTTTTCrTAGTGCCT 

AAGTGAACCACAGACTG CTGCAAGT ATTAATT AG AAGAAGCT^ CCATGTGGCACACC CUT'l'l'T ACATATTACCGT 

AACAAGGAGTGGAAAGGGACAGGCTTAATTATGT^ 

CAAATAAAAAACCTGATGATACTGTCTTCATTTTATGAGA^ 

CT AGC TGAGAATTTTTTAAAT AATCTGAT ACCITGTGTGTT AGTAGTGTCA 
2 0 TTCTCCAAACAAATTTATGCTTACATATAGATACATAC^^ 

ATGTCCACATTATAAGTTTTTAAT ATCAG CTGTGAT ATCT AGAATGTTAAATT CTACATGAT CAGTAGATT CTT AAAG AATG CCAT 

TTTGATTTCATCAAACTGTAACCTGCTTGAATAACCTCCTCCTAAAAGCCCTGACATTGTGATAGC 

TAGTGTGGGTATTTCTTrCrTCCACATTCCTTAGATATTATGTATTATA^ 

ACCTATTTTTCCTCATAAGCTCTAAAGGCCTTGAACTTTA^ 

2 5 TGATAATTCATTGAATTGAAAG CAATAT TATTT C CTAATCATGGC CAAT AATTTTT AG CTG ATCAAAAATG ATTTTAAATT CAGTT 

TTAGGGAAATCATTGTTCTGCAGTTGTCAATCTCTCCTCrAAGGAGCATAGCT 
ATATATAGTTCTCTGCAAGGGTTCATGTACCTCT 

GATCAGGCCCCTTTGGGGGGGGGGGGTGCATGCTATTGTTTATGTTATCri^ 
TCTGTAGATCGATACATTGCTCIATTGCAGTGGCATGATTTAGTTATTTGGGTAAAAATATTAG 

3 0 GAGGAACTGTCGGGCTTTCATACAGCACAATGTTTA^ 

T CACAACIATAGG AACACT ATGTT<1AGAATAATGTG AACAC CITT AAT AGTTTTGTTACIATT AT ATGATT AACATCT AG AAT CC AGA 

CTACAGTTATATCACTGTTGCCTTCAGTACAGAATAAATAACACTAAAAAG 

GCTATGCATAAAATT CTTCCATATTAT AATAAT AT AAAAA 

TT ACATTTG AAGTTCAAGGAAG AAC CCMAG ACATAG ACTOT CTT AACCTGCTCACAG CCCT 
35 TTAGAGAGGTGTGATGCTAGAAGTCZATGCrrGTGTATGTCATGAGAGTTTCTAGAGAGAAAAACCCT 
GCAOTTCTGCACTCAAGCATCTGACTCTGGAGCATCTAtZAGTTATCCACTTTC 
CACTTGAGGCTTATTAAGGGTTAGAGTGGAATAATTATTTATGTAGCTGGCG^ 

TGGGCACTTC CTG CC T CTTGTCAAGGC CT CTCACAG'i u ri"rr ATCT CCCT AAGAATAAGT CTTG CTGTT CCTTGGTTGTT CATGCTT 
CCTGCTTTCTTACTGTGTATATGATCAGAGGAGATAAATCT 

4 0 GCTGTGCAG CAGTTG T CT AC CT CAGAT CAATGAAC C TTTGTT CT C TT13GTGCT CAAAGGAGATG AGTGTGG ACT AGG C CTT CAG AC 

TAC1ATCCATTTCAGTTGTTGCTAATTATGGAGCCAG 
AGTGGTCGATCTTGGAATGCraCTCTTC^TCAGGCTGTGAG 
TATTAACTCTAACAGAAAATTTAGTCAAATCTAGTTTTAAATTCT^ 
GTTTCTTCCTAAGAGGCCCTCTTAGTCCCTAATCTGCAGTTTCTC 

4 5 AGAAATATTTTAAACATTTTAAAATAT AATTTTT 

TATGTCTCACTCCCTGTGTAAATTCATGGCCTCCITTTCTTTTACTC^ 

CAT AAACATG ACAT ATGTAGT CTGTATAATAGT AC C TTTT ACTTGTG TGT ATAT ACATT CAGGTCTGCTT AGAACAGAAGAAAT AA 
TCTACCCGAGAGAAGGTACCAATTt^TTATGTAATACTAAATGGTCAGCCCTGAA 

ACTTTT ATTGTGT TATTTT ACTGT AT AAAATTCG G CAAAAGAG AGACACGG CAAAACttt ututt CTT CCACT AACT AT CT CTGTG 
50 AGGTTTTTAACTCAGTCCTGTCTGGAAACTTATAAAAGCAAAGGAT^ 
GCACAGGATGGCTGCAAATAGATACCCTTTGACTTGTC 
CTCTCTCCTCATGCCCAAGCTTTCCTCC^TTTGTATGATTAAAA 

T ATTGAAGAAATAATG ATTTCG AAGACAGTGTCCAAGC T ATG AAAAC AGTC^ CTAAAAATATTTT AATTG AAC 
AT ATT T AAGT AAGTTG G AG TAG CACAGTG ACAAAAATCAT AT ATGT AT AAT CTGTTCTC^GTCTGTACTGTTGGACTGAAAAATAC 

5 5 ATTC ATTTG ACTAATGAGATTGT CTGAAT CATTTT ACAAG CTAG ACAG CAAT ATT ATGGATTGATTTTTGTCTTTAAT CAG ATTTG 

TG AGTTTTTCTTCCT AATG CAG AATACTGAAAACATGACCITT CCCTTACCATTT AT AAGC AT T AGGAGTGT AAAT CAAT AAT T AA 
ATTCACCCCATAAAATATTTTGGGTAATACAATTGGGACTTTTAA 
AC CTTT CAT ATAAGTGTGATAAGTX3T ATCAGTGT AATGAATCACATC 
{^GereTGGGCTTCTGAAAACCAGATTTAGTCAOU^TCAGCAACA 

6 0 TTCACTTTAGCTTTCTGATCACTGGTTGGTTTTCTTTAGTATCATA 

GTGTGTGTGTGTGTGTGTGTGTGTTTGTGTGTGTGTGTGATTTACACAGGTCAGCCTCTAACAAAC^ 
GCTGTATCTAGTATAACAATTCTTCCTTCCTGTTTTCCCCTA^ 

C AAC T AACTG CTTCT AT AT AGCG TTTCT ATTiTC'riT ACT CTTTG C^TACT AGAAAATGAATCCAAAGT TGAGTTT ACTTCT AATG 
CCATTTGGTTTTGTGAGATAATCCTTCTTACAAGGTCCTGTATACTTGTTATTTCATCATAGCA 

6 5 CCACT AAG GAT ACTG AAAT CTGT CATTTCTTCTTGACTCTTGGGAAAAG AC CAAGGGTTGGTTGGAC C CAATGTTTTT ACTT CATT 

GTTGATC ATtSTTCTAG AAC C7VTG AT ATTTTTTAAGG ATTACAAAGGT TT AACTTT TG AT CATGCAGTTCCTCCTATGTT ACT ACT A 
GAAAATTTCTAAACAAATGCTCTTCTTCIAGTTTTATGAGACACT 

TAT CAACATTTAAAT CAAAT AT CTTCTAAT AT AAGAT CT AAACAAATTT ACACAT AC CTTCAAATTATT CT AGG TT ATG ATGCTTC 
TTAGTGGGTAATGGGCTATCTTGCTAACCACAGGACTGGACCACAGACCTGGAGCCT 

7 0 GGCATCCCAGGCACAGCCGAACCCAGAGCTGGCGGGTCAG^ 

ATTCTAATATATTTCAAGACTTTTCTTTTGGGAAAAATCTTTGAATTCCTTCTCCTGATCT 
CAATGCATTTCCATCATATCAGGAGTCGTCrGACACTrCGTTTCTC 
CTCTTTTTTTTTCCCCATTTGGGTTTGTTTTGTGTTCATGATGCAAA 
AATCTGGTTTTCCTTATTCTAATATTTGGTTTCATATATTAAA^ 
7 5 TAATATCATGACAGTCAGAGAATTGGGGGTTGGTGAX3TAACTCCCACTGACCCAGAGAAACA 
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TGTTACAGTAAAG<3AAGTTTCATTTAAACTAATAACT^^ 

T AATGCAAATGGG7UVGAGCCCACGAGGCTTCCATGT AGTGAT AT CTGAGCTT CAGCTGG CTTCTAG CCTT AGG AAAT AATGAAGAA 
ATGTTAGTGGTTTTGTTATCTCTrATAATGTCT 

AAACACATAGTGTGCTATGGAAAT CTGGG CTT AGCATTTATG ATTCTGTTTTGT^ 
5 TATTACTGTTTTAGGTGATT CATCTGTAGTTGATTAT AC CAAGGTG AT CTTGTG CCT ACTTAGAG AGCT AGTTTGTAGG GTCAGCTC 

TCTTCTCCAATGTTCAGTAGACTAGACAATGGCAAATTCTGGCACT 
AGCAGCCCCGGX3GATGTTCTGTCGATTTTAGCATGTA 

CTTTACTTATGG TTAGGTACTATAATCCAGCTGAGGTTAAAATGAAAGATAATTTCTTGGCA CAATAGGAAC 
, ATAGACAATATAGAAGCATC^CATTGATTTTTTTA^ 
10 TTTTAAATCAAATGGTATTTGTATCCTTGGCTGGOT 

GTCTCTGG TAACACCAAGGATTGGCTGAATTTAAAGTTAAAG ATCT 

TTACCTGGAGGAAATATTTTCATATGAATAGCACATAAGTTCATGACAAAGTTCACT 

TGGAAATGGAAGGCTAGTCCCAAAGAGACAGGAGTTCCAAAAACAACT 

GAGAAGAGCTGATAGACTACAACAG<3AAGACTAGTCACTTGTGGATGGAAAAGGTG 
1 5 CAGGGTGTTGTCTAATATGCACATC CCAAC CCCCTGGGAG CTGCACT AG CAAAAC CAGGTAATGCCACAACACGAAAGACACAAAT 

ATAAAGTITTCAAAATCAGTGACrGGATTGAAAGCAGG^ 

TGGTGOTCCTCAGCAGAGCTGAGCCTCTTAGCTCAGGCTGCT 

CACGCTGAGGCTCCGTTCAGATATATATTGCMG^ 

TTG C TTAT AGAGAACATTT ATCCACAACTT AAGAAGT ATAGGT ATC CAAAT ATT C CAGT AG CTATTT CT GTGAGCTTTT AAAAG TA 

2 0 CTAGTAGTCATTACCTGTATATTTTATAAAAACGGTTCTCTATAACTGGAAAGTAGACCT 

CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCrrCTCTCTCCCTCCCTCCCTCTCTCTCTCTCT 

TTTATTG AAATTCTATT AATTTT CCATTCATAGTAAAAGGAATAGGG^ CT CCTCACTTGT 

CAAATGTGAATAATAAGATG CCTGACT CAAAAACAGG CAACAAGAATAG AATGACCTGGATG CAGTAATATAAP^^ 

TGCT ATGTTTTTATTAGTATACT CTTT AGCCAAG T ATGGC CCAACTGACTTAAATG ACGT ATTT CT AGCTTACAAG GAAG GTGGGT 

G AACTTGTTCATCTCTT AG T AATTCTT ATAAATTATGTCAAATATTTTT AATATT A CTG CTCTATTTGG AAAAAT A 

TACCTTTGTCAGTTAAAAGTAAAAACAAGTGATGCATATTCAAATAAGAT^ 

CTTGTTCTGTATAGCTTTATGGAATATATAATTTTGTATCATTGTGCTC^ 

TTTTT AAC CAATT ACAAT ACTTCTGTTGTT CATGAGCC CAAAATTTT CTGAGATAAATTCT AAC CTCT AACAACATAAC CAGGTAA 

3 0 TACAAACACTGAAAGTAGACTGAAACAACCAACT 

AAACTGTACAGACAAATGCTATCTCCX3ACAC 

GGACTTTATTTAAAGCATAGAATTGTGAGGACAGATGACTC^ 

GGTTCCCACACAACTGACTCCCTATTAAATATTGGGGGAATACTTTGTTTTC^ 

ACTAGTGTGCCCTTTACTTGTTATAATTGTTATTAATACATTTGTTTCCATATGAAGCCTATTCAAAGAATTAAGCAGAGGGAATC 

3 5 GGCCCAGTCATAGCTC^TGAATATAAGAGGGGCTTAGCTTATTTGTACACAGATACT 

G TGTGTGTG AGTGAAAT ATT TACATAACT AGTT ACTTGAG CTTGAAT CTT AACACTT TCACTG^ CCCAG CATCAGAAATGACTATA 
TGGCGATTGAACAAGTTCTGGATATGCTGAAAAAC CAT AATTTAT ATGT TTGT ATG ACAG CATAAATAACTG ATG GGTT CTCCTAG 
TTATCACTCACTAATGCCAAGATAATAAGATAATTTGTTTAGGGGGTTGGATA 

AT CATG AGAT ATGTCTT ACAAATAAATCCTTT AATCT AT CTTATG CAAAAAAAAAATCT AT GCAGTTACTTT ATTT C CCTT ACTGA 

4 0 CAATATGGAAGAATATACTCTTTAGCATATT^ 

ATTAGCATGAATAAATCAAGC^CATTTTATGTGATATCAGCACC^ 

T T TG ATTTT AG AAATAAT TG AG AAT AAATAAACTCTT ACATGGTATGCCATTTGAAGGGAG TCC ATG TAG G CAC CAAAAT AAGTTG 

TTGGTAAATGCTGCCCAGGGGCTGCATTTCATAAAGCTTATGGCTAAATTATAAC 

TAAAGTTTCGATAGATTCTCCTCCTTAGTTTTGCCn'AACCAAGCAGTAGCT 

4 5 GAAAGAAACAGGTTGG CT TCTGT ATTATTTTTGGCTT AGCATTAAGG AC CTGTCTACATGATTTGAACTrG TCTGG TCTGACAGTCT 

TATGTCCTCAGCAGCTCATTTAAGGAAGGTTACTTCAGCTGCTGA 

ATGTCATGGG ATGAC C CTT AAATTTCTT AAG TT CCAT CTGAACTCAT AATTATGGTCTG AG C CTTGTATT T AATGAACTGCATGGG 

GGGGC CGT CGAAGT CAGGGT CT ATTTTG AAAAATGTG ACAACT AG AT ATGT TCTTGA G CATAAAACAT 

50 GTGAGAGAAATTTGTCCCTCTGTGTAGACTTATTCCTAGGTCCTT^ 
CGCTCTGAAGAGCTCCAGTACTCAGTTTTCAAGAGTCTACAGA 
TTTCCTGCTTTTCAAGGATTATGACATTAGX3TGACACTTCTO 
AACATGCCAATAGTCTTAGTTCACAACTTTGTAGTTTCACCAATAAGGCC^ 

T CATAATT CT CT AAT AAGTCTGTGGT AGAAAT ATGTTAAT CT TTTTCATTT C CT CTTGAAGTTCTT AAAT TCCT AAG AAATAAATC 

5 5 TAGGTATCTTTTCTATTTCTAGGATTGAATTTCTAGCAAATCATCTAGATACCT^ 

T CATATT CTGGTGAAAGGAAATGATCAGTAACATCAGAATG GAAAT ACAGT ATG GGAAC^ CATTGCT CATTTTG TAAATG 

TTGCAAGAACTGATATCTCCTCATOTCGTCGGAGTGTTGTACAGATTATGAAAACAGTAACCCTTGAGAACTTTCTA 

GGCAGATAAGAGTAGATTTTAAAATTGAGCAAAATCTCAGGGTAGCAATTA^ 

AGG GCAGTTAATTTG CTT ACT AGG AATGAGAAGGAAAT AAATT ATG AAAGG G CT AGTG CAGT TG CCATG AGAT AAACCTTAGAGAG 

6 0 CAGTTTTCTGTATAGAATTGGAAACTGAGTAT CTCATGGT AACAGTGTTTAT AT CACTTG CT TG GAG AGCAGATTTAGGT ATGATC 

ACAGGACGA(ZACCTGTCITGGCACTTGGTGTAGGATAAGTGAAAACACATGG 

TTGGGCTAGC^TCTTACTG^TGTGGATCTGACAGAACTCAAG CTACTGCTGCCACCCAC TGCTCCTGATACCGGGGAATGG AAGC 

ATGGCTGACCAGGAGACCCCTAACTCTCAGTGCCATA 

ATGGCACCTTGGCAAGCAAAAACTGTATTATCCATTGAAGATGAAGCATC 

6 5 AACTGGAAGCTAATTAACTAAGCTAAAGTGAGGCAGGCTAGTTGTGCAGAATTCCATGGGA 

TGGGAGCACGTTCCACACTTTAGTGTCCTCATCCGTAGCCAGCTTATGTTTCATGAGCACTGCTG^ 
TGATTCCCLAAACTGATGCACAGTTCAGCAAATG{X3GTATAGATTTCCATA 
GAGTCACAGTTATACTTTCATAGATGAAAGTAATTATTTTTTTCATTTCT 
ACATTTTAAATGAAATTAATTGATGTGCATGATTTTTGCCAGAAATGTTATTTTTAGACAGAGA 
70 TGATATATGCTTGTATGAATCAGATAGTTGTACAGGCrGTACCTTACTATTGTATTTATGCAT^ 
TTTCTATCTTCAAAATAGTTTATTTTTTATATAAAATTATATTTACCATAATTCATC^ 

TGTGTTCATTTCAT TAATTGGAAAAAAT ACT AATTTAT AAGG CTACAG ACT ATAG ACTGCAAGTTTT CCT AGCAT AATCAT C CTGT 

ATTGATAATGTTTAAGATATGGTCTTTAAAAATCTGTAATAATTACATTTGAGAACTT 

AGAAAACATGCTGAAATTCITGTTGGCCCTTTAGGCTTTATG 

7 5 GGT ATT ATGT AGGTGATGCTGGGATATAGT TTTTG ATGAATTTGTTAGG AGT ATTTAATG CTT AAC ATT TATT CAAACATCTCAT A 
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TGCTAACTTATTTACTAGC^CATTTTATTTACACAAAACAA^ 
TTTATAAATAAATACATTAACITTCCCATTTTAGATCAC^ 

AAAAATAT AGGTGCAGT AATCCAAGT AAAATTGATCTGTTG CACATTATGGAAG CGATTG CCCTC CCTC 

CTCCAGACATAAGGGTCCAACCAjCTCTTTTTTAA^ 
5 AATTTAAGTGTACTGAAGCAGGCTTGCTACAGGGAAACATAGAGGCTrTG 
AATCTTGGAGATCCACACrcATCTCCAGAAGTCATAAACAAGTC 

ATGCCATGTGCTCCCACATGCCATGT ACACG CACTCGTGTGTGCAAACACACACACACACACACAC^ CT AAAA 
ATGGAAATTGAGATTACTTATCCTACTCCCCTTTATAAAAATCT 
GACTAATTTGTGTTGTTAGTTTTATTTTACrTTATA^ 
1 0 CTCAACTAATGAGGTTCATTTCATTTATTCTTAACT 

TTTTAAATACTCATTAAATTACATGGGTTTTAATGTACCAACTAA 

G ATATTGAAGTGAAGAGTTACTCACACGGACCTG CTA CTGTGTCCAAATGTCTAAAACAAG T 

CTAAAATACTCTGTATAGTTAACTTGCTTCTAATTTTAAAAACGTCAGGATGG 
GTTAGATAGAGTTAGCAGCTTCTCAGTGCTTATGCTTGGTTCCCAAGTTGCAAACAGAGTC^ 
1 5 CTAGCTCTCGCAGTTCATAGTAAGCGGCTCA^ 

TCTCTGCACCTTCCTGGGAAGAGCTGCGTTGTGAGGCCCAGGC^ 

GGCACACTTGATTCCCCTCTGGTTTTTCCTCCTTTCITAGTTCCATCTCTT 

CCTCTTTTCTrATCAAATTCTTTCCTTCATCAAACT 

ATGTGTTGCACAATT AGAAAGT AATT ATTTTT CT CAAG AAG ATTTT TAAAAAG AAACT AAT TG AGTTT AATCTCTGTTCTGTG T CT 
2 0 ATTTTCTCTTCCAATTCCAGACAGCTTTGGCTrTTTACCT 
CATTAATTTAATATTAAGGTTGCAATGACAGAGTATC<3 
TATAACATGTGTCCCTCACTTTTCTTTCrAGA^ 

GCTGCCCCCTTGTATTTTGTTGCACTTCTCCATTGCITCTTGTGTTCTG 
GCTCTGTGCTGGTTGCTGCTCATCTCCGAGGGTCTTCT 

2 5 CGTGGTCCCTTGTTGGCTGCGTCTAAACCCTTGTGAGGGTG<3 

TACCACAGAGTCCTCTACACTTCCTTTCAGTTGCTAA 

AAAACTTTCCTAGCCTCGACACTCAGTGGTAGCAGCAGGTGTTGGGATTT CTC CAACTTCTAAGGCCTTGGGAGGAAGAGAGTGGC 

TGTTGACACAGACCTCAAGGATTTTCAGAGTACACCA 

AAGCAGTTTCAGAAGCATAGAAAACAAAACCTCrrAAGCATGGGT^ 

3 0 AACATAGACAAAACTTGTCCTGCATTGGGCCTCTACTTACTGCAAATACTTCTTC 

TTCTATTCTATTTTTACCCCTATTAAATAATTGGCATTN^ 
AGTAATTGTACTCAATGATTTGAATTGGAATTTTG^ 
GTTTTTGAGGTTTAAAATATTAAAGGATAATGTAGCAATCCTTTATTT^ 
ATCTAATACCAAAAACTTATTAAATGAGTGTTTCOTTACATTCTGCCCAGGAAGCAGAA 

3 5 AGAGCACACACTTGTGCCTTGTTGGCATTC 

CTTTCTCrCCCTTTCTCTCTTCCCTTCCCTCTTTTATTATTGT^ 

AACACCCACCCTTCTGAAATTTACTCTTACAGTCCAGCCAGCCCCAG CACTTCGTGATCTTCTGTCTCTCAGTCTG CAGGGCTGGG 

ATCACAAGTGGGATAGAGTGTGGGCCTGTTTACAGTGTTCCT 

GCTAGTGTTCrcrGTTTCTGAGTTGTGGGAGGGCAGCAGTA 

4 0 CTAACTCGTTTCTAACTTACTCTTCACTGTGCACTTCACAG 

TT GATAGCATCTTTTTACCTT CITTGATTTTTTT TGAAGT ATG AAC TTAAT AT CTTAAC CAATGCT CTAT CATTG AAATT AAGGCT 
CTTGTTTAGAAAACCAACCTAAATTTAGTACTAAAATTAAAACATATACTTACCAATA^ 
GTATTTATTTCTGTTCTGTCCTATGTTATATTCCCTTTAGACTGCAAAGAATAACTAAA 
GTTAAAATGCATATTTGGTTTCTTATGGAAAAATAG 

4 5 TTTAGTCA.TTAAATAAAACAAAAAGACTTGATATGAAGTCCAAGGCT 

CCGCTTACATTTCTCCAGAAATCTTATTGTTCCTTAGGATGAGAAATCAGATGAATATAAGAATGTGAACTCT 
AACTTGCAAATTAATTATGAATGATAATGATTG<ZAGAAGGAGCCTAGTT^ 

TGGCCCTGCTCTCAGAATAAGGCCCAGTTAGAGTTTCTGCATAACTCAGGTCAGCTTGAAACTTGTGCT 

CT C TTC CTTTGTG CAGCAGAAAC CT CCAAACAG AGGGGAAT AGGCAGAATATTT ACAGTAGATG CTTT AATTAAGCAAAT^ 

5 0 TCTTATTATTT AGTTT ATG TT ATGTAAT AAG AAAGTCAAAG TG TG CAACCTTCACT AG ACATG C CCATGT AGAAAAG GAAG AAAGG 

AGAGACCTGCX3TTTGAT(^GAGCAACT 

GTTTT AAGGCT CAAAAGC AAAGCATrr CTATGTATTAT CCATCGTGTGTGGAACTAAAAGCT TATTG GTT AGTTTTACAG AATT AA 
AATAATCTTTTAGTTAATTTTCATATTTTATAAATTGCCTACCTTAG 

ATGAGAGGGAGAAGCGAGGGCTGGGGTGGAGGGGTAATGGGAGATCATTTTATATCTGTAAGACATTTTCA.GATC 

5 5 TATAACAGAAAGGTACTOVAGATACTATTTTTATC^ 

TCTTCACTCCAGGGCTCACTGACGCCAGAAAGTGTAGCTGCAGAAGCTGTTGGC!AGGATCATTGATC 
TAGTGCCCCCTTCTCTCCTCATCCTTGGAATAGCACAGTGCCATCAAGG^ 

CT TATT CTGTATCTG GCTTTGTGACCTTGGG AT ATTTACTT ACAGT CTTTTT CAGCTGT AGT CTCACT CT CTCT AAT ATGTGAAT A 
ACATTATC T CCTTGT CAGAGTTTTTATGAAG ATTTAATTT AATG AAACAGCTTGTG CCAGTACT T AGCACAGTGGCT AG CAT AGG A 

6 0 AAATGGCGATTGTGTGGATGTTACTATGTAATTTTCCCAAGGTCA 

TCTCTGAATAAAATAGTAAATAAAACGGGAAAAGATACAAGTACATGCACACCCAGTATG 

TACTTC CAACT AAT AAAAGATATT AAATG ATACGTATG CAAAC CACAGG CTCAAAT AACTACTCG CTTTT ATTAACAGTTG ACC T A 

GATATAAACACCTCTTTAAAAGCATAGTGTCTTTATGAGATTO 

CCATTCTCTAAGAACCAAAAAGTTGTTATGTCATGACATAGACTGTCIAGTGGGAAGGG 

AGT TG CACAG ATGAT AGAATAT T CTT ATGAT CTT T ATTCT CTGAAG CAGAGCAGAAACTAGCAGTGTCTGCTCGTCCTGAAGAGAG 

ACTCATCAAAGCTGAAATGTCAGGATTTGGCTTTGAATGCGATGGATTCAA^ 

CTGCTCTTTGACATCCTTTACTAGACTTGGAAAAGCT^ 

GTCTAGGAAAGAAATGGACATT CAACAT ATG CTT AATT ACATCAAAGGAAATGTATTG CAG AACAG AGT AAGAAGACT AACGTTAG 

7 0 GGT CT AATG GCAC CAAAGGGATTAGAAAG CAAGGACT CAAGACT CAGT CTAAATGC AGGGG CAAATT AGGG CAGGCG CTTCAG CGT 

GGGGAGCTCTGTGTTTGTATTTTTTGGTGCCrcGCTAAGGATGTAGAAGAACTAA 
AATGTTCAGCATCACTCATGAGCACACAGTGCAATTGGAGAATAGAGAGGTAAAGGGACTGACCT 
TGCCGAAAAGTATTTCTGGCCTTAGAACTCAATTTGGTGCCCAGAGTATCGAGGGAAAAACTAT^ 
ATCTTCATTAAGAAATTCTGAAAATATGGTTTAAATGTACAATATTGTAAATTC 
7 5 TCGATGACAG AG AAAAAGTAT AAT TCTAAAAAG CTGAAAT ACT TAAT AGGGAATTATTGG AATAG CATTG AT ATTTGTTTTGTAAT 
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AAAACTCTCACTGATCTTTTTATGTTTATTC^ 
AAAGAACAAAGGAAATTCCTTCATTTT^ 

GGGCTGCTCCCCTTCCCAGGAGGCTGTAATCACACCCTTCACTTTCTGTT^ 
GCATCATTCACTGACACCAra 

5 GGATCCAAATCCT AG CGAGTGTGTGCTGTCACCCG AAAAGAGCATATTTAG AAAAT AT CAGATGAGAGT AAAAAT CAAACCACATG 
AGCAAATGTTCCAGGAACATTTGTATCTTAATTTGAAA 
ACTTTCCAGAATATCATCAGAACAAACTTAAATTGACTGCA^ 
AAACAAAGAACrGAGTGTGAATGAAGCAAACTTAGAACTAAC^ 
TTATTGATGCAAATGTGGCTGTATTTAGGCCITTCCATTT^ 
1 0 CAAGGCTGTCTTACTTCACTTAACCCATGGTGAGTTT^ 
TGGCAGTTTGCTATTTGTGTTTTAACTTGGCTATAAT 
TCTTAAAOTCTTTCACATGTGAAAA 

TGG CAGTTTAATTTGCTTTATTCTGC TTTGTTATAAA CATGGT ATAAATAAAATCAGTGAGGC CT CTT ACT AAGGG 
G AATTTTGT CATGAG AAGAAAAAGAATTTTGT CATGAGAAGG AGAAGGT CAACATATTCTAAAGTGAAT ATAAACATTT AGAG TAT 
15 AT AAGATATAAATT ACAGAACTTTT ATG CCATTT AT AACAGTT AGAAATGT TT AGCCTTGTGGTTAAGAAT ATCAGTTCCACGG C C 
AGAGAAGCCTGGAAACC CTGGATTTACCATGTAATG GTTGTGC CGC CTTGGGAAGTTACTTG ATTGTGTTACG TT AGGGAACACCA 
GATTTTCAGTCTTTGCAATGGGTATAATCCTACACGATAGGATTCTCT 
CTAGAATACTCCAAATACTGACTAAGAGGTTGTTTTCTTCT 

TGAAAAAAAAATGAACAT AAT AAAAG CTCAACCTGT CTAGAAGT CAGAC CTGTACAGTAG CATG AGTT CAT CAT CACGGG GGAG AG 
2 0 AGACAGAAATCAGGCCAAGTCTGTGACAGCGCT^ 
TTACTCAAGGTCAGCCCTGCAAGCTTACATCA 
CACCTATCACAAGCCAjGATGCATTAATT^ 

TTTTAT AGATTTTTTTTTGTTGTTTTTC^ TGGCAT AAG CATGGTGACT ATCACACTTACT ACAT AT ATAG G 
ACTACCACTATTTTTC!ATGGAGTAACTTGAGTGTTAGTT^ 

2 5 TCTGCAGTAATATTTTGAAGTTTGCCTAGGCTATTTGG 

GGACAACTTGGTGCAGTGGGTGTGGAACATGGGTCATCCAGCCT 

ATGCITCATAGAATTCAACTAATAAAAAACACTGTTA^ 

3 0 AAAGGG CAT CACAG C CCAGAGGT AGTT AACTTAGAACCATTCTAGAACAAAAATTATTTT AAAAAT ATTT^ 

AACATTTAAAAT AAACAATAAAAATTTGAAG CTGGGCTT AGTGGCAGT ACTGTAAT C CCAACT ATGTGTGAGGTTGAGGT AGGAAA 

TT CAAAG CCTGCATTGGT CT ACAGAGCAG ATT CTCCAG C CTG AT CAACCT AGCCIAAAAG CCTTGTATCAAAGT^ 

TTTT TT AAGTGTTGAGGTTAG ACAATGATTGTTTAGTAT ATT CAAACCCT AG AT ATG ATAC CT ACAAC C CCT ACAG ACACATAT AA 

CAACAAAAAAAGAAAGAAGATAATAACCCAGAGACTTGATTATTTTAAAACT 

3 5 CCTTACGTTGTATTGAAATTATCCTTCTAATAATATTTAAATTA 

TTTTCAATAGTTTTTTTTTCTGCTAGTAGTTCATATATGCT 
TATTTGGAGTTTTTGAAGGAATCCAAGAGCTTTAGAGTGAAGGAGACTGGAAGT 
TGGAGTGAGTTCrCCAGCTTTCTAAGGGAGTTTGGGTTCTTCAGAGTGCTGCCTTAAT 
TTCTITACCCAAGCTGCTTAGAGCTAATGATCTACCCCATACAGCTTGTAAA 

4 0 TGGAGGTTTAGAAGCAGGTTCAAGCCAGTTTTTCCCCTCAGCTTTACCATT^ 

TTAGAATTTG CAATGT AAATTTTCCAC CTGTTTAGAATAAGG AG AAGACCT CATAT ATG 

TTTTTAAAAGAAGACAGGTGATATTTAAAATTTGGTCCTAAGGAATTTCACAGACAGAGCCATCTGCC 

TTTT AGCCAACACTAGAGAATGTAG GATGT AGTTT C CT AAAATTG CTG GATACGCCAAATGTCTGTT AATGAAT AAATGAAT AAGC 
AAGT AAAAATTTATCTGGGTGGACAAATGAAT AACT AAATTT ATTAGTTG TGTGTT CAT ATG CTTCT TTATT ATTTTT AGG AAAAG 
4 5 AACTCAGCTTCAAGTGCTGCAAAATTTATTGAAA 

AGGGCCAGGGATGTTTGATTGATGGTGCTTCTGTAGGTGGAATTATCCTGC^^ 

AGATCCTAAGCAAGGGGGCTATTTTGAGGAAAGGAAATACTAGTTCGAGTGATCACAGCCTAAGAATAACC 

AT AAGATCACACTGGTTGAAAAGGT ATT AATGTG CTGTGTCCTGTTT CATTTGGTGTTGGTGGCAATTG TTAGT AAAGGAATACCA 
AG AGCAATACTTTTT AAG CACAGGT CGAGATT ACAG ATT AAGT ATT G CTGTTTAGAGAGTTGGGCTTATTGAGCAT CTT AG GACGT 
50 TGTTGAAATAAGACTGACTGCAGAAAGGAACCAAGGTTGAATGTTGAAGAAGAGATTCTGCTGAGTGTGTATTAGAGGC 
AGGTATGGTTATTTGAAATGGAGCATATAGCTATAGAGCTCTGACAACTCTTGAAACACCTAA^ 

CAT AG CTG AATAAAATACTT AGTAT TGAAAAAG AACACAGAAAT CATTTT AGTAACTAGAGTG ACTG AAAGTTCATTT CTGATT AC 
T CTAAAGTAGTAATTT AATGTG ACATTT CACT AT AATATGAAAAAAATAAAAAAAAG ATGCTTTGTTGTTT CTTACGCAT CCT AAT 
55 " AACTATTCAGTTCCTATTTTTCATAGTCATAAAATCACACAGTTCACTCATGTGTTATGT 

TAGTAGCCCTTGTAAAGTAGGTGCAGGACATGTTTTGTATTATATTAATGTTGCITCrr 

CAGGT AATAAT AACC CX3 AAGGGTTTGAATG CTG AT AATTAGGCTC TGTTACT ATT AGAAATATTCAT AAAAAT 
TATTGAGAATTTAATAC!ACGAATACAGTATTTATAACATTTTCCT 

6 0 TCACCCCT AGAG AAGACTGAGTCTCACCTCCTCAGTAACTT AAT 

CTG TTTG CATTGCATATG AATTTGGTGGTGGCAG CT CTT AG ATAC CGTTT AGGAAT AG AAGTG AATGTCAG CTGCAAACAT AG CAC 
TACGGAGACAGCTGTTGAGAACAGTGGGACCATACTTGCTGAAGAA(^ 

TGTTGAAG AAAAG TG GAATTACAGATGTTT CTT C CCCACTG AG TAAGGT CGCAGG CATCAG TGGAGAGC CAAAG CATGAAAAGT CC 
CT AACT AGACAATGTGGGTGAAGTTGGGGCT AAT CAGAATATGG^ 

6 5 AGAGAAACCAGGAAAGAGATT CTGG^ CAAT AAG ATTGT TTTT CTT AC^ 

TGTGATCTACTGCCTCTGTCCAAACTTTGTTAATGGATTGCTTTAA^ 
GACATAATGTTGTGAGACAGAGGGCCTGACCTGTCCATACACACACTGGCTGAACCTOT 

ACACCATGAGTGGGAGTTGACTACCAGG AGATGAGCGTGCAGGGCTGTT CCACA^ G CTT CGAGG CT ACTT AC CT 

AAG CAAAACTTTGTAG C AGTTTCCAAAACAAATCCTT AATT C CT AGTGAAT AAGACC CATTATTAT AT CAGACATTTATTGTCATT 

7 0 GTCTT CA.G CCTGT ACTCAGG AAGGCACAGACC TTCACT AGAATAGTT AAATTTATATGTTAGGC^ CTTGTGGTG AAAGTTAAAA 

AATTAAAGTTGTGTTTGCTGTAATGTTTTTTTTTCCCAAGAATTAAAAACT^ 

AGATATATTTCTGAAATCTTAAGAACTCCTTTGTTAATGTTTATTTATATATTTATGTTATAGTAATG 

TGTGTAAAAACACAGGGGAGGAGTCTAAGATGGTTGGCTTAAAGGTACACTGCTGGTATTTGA 

TATTATAGATGACAGTTGTACTAACTGTAGTTCAGGGCTACATAAAATGGTATT^ 
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CITTAGAACTCTGTGTTTGATACCCT 

TAGGAAAAAGTTTCTAGGCATAGACTGTCTACTATTTTCACTTT^ 
TAATTTCTAAATTATGTAAATTACAGGTTTTTAA'l"i"i~rAGACACTTGA 

AAAGTAGT AAAGGAGGAAAGATGGAAAGAGAAGACAAAATAGTTAG CATGAGTAAAGAAAG CTGACT ATATTCACATACACGTGTT 
5 GCTACTGATTTTAATCACTTAAAATACTGATTGATCI^ 

AGGTCAGAAGAACAAAAGGCCCTTTCCTGCCCTGGGCAGTGA 
CTCAGTCTTTGAACCACTCTCTTGACTGTTCT 
CTCITITATCTCTTCATAAAACCITITITGAA^ 
GGACACTGAGCGATGTTTCCACATCACATCATCCCACATT^ 
1 0 TTTGTGGTGCGGGAATTTCCTTATTCTGGAATAGCTGCTTAGTC 

GGCAGAAATGTTCTGGGCTATGTCAGTAAGTCTGTGTAAGTCAGCTTTGTGAG^ 

TGTAGTCAAGCACAGACTTCACCTGTGAATGTCAGATAGAAATGT^ 

GCAGGTCTCATCCTAACATCAGCGCTTGACTCCAAACAGT^ 

CTAGAT CTG AATGTGTCGT AC CT CCATTGAACATGC AATGTTGAGAATGGCTGAGG CCTTGACAGAGT^ 
1 5 TTCTATTTATCTCTGCTCAGAAGTGAAGCCCCC 

AGAGACTGGGAGAGGATGTGTTTTGCATGCAGAGATATAGAGGAAATGCC^ 

CTTTTCCAGTTTT CATT AAAAGGGTTCATG ATTGCAAGGGGATGTTATGAAG CAAATG CTAT GTGG C CAT CTCAAGTAAG AAATAT 
TAATTCrCrGGAAATTTATTGAATTCTTGTTATAATTTTGA 
GTAGC^UVTTTGAAAATTGAAATGTAACAAAATACAGGGTTTACCTTGCA 
2 0 GTGGTT CTCAACCTATGGTTTCTGAC C CTCAAAG ACCATGGG AAAACACAGATACTTACAT TACTATT CATAGCAG AAGGGATATT 

ACAAAT ACAAAATAG CAACGAAAAT AATTTT ATGGTAGGTG GG CACCAT AGCATGAGG AACTCJU^TT AAAG AGT CACGGGATTTG G 
AAGGGTGAGAGGCAGTGAGTTAGAGTTTGAGGCCATACTCTGTTAGGAACGA 

GTGGAGG AG GT CAATG CATTGCTGTCTAGAT AGGAGCTAATGGAGG ATGGGAG ATGGAGAQ^ CAT AAT CA 
GAAGCCAGCTGAGCAGGCTCTCTTGCGGTCTGTCCTGTCTGAGACrGTAATAA 

2 5 ATTGCAG AG ATGGCTGTG CT CTGTGTTAAAAC^TT AT ATATTT AAGTTTGTGGATC^TAG AG ATT CATGACAG CAAT ATAAG AC CT 

TTC CATGACTT ATTTGT ATATTTGGGTTTTCCrr ATT CTTTAA CAAAT AC CTTTTTTTTTTTTTTTTTGCAACAGA 
ATTTTCTTGATTGAAGCAATGTGGATACAAGTGAAATTAT^ 

GTGTGTGTGTCTGCTCCACrGCCCAGGAGACAGAATGGCATCTTGTGCCTGTTTCTGTATC 
AGCATTTATTCCATGAGCCTCAGCCCACCGTACTTATTTTCTAAATATCAT^ 

3 0 TCT ACG CTG CT AGT AGATGTGATTATAGTGAG C CCCIAAC^CGAGGCACATTAGGATCTGG AGTAATGGT AG CT ATT 

CTATTTGTGACC!AATTTTTAGAAGTCCGTGATTATAGTC!AAGTCCA 
ACATCTTGGAGCTCGTTTACTTACCTCTAGAATAAAGTA^ 
AGAAAATAATAGTGTGTTGTGTCACCAAAGAGTTGCTGCTITTCTCAGTACT 
AGTTGAGCCAGACX3GTTGTCTGGGAAGGCACTGTC 

3 5 TG GG GGGGT CTATAATAGG AAG AAAGT ATGGAGGTAT AACAAGAGG AAAGGCATGT AGACAAC^ CAAGG AATGG AAGCTT A 

GGGAGTGAAGTGCCTGACTATGGAGATTTAAGCTTCAGTGAGAGGAGAAAAGATGA 

GGTTGGAGAT CTGGTGT T ACAATTT AG CCAAG CAGAGAG AGGGGG AAAG TCA.CATTG AAGTGTG AAACC TGGGG ACTG AGTGGT CT 
GTGGGGCAG AAAACAGAGGTGGG AGTAAAGGCAGAAGCTCAGG AC CTGAT AATT ATT CTTTTT AAGGTG AGAAC CAATTGAG ATGG 

4 0 TTGTAGGAGTGTTAACGACTTTTCCTTTGATTCATAATGATT 

AAAGCACAACTATGTCTTTTCrrATCAAAGGATCTGGGTTTT^ 

T AATTAGTGT ATTTGACTAATTTC CTTTGAAAAGAAAACAACGTG ACT CAT TG AAAAGCT AG ATTT AC C TCT CATAAACAATG ACT 

AAAGTCACCCTGGAGATTAAAACTTCIAAATACCCGAACrAAGACATCTTAAGCTGCTA 

CGTGTATAAGCAAGAAAGGGGGTTTGCTTTCCGGTCTTTGCTGACCT 

4 5 GGACAAATCACATTGCAAATGTCAAGAGGCATCCAAGCCTTTTTCCT 

ATACAATGTAACTTTCTCACATTTITCAGTC^ 

ATGCAGTGGTACATTGCTTAGCATTTTCCCCTTGGAGGGGAGAGAACG 

G CTCACGTTGTTATTGATTT CAT ACT ACTTCCATAGG GAAGG CACTGGACCTGGAGTTGGC CAGGGGTG CCT AGTAAG CTATGG AG 
CCCTGTGGACTGGTCCTGGCATACCGCAGGAAGGACAAACATCCTAGGAGGATATTGGAGCTAAGGTGTGGC 

5 0 . AAGGTATTGCTCCAGTGGTTCTCAACCTTCCTAATGCTGGGA 

at ctcataact ctg attttg ctg ctgtt aggaattgt catgt aactatcagataggtatgatgtc tg aaatgagac ccctgtg aaa 

gtgttgtttggctactaaaggggtcacgattcacac<;ttgagaacctcgggttattggctgtgatc^ 

ttctcagttactgaccatgattttaatcgcttgaggagagtctagactct 

ATTT CTG CAGTTAG ACTGATTT CTTAATATAAACGTCTGAGAAT AGTT CTTTT C CT CGGT TCATCTT AG AAT CCATTC CTTG CG AG 
55 GCTTCTCTAATTAGGTGCTTCTGTCTCCCAGACCTCAGTAGGGATTCCTCTTCGGCGTGTCTCCATGTGGTTCCCACT 
TTAATCCTTCATTTTCTCATTTATTATTTTAACTACTG^ 
AATGATGTGTGTGTGTGTGTGTGTGTGTGTGTATGGCATTCATGTAAGTT^^ 

CTTT AGTGTG CATG TGTATT ATGTGT AGGTAT ATGTGTGTGGTGTGTT CATGC CATG CCCTGCGTGTG AGATTAGAGGACG ATTG C 
AGGAGTCATTTCTCTTGTTTTCACCGTGTAGACTTGAACTC^ 
60 ACGAGCCACATTTTGATAATTTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCT 

CTTTTGTTAGTTAACTTTACCCAGATGCTGCTATGTTTGACTTAGTTATACTTCACAACTTTTGAAGG^ 

ATTTGTCAAACAATTAGAGCTATAATTATCAGGTCCTGACGCTAAGCT 

CAGAGACAGAGACAATGCTGGGTTACCTTTGAATGCAGTTA 

TCCTTTCTTCCGTCACCCACCTTTATCATTTACGTTGTAACAGGCTCTTTATGTGCTGCrrAAGAGTC 

6 5 ATCTCTCTCCAAGAGCAGAATGGATATTTACTTATG 

G CGATTTGT AGTAAG GAGGAATGAACTAAG CAGATAAGG AAAACACCATTG TATTTG AAG CCAAAGAAG CATG AAACATACAG AAT 
ATCCT AAGGAACCAGGGAAG CTTTATGCAGAG CAGCATTTGCTGGGGAGCTGTCTTCAGT CAAGG AAAAGC CGCAG CT TGG CAG GT 
GGTGTGGC!AGAGATCTCCCA(X»CTCAGCCTAAGTGTATAAA^ 

TCCACAACCTTACCCTCAAAAC7VCCGAGCAAAGGGCTGATCTTGCCGTTTTTAATTCTTGGAGA 

7 0 AACG CTGG T ATTACCT AGTT TCGTTACTTT AACAACTTATGCCCTTTT ACC CATTC CTG AGG AG CTGTT GGTAACTGATAG CTGCT 

GCAC<1AGGGCAAGCC!ATTCTTCATTGAGCCTGTGACCTCTGCT 

TGG AG AGAGGGGTAATGAGAAG CTG AGAGG ATATGGAAG GAATGGGAGGGAATAAATGAGGACTG TGGG TGCATATGGT AGTGTTT 
ATTCTC^GAGAGTAAAAACAACAACAACAACAACAAAAACCCCACAA^ 
7 5 GTTTACAGTTGCTGCATTTCGCAGGGACATAGCATTGqCCTGGGTTTCTCCGTCTCAGCTCTGTC^ 
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atttgtcctaggtccattccttgctttgtagagtc 

agtagtttcctcaaacctccag ctgtgaaaccatacatgtaaagatgtcxrrtgattaaaag ctaggcacttaattaaaaaaataag 

TGTGTGTGTGTGTGTGTGTGTGCCATGCGCACATGTGCATGT 
5 CATGTATGTGTACTTGTGAGCAGATGACATTGTCTAATAAGTGTTATGAA^ 
AACACAATATCATGGAAACCAAATTAAGAAT<3T<3ATGCT 
AGCGAGGTCAAGAAATAGTCCTGGCCACTCACGTGTAC 
AATATTTTTGTCTATATTTTGAAAATTATTAATA^ 
GCAGACCTTGCATACrrCAGAGCCCGTTCTTTATTCrA 
1 0 GCTTCTTAGCCTTCCTAATGCTGTGACTCITTAACCC^ 

CTCTTAACTGTAACTCTAGCCAGGTGTTAAGCTGTTTTACATCCTTTATCT 

TAGGCCCCAATTTGAAC CAAACT CT AGGCC CAAGCACACCAGGCAG CX3CTCTAACACTGAG CACAC CTGACCCAGCAGTACT CAT 

AAACATGTGATAAATTGAT CACACAGCT AATGG CC CAGGAAATACAAG TT ATATAGACACGGTAAC CATGTTCTATCCAAACCTTT 
1 5 TTATTC LTriTlT CTAAAATGGGTATAAGACAAGTAGTTTTTTT 
ATAATTCrCCAAACCGAATTCCTCCTTCCTTTTCCTTCTCCATCT 
TCCTGTATAACAGATATTTACATTACAACAGAAGAGAAGCAAAAT^ 
GGTGGTTACCACAACATGAGAAACTGTATTAAAAGCT^ 

GTAGTATTCTGGCTTAGCATTGCCTAATCCATTTATGATGATGATGATGATAATGATC^ 
2 0 AAGG CACACAATAACAATG C ACACAATGCAGCTCT AGTGTG C CTT AGGAGAACTTTATGTCTTTATTATCTCAGCG ATT CCTTGCA 

AGGGCACTGTACAGCCCTCCTGACAGCTGTTA 

TAGAAGCTGGGCAGCAGAAACCCACCCTGTCAGAACACAGGAGGAAGCTC , 
CCCAGCTGAGGATGCTTAAAGGACATTTGATTTGAAGATCGGAATCCACACA 

2 5 TTTGACCTGAGAGGACAGAGGGTGAGT CAG AGCCGGTG GTGGG CTG CAGAAGAAT CG AGAGCAAG C CTGT CCT AG CAGGG AAAGG A 

GTCCAGTTTCCAGTCCTTGGTCCTT CCTTTTGTGGGCTGTAATAACTGGGAGCTGTGTGTGTGTGGAGGTGAATGGACAATGGCT C 

CTGCTGTTGTTTTACC^CGTGTCATCCATTTTATGCCTTGACTG 

ATTCAGCACCTAAGGACAGAATGGCTTG^TTGGCGCT 

GATACCTCACATTGCATATATGAGCCTTTAAATCGAGTTT 

3 0 AACTTGGCACATCCTCAGTGAAGTTCTCITGTTCTTGGGTT 

ACCTTTTCCTAACTTCACTTTTCATCTTA 
ATTTTAACTGTGTGTGTCTGTTAAGC!ATATTTTCTTTTTAT^ 
GAAACTTATGCAAAAATATAAACTAGTACCTGCCACAGTACrCGGTATCT 
TGAGTCAGCGTGACACACAGCGTATTGCGGATCATTTGATTCAC^ 

3 5 ACTTGG CGG AAAAACACTAT CATTGT ACATTCCTGTACACAT AATG T ATAAATGT CAACTAT ACATT AGCACGTCATCTTTTTG AAA 

GC TTAT AATGGCTT AAAATTTT CAAT ATATT CATAAGT CACTTTATT AAT CAG ATTTT TGAT CAAC CTGAATGACATTGTGTGTGT 
GTGTTTGTAAGTAGTATTGAATTCAGAATGGAAACTTTCAAAGGCATAAAACCAGTCT 
AAAACTACACAAGCCTTCAATTCTTGATCAACAAACTGAGATAATTGTOT 
TATCACACACACACACACACACACACACACACACACAGAGAGAGAG 

4 0 AGAGAATGACTTCAGTGATTACAGTGAAGCTAAGAGGCAGGAGGATCTGAGTTCGAGGCC^ 

GCCAGGGCTACACAGAGAAATCCTATCTCG AAAAACAAAAACA CTGTTAATCATAG CAA 

TCTCAGCCAGAATTTTTCCTGCCACTTGATTCCAGAGTTACA^ 
CAAGACAGTATGTCTCTGATTTCAATGGCTTTTAGAGCAGGTAGAC^ 
ACTTAATTTTACTCAATTTTAATCCCTTTTAATTTTGCT 

4 5 ACCACCTAATAAGAAGGGAGATACATTTCCATACCTGA^ 

TAAAGACATTTTTATTCIATTGTTAAATTCTGGAAATGAAT 
CAGTTTAAATACATAGGCACACCIAGAGAAGCAAATAGTGCATC 

ATTGTTT AGT CACTAAG AAAATG AGAGGCAT CAG ACTAATTG CTTC CTGCTG TCAGTTAT TTTGAATGGTG AAT AAAGACAGC CAC 
TTCTTTACATTGCCTTCATTTGCTCAGCTCATTACTGATGCTGTAATAG^ 

5 0 TTAT TTT ATGTGG TAAAGAGAAT AAATTGAT ACAGT ATT CATAATT CAGAGATGGAATT AAAGATGT ATT ATATGCTTACT CAAG A 

TGACCAAAGCTGTAATTCCAAAGTAAAAGTGCGTTAGACACTGTTTATATGTGGTCTGCTAAGTGAT^ 
TGTTGATTTTTCCAGGGAAAAGAAAACTTATGTTTCCATTCTAAAAATG 
AAGACTCGGTGAATATTGCAAATGACCATAAAACAGTGTGGA 
CTTTTGGTTCAGCAGTAGCCTGTTGAAGCTCATGCTAAGTC 

5 5 TAGACTTTGTCAGGATGAACATCTCCrrTTATCTTTTTTATTT 

AATACCCXSAAGACCATACACACTCTCATTrCTATTTTGAATTGCTGTTTGGCG 
ACCrCACTTTGTGAGATGATTTGGATGGCAAAGGGAATTTTTTCATT^ 
GCAGTGGTCCAATGCAGACAAAATGCCTGGAGCCTCCTCAGCAGAGCCGT 
TCTCTGCAGTCGTTCATCAGCACTGTGGCAAATCACT 

6 0 CTGTG CCAGACAACC CAAACTT AGGGTAT CAAG ATGGAACTG AAGG AAGTAGACTGG AAAATG GGCACAT TGTG ATTCTCAGG G AA 

GTGAAATGGATTCGT AACTCATG AAT AAAAT AG GG AAATTTTACTAACCACGTAAAAGTTTT CAGTGTGGACCTG AG CATG CAAAA 
GCAGTTAGAGGAGAACTTAGCATGCTTATGCCTrrAACAGTGG^ 

AATAGGGACTGG GATGTCAGAGCAACTGTTTTG TCTGTG^ CATC CATTGTG AG ATAAAAAAT AAGAGTACTT 

CTAAGATGAAGACATGTGCATTGCAAGGTGGTGGAGTTCTGAAGCCTCCTTTAAGAACTGCTCTTG 
65 TTCCTTGTATAAAG C^TTGACACTAAAAGACTCCTCTTGTCAACCAGTGTATAATATCTG CACAATAGAGTGCAGAGGGCTGACGT 

GAATTCCCTGAAATCATAATGTGCTGCAGTCAGTAGATCA^ 

TACAAAT ATTTG C AACAT CC TGAGTAATT AACT CATAAAAAT AAACAATT AC CAT AATGG AG ATTGAGTGAATTTCCAT AT ATT T T 
C CACTCCTAAAAAT ACTATCCAGCTGGTT AATACTATT AT ATTTCT 

TTCCATGG CTTTAG AAAAATCTCAGAATTTCAAACGT AAGG^ TTTCTATT AAATTTT TCTGGT ATATA 

7 0 CAAAACTT AGCTATTTG ACTCTG ATATCTGAG AAAAAT ACTTC CACATTTTG T ATTG AG AT AA^ TTT ATCT CT AT 

GGTGCTTATCTTTTGCCTAATACAGTGATATCATTCTGTATCTGTAGTTAAGCTGCTGTAAAT 

ATGATACTTTTTTTGGACACCTG ATAGCT AACTCTG CTT CAGG AGCAAACGGAG ATGT TTACACTGG ACTCAC TTGCTGACTCATG 
G CTGCTGGAG CAGTTCCAAT TGTGAGATAGTTACXSTAG ACAAT CCT CTGT C CT AT ATTTGAT CTCG CTTAAAGGGG AGC CAAACTC 
TTTGGAATATAAAAGGATGTTTTGTTATATAGTCCTGTGTTTTCAGGAATAAAAGCCCCTGTAAGGATC 
7 5 AAGGCCACACCCACCTGCTGCTGGCAGATGGTCTGAGGGGTGCT 
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TGCACCCCATCTCTTTATATATCCCCTCTT 
ATTCAAAGCAGGATTTGTGGTTGGTGATTCGCATGC^ 

TGAAGCCTTAGAAGTGAAG CTGTCCC^VAGT CATTG AAACAT CAACGT AGTTAGAAG CAATT AATAAAGAATATAACTGTTTTTCAA 
AAAGGT ATTCATTATTTGTAGGATATTATACCTT ATAT ATAAAATTGTG CTCT 
5 GGCCTITTGTGCTTATCTGCACCTTTG 

ACTAGGGATAACACAGAGAATTAGTAT CTGGGAC CCATGGAGTGGTGATAGAGTC CAGTTCAGCCCTGCCATGT CCACATAGCCAT 
AGAGCATTGGCAATTTAGTTTGTTTTCAATTGAAGTCT 

ACTGTGAAGTCTTCAGTGAGCrAATCTGTGTGCAGCATGAAAGCCATACTCAGC^ 

ATCTGTC CCAAATCAAGTTGTTGAT AAATTGGAAACCTGTGTTGTAG CACACATC TTTGT ATTTAGAG ATCAAATGCATGACTCTA 
1 0 AATCCTTGG CATTTCCTCATTTAGATTT ATCT AGCACTTGCAC^ CTAAATGTTTCTGTGATCAGAT CTGT CAT ATT AATCAGCT 

GGTTATTGCATAAAATGTACAGCTGTCAAAGATAGGCACAAAAGCAAAT 
GATGCTGTATTATATATGTTTTAGATAGATAG^ 
TTCTTCCAATTCTAAGGCATCACTTTTTTTGATAG^ 
CTATTCTGAATTCACrrTACATGTTCAGTA 
. 15 TATTTATGTCTTTGGTTTTTATTTGTTA 

GATTTTTTTATAACAATTCWUVTAATATCAGGAAATTCT^ 
CAACAGAGTGAACACTCTTAGCATCTACCTTTCTAATTG 
TAGGTTGAAAACAATGGAAATTAAGTATAAAAGCTTGTGATATAAAGTGATC 
AAGAATATAGAAO^TAACAATGGGAACTGAAGCAGCAGTTACrATAGT^ 
2 0 TTCAGCT AGG CAGTGAACATTTATACTG AGCAATTACTT ATTGTAG CATGTGTCTTG CAAT AG ATGTTT CCTATG CAG ACACAG CA 
ACTTTTCTGAGTAGTTCCAAACTTAAAGCCATTTTAGCT 
GTTGTTTGTTTTCCTGAAAGCTTTTCAAATTGCTTAGAATTACCATTATTA 

ACATTTAATAAT T ACTGCAAC T AGG CACrGAAGAATATTGACT CT G GTTGGG G AAC CCTCCAGATAATCATTGATG AGCAAGTGT A 
AGCCCCTAAGAAGGGX3CTGGGAGAGATTTGGTGCTTTGCCTCTTTGCTCCT 

2 5 T AAAT ACCTTT ATGG ATT AT AT CAAGTCTTTTTTCTGGGTGTT AC CGTGTTTTCACATG AT ATGTTCT AGTTCAAGTTTCACACGT 

TTTTCTGTTTTAAATCTCCTCTTAGTTATTTTTATTCA 

AAATACTGGT ATGCCTT AATTTAT CATTATGTAGAAAT ACTGGAT AAATCAACTATTTG AGTG CACATTC CCAAGTAAATTTTGT A 

TATGTGTTTGGCTCGGCTTCTAATAGCAAGTTCTTAGACTTTAATTTT 

TCACGTTAACTTCX3CTTGTTTTTGAAAAATTCTAATGGCAA 

3 0 ATTATATAAAGGGGAAGTCTGAATTCCAGCTACATCCCAATTTGTTTGTAAAAAGCTTGGGTTGAGA^ 

AGCATTGCTGTTACCTGGGAAGGAGCAGGTCAGGGACCTCT 
CAGGGTTTCTTTTCACAAGGTGGATGAAAACTTTTGTTATTTCTTGTCAAGA 

TAATCTG CTTCAGTGCAGGAAGCAGGGTTTCTAGGGACTTCCGTGGT ATGT^ CACTTT AAGT AACG AAGT AGTT A 

GTGTAACTTAGCCCTACCTTACTTTCTACAAGGAAAGTATATTTAGGCATGTTTAGTGAGCAGAAGTGAT 

3 5 AGCCTGTGCACCAAGTCCTTCATCTTTCACCCTACTTGGCAATCAG^ 

TGGGAGAGGAAAGGAAG CT AG CTTT CCACTCATCTG AT C CATT GCTAAATT AAAT ATGTT CAAG AAGTT AT AAAACCAAAGGAAAC 

TTTTTCTTTGCCAAAATCCTAATAAACTATACAAAAACATAT^ 

AGCATCAAAG^SGGACAAAAGCTAGAGAAAGCACAT^ 

CTAGATCTTCAAGTAATCTGAAGTATGGAAAAGAAAAGGAAATGTTAAAAATCTAACACGGATGAG^ 

TCGAGCAAGGGCTAGTCTTCCCCTGGTCTTTCCTTATCTCTTAAAGTCAGCTTT 

TTATTGCTGGTTTGTGCGTTCTGGACAAGGAAACAGGACTTTCCC^ 

AAGTTGCCTCATTCTAGAGGACTGCTTCTCAGTTACATGGCAAGGACGC^ 

GGACAAAAGAAGCAAAGGTTCCTAGAAGGTAG CCCTAG CCTTCGGATTTGG CACCAGTGTCTGCACTGTTCCCCATAGCTCCTCAG 

4 5 TTACCCATGCAGATTCTTGTCTCTGCTCT 

AACCGATACCTGGGATTCTTCAAAGTACACCAAGTCCCTTGTGAAGACACAGC 
ATCTAATGCCACATGTCTTATGTCTTAGGAGTGACACCATATGTAAACGCTTGCT 
AGATATTATTCTGAAAATAGAAGAAATCTGTAAAGTTTGCCCAOU3AGAAGTACTGTTTCATGAGTAGG 
CTAGCTGAGCTCCTCTGAAGGATCCATTGTTACCAGATCA^ 

G AGAATTG CATGTGTTCTGGG AT ACAGGT AT AT CATG AAG AGTTTG AAG C CTT ATACTT ATAAAGCTGTTGACAAT TT CCAGTTGT 
TGTTCACTACCCAGTGACTATGCACCATGTCTCT 

GAGTAGTCCACCATATCCCATCCTGGGGATGTCCATGGTCATCTGGCCTGTTCTATGTTGTT^ 

5 5 AACATAGAGTGATACACTCAAAATATTTTTAACATGTGGTATTTAG 

CCTGGAGGCAGGGTGGTTCTAAAGGACTACTCTCTGGA(^CTTTTATAAAGTGCTTTGGAAGAAA 
ATATCATACCACACACTGCAGTGGTATA^ 
TTTTCCTATGAATAAGAGAATGTGTTTGATGCTCCCTCAGGG^ 
CGTTTTATGTCCAAGTGTAAATGATGGAAGAGATAGCATAAC^ 

6 0 CAACGTTTTTTAATTGGCAAATTCTATTCTGCTAGTACTCTCCAGCCTATTTTTCTTGGGTATA 

CATAGTAAAAGTTAGACTTAACTGTGCTCTTTGACAGGAA^ 
TGTGTTTTCGTGGTTGGTTCCTGTGGCAATTATGTGTCTTT 

TTGT CTG AGCITACAAGT AG GTGAC AG CCTTTCTTAACTGAAAAG G AAGG AAGG AGGG AAGGAAGGACAAAAGAG AGA 
AAGAAGG AAGG AAGG AG AGAG AGAGAG AGAAAGGG AGAG AGAGAAAG AAAG CAAG CAAG CAAG CAAGAAAGAAAGAAAG AAAACAG 

6 5 TGTTTGAAAATCCC CCAAAT CCATTT AAG CAGGGTT CTGGGTG ATT CT TG ACTGT AACAG AAT AAGCT AGACTGATAT ATCAGAAA 

CACAGTTTTCTTTG CT CATCGTACACATACTGT CT CTTGAAT CAGTGTGGTGCACAATGT TGAAGCAGGT ACTGGGTAAGT TGG AT 
TTGGCTACTGCAGTGTTATAAAAATAGATTAGGTAAACAGACAC^ 
AGTATAACTTTAATAGGCTGTAGTTCTAAAGAGAACTATGAATCTCTAG^ 
TTATAGCAAGGAATC!AGGTGTTATGTCTTGTAA 

7 0 GACGGTGGACGTCTTTATTTTTTGCCTTTGCTATTTTTAGAGCCCT 

AGTTGGTGTTGG ATGCAG AATTGCT ATT AGATG AAAGCT AACAAAG CAAAT CCTTGGAAAGAAGGAGCCTGT CTG CGGTAAAGACT 
TTCAGTATGAAAGTTGTAATGTAAAAATTAGAATGCTCAAGGATAAAATAA 
AGAGTAATTGTAGCCACTCACTTTAGCrrACrCr^ 
TTTTTGGATTTTTTTTTCCCTTTTATCAGAAGTC^ 
7 5 AATCCACATG CTTTCTG AATAATTATAG CACCAGT ATGTGTATAAATATTCAT ATTTTATAGGTTGATAGGACTT ATG CATTATCT 
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CCAQTAGTCTAAGAAAAATAAAATG 

AGTAGTGGAGATGAGGTGGGCTATAACATCTGTGGAGTt^ 

GAATT CTTTT AG ATT ATTTATAAATCT ATACTTTAG CTGTCTGGT TTTAAATATTT AAAAG AAT AC CAAAG ATTCATGACATAATT 
AACCGTGTTGCTGCTTAAGAGGAAAAT/TAAAGCTC^ 
5 TTTCTTCAGGTAGACAGCAACCCTQATGAAGTGGAGGGGATTTGTCCATTT 
CtAATTATCAAGGTTCTTTTCATAACCAGACTA^ 
TCAGTAGAATTCTGATCTTTGGATTTTTAACCCTC^ 
ATTCTCCAGCAGTTCCCGTGCTGAGAGCAGATCTCT 
TCTGACTTTXIAGCATTACTG^TTGCAAGTACT^ 
1 0 GACGCAGCAGGCTAGACAGCTGTACATCTGGA 

TTCTGTGACAGATAAATTAGCTGAATTATTAAAGAGCTTT 

AGCAGGCAATGAGTTTCAAAGGGTCTAGAAACAGGTGTGTGGTATAAAA 

AGTTGCATTTGTTTTCCCATATTGATAGGAAATGTATTTTTCA 

ATT ATG CCATG AG ACACTGATGCCATTCCTCTGT ACACACAAG CC CTGGTC CACATG CAT AT AAG CTCACCTGT AACA 

1 5 TTTAGGTATTT AGGGTGAAATCAAGTGTTTGG AAATGAAATG CTGTTT 
ACAATCTTATCAGATTGGACTCCTTTGTGAAAGAGTTTTA^ 

G AAAACCGTACTTTGTATAAAGTTC CGGTTGAGG CAG AACAG C CTGAAGGTTT CC!AGACTGAGACTG AAAGCATC CAACAAATGCA 
GATAAACCTGCCTACCATAAACACTGCCACATCTCCTGGCAT^ 
CACTGAAATGTACCCTTGTTGCCTTAGTAGATAGATAATATAGTTACT^ 
2 0 AAGTAATAAGTACAACAGAGTCACAGGTAGGAAAGTGCAGGAT™ 

ACrATTAGAATTTATCCCCATATCAGTTTTAAGGAATCAAAATGAAGAAGAGCGGTCC 

GTGTGTGCTGGGTTACTGTTTCAGATTTGTTCAGTATTCAAAGCT^ 

TTTCCAACTTCTCATACCAATGAGCATTCCTCCCTTTGACCT 

2 5 CTCTTGTCGATATTGTTTTGCTTTTATAGCATACT 

TTACACACTTCAGGGCGTTACAGAGTATATGTGiAGAGCCAGAAA^ 

ATGAGAAATAGCACACAGTTGCACG AG GCGAGCTGAATGTCTGT C C CTTCCCACCC C CACCTGATAGG CTCATAGT AGAGCACATT 

TTGAAAATTCTGAGGAATTCATTCCAATTCTATTT^ 

ACTGTCATCTCAATGAATCAGAAACTTTAGCTTCT^^ 

3 0 TTTTAAAAGTGT AT CTGTTCG T AATGG AGAGAATTGAAATTTATACGTGGCAGGG GT AG CAAAGT CATTTCTCTTT ATAGT AT CAG 

GTTTTAGGAAG<rrCCATTCTCCC!AAAATATGGCGCTCATT 

GCCAATGGGTGTTGACTGTTGACCAACCAAAACCAGTGCTGGCCTTTAGGCTCTTTCAGT^ 

TTTCAGCGCTTCTCATCCTACCTCCTGCTTTAAACTCC^ 

TCTTGTGGTCTOGACTCTTTCMATGCCTTTGGCTGTGTCCnT 

3 5 AGTAGTCAAGTCTTTAGTTTATAAATCTAGTGCCTAAAC^ 

TGGATCCCTCCCTT^TCAAGGGGTCTTGCCTTTCCTAAGC^ 
TTGTTTTTATTTATTTAATTCTAGAGTCTTCCTCITCCTTGCT 

GCTCTCTGGCTCCTGGAGCCTG (ZTTCCCCCTCCTGGGTGCCCTTCACAAATAAAGGCTCrrCTCTCCCT CCACCTCCTCAGTCCTCT 
TCTATTGAGTAAATTTAACTGC^GCACTATCTGCTTTGTCTCTGTTGACTTGCTGGA^ 
40 CAGCCTCCTGGGCCCCTGTCATATAGAAAATGGATGCCTCTCCT 

CATTTGAGCATGCCCCAGATTGAAGCTCTTGTTGCACAGAACACT^ CCTTTCTCCTTCACTG CATGTT CTC 

GG CTCCCTCCGAGCTTCCTCCTCTCTGCTAG AAGCTGCCTGCGTGGTCT CTOTCC^TAGCTrGCTAGAACCCGTTGTTTCTTC CTT 
TACCTTCCCTGACCrCTGTGATAGACrACATCCrrCT 

CTi\ATTAGAACCCTCCCTCTGGTGGAGTTCCTGATTTTTCCTTTTTCTC^ 

4 5 ATGTCTGGCTGAAAAAGGTGACCTTAGTTTAGTGCAGTAGCGCCAAATCAATTTGCAA^ 

T C AT ACAAGCAAAT ATT CG ACAAAAGT CTCTTATCTCTAAGCTT T TT CT ACTTTCCACT CrAGAAAATC"l"l"lTATCT CT ATG CTTT 
TTGCTCTTTGCTCTC^AAAATTATTTTAAAACTATTATATAAACTCTGTA^ 
AAAAGCTAATACGTTAAAAAAAAAAACAAAAAACAAAAAACAAGACTATACCT 
TGTAGCTCGGATTCAAC^TGTTTAGGGAAAACACACACACACATACA 
50 TGAATATATGAATATGTATGTATATATATTCAACTC1TGTTAAG 

CAC CAT CTTCTTT AAGATGACCACGTG CTATAAACCAGCT AAGGCAG CACCCATAAAACATTTTAGAC CTCT 

GTCTTTGTGTCTCCTTTTCACACAAAGGAAACAGCTGCAGGTCTTA 

AAAGAGTTAAAATTTATACATGGCAGTAATAGCAAAATGACTGTTCTTCACriT 

AAGGACCTTAAGGTGAGCCTTAAACAATGCTCTGCTTTAGGAGAGAAAGAGAGAAAAAGGGGGTCTGGAGA^ 

5 5 TATGTGACTT ATTTTATGAAAT AAAATTTCACGCATCTTTC TAAGAT AA^ 

AAAGGGATAGAAAGTTTAACCATATGAGCCCTTGTACATATTTACACTTTATTTCAAGTGCTG 

GGGATGAATGGAAATCTTCTGAAATTCAGCATACTTCTCCTACCTCTGCCCATGAATAAACAACTGTCCT 

CAACCACAAAAGGGCAGAACACCTAATGTTAGTCATTAGTTTTT^ 

TTTTAG TATG CC CT GG CAAATT ACG TAAGAGCCATAATTGGGTGAG CTCT AATGTC!TC^^C!AAGG AACTCACTTGAGAAG AAGAGA 

6 0 GATCTGTTACAAAAGCAGCAGCTCGGGACATCTC^ 

GTACAGTAATGT AAG AAAAG CTT CTAG AAGATG CAAACATATT C CTCAGGGTCCACACTTCTGT CAGGGAC TTTTTTATGT CATAT 
TTAAGAAGAGAGTGGCTACAGTTTTCTTTTTTAATAGGG 

CGGACTATTCACCAAAGAGTAGTCCTGGTAATTTCTTATATTAATAATTAGAGGGGTTTTTTCCTGCT 
GCTTTGTGTGTTCCCTTGGCCTGCTTCACTTGATCAGTTTAAGCAGCTGTTTTGCCT^ 

6 5 AACTG CTATT AACT CITGGATTTGTTTAATG AG TGCAGTCTT TTTAT ACC CTTAGG C CATCTTG CCTGGC CTAGGTTCCTGACATC 

TGAACAGGCAGCTACAAAGACACCAAGCACAGGAGTAATCTTTGCTT^^ 

ATACTCC^TCTTAACTCTCTGACCTGGCTACTCTGCACAAAGCCTTCTCCCTAGCCT 

CTCTGAAAGCTCTACACAGTGTTCTTAAGTCTATTTTACTGTGAA^ 

AAAGAAAATAT AGTTGCAG AATG AGTAC CAGAC^GATTTG AAAACT AAATGTTTT CATTCTTT CTGAT CTTGGG CAGGTTATTTAC 

7 0 TGTCGGTTTGACTTTTTTGTTGCTGTGTGTTTTGTTTTTGTTTATG 

TTCTATTTTATGAAATAATATTGGACTCCACCTGTGTCATC^ 

CCATTTTTCTTTCTTTCATTCCTCCATCCTTTTGTGTTCCATTCCCCCC^ 

TGAGAAAATAACAAATTTCTCAGAAAGTTTGAAAAATATATATTGTATTGA 

G AAAG CAGGG T AGGGGG AAATGGT CTTTATGCATCAAAAACG ACTTT AG TTAAAG ATC CCGTGGAAAT AT ACAT ATATATGATTCA 
75 CTTTT AGTTTAT TTAT TGAGT AATTTTCTTGCAT AGACATGTG CAT CACAGC C CTAG GGAGCAAGAGT CAG ACAGC CTGT AG CTGC 
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AGTTGTGGATGTGGGGAGCTCAACCCAGGTCTTCT^ 
CCCTTCCAAAAGATCAGTTTTTTAAAAGXn'C^ 

TGGTGAGOTGTCAGGCTAATACAACAGTGATAATAGCTAGGTGTTCTTTTC 
CACGTCTAAATTGAACATTTCACTACAGTTTTTGTTTC 
5 TGGAACTCTGTATATCAGGCTGGCCTCACITACAGAAATTAGATTGCCT 
CATTGT CCAGC CCTTAAAAAAATCTTTAAGAAAAAGTGG^ 
ATTGAGTAGAGTTGTTTTAAAGATGAATGTTAAAGCTATAATC 

CAT AAAT AAACAGGTTCAATGGTGATAT AT AAC CAGGCATCGG ATAT AG CTGG ACGAACTCTTAACAAAGACTTAAAAACATCTTG 
GAGGAGATGATATTTAACCAAGTCTTTAACTTTACTGAAGATTTAGCCAAG 
10 TCTGGGT ATACATGT ATGCAGAGATATCTTGTGCAAACATGGTTTAATGTGGC CTGT AGACAGTG TGTGTGTGTG CATGTGCTAGC 
CTTCCAATCAGATATGCATCCACTTTTCTAAATAATGTTATAG 

ATTCTTTT AATCCCTCAAATAGG C CACCTGAATC CATTGGTGGGATATATG ATTAGAGGCAG CACCT CTAATGTTATAACGTACAT 
TGTCATGGGCITGTGGGACATTTAACGGTTGGAACCTCA 
1 5 GTCATGAAGATAACAGTAGGTCTTATGAGAATAAGGAGACACCGTACTITCAAATGGAATG 
TATTAAATGAAAGTTACAAGGGCATTTTATTCATTTGTTTTCCAA 

GAAAATTTATGT CAATGTACTTTTATGAACATTT ATTGAAACT CTAAACTTCCATAAGATGTAATTGAGCTG 
AAAAAAAAAAT CAGTATTAG CTAGCAG TT AACATAAGAATT ACCAGAGATGTGAATG ACTAC CTAAGTTGTGTAGGATTTATT ATT 
AATAT AATTTGTCATCTGT ATCATGACTATGC CTT C!TG AGG ACATTTTG AC CAACAGAAT CCTTAAACTCAGGTGCATCCAGTGGT 
2 0 AACAT AG CCACT TCCAT AAATTTCITTTAGATTTTTTT ACTTCCT AAGAAC^ CTG 
TTCCTGATATTTGTATGGGATCTTATGTCTAAAGGAGATACAGGCAGCAGATCTC 
TTCTCCCATTCAGAAAATCTAATAAACACCTTCAAAAC^AAAAA 
GCAGAAGTTAGTCTTAAAAGAACTCTGCAGGATATATTC^GC^ 
AGCTTGATAAATCAGAATTTCTGGGAGCAGAGGATAGGTGTTGCTGTAATTTTAC^ 

2 5 CATTAGGTTGGATCTTGATTATATATTACCTAGAGTAGGCAAAATGTCT 

GTGCATGTCAATGAGACTGTTAATCCTATAGTC 

GC T ACTGAAGTT AAT ACTATCTGTTTATTCT AC CTG AC CAAAAACAAGAAATTGAAAAG AAT C CACAAAAGAGGG CTATTGAAGAT 

AACCACTTGTCATTAAATTTAAAAAAAAATGTGGAAACAAT^ 

TAATATAAAAGAAGTATGCTTGAGTTTTTCCTCTTC 

3 0 CAGGCAGAGGCACCATTTGCTC^GAGGCTAAATGCCTCAGTGAAGTATTGCT 

GTTCACCTTGGCTCTGGX-K5AATATCCTTC 

ATTAAAAGAGACTCTAACTCTGTGTAAGATCTGAAGGAAGAGGAGAGAAAGATGCTTTAAGTAAAGATA 

TG ACAG AAAGCAAGG CTTG CAAGGTT C CAGTGATGGGATAAAT ATGCTAGGACTGGG GTC T AGTCIACTTATGTTCA 

AACTGCATGTGTCGTTTCTCTTACATTCACATTTTAGTGT^ 

3 5 CT CCATTCT AATGACTTAGAACACG CT CCTTTCCAGTTG C CTTTTAGAACAT AG CACTTTGGGAAGACTTTGCTTTTGTGGCGATCT 

TAGGCTCACAAACTCATCTATACTGGTTTITTTAAT^ 
AAATGTGTCTACTGCTCTATTCCTTGAGATAAACGTCAGA 

TG GG AAGCC CACAGGAG AC CTGGG GGTGAGGAAGG CACACAGAG C AGGAAACAGGCT CACATAACTTGTGTTGTGATACT CTTTTT 
TTTCAAGATCTTTTCTACCCTAATTGCTTATGCTGCTGCAGCCAGCACCC^ 
40 TCTCTCTTCATTGCTGTCTCAAGTCAGAACTGATAGCTGTGCACAGTTT^ 
TTCCTCATAGGAACAGAACCAGTGTCAGTAAGCACATT^ 
GCCCTGAACACTGTCACCTCACCTTTCCTTCTGTCTCTGTCAATGTGATCT 

GGCATTTAGGAGCACAGAAAGGGTCAG TG CATTTT CAG TT AG GGAAT CC CAT AGG C TTG CAAAGATTGCCACTGGAAT ATTGGGAG 
CG AAAG AATCTCATATTTGG CAAAGCTAATG AGG CGTG T AAAGCCGGGAG AGG AG AGTGTG AGGTG ACTGT AGTAAATAAAAG CAG 

4 5 TTAAAACTGCCCGCTTAGTGTAAGAAATTGTGTACGCACCAGTTCTATGCGTTAGATATGT 

TTTAAGGTAAAAATATTAACATTTGGTGAAAGTGTAACATAGATACTTTGAAAATTG 
TATCACTTTAGTATGTGAAAATTTTACTTTAATTTT^ 
AGGAATAAATACAGTGTCTTGGTTGTATCAATACAAGCAGATTCTGAT^ 
CGTGACTCCTAAAATTTACCATGGTTTGTTTTTCCCTAGATGCTCTACTATATGTG 

5 0 AATTTCTGATGGATGCAAATCGAACAGGCCTATGAAATACATGGATAATTTGCCCACTCA 

GAAATAGAGTGCAGACCATTACTTTAACTCATTTGGTGGTCCTC^ 
ACATGAGAGATTCACTGAACTTTTAAAAGCACATTTAAGATTTTT 

GTTG CT T ACAT CTTGGACT'iTTT TTC C C CCCTTTGGCAG AACT CACCATAGTTG AAT C CTTCATTT AGTTAGGACCTGTTCTTGCT 
GG TC CCATT TGT AGTTTTATTTTGAG AGG TCAGTGGTTAG GT ACTT ATACTTAAAGATAAACAAG TC TAATGCAATTGGTATG AAT 

5 5 GTGT ATCCT AAT ACAATGTCT AGG AG CT CCAG C T CACTG AGT AGACAGGGGTT CAGTG ATG T ATTTATG AACAGTCAAAATACACC 

TTCTGTGGT TC CCCAAAGTAACAGAAAGGTAG AAT AACTGATG CT GG AG CATT AAGTTGTCATGT CAACATCCATAATGTGCTTAG 
CAT AGCACTT AACACAG AAG CCTCATG CAAGCT T AGT AAGT ATT AATAATT AATAAT AAATATTC CTTTAT CATAACAAGATGCAT 
CTTTTGGTTTTAGATCAGTGCTGTCGGGAGGGGAGGATACAAGCTGGAGGTTTC 

TG CAAACG CTG C CTT AGT AATTTT AAG AG CCAG AT AGG AGAAG CATGC CT ATGTT TT AC CT AAGGTT ACCT CCTG ACTTCTTC CAT 

6 0 GTT ATCATC CA(^TTGT AATG TCACAGTG ATG AAAAACATG CCCT T ATT ATGTAAACACTGGGCCT 

TTTGATTATTAGOTGGTTTGATTTGCAATCCCTAAAACATCATAAATGGCTTGGCCA 
GCAATTTGAATACAC^CACACACAC^TATGTACATATATATATATATATATATGTA 

TGGCATTTC CAGTTT TATGATG C CATAAAAAAG CAG CCAGCACATGTTG GACTGTTTAATGGT CCTG GCACCT ACCAAATG T CACT 
GG ACTTT ACAAGGTCATAG AAACT CCAC1"1"1GGTTCAGGGGTATAGACTGGATC CATCTCTGTAG C C CACACAGTATATGG AACCA 

6 5 GG AATGTTG ATGGTGCTGGTGAAG C CTCCACAG CACTAGTG TCTAT ATATTGCTAGGATTCTGTGGCAT AGAGACCACACGTTT AT 

GAGAAAAGCAAGACTGTTTCTTTGTGTGCCATCCTGTCAGAGGCAG 

GCAACCTGCTGTCACAGCTGCATGTGTATGTGATCTCTCATAAGCAAAAGCT 

TGTCTGTGATTTGCTGCCGTTTCACTTCCTCCCCCCT 

GAGGTATAGAGTTTCACTTTGGTCATTTCCAAGATGATTAATGGAGTTGTGC^ 

7 0 GTATCCCAGTGAGGTGGAGTGCCTTGATACTGCGTCAGCAGGGAGGGTA^ 

TACAAGAAT CTTCAT CATCAAAC CTTAGG AATGT TC T ACAAAACACTGC CTAGAACAG CTATTG AGTTCTATAAATG ACTCTTTAA 

GTAGCTAGAAACTGTTACCTAAAATCTCCCATTTGTGACCCAGTTAGATATGGATGAAGACAGTTTAGGGCT^ 

AGTATTCCTTCAAGTAATCATAAATTATAAAGTTAACTGGCTGTGAGTGTGTGTGTGTGTC 

CCAGAAAAG AGGAAAGC CTGAATTCCCAGT C CAT TC TGTTTCAGTTG AAT ACAT ATTTTGTATCTATGGTAAACTAACTGTAAGAA 
7 5 GTTGAGATATAATTTATCTATCACTCTTTTCTAGTAAATATCTTAA(^ 
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AATATTGAAACTCATTCATTCATGCTT^ 

CAGCTGCATGCTCL"rri T ACTTCATGTTGCTCAGTATAAACTTAAT ■ 
TGAAAATGGTAACATTTTTAGAATGCATAGAGCTCTCTTTAGA^ 
ATCTCATGGTAGATTTGTAAATGAGGTTTTTAAGACTAGAACTAAGATTAA 
5 TCAGTGCACTGACAAGAAGAATTACAAGTGAGGTTAGAGTGGTAGTGTCTCTC 
CTTCCTGCCCTCTCTCCTTCCTGTTTCTGTGAACACTAAGGGAGTTC 
TTCTGACTGAAAACTGGGTAGGATAGGACCTTCATCTTGGTTT^ 
CCAGTCTGTGACATTTTTCTTATGGCATCCCTAACTGACTAATAAAGCCTCCATT 

TGTCTTTCAACTG GTAATTATCTT AATGATGTCCACTGCTG CCTGTA 
1 0 GTACTCCAACCCACACAACAACATTTATATCTGGCCCT 

TGCAGGTCCTCTTGCAAACTTGCAAAAGCCCrrCCATGGGTGCAAGCAAGCT 
TGTTTTAGTTGCATTGACTTTTTTCTCTGCACTTGCA 

AAAAATCAAATTGAAGGATTTAGATATGGGAGTTGGAAGGTCTGGACAAAATAC^ 
TCGATCTCCTAGCATTAGATGATTAACTAGACAACAAGGTGGATTCATA 
1 5 GGTGACCACrGACATTAATAATCTTATTTTGGGAAAATATATATCATC^ 
CTATTGGTTTCACAGTCAGATTTTCAGGCC^ 
CATTTTAACCACTGAAATAAATGGTTCCAGTTTGAGAGC 
GGTGGCACCCCTAGAATTTGTTC^TTGCCTCCGTATTCTAGAGCCTA 

CCATTTGGCAG CATGACAGTATGTC CTTTTGGATTCTG ACTAT ACAAAT CATGATAACCCTTTTATTTCAGGTTT AAAAAAACATT 
2 0 G AAAATTT AATTTTGACATT TTT AT ATTTTTAAATTTACTTGC CTT CTTTCT ATTGT ACTCITTT ATTTTCAATTATTAGTAGGTT 
TTTAAAAT AGATATTAAAAT AAAGAATTGATGTT AAATTT AGTTT ATAAT CTATGTTCCT AAG CACT AG AATGAAGACAGAG CACA 
ATACTTTGAATGTGTTATAAGGAACTTATCCACTCTCrGTGCTTCCTGGGTAGGGGTTA 
ATGAACAGGGATGACACTGACTATGGCTTTACCATAATTCTCAAAGGAACTAATCCTAC 
TTCCTAGATTTTTAAGTAACGCTCrAGGTTGGGATATTTATTCTGATTTACTCTTAA 

2 5 ACAAGTGCAGTCAACACCTAACCTGTGCTCACTTTA 

TATGTGTAGCAGTG C^AATTCCTGTGGCITCACACAAAATGTCTTCTATGGTCTTTTCTGCT CCAGAACTAACTCCTTGTTAA 

ATTGATTTGGCAACTTCAACTCTGGTGGAATTGCTGCTTTAGTCT 
GGATGTGTAGGAATGTCCTGTTCAGAACATATTCACATTATTTT^ 

3 0 AATATAGTCAAGTGATACTTTAGCCACTGAGTAAATGCACAATAGAAACCATAT^ 

CAGCAAAATGGCTCTGC^GGTAAAAGTGCTTGCTGTCAAGTCT 

AGAACTGACCCCCCCCCCCCACACACACACAGGTTGTTCTTTGGCCTCCATTAGAGTGCTGTC 

AAT ACAAAAAGTTTG ATGG G ATG CCC TG ATG CAGAATTT ATAG GCAG CACCTAAAATCC CTTT CTGT T ATTTCAG ACCAAGCTGGG 
TTGATAGGTAGAGGGTCTGGGTGCTGTGTGGTCAGGAA^ 

3 5 GTCCCAGGCTAGGTCTGTGCTGTTTGTAGAGCTAAAGCACCTGAACTGACTCT 

CTACACCTCAAGCCAGTTTTGTCAAAAGTTTGAGGAGGAACT 
CTTCCTCAAGCTGCATCCCTTTAAGACTCTGGTT^CTCATGTTGAGGGGACCCC 

TCTTAGGAATCATAATGTAAACACGTGTTTTCCAGTGGTCTTAGGTGACCCCCAAAGGGATCATGACCT 
TGTGCTGCAGCCAAGTT C JrrCTT TA TATTGAAATAGGTATTATGTCTCTGGGAGAA 

4 0 CTTAGGAGCTTAATTATTTTGTCCAATCAT^ 

GTTGTTTTTCACCGAGATGCTTTTATGCTAACAATGTTTCTGTGAGCACAGCA 
CCATTTTATGAAGCATGGGTCTATCTGTGAACCTATCAAAAGTTGT^ 

AATTCTACTTCCCTTTGATATGCATTTAAGCTGTGTTCCAGGGATGTGCTTATAGTGTTAGTGTCTTATG 
GGTGTTTATAACCTTCGTGTCTTATGCAGTrTTGCAGAGCTGATGGCTCTATAAACTGGGGGATTTAGGC^ 

4 5 AGAGGCATCTGGAAGAGTCCAGAGCAGGGAGAGAAAGTAGTAGGAGG^TGAACATGGCCAGCAGACTAGAGTTC 

GT CAG GAGC CAGAGAG GAGGG CAAACAGGGTGTGG ACAAAGGAGG C CAAG AG AGG AAAACAAGAGAGAG ACAAAAGACTGAGAGCA 
AAGAGGGTG CATGGCCAAGATGteCAGGGTTATGTGGGGAAA 

CTGGACTCGGAAAAGCTTTGAAAAGACACAGACAGGTACTTGTGCTACTGAGAAAGC 
AATAGGCACCAAAGATCCATGTATGCCTTCTGACAGCAGTAAGGAATGGCTCCTTTGGCAGAGGG 

5 0 GTGGCGTTT CTG TTGGAC CTACX5 AATGAGAAAAGAAG CTGTAG ACTGTT CACAGT CCT AAACT^ CAGG CAGTGTC CTTATT AACA 

CAGTAT AGAAAC CTGTC CTAAAACAG CTT AAAGTT AG AG ACCCTAAAT AG AGCATTTGG GT A 
AACATCAATTTTCTAACTTATTCAAAGTAAAATTCAACAGAATTTACTTTTCT 
AATCTGTATAAGCAGATCTGCTTAGATTCrrCTGGTGTAAATGTGTAGTTGTTGAGATTGACTA^ 
GGAGTCGTTCCCAGGGTCTGGCATATTACAGACTTTAATAAGCGCn^CT 

5 5 TTATGCTGCTTTAGGGGTTGATGTTTCTGACTTTCTCATTGTCAGTTCTTTGCCGGATC 

GTTTGGGGTCATGGGACAATATTGGAAAGGCTGAAGGCTCATGTCTAA 

TGAGAACCAACTAGCTCTCTTTTCAGCCAGGGGATT^ 

CCTTTGGAGCCATTTTCCTGGGGGAACAATGT 

GACAAGTGTTTT CATGAATTTCtSCTTCTCGTGACTTGGTGATGTTAATG AAGATGCTGTAG CAAGCATT CAAGGTTCTGTACTTGA 

6 0 CATCTGTGTGTGAGGCTTACTTTTAAAGCAGTAACCTTTTAAACTGGAAACAGTAACCCT^ 

OTGAGCCTTTAAACTTTTTCAAATATCTTAAATCT^ 

AAAAATGCTGTGAGTGATAAATTGTGAGAAAAACTCATAGATTTAGATAACre 
ATTTTCTTTAACAACG^TTCTTGCTTCTTTTATCTTTTAGGCTCGTT^ 
ACTTAAAAACACAGTTAGAGATAAATGGGCGGAATAACTTGATTCAGCAGAAGAACAT 
65 G^CAATGCCATGATGCCCGGTGCCCCGTTGCAGCCCGTGGTGAGCAC^ 

GTCTGTCCTGAAACAGTGATTGTATGCAAATAGAGGCAGCAGTGGTAATTG 

CG G CAAACTTGG CAATTT ATAGT CTAGT ATTGCTT CTG CT AAAATT CTCTTAAAAGTTAACCCTTTAACCTGCGACTGTTCACCT 
GTGTTT ATT AAAG TAGC CATTAAACT CAGTTT AGTT AAAATACATT C CAAATTTC CTTTGAAGTTG ATAATGATATTTTTCTT AAA 
AACAAAACAAGAAAGCAGATAAATGATAAATAGTAGTCTTTAGCTTTGAGGCTATTTCTTG 

7 0 CCTGTTATTTTTTT/TCTCTCTCAAAAAGG&GACTTTGTGG 

TTTCACAGTGGCGAGTTATAAGCATAGCTTGGAAGTATCATCAGAAGAGAACT 

AAATAGCTATGAGTGGTCATAGGTACTAATATTTTCCTAGCCATGTGTGTTGGTAACTTGTTTATGAATAAA 
TGTTT AGATTT TTG ATGTGCATGTG ATAAATTACTTTAAT AAT CT T ATTT AT AT AATAGTT AATGTTTGGTT AAAATGGTGTTT AT 
GTTAATTATTGTTCAAAAACCCTAAATTACTTACATATGTGATAACATTTATATGATCAGAATATGTC 
7 5 CATATT ATG AACT ATTAT AAGGT CATGAGAA(^CATGT/TTTTC CAGTTTGTT AATTG T AGCATGTGTG C TAACTG ATTTTTTT CCT 
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AATACIATTTTAACACCTGAACTCAATGTAATTATTATTCTCT 

TGCATTTCGAGGTGGGAGGGATCCCCATATTACACCGGCAGTGAT AC CAATTATCAAT ACTGTACAGT CGAGG TGAGAGAATATTA 
TAATT AATT CT CT AT C7VTTTTAATATGATTTTT ATTTT ATGACATT ATTGGGGGCT CAGTTAAAATGTGATATTTTGATTTTTATG 
GCATCTCCTTATTTTATGTGTCCTTTATGATGAT^ 
5 CAGACAAATCGGTT CTAAACATATCT CTTACACTTGG C CCTCATGTATGGTTT CAAAGATCTGG AACAGTTTCTACTTTAG TCAGC 

CT CCTTGATGTGAGGT C CTTTCTC CCACTTAAG CCTGT AAATCACACTTACTGACATT CTGTGCTTT C TGT GACTCAGAGCACGTC 
TGAGTCAGGCAAGTGTAGATCTGGGCTGGAGGAGCATAGTCCCCAACACG^ 

ATTTTTTGACCTTCTCCTTAAATCT CCTCG CATGGGTG ATC CAAGGTCATATTATGCTT AG C C C TGGAACAAACTGTCAAAGATCT 

TTTATTCTCTTTGAAATTCTAGTGA(ZATATGTCT^ CACTAAATCATAGAT CTCTGAAGCCATCCGTACTT AGACTC C 

1 0 TGGTCCrTCTGTTAACATTTTTCTGCATAGACGAAGTAA 

ACAT CATTACAGTTCAATGGGAATT ACC CTCTACAAAGGTACTGTGAAAC CACAGTTG AAAAGTCTCCGTTCWM ATGATGl"TTrr 

GTAAT AAGTATTAAAAT CACTTCAATTAlTCAAACCniAAGCATTTTTT A G CTTT AATCTAGG AAA 

TTTGCCTTCAAAACCCACCCGAGACTCTGATCCTCTTTTGAGT 

TGGAATGACTCTGACTAGCTGCTTTCTGTGACITCAA 
1 5 TTGAGAAGGACTCGGCCATGGGAGAGCATGGAC1ATAGATCAGGCAGGAGTAGTCT 

ATCAG AG CCT ACX5 AGGACAAAGATTTCAATTT AG CACCAAACTTCAT ACAG AGGACAGAGGT AG CGAAATG AGAG CAAACT AAATT 

TATACACATCCAGTAGTGTGTACCTGCATATGGGAGAJSGTTCATAACT 

CAGC CATCAACCTGACAGAT ATTCATAGTG GTTT CTCTGCC^ 

AAGTGGCAGCTTCATTGCCACTACAGTGGGCTACZAGTGATCAGCAGTTGTCA 
2 0 GAGCAGCCACTGTCCCACCAGAGCATGGTATGGGGGTAACTTGGTTAATGCAG 

TTGT C^TG CAGTATAC CCATT AAAAAGGAGTAAGATTTTTG AG A CAG CAGG TTC 

TGTAT ATG CCATTCTCACAACTTTGCTG CGAG CT AG GTAATAT AATGTCCAGTTT AGGGATTG AG CCCAAGAGGTT CAG AGAGGTT 

AACTACATGGCTTAGGTTACACCGCTTGTTTTAATCCAGCCTGGCCAGAT^ 

CTAACT AC TTTGGATCTATGGCTGACT CTTTG AT CTTCAACATGGAAAT AAAAT ATGAAATATATTGTCCAGAATGT CTTTGTAAT 

2 5 TATATAAACATAATTCCCATTTCACAAGATTAGTATAAAGGGAGTTGGAT^ 

AAAGTTCACACAATAAACTGTAGAATTTTTTATATCAACT 

ATGTTTTTT ATCATAT ATGACCTTTGAGGAT AAGTGAG ATTGTAATGA CT CTTT 

CATACGTT CATCATGACAATGAGAAATTGACATACACTAG CTCACCCTGACCTGCATCTGCTGGATGACAACCTGGAAGCTTCTGT 
GTATTTCTTTCATCCCCAGGATGATGTTCCTTAGACTCTCTGATCCTCCAA 

3 0 GGGGACTGAGAACTCGGTGTCAGCTCAGGCGTTCCATTAG 

ACAGAGTTGCTGGAGCAGCTACTCCTCAGGTACACACCTAGCACCTTG^ 
ATCTGATGTGTGTGTATGAGC^GTTCAGAGGCCTTAGATTCAGTTCAAGTCT 
TTTCATATTCCC^C4GGTTCTTTGGCAGTAGGTGAAGAAGGCTC 
CAGTACTTTCTGTGCTTTCTTGTCTTGTTACCCACCTCA^ 

3 5 TACATCAGCCACAGTGACTTGTCATCTGTTTTGCCA^ 

AGAATAGACCTGTGCTCATTTATACTGAGAAGGTCTTAAATGAGACTTCACTATTTAAATGTTTGTTTCA^ 
GGATAC^GTGAATCAGGGTAAAGAATGAATTCCTTCCCCTTCCCA 
ATTCTTAGACAAAATTGAATATGGGTTTATTCATAATACCTGTC^ 
TC!TGTATCTGTTTACAA<^TGGTGCTTGCCTAGAAACAGTTGCCCATACT^ 

4 0 CACTGTTTACAAGTTTGACTGAAGCATACATGTAT^ 

ACCAAGCACACACATGTG(^CAGTTTACTAAGCATACATATGTACACATACTAC^ 
GTATGTCTCTCTCTATATATGCACATTCTATTAAATATACATGTGTACATATGTGCACATT 

GCACAT ATT CTG CTGAACAT ACAC1ATGT ATACAT ACTC ACATT CT ATG AAG CATATATACGAACACATG CACACAC CCT AGG AAAA 
ACTGACTT ATAAAATTT CTGTT C CATAAATTTCT AAATGAT AAGTTT TG GAT CATTATAT ATG ACCCTT CTTTTTTTT AT AAAAAC 

4 5 ATACTATTTTCAGCATTTTAAAATAAACAGCCTTGTGATATT^ 

AATATAAAAATTAGAAAAAATAAACTTTGAAATATTCACT^ 

AATCTACCATCTAGATATCTGTGCTAGAAAATATATAAAGCATTTGCTACTCTTGCTGGCAAAGA 

AGTGAATATGTTATGCAGTAACTGCAGAGGTGXSTTACAGTGCCATGTGCCGGGAAGTA^ 

GGTCAGTGCTACATGCAATGCGGGTGAGCITTATGAGACAGAGTGTAGTGGT 

5 0 CCTGAAGCGTACCTAGGAGTCAGACAGCTAGGCTGAGGGCAGGC 

CT AAG AAAC CACTT AGAAAC CT T CTGTCCAAT CGG AAAGAAATCT ATTT C CTT AAATACATG G AAATGG AAT CACC CTGAAG CTTT 
GAAAAATACTGCCGCCTAGATACCTTTCCTGGGAATTCAGACAAGGAGGGGTCG 
TTTAAATTCACATTTGGTGATTATGTGAGAAACAGACTTCCAACCAGACT 
CGTTGATTCATTTATAGTGGTAGACATGTAACATTCGTGAGCATCTTCTTTAGAA 

5 5 GCTC^TTCACTGAATGCCTACTGTAACCIAGGCATTGCTTTGTATGGTGG 

CTGACATCC CCCCTCCCAAAGAGG CCAAAGAAAGGAAAGTCTT ATAGTAAAAAAATT AAACATAACATAAGTAAAGCAC^ 
AGTGTGGGCTGCCCGGGATAGGGAATTACTAGTAATAATCATGTTTGTGTAGTTAACTAGGATGGTAATGTGAAATTGAATAA 
TTTCAGTTATAATGAAGCCTAATAACTGTAATGTAAGCAGAATGCT^ 
TTAGCAACTGGAAAATGTCCTTGGAAAGGTAATGCTAAACCAACCITCATTGG^ 

6 0 TG CACT CAGTATCAAAT ATACCAG CT ACATAGTT AGAC^ 

AGCCATTCCTATCTTTGAGATCTTACGAGCCCTCTG CAGGTCAGCTCCGGTAATG CAGGAGTGGCCATGGTCACTGGCTTTCCTAC 

ATTAGGATCCACTCTGAGCTGTAACCCACCCCTAACACAGTGTTCATAATTGGTC 

ACTGTATTTAACTTGGTGAGATTTAACGAGGCTGATTTTATTGGTGTGCTGTCATTATTGTGTCA^ 

T CCTG AATGTC TTTATGAAG GCAT TTGCCTT AAGTAAT TCCTAT AGTAT ACAGTGACATAAA.CATTT CAGT AGCTTGCT 

GGACTCTGCTAAATTTATAAC1AATT CTT CTATGTGGTCTGAGATTC CAT AC CCTC 

GCCTAGAATGTTCTTGGACAGAAGTTTAATGCATGATCACCATGCAGGACTGACCCTGTCATGGTGCTGC 

CAAAGGGAAAGCCTGAAGTTAGCTGCTGTTCAGGGTTTTGTGGAGAGACAAATGCCACTGGACCCAGTTTTACA 

TTCATAATACACIATGGAATTTTTGTATTTCCTGAGGGCAG 

7 0 TTCTAGGCTACTTCATTAGTCAAGGGCCACAGA 

ATGTTAGGGTGAATGTATCAGTGTGAGCTATGCTATC 

TCATATTGTTCTTAGTACAG ATGTGTACACAATG CAGAGC T GT CATGT AGGATGGATG CT CAACC ATGGTAT AGTC CTTT GGGACC 
TTATGCTCTCATATTAAAGAATGCCATTTACACATAGTAGATCCCTGGCTTCT 
TCCTGCCCGCTGACAGCCCATCCCATGTTTTTTAATAATTCTGTGGCATTTCCTGGGAAATGTTCT 
7 5 CATTTTCTGGAGAAGGTTAATTTCCATTGCTGAGTGCCCCACAAGTATGGGATGGGCT 
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GGTCAATGTGCTGGCTGTGGGGCTCTGTGATCCCCTGGGACATCTGTGTCTGCT 

TGAGAACAGACCTTTCCACTATAT CTGACAG CAGTTACACAG TGTGTTTCTCAGACACATTTCTAGGATTG AGTCACAAA 
GACTGGGTGAGCACAAACAAGTGTCTGTACTAAGATATTTAAGACGT^^ 
5 TCTGGAGATATGAATGCATATTCTGCCTGCTAGATTTGGAGA 

ACTAAATTAAAGACAGAATTAAATAGAAGTTAAGGAAGAAAGAATC 
TATCAATTTCGAATG-GAGACTTGCCATTAW 

ACCAGTG CATCAGCAG C CTTTAACCCTTACXZTGGGGCCTG TTTC C CCAAG CCTGG TTC CAG CAGAGAT CTTGC CGACTGCACCAAT 
GTTGGTCACGGGGAATCCTGGAGTTCCAGTGCCAGCAGCTGCCGCAGCTGCT 
10 TAGGATAAAGAGTTGAACC 

MOUSE SEQUENCE - mRNA 

GCAGCTTGGAAATTCXSGTGTCAAAGGGTCTGCCACGTTTTCATGCTTGC^ 
GAGCACACGGCTGAGTCCAGGCCTCCTCTAAACACCCATCT^ 
1 5 TAACAGTTTCATGCTGTGAATTTCTAGCGGAGGTCT^ 

GGGGGAAGCAGCXTITTCGGACATTTTCACAGTTATCrCACAC 
TTTTGGTTGCTGATTTTTTTTTTTCTATTT^ 

GAGAGGGGTTGGGTTTGTTGGTTTCATTGAACATTTAACTACCrrGTA 

CACAAAATGGCTAACACTGGAAGTATGT AGAGAGTTT CAAAG GGGG ACTTG CTCACG ACCAG ACACGGAATGTAAATTTG CACAT C 
2 0 CTTCGAAAAGCTGC CAAGTTGAAAATG GACG AGTAATCG C CTGCTT TG ATTCACTGAAAGGT CGTTG CTC CAG AGAGAACTG CAAA 

TATCTTCATCCACCCCCACACTTAAAAACACAGTTAGAGATAAATGGGCGGAATAACT^ 
GGCCCAGCAAATGCAGTTAGCCAATGCCATGATGCCCGGTC 
CCACCAGTGCATCMCAGCCTTTAACCCTTACCTGGGGCCTGTTTCC 
ATGTTGGTCACGGGGAATCCTGGAGTTCCAGTGCCAGCAGCTGCC^ 

2 5 GG TGTGT CGAGAGT ACCAG CGTGGCAATTGCAACAG AGGAGAAAATGACTGT CGGTTTGCTCAT C CTG CTGACAG CACAATG ATTG 

ATACCAATGACAACACAGTCACTGTCTGCATGGATTAC^TCAAGGGGAGATGCTCT 

GCACACCTCCAAGCCAAGATCAAGGCTGCCCAATACCAGGTCA 

GGGAATTCCTCAAGCTGTACTTCCCCCATTGCCAAAGAGGCCT 

GTATTTTCCAATACCAACAGG CTCT AGCCAACATGCAGTTACAG CAGCATACAG CATTTCT CC CAC CAGGCTCAATATTGTG CATG 

3 0 ACACCCGCTACAAGTGTTGATACCCATAATATCTGC^GAACATCTGACT 

ACAAAACAATCATACAAAGAGGAAAGGACAGTGTGCTTGATTAGA 

CAACACATACAAAAAT CCCTCAGCCACAAGACATCCACATATTG CATGTTAACCAGAAGAAACGACAACATGGGAAC CTGCTGCAC 

ACTGTTGCCTACACACTTTGTACATTCAATTGGTATTTGTGCTGAGGTC 

GTAGCACAGAGTTATGCATTAAAATATGCATACGTAAT^^ 

3 5 ACCATKIATTCTATTATGTATTGGTACGTCTGTAGACCAAG^ 

AACCAAGGTGCACCrTTCTATTTAAAATCGCCGTTAGAGCTGAGAGCGCGCAT^ 

CTGTAAT AGATTGT ACG ACATGGTGAG GG AGGGAACT G ACAG ATG AATGTGCCAAGCAAAACGACAACTGT^ CA 

CACCATGGCTTTAAGTACCATGTTGTTAAGGATTCTCATGAAGTGCCATAGACT 

TTATTGTTTCTGGAGCCACATTTTGTTGCTTATTTGCTAGTACTAATC^ 

4 0 ACCAAAGCTGTCTCAGAAATGGCCAATTTAACTTTACAGTAACA^ 

T ACTGG AGAT ATAT ATGAT AGATATATAAAATT AT TTT AATG CATTGTAGTGT AAT ATTT ATG CATACTCT ACTAT AT AACATGTT 
ATTCAAAAGGGATATG C CATTT CTGAG ACACAAT AACAAAAAATGTTTGAGG AAATT ATT^ C C T CTGTCAA 

AAGTCAAAAGACTAT AAATG CTTTGCAGAAATGGGTT CACGTTTGCTTAAACGCTTCAT CACAGTCACATTCAAAATAGTGACTCT 
AAACAAAG AGAACAG CACTGTCATCAGATGCATGATAAACCAAAATATGAAAATGGGAAATGTTTAAT^ 

4 5 GG TT AAGT ACATGGGTG AATTTTATATGTGATT CTTTTGT T CAG ATTAACTG CTT ATAG CCTTAG AAAGCCTTTT AAAAAAATTTT 

AAAAATAGATGTGCATTCAGTTTTTAAGAATGGATTCATCCAAAGGAATTCCCCTTTTTTC 
GGCTATTTTTGCTCTTCTTCAGCAGTTCTAA 

G AGTTCTGTT ATAGAACTTT C CATACTT CCAAGTTT ACTGCAAGTTTTT ATG CTTG AG AG AG ATGC TTT C T AATAT AAGACTGATG 
TGTTGATTTTCCTGATTGTACTGTACATCTATTAAAG CCTTAGATTAT T ACATT ACGGGTTGGAACCCATACCAATGTAATTT CAA 
50 TCGTGTTAAGAGAGTAATGGTGACTTCACATGTTATTGTAGTTAGTTACGTTATAGAATATTACTTATTTTTCTT 

GTTTTT CATTTCCTACATTTATTGG ATTTT CATTTT CT ATTAACAGTTG AAT ACCAT TTCAG T TTTTAGACTATTGrr nATTAGA 
TTTTACCAAT GAAT TTTTCAAAAT ACAAAAAAATT AAAGT AGTTTTTT CTTCATAACATACT CAGTTTTAAATT ACATGTAGTGT C 
ATATGAATATCCGTATTATTGTTAACTAAATGATTTATATTTTACTGATTTAAT^ 
GTCTAGTTTTCATTAAAAGAA<3AAAGATCTTTTATATG 

5 5 GCTATGGCAACAATCCTGGCAGACAATTGAGTAATATTTTGATGATTTATTTTG 

CCTAGATATTAGAATAAAATTTATTTTCTACTGTATCCATTTC^^ 

CAGG CCTTG TTAT AAAT AAG CTGCAT AAT CAATAAATAGAACAAGGGACTTTT TGTTG AT AAT C CAAATACTCAAAGTTTACGT AA 
TGAGAATTTTAGCGTGTGTGCAAACTrCnTGAGGGTTGATGATGCTGCAATTTAGC^ 

TTTG CACTTCTGT AT AT AGTCAAAAGAGAG AAACCTG TAT AAT AG CAAGATCTTATTTTG AAT AAAAACGTCTATAATT ACAAGGA 

6 0 GTTTTGTTAAGGCTAATGAAATGACAGACTGAGCAAAATTGCTTG 

GT CG CTTTG CT'ITT TGTGG ACAGCTTG TAGTTTGCCAGGAiT ri'T CAG CTGG AAAG ATTTGC CATCCTT CCAAG AT CTCATGACTG 
ACAAAACTCCATTGGG C CAAAT CTG CCTGAAG ATCATTAC CAAAAAAT AG CAGGTACTT CAG C CACT AAG ATGAAATCATGGATCA 
GAT ATC C CTT ACAT TGTTTTCAAAACTACTG CATG TTTAAAACTT CAACAAAAAGAGAG AAAG AACT ATGCT AAGG ACAT ATAT TA 
TT CAGAT CG AT ATC TAC CAATTTCAG TGGTTT AATGTTCACAAAATG AAAT CTTX3AAAATAACT ATTGACTTTCACAAAATTTTAA 

6 5 CCATAAACAGC^AAACCAAACAGCACACCTGTAGTTGTTCTGTGATTC TG CTGTAGATCATGTTCTTTCCGCAGGTG 

AAAAAAAAAAAAAAAAAAAAAAAAAA 

MOUSE SEQUENCE - CODING 

ATGG CTGTT AGTGT CACACCAATT CX3GGACACAAAATGG CTAAC^ GACTTGCTCACG 

7 0 ACCAGACACG GAATGT AAATTTG CACATCCTT CGAAAAG CTG CCAAG TTGAAAATGGACGAGTAATCGCCTGCTTTGATTCAC TGA 

AAGGTCGTTG CTCCAG AG AGAACTGCAAAT AT C TTCAT C CAC CCCCACACTT AAAAACACAGTTAGAG ATAAATGGGCGGAATA^ 
TTGATTCAGCAGAAGAACATGG CCATGCTGGCCCAG CAAATGCAGTTAGCCyUVTGCCATG ATGCCCGGTGCCCCGTTGCAGCCCGT 
GCCAATGTTTTCAGTTGCACCAAGCrTAGCCACCAGTGC^^ 
TTCCAGCAGAGATCTTGCCGACTGCACCAATGTTGGTCACG^GGAATCCT 
7 5 CAGAAG TTAATGCGGACAGACAGACTGGAGG TGTGTCG AG AGT ACCAG CGTGG CAATTGCAACAG AG G AGAAAATGACTGTCGGTT 
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TGCTCATCCTGCTCACAGCACAATGATTGATACCAAT^ 
GGGAAAAGTGCAA&TACTTCCATCCrCCCGCAC^ 
GCACAGGCTGGAGCTACTGCAGCTGCCATG 
CAACGGTGCCACCGCAGTCTTTAACACrGGTATTTTCC^ 
5 TTCTCCCACCAGGCTCAATATTGTGCATGACACCCGCTACAAGTGTTGA^ 



HUMAN SEQUENCE - GENOMIC 

ACATCTTGGGCTCCTTTTTAATTACATCCTACCTAATATAAAAAOT 

TAGCT ATATATTTCTTAAAATTTGT AATTTGATTTGAATCXTT CAT ATTTTGAAGAG ACTTGAAG CTTAACAAGAGCTTATrTTTTA 
1 0 AAAAACATATTCTGTATTTTAAAATTTCTAAT^ 
AGGATAATTCTCATGAAGTCATTTATTA^ 
TCTCCAAAATCTTATGCCTAATAACAGATTCTAAAGCCrGT 
TTCCTGCCCTGATTCTTCTAAAAAGTGTGTGGGTCATGTGCAG^ 

TGTTGCCAAGAG CAG CCAACAACAAAGG CAGCAGGG aAGGTTGC^CAAAGACCAAGGTTGCAACATCAATAACTACTGCCAGTG 
15 GAAACT AG C CCATGTGT CATACTTGGTATGTATATATCTGTTGCCAAT AACC C CAAAGTAGACAGTGTTTTCCITTT AGT C^CTTGA 
GTAATCTAGTCATAAACAAAACAGATTTACT^ C CACAT A 

TCGTCT AAGCATAC CTT LTTl'CT AACAATTAATTTTGGCTG C CTT CT ATG AA 

AGTT CAAGAGT AATCAAG AAAAGTTGACACCTTCTAGTGTCCACG TG GG CT TCTTGGTATTTAGGGACCTTGGCAAATTATTT ACA 
G CTT CTTTCAGACG AGC CTTC CAGAG CTTTGT AAG AATAAACAAACTT CAAAGGTCTCCG AAGGGGAAAAACTGTCCTGG CTAAGT 
2 0 CTAT AATTTAGGCT ATCTTTAGAGAG TACAACTC CATAAAGAGAG AGGT CCAAGAAACCAGCTTCTTTTT ATA 

TTT ACTTAATTGT AAAGACT AAATG ACAAGT AGGAAAGTG CTTTTG CAGAAG CGTGG G AGTCTT AAAG CTGTAG AAG GAAG AAACA 
CTGTG CATGGGAGT AAGT ATGTGGGTGGG AGTT ATAGATT ACTGCTC TT TTGCT AGTCT AGAAT AACT ATG CAT AAACAT CTTTAT 
AGT AGT AACTTG AACAGAAT CGTTAAAC TTCTATTAAAGG CACTTG AT ACATCATTCAAAATAAG CAC AGGTTAC CTTAT ATCCAG 
AAACAGAAAAAAAT GATTTT ACAT CCAGC CAAAAAAGATTTT AAAAGAGCCTTTG^ 

2 5 AATTTTTATTTTATTTTCAAATATTAATCCAAAAGC 

CTGTAGATATCATACCCTTAAAAGATTAAGCCCAGGAAGTGAATGACAGTAAC^ 

ATG ATCAGTCAG AG ATTT CTT C CG AC CC TGC CTTTAGAGGCTATGTGATGGACACCTTG CGACTTAAC^C^ 

ATG CAACT AGTAAT AGAATTCTGT CCITACAAATG AAGAAG AATT CAATGTG CACATGT AGT ACAAATACTGTGAAG ACTTTCAAC 
TATAATGTGAGACATTTGAATTCAATAAAAAAGGGAGAGAGCAATAAAATAATTATAATT 

3 0 CACACTCTCATTTTTCTCATTCACT 

TTGT C CTTGT T AT ACTG CAT ATTT CAAGAGGCTTCAGGTCTAGl"ri'i" T CTT ACCATGT AG C CT C AG ACTCT AGG AAAGACTGGAAT 
TATGCCAGGTGCTCAGTGACTATTGTAGACTCGATAGGAAAGTTC^ 

GAGCTG GTGAC CT ATGG CCAGAAAAGTT TG AAAAAGGAAAAGAATGC T C CACATTGT CTAAAAACACATTCATTAC TGGGAGGGCT 
CGATGGAATCAGGCCCTGTCAGCAGTTACTCATTGGTAAGCCGATT^ 

3 5 GAGGGTCAACTGGACTGACGGCACATTCACGAAGGCCATAAACTAAGATGTCTAGCATTATCTTCAA^ 

TG TT CTGCTAACTTGTTTGTGTTT AT AATTT AACT ATG TTAATTT AT TGAACX? AATTT AT AAAAT AT ATTGAATGT ACAGAAT AGC 
TTTTGTTGAATACACATAATTACATGTATGAGTATGTAACTGGTGAAGATA 

TTT ATCTCT CAG AGG CTCTG ATGT CACT CTCAAGGGGGTGCGT ATAAT CACAATT TCTTAG CAAGAGACAGGGTATAGAATTTTAA 

4 0 GG AC CIACTAATGCTTACCCGGCATAAAGTG AAACAT ATT TTGGTTGCCTA^ 

TACCACCTTGCACCACCAGTTTAGCTATCCACAGCTACT 

TCTAATTCTATATGAGATCGAGTTTCCTGCAGATAATTCTGTTTCrrATTGAGCCAAATAAACT 

AACAGGTCAATGGATTG CT AC CT ATTCTGGTTAATTTCCATTTCCTAAACl ClAGCTT L.TTGCAG AG TTT AG G CAAGAAG CTGGCA 
ACCTTGGCTGCGCTTTGAAAT CATCT AAGGGGCTTAAAAAAGATACAGACGCTTGGATGCCCCTCC CAGAAATTTTGATTTAATTT 

4 5 GTCTGAGTTGCAGCCGGCCTATCCAAAATGTAGTCAGGGTTAAGA^ 

TCATAATTACCTCACATACCTCTTGTTTTAAGTTCAAGGA 

CTITCATGAAAGACAAATAAAAAGGAAGGACATGGGGTTGGCAAGTCTGGGAAGGTGAGGAG 

CAATGAGCAGTTCATTAGGGTTAACAAGCTATCCTTTGTTAAAGCAAACGCTCTTGAGGGATTAA 

GAATGACTTTAATCAGATAATCTCCTATCIACCrrAGTTGAGGATTTGTTACCGAA 

5 0 GGCCAGTTTGGGATTCACTGAGGCAAGCCAA(^ 

TCTGTCCAATAGCCTTTCATGACTTTTTTTCTGACTTTCTT^ 

GTTTGTTGAATTTAACTGAATGGTTTCATTTGCTACGCTCTTTTCCAACTGTGATTTTTCCCTCACGTOT 

5 5 CTATGAACACCTTGCAAAAGACTTAATAACTTCTTTATCCCCAGTGCCAAGCACAGCCCTTGGCA 

ATG TGTTAAATGT ATTAAGGAACAAT AGC CAGTGTACTGATGC C TAAG TTAAG ATATTAAAATTAATTT ATGACTGT ACTGCTCAG 
TTCCCACTCITTTGGACAAGGCAAAGAGGAAAGAAGAGAAAAAGAAACA^ 

TGTTGTTCTT ATT CAAAGGT CT CTTTAGAAC!ATCTTTCGAGACATGATTTGAAGGT AATTT CAGG AAACT CAG AT AGGCAAGAAGG 
AATAAGAGGAGCTAACAAAAGATCCACAGAGAGAACTCACACAATGGAA 

6 0 GGAAAGGAAC CT CAAGAATACAATG AGT AAGT AG ATAG AT ACAGC CAACATTTAGGAACATTGAC!ACTG CT ATTG GCATTGTCAAA 

GATTGGTTTACCTTTAGAAATAAATAAGACAATGGCC^TATTTTTTCATATTTGT^ 
ATGTTTGGGTTACAGTACATAAAACAGTTTCACTACGACATTTATATT^ 

GGAATT CTGGG CTAAAT AACAACACAGAATTTT ATGAAA C 
ATAAATAGATAATATGTCAATTTTGATCTTCTGGCTGTATTCTATATTTAT^ 
65 TG CAT ATTT ACTTGT CTTTAACAACAGT C CAT AGTAG AAGGT ACGCAT ACATC CCAGG T AGAGTGTTT ATAAAT AT ATTACACATT 

AGACAGATGAATTC^GGAGCTTGGGGTACTGGCTGTGAA^ 

GT CAAGGGCCAACT CT C CAG CTTCCATT ACCT CATGAAAAATGAAGATTATAT AT AAATG ACAT GG GG G AGGTGG CCTTTGTTTTA 
TAAGATTGTCTTAAAAGATTAAATTTGGAAAAGAAATTGCATATATAGTACAATATTAACTATC 

G C CTAGAAGGAAATAAACTGTAATACAAAGAGTTCCCT ATC^ CAAAAT ATTCTAC CTTCA 

7 0 CT AAT TTCG AATTTT CT ATATTGTGTTG ACT CCAAAT AATAAATAAGT CATACCACATCTCC CTTC C CTCAA 

GTTATGTTATCTCGAGGATCATGAGTCTTATGGCTTCTCTTCAACT^ 

GAGGGTCTTCAACCAAATCACATTCCGCTGCTGGGTCCCCAGCCATTTGCTGTCTGTTGCCTCCTAGACAG 

TG CTGTCATT CT CAG CAGTGGTGGACTT CATG CAGTCT CTTCTGAGATTAGACT ATCAGCCTTTAC^^ CT 
AGTTATCTTTGTGAAAACAAAATTTAGATAATGTTCCATCACCCCACCCCACCCCT 
7 5 TC CCAGTGAGGAAAAGTATCATG GTTGTAGCTTTG CATTTTGAGTGTGATT AATTTGATGG AG AT T CATTTTTTC CCT CTTCCTTG 
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ACTGTAAGCTCCTTGAGGT TAGGT AGTTTTTGTTCACCATTGG CT CT CTAGCCCTACCATGAACCCTGGCCTT AGGGCTCTTG CCA 
TGCAGTCCCCACTGCTTTGAATGCTCTTTCCCCAGTTATTTCT CACTT CACCGAGAAC CTTG CCTGCATATGACCAGCTCAGT CCT 
CCATCCTGTTTCAC^CAACCITCTCrCCCA 
ATGTTTATCTTTT ATlUTri AT^^ 
5 TTATCTXjTGCTGTTCATTTCTGTAACCACAGTACOTAC^^ 

TAGAGTCATTGTGTGAAAAGTGTTTTCTGTATGGAAAGAAGAAAG 
AAAGAAGGAAGAAACAGAGGAAGGAAGAACTATACTC 
GGCCTATTTTAGAGCTTCTTCITGTCCTTATTGTCTTTARG 
GACACTGATGTCAGTTCTGCTCrGCCATGGTGC 
1 0 GCTCTCTAACATTCCATTCAATGCTAAAATTCTATGAT 

CAGATCCTAAAAT AATGATATTGAAGATACAGAT T CAACTTTT AGGT AAAATAACTTAATTC AGAA^ 

TT AAATTT CATAAACATTTAGTAGGAAATATATTGATTTTATATATTTTCACATTCAT GTGCAAT ATACGTGTTATACATACAAT A 
TATTGTATTATATATCAAATAT ATACATAGT ATCTATTA CACAAAGAAATTGTGTT 
GTGTATGAAATATATAAAGGTCCITGAAGATAAACTAAATTCATGTAATTCrTTCrCT 

ACTTGATTCT CCATACAAG AATGTTTGC^Cn'TTTGATTG CCTT TTAG AGTCATCTI^CTCT^G ACCATCTATATTT AAATATTGAT 
TAGGAAGCACAGGACATGAGACTAGGTTCTGGTTTTGACACCATC^ 
TTTTAAAAGAGTATACCTCCTTGAGATATTTTTATTrAGGTGATTTC 
GTGTGTGTCAGGAAAGAQ^GTTATATTGCAGAGGCAGTGAGAGCTTAT^ 
2 0 AATTCACACAGGCACTCACACACTTACATAGGTACCTTAAATA 

CTTTATTATTGGTAAACAAGGAGCTGACATTATCTATAAAAACCAT^ 
TAAAACTTGAATGGACTGAAATATTTCAGGAACCTGCAGTGAACACT 
TTGTTTTCTATAATTTCCCTCTCCACCC^TAAATGACCTTCTTTTTATACT 

CACTTCACAG AAATAGGAGAAACCAT AATGTT CT CCTGGCACTGAGGG AATAT CAATGGT ATTCT AAATGAGGAACTATTAACTTT 

2 5 TTATTACAATAATGATTTTTTAGTAATT 

G AATAGTTTTAACATGATCTGGTCTTTGTT ACAGTGCTTTAAAAT T ATCATT CATGTTAT AATGTTATC CAGTGAT A 
GTCAATTAGGCCAAATCAGTGGTAATTTAGTGTTATATCA 
TTTCTTAGCCATTCTGTTTTTCTGTATCTAATGTAAAATAACATTTTTTCT 
TCTAACATTAAAGGATCATCCTTCTATTTCTTTGTATTCTGTCATGGAGTAAAA 

3 0 TGCTCCCAGAAGTCCCCCCTCCTCCAGTGTTGAAGTAGAAATATAATTT^ 

TTTC^CCATGTTGCTCCAAACAGCAAATTTTCTAACCCACATC 

TGAG AACAATTGAAAAT AAATTT CTAC CAC TAGG ATTAGT CCT T AGTGGTT CGCAGGC ATT CTAAGCAG AGGGGCATGG AAGCT AG 
CTAGGGCACCAACCrrGCTAGCAGATGCAAAGTTTGCTCTACTAAGAA 

AAGACATAAAAAGGCAAGTAGCAAGCAATTCAGTAGGCTAAGCCAGGGATTTGAGAGTCTCCC^ 
35 CTGCATGAGTCATTTTTCGAGTCTGTCCACACCGA^ 

AACTTTCTGCAATACAAGTCCAGACTCCATCTCTGGCC 
GACTCGTTAGATCAACACAGACCTAOTTTTAGTGTCATOTCCTGCCACT 

CTGAGAAGGCTGGCAGACCCCTTGTATATACCCTGCTCCTTTCGTTTTCTTTGAGACAGGGTCTTT 
TCAAACTCCTGGCCTCAATCCTCCCTCCATGGCCrCCCAAAGTGCTG 

4 0 TCTTTTCTCTTCTCCACCTGGACTCrACTCCCTTTTGATCTTGCCAACTTCTACACAC 

CT C CAGGAACC CTT CCTTTGCCATC CT CAGA.GCC CTC!ATTGG ATTGCTTCC C ACAT ACAG CACTT ATC C CTTATG CAGAATTAAT C 
GGTTTGTCTCTCACCTACTAGAGGCGTTATCCTTGAGTGTAGK5GA 

AATGTAGTTGCTGAATGCTTAATAAGAGGAATGGAAGTCTTCGTAGCTTC<5GGTAGATACTAATGGGCAGATAGAAAGTTCC 
CCTGTTCTCTGAGAAAGTGTCAGTGTTCTTTTAGTTTG 

4 5 AGAGGCACCATTTAATTAATTAATTTGTTCTGGAAGGGATCAGTGTACAATT 

AAGAACTAAAGTCGGCTATTAGTGCCTCTGGAAGATATTTCTGCA 
CCTCACTCTCTTCAACAAAACCGAACCTGCAGGAAGAATC 

TTTTTCACTAGAATCATCCTTGAACAAGTTATCCGAATTCCTTACATTCAAAGGA 

C ATCACAGACG TT CAGG ATTCTTTCCATTCAAAG CAAAGC G^GTT ACAT CTT AT AGCACCCAGTGAATTATG AT ATGGGG CTGCG 

5 0 GTTTATTTGCT CTGG CGACAATATGG CTTACTTTG CTOTGGAATAT C C CGGT ATT TCAGAAGTAACATATTATTTT TAAAG AGGGAG 

CTTTGCTTTCTCTAAATCACATGGTGCAAAGTTGGGAATACAGTGGCCT 
CTGACATGTGTTTGTGCCTGTGTGACAGTTTCACTAAGGAATTCCAAAACT 
GCAGTCTTGCAGATGTTTTTCCCGCCTGTTAGCCCTCCAGGTTCAAAAACr 
CGCAGGATTOTAGTCTAATTGGCTTCTAACCTrTCAATCTAACCTTGCCGGCTTCCTTG 
55 AATGAACCCACTGGTt3TGCCTAGCAGTCGGTAGAAGAAGC^ 

ACGCATGAGATGGAAGATACCAACGGGAGGCCGAGGGGATCCACGGCGCCCGCGCGGGCTCC^ 
CTGGGCGACCGCCCATGACCCGCTCTTGCGGGCTCTGTCCGGTTGAGAGGCGACCCT 
GCGGAAAGTTGAAGAGCGTTTTTCTCGCCGCCGCGTGCATTAGGAGCTCGACGAGTCCGCCCTC 
GGCGGGGGAGGGGCCGCGCAGCAGCAGCGGAAGCCAGACC^ 

6 0 CAGCTGAATGAGTTCTCGCGCCC^CAATGCT 

GACTCCCACAATCCCTTGGGCACTTGGTCGACAGTGGGGCCGCCT 

TTCCCCGCGCCGCTTCATTGTTCGGACTCCGGCGGGTCTGCCCGTGGCTGAAGGAGGAAGTGCGAGGAGGTGCT^ 
TTCTCCCGGGCAGGGGCGGGCCGCTCGCGGTAGTTGGTTT 
CGCAGCCGCTCCTGCGCCCTTCCCTGCCGCGGGCTCT 
65 ACCCAGGGAGAAACAGGGGGGCGAGAAGAGAAACAGGTGCGTC 
GAGTCCCTCTGAGTCGAGCGGCGCGCGGGTCTTCCCGGCXM 

TCTGGGCCAAATATGTGCACGCAAAGGTTAATTCTCTGCTTGCTACCCATAAATTACGCATGACATGTT^ 

ATGGAGAGTGCTGTATTTCAGGCCAGTTTGCCTTTrGCCAAGTT^ 

TTTTCTTCTTCTTCTTTTTTTTTTGATCGTTTTC 

7 0 GGAGC03CAGTT<3GTTGAATTGCTTTCATGAAA 

GCAGGACAAATATGTAGCCCCCCCCCCAACTTTTTTTTTTTTTTTTA 

TGAGAATACTTTGATTCCAGAGCCTGTAACAOT 

CAGGGGTGTTGCTGCCAGAGTAATATTGTATTCI^AAGGCTCTCT 

TTTCCGTGGAGTATGTCTCCCCAGGATTAAAGTTAAGAAGCTGCCGGAAAGGTTGTTGCCGCTGACTT 
7 5 GCACAGG CACAAGT CATAGCAGATGGATGG CAAACT CTG CCATAT ACT CTGAGGAG TTTCTT AAATT AATACG CTAGGAGAGAAGT 
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CATAATTAATGCTTTTiVGACTTC 

TTTGAACACAGTGTCTGTTTATACAavGJUSGGTGTTTAG 
TCTCTGTGTGGATTCCTAAGTGTGTTCGTGTTTCT 
CGACTGTAAAATTTATCCCAGGAATGTGCAGTTC^TACTTAGAATGT 
5 TTTTGTAGCCTGAGAATGCACATCGCTGTCACTTGTATCT 
ACCTAATGTTCXAGTTTTTATCTGTTTTGTGTA 
AATGTAGGCATAGCTTTTTTAGTCTTATra 
GAACAGGTTGC^TCAGCAACAGAAAACCTGTTCATCTA^ 
AGAGACTCAGAAAAACCTCAOVTATGCCT^ 
1 0 GTGTGGTACTCTGCGC^GTCATGCCAAAGAGAAA 

TTGAGGCAAAGTTAGACTCCAAGTTGJATTCATTAAATTTTGATTC^ 
ATGAAACACTGGGTTTTGTTCCAGAACAAGAACAG 

TCATTGAAAATTGTATTTCTCATTGAGQAAGX3GTGTGTTACTTTATTATAT 
CTTGCATTTGTCACTGTTGCCATACATTAGGCCTGTATTTGATA^ 
1 5 CTGGCTTTCTGCTTGGTGATAGATGTGCGGATTAGAGAAC^TCTCAG 

TTTTTAAAATTCTGTATGCAAAGGCATTAGGGTGGGTAGTAGATATGAG 

CG AAGTGATTGG CC CAAAATAAAAT CC CATCAACCATATGG ATCATAG AG AATGTGACA.T AGAAAG ATTTT AAAGTGTATGGTTCT 
TTTAGGCCAAAAAGCTTCATATGTTTGGAAATCTAGAATGTGCTGTCCM 
GTGTCTTGTCCAGATTGAGAAGTGTAAAATACTACrc 
2 0 TTTTTATATTTATGATATGTTTTAATGATAATTTTTGATATATTGGTTCTAATA^ 
TTTTCTCTCrCTCTCTCTCTCTCTCTCTCTCTIT 

CAGTGTTGTTCTAGG ATAAAG CAGG AAAACAAT ATT CTGGG AGAAAG CTT AAAGATTTTGCTGGTCAAAACAAT AAAATT GGCTT A 
TTTTATTATACATAGAAATACCTCGTCTTTCTTATCCATTGTAACCAA 

TCCTCAAAGCTGCTATTTAGTTTAGAGATTTGATGGTGAGGTAGTAATAATAGAGAAAAACTTATAATTA 

2 5 ACCTCTCAAAGGCCAAAATAGAGTATGATTGTTACTTA 

CAT AAACGAGTCACTAAAATTATTTTA^WVTAGAGATTTTAGT AAAT AG CAATTAT CTTCAAAT ACTT AATATGTGTTTTCTTTTG 
CTAGAACT1TCTTTAGAAAGTTAAAAAAAAAAACCCATATCCAAAGT 
AATGAAAGATTTTTACCTGCAGATAGGAGGGTATTCTTTTGAATATTAGCAAAG 
CTCTGAAGAATTTCTATACATGGAGACACTGTCTITATAATTTC 

3 0 ATGGCATAGGTCCTGGAAACATTCCTTTCAGATATTTTCATAACTGAAGTACATTTTG 

AAATGTGGTTCTTTTCTAGCCATATAAAGTATGCTTACTATTATTTTCTCGAAGGATGCCAAAA 
TTCTTCGTGGGGAGGTTAGCTAGTTTGTATATAGAGCCTCAAGAAGAGTTGCCT 
TAGATGAGGCATAGCTTCCATTAGTGTCAGTGCGTGTGTGGAGGGTGAGGGGAAGAATGATCCCC^ 
TGATGAGTCATGGAGCATATTCACTAATGGTTGGAGAATTATTGATTGCA^ 

3 5 AITAACTGTCTCTAGACATATTTTTTATTTTTAAATTTGA^ 

TAAACGTTTATATAAGCATTACAGATAt^TCCTTGAGGAAAAAAATAGTAAATACATGGT^ 
TTTTCATAACAGATCCAGAACATCTCTTTCAGATAATCGT^ 
TAAAAATATGTGTAACACACATCTTGAAATATTTGTGTTAGGAGAATT 
TTATGTTCTCTAGGTTACTATGATTATCTTTTGTTAAAACAGTTCAGGAAATATT 

4 0 AGAGTAGAGAATTGCTTTTCCACAGAGACCAGTTATTCCAAAAAGG 

AAAGAAAAAC CATCArCTCTTTTGACATTCTT TTAGGAAAC TG AGAGATGT ATGTTTTATATACCAAAAG AGGTGAAT AAT AGTTG 
ATAAAGGTATGCAAAATACaTACTCCTGTGAAGTGTGAACATACAATTTG 
GACAATGCCAATTTATTCAAATCACCTGAATGTTAAATAATATCATAC^ 
CAGTGATGATGTCCATGCTGATGTCAACTCCATTCCAGTCCTGCT^ 

4 5 TAGTTTTCCAAAAATTTGTATTTGACACTGGATTTTTTCCTACGTTTTCACTTTTAAAAGTTTAGAA 

CnTGTTTTAGAGAGCTAAGAAGATTCTAGATTTAT 
TTATTCTATCATATAAAGTTGATTTATAAGTGAATACATATTGTGAA 
ATCAAGATAGATATACGTTGTCCTCAAAAAAATAAGAATCTGGTAGGGAA 
GATATTTAATAAACAATGGCTGGGTAACATGGTATGAAAGTTCAGATAAGGGAAAGTCT 

5 0 CATG TAAT AAAGGG AACAGTC CTT AAATGCAT AAAT AATGGGAACAAAAGGTATTCAGT AT TTTT CAG CCTTAG CAAAATATTATT 

CTGT T AACGC AGT ATT CT ATT CAGCAATGGTT CCAT ATTTT CCAT ATGTTT AT^ 

TTTT AATTCAT TT ATTT ATTT CTTTT ATCTAGG AG AAAT AT AATG GTTTC CAAATG TCCTAT ACTAGATT ATTGAT AGATTTTGTC 

ATTAGTTTTACTATTGAATACCATGAAATATCAGCTGTTCC^GCCACT 

AATCCATGGTTATATGCTTGTGCTATGCTAAATATAAAGGAT^ 

5 5 ATCAAAGATATTTTCTTAACTTGTCTCAGTATTAGAATTTACAGCCC^ 

ATTTTAGCACTCTTTGTGGACGGATAACTCTTCTAGTAGAAAGAAAGCAGGAGGGTGGTCAAGCT 
GGAGGACATAGGGCTTTCTGTGGGTAAGAAATGGATGA 

CAAAGGCCCCCTTACTGATGTTTCTGTTTACCAGATCTTGGTTTCAAGATGCCAGGTTCTGAA 
60 AACACTATATCTAAATTTGGTTTGAAAGTCTAATGTTTGGCTGTGTGCAGTGGCT 

GAGGCAGGCAGATCACGAGGTC^GGAGATTGAGGCCGTCTTGGCCAACATGGTGAAACCCCGTOT 

GCTGGGCATGGTGGCGCATGCCTGTAATCCCAGTTACTTGGGAGCTGAGGCAGGGGGATTG 

AGTG AGCTGAG ATCAT GC CACTGCACT CCAG T CTGG CAACAG AGCAAGACT CTTGT CTCAAAAAAAAAAAAAAA^ 

G CT AATGT TT ATAC CAGG CAGG TAGAAAAACAAAAACGGACCTCATTAG AGGATACATGAGGTTT CTAAAAGTC 

6 5 AAATGTAAGAAGTGATATCAATATTATTATTATTTTA^ 

AACAAT AGATTGGGG AAAAACCAT TTACTCTTTTCATG C C CT AGTTT CrAAATCTGTCACATTATAAAGGTGT AAATATTTATGAA 
AATAGTAGAAGTTGCCACTGGCTGTATTCTTTTAAGTAAGGTATCTCATTTAATCTTTTCTACTG 

AG ACTT TT ACCT T AAAG AT CATG ACT AG AACAATTT TTTCATTTG CGGAGATATGAGGCTTG TCATT^ CAATGT TCA 

TGTTTTTAACCAATTTTTCTTGTCTTGCCTCCACTGATTATATATAAAGAATAGCGATGAGAATATAGC 

7 0 A(^TCTTTGAATACTTTCTGTGTGCTAAGCTAAGGTTTATAATTAATTCATTTATTCCT 

TTAATATTCTGATTTTACAGATAGGGGAACTGAAGCACAGACAAGTCCAATTAATTOT 

AAT AAGGAGTTG AACCAAGGTAGC CCAGCTTT AGAAT CCATG CTCCTG TGTAAAAGAATAAAAACAG ATTTT AAAAATACTTATTT 
TGGATGCTGATAGGAGCTTGAGAAC^TAAACGTTTGGAATACrTTAAAAGACrTGTACTT 
AGAATGGGCAGAAAGTAAGAGGATAGAGAGAAAAGGCCTGGGGTGAGGGGTGGGAAGCATTAATAGGA 
7 5 CAGGTGTT C AAGGG AGATTG GTGCT ACACTTTTGAGGTCATTGATGACCAGCT 
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ACAGACACTTGGTGTTTAGGGTAGfAACCTT 
TAGAGATTGTGGTTATAOVGAAaUSAGATGAGGGACTAG^ 

GAAATGTGATAAGGTTAGGCTGCATAGGATGTACCAGCTGACTGGATGTGGATATTAAA 

TAAAATACAGCAAGAATTTAAAAAACTrrrTAATCAA 
5 ATTTAAAATTGTATTTTATCAGGGATTATAACTTAAGTAGA 

CTTT CAAAGCACTGGATGGTGATCTAAAGGTC^CCAAATTCTTTGTCC CMAT TGGTAATTTTAAAGT AAACAT AATTTCAACGTT 

AGGCCTATTTTGTTATTTGCTGAGAAATGTCTCAATGTO^ 

GTGTAACCTATCCTTAGTGATAAACAAGGTGGCAGATA 

ATACTTCTCAGACCTTGCTATTTTAGGGAAGGTATCAGTATAACC^ 
10 GTCACTATTATAAACAGAATGATTTACAT^ 

TATATTTCAAATAGAATAGAAAGCTTTATTGGAA 

GGTTTAAAATAGTTTAAACAGAAAATGTCAGAACATTGATAGATTAT 

AGAC^VGTTTTCAAGAAAAATATTCTATTTATAAATATTTAC^ 

GAAATTACACTATTTATGTCTTTAAATATAGCATGTTTCTATAAACT 
1 5 GTATT ATGTGAAAGT CACACTTATTACATGCCTTTTTCT ATTATT AG^ 

T ATGGTGAAAATCTATTTAG CTATGTCATTAAG AT AATGCTTAGAAAATG CTAATTTTT AAGT AACT AAGATT AAGT AGTATTGGA 

TATTCTTTGCCTTCTTTATTATTCTGTCATTGGGCT 

CTTCATTGCTGTAACATTCAGGAAAAACTGCATTGTCTACC 

AAGACTTTTAAGAGGTTTGATAAGCATAGCAAAAGTGAACGACGGAGGATA 

2 0 AGCATTGCATGACCTTrCCAAAAAATAATTTGAGAAGTGCCTA^ 

TAG AAAAAAT AT ACT CAAGT AG AATTCTG CAT AATG CTTTCTGGG GG ACCACATTTG CTG AAAAT TGC CTACGGGAAATGACTTT C 
TCAGAGTGATATAGTTTTGAGGACTGAGATTTTAAAAGTA 

AGAGG AATGAATT CATCTTTTGTGTT TTGCTAAATAATCAGTTAACAAACAT ATTCT G AG CACGTTT AAAGT CAAAGGATTGTAGT 

ACTTAATGCAAGGCACT ATACAAGTT AGT AC CAAAGGAGGAAACTGAAAGAGGGAAG G AAATTCAAAGAAGGAAGTTTTTGTGG AA 
CTGAATGGCAAGGGAATGCTGTGGGGAAGAGGGGGC^TCTGTCATAGGCCCAA 
ATAGTAAGAATAAGGTGGAAGTAGTGATGAAAGCTACCATATTTGAGCAGCGTCTTTTCrATOT 
TTTTTTGGAAAAAAACIAGATAGACAACCTGGTAGTGATAGTAGACAATGAATGAATTGG 

3 0 T AGAGG ACACAGACTTAG AG AAATTATCTT ATTTGC CTAAATC CACATAGCTGG TG AT CAG CCG AACTGAACAGAAGCCAGGT CTT 

TTCATACTTCAAAACCAGCATTCCCCAACCCCATCCCTCCCCATCCCATC 
AGTAATTTTAAAACTTCACCCATACAACAACTGACATAAGTC^ 

ATCTTTTTTGCTTTGATGATGGGTTACAAATAGCGTATCGTTTGACAAATGCTTATACTCTGTTTCTTGG 
AAACAAACTAGGCCTCCCATCTT'CAGTAGATATGGTATGGAGTTGCAGTATCATGATGTTGT 

3 5 ACCCATTTGGGATTCCCTCAGAGGAAGCTTGTGAAGTTTGAAGATAATGTACTTTCTCACC^ 

ACCAAGAAATGATAGGGGCAAAAGTTTATAGATCATGGT^ 

CAGTCTCTGCTCTCTGTAACTTTACATGTGATGTAGCCTGTGACT 

ACCTGTCACTTTGCTAACAAATTCCTCTCTTCATAGAGCTACT 

AGAAAAATTTATTT ATAT AAAATGAAAGTCT TTGC T TTTGGCTGC AAAAGAAAG CAT C CT T AAT AG CAAAT ATTGATTGCAGTG CC 

4 0 AGTCTGCCATGTATCATCCTATACCCTTTCGAGGAAAA 

COTAAAACCATTCGGTAGACAGGTTAACTCTTTAGATACTTTGTATATATTC 

ATGATGTGATTTGATATGTACACCATTTAGTTTTTCTGTT 

ACTAGACAGAAGAAATTAATTTATTTCTGATGCTC 

4 5 CTAAATTCTTGATTGGAAGCAGGTTAAAAGACTCAGGT^ 

TGTATTTAACATT AATGACAGAG AATTT TAAATATG CACAT G ATT CT CTTT TTCCT CCAG CT TTTCTTT CTG CACAGG AAATAAAG 
CTGTATTTTGTAGGCCAGCTTCCAGACTTTGAAATGGTGAAAGTGCAAOT 

CGTGTTGTTGTTTCCAAGATTTAATCTG CATTCTAGTTTAG AAAATTCAGGATTTTATTG CTTATTATTATTCATTTTTATTCATA 
AATATTTGTTTAAAGCCCATTACATATAAAGTTTTACTATAAGATTTTAATTG 

5 0 GATTTTCTGAAGTTTATGGCCTATAGrAAAAAGGTCC^TTCATAAG 

CCATTCTAGTAATT ATT AAAAT AGOTACTTTCTTCGTAGGAACTTATTATTTT AC ATGTG TGAT C C CATAGCTCTTACACAATTCT 
GTGTGATTTAGGCATTGCTATCTTTCTGGCTCCCAAGTGAGAAGATTGAGGCT^ 

GTCATATGATAAGGTACTACTGTGAGAGCTGAGATCTCAACCCAGGAGTTTTGCTGTTTTTCTGCTTGT^ 

GTATGAATT AGGCACACAAAT ACT CATTGCATG T AGATT TGGG ACCTACT CCAAG TTAATT CTGGAAAG ACTCAGTGAAG AAG ATA 

5 5 AGATGTGGAAGAGG TTTTTTTT AAGCATTATGG TCATG AAAG AGCAGACT ATG<5AATG G ACTTTG AAGTTTC ACTAGCTTTGAAAG 

GAGGGGTATATCAAATGGCCTAATCTGCCTGTGTGGTTTGCTCAGTATTA 

CGCCAAGATCAGCAAAATCTACTTTTAGACTCAAAATCATT^ 

CTGTCAATTGCTCTGTTGGATGGCTTTGCTGAATGGCACCTTTTAAGAAGTGCCTT 

AAT CT ATT CAG CTGCATT AT TG AAGAT ATT AGGGTAATT CAATT CATTTTTT CAT CACT AAAAC ATAG AAAT CTGTTG C CAACATT 

6 0 AAAATAGACAAACATATCATTGAAAATTTAATAGCTCTACATTCA.CTTGGAGATTTC 

CCTTAATAATTAGAGCATGTCCATGTGTGTATGTGTCC^^ 

GATTGTGAAGTGGTACACAAGTTGTAGCTAAAAATGAAATAGAATAAATGTATGA^^ 
TAGTTGACAAATTTGCTCTGAAATATTGATACCATTTTGTGGTTCCCTGTATTGAA 

C C ACAGACC CTAATT AG AAACAAAAT AAAGGTATTCT AAT ATACCT^ CATATAACAGTTTT 
65 CAGCTTGTCAGTGACCTGATATCAATGGATTTGAAAATGATTTTACCT 

TGTATTC^GATTAAAGATTCCATGCTAATATACTAATTTTTTGGCAAGAA 

GTAAATG AAAT AT ATTAAACATGCAGATTCTGTT ACAT T CACC TAAT ATT AATTTG AATTT TTAATATT C TAGG CACTG TG CT AGA 
T AT TGAAT ACAT AATGGTAAACAT AAACAGAT AGT AAACAGT T ATAAAATT T AG ACATCT C T CAATTTTCAAATTGG C CATTGCAG 
AAATAGTG ACATT TTATTAG GT TTTGTTGCCACAAG ATTTTCAATAATT AAAAAAAAATT ATTT^ TTAT 
70 TTTACAGCAT C TAGATG C CAGT AAATATTTATTTT ACAGCAT CTACC AATGTT AGATTT ACT ATG CACGAAG G AAGG ATTTTATTT 

TCATG TAAAGAAAAATG C CATT TT ATGCTTAG AAT CAATG CTTT ACAATTAAACAAGAG AAAT AG CATCACAAAAT AATACGG GAA 
GAGAAAACJTCATTTTGAGTAGCAGCAAAATATTGTGAGTTT'AAAAGGTCA^ 
TGCAGAATGCTGTGACAGCCAGGGCTGATACAGCTGAAAATATCTGCAGCTTTGGAAGCCT 
CATCATGATAGGAGCACAGATTCACATGGAAGGTCAAACTGGACAGCC^ 

7 5 TTTTAGAGCAATCACCAACTGCAGTAACTGAGTTTTTTAATTCAATGC^ 
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TTACTTGGTGGGGAAATTATAAATTACCTTCTCT^ 

GTTACTTTAATTTTArrTACGTGTGA^^ 

T0K3TGGTGGTGTTTTTAATCAATTAG 

TTCCTGGCAACTGAAGCAGAAGCTTTAGGAAGTGAAAAAGTCCTGAGTT^ 

TATTCITAGCCCTTTGGGTGTTTAGTTAAATATACrniA.CT 

AAGTGAAGGCTGAATTCTTTAGCCTGCCTTTAAGGCCCTTACTGATTTGGCT^ 

CCCATTACATACAGOTGTTCCATACAGACCAGTTCTTCCAGTGTTGA 

ATTTCTACAGTTTGGATTTCTACTCCTTTTTGTCrrCT 

TTTGAAACTAACCTTGACCTCTCTCT^ 

T AGTTATTTGTGTG TTAGTTTTATAC C TTTAATTAAACAT CAAATT CTTAAAGGCTGGAT CCAAGTATTCTCG CTATATCTAGG CT 
ACATATAAAAGGTCGTCAATACACAGTTAATATAATCAAT^ 
TAATAAAGTTACTGTTGAATGTATTTGAATTTCTTGAAGGAAAACAra 
TAAAAAAAGAAGCACAATTAGAGAAACACACTCCTGTAATTT^ 

TTTTTTTAATTGTTTTGACTTTGGG TTITCT^ CTTAAT ATTTGTGATTTCTCTGTGC CAAAAAGTAACCAATAT 
TAATAGAGAAGAGCTTTAGTACATO 

TAATGGTTTTATCAGCTGTTAATTAGGTTGGTGCAAAAGTAATTGC^ 
ACCACCCTGATAAAAAATTGGCCTGACTATATTGCTTT 

G TTTTAG CAGTTT ATCTAGATCAGTCTTACTGTGTG CATAGTCAGCAT ACAT TT TT ATACTTTTAAAGCGGCATTGCATTTTA 

CCTACGTGGAAGTTTTCTGCAGACATATTTTAAGCTTCTCTAAGAAAAGCTCT 

TGATGAATAAATTTAAAAAAATCATGCTCAAAGACCACT 

TCAT AATCTATT CAAAATTGAT T AACAAT CTT AATTG AG AAGACATTTGT ATCTG AAG T GTAGAATTTT CTTGTCTCCAGGTATAC 

ATTATTAATGTTTAGTGTCTACCCCATGGACATAAAAGAGCAATAGCATCCGTTCTCAGCTCTGTA 

TCTAGAATCAATATTCAGGGGTTTCTCCTTCACTTACACaCACCTCACTCTGAGCTAACC^ 

AATAAATCCCACTCACCCCCTATCTCTCTCCCrTCTTTTGGCTTCTGTTGTATATATTAGT^ 

GGTATTAAATTGCTAGCATGTATCGTGTATCGTGAATGAAGTTTTACATTTGAAATCAGAAGAGCT 

CCATTTATTTAGCATAGGGATTTTAATGGGTCAGTCACTTAACCATGCAAGCCTCG 

ATAATATAACCAACCATCTCTTAGAGTTGTGAGGAACAAATAGTAATTAGGAT 

ATTTGCTATTTTTGAATAATAGAATGTATAAGTTGTTGGGAAAGTAAATTTCTATGCT^ 

TTCAATTTTTTTTTTTTTTTTTTGAGACAGGGTCTCCCT 

CCTCCCGGGCTCAAACAATCCTCCTGCCTCAGCCTTCCAAGTAGCTTGTACT 

TATTTTCTGTAGAGACGGAGGTCTCACTGTGTTGTTCAGGCTGGTCTCG 

TCAAAGTGCTGCGATTATAGGTATGAGCCACTGTGTCCATCCTGAAACTTTT^ 

C AG AT TTTATGTATAAG CCAGAAAGAAAG CTCAG TACATGT AT AG AATGTAGCACAAGC CAACTTTTCAAAAATT AATCTTTTT AA 
GGGG CATTATGTAATG AAG G AGCAT ACATT TTTAGTTTT AT CCAGGATAAAAG T^GTTCTTTTTC CCCCATTAACTTTAACACAG A 
GGTTTT ACTTTAATTTAAATTCCAAG G ACT AGGTTTTTG TTGTAT ATGAAGAAAACGATTTGACTAAAT CCAATGTAAATGTTTTT 
CCTTATTCTTTGGGGGCACACAGATACACACATTAATC 

TATAATACAAATATTTAATTATATTTTATTTATAATATAAATATTTATATTT^ 

TATAATA TAAA TATTTGTATTTTATTTATAATATAAATATTTATATTTTATTTAT 

AAATATATTTTATTTATAATATAAATATTTATATTTT^ 

TTT AT AATATAAATATTTAATTAT ATTT T AT AG GTTGAATAGGTTCCTTGTTT AT CT TTTTTG CAAAATAATTTTATTGTT ATAAA 
TTGTTCTTATT ATAAG CAATAAAT AAG G AAC CT AAAT ATC ACAAT CTGT AAAG CATTTT ATTTGTGTGTGATTTT AT ATGAT AGTT 
CATTAGACATTAGATAGGAAAGTATGGAAAGTTTTGAGATTGCTACATGAT^ 
GTTCTGAGGGATTTTTTTACCAGTGAAGTAGCTT^ 

TCAGGCATGATTAATAGTTGTATATGGTGGCCGGGCGTGGTGGCTCATGCCTGTAATCCCAGCACTT^ 

GCTCACGAGGTCAGGAGATTCAGACCATCCTGGCTAACATGGTGAAACCCCGTCTCTACTGAAA^ 

GTGGTGGGTGCCTGTAGTCCCAGCTACTCAiSGAGGCTGAGGCAGGAGAATGGCATGAACCCAG 

TCACACCACTGCACTCCAGCGTGGGTGACAGTGTGAGACTC CG T C TCAAAAAAAAAAAAAAATGGTTGTATATGGTGTAGTATCTT 
AGAGTGATGTGCAGTGGCATGCATGATTTAAAATAATATTAATAATCATAACA 

CTTGCTGAAGTAGTCG ATGGAATATAATCAAAG ATTG TGTCACCAAAT CTTT AAACAGG ACATTT TATCAG AGAAGATACT CTGGT 

TTCTTAGGTAAAGTTATAGGGTAGAAACACAGTAAGCAGTCT 

AATTT AATGG AATTAAAAATAACCTGGGAAAACATAGA 

ATAT AT TTTTAAT AGTTACCCACGAGT CACTGATT CAGAT AATCCAATAAT C TTT CTGG AG CATCAGT ATCCC AGAGAAACAAATG 
TTGCCGTTAATAAACAATTTGCAGTTATTTCAGTTTA 

CTTTTT AAG AGCATATGATG CTG ATGGAAACATGTAGAG C TTTAATCG CTGTCTCATT AAAAGGATGT AGTTTATTCTTTGTGTTT 
CCTTAT^GTTACAGCTTTTCCAGAATGCACAGAACCTTATGT^ 

AAAGGAACTAAATCT CATG CTC CCTTATAGTTT CTG TTCAATATGC ATT CTCTTTTG AAAGT ACAAATTTGT AATAAT^ TT 



CTt^CATTCTGCCAACCAACCATATTTGAAGTCTTTTA 

AAG AAAGAG AGTTTG AGG AGG ACAAAGTGAGGGCGGAATGTTTTT ATTT^ C AT AAT CATAT AA 
GAGTATATACCTATCTTGGTTATTTTAATTAGAAGTGCATGCTGCT 

GTGGAACCCCCGCCTCAG<3GAAAATTACTACATCGGAAAATATGCTTGGGCCTACATGAGTTTTCCAGA 

CTAGTTTGATAATGCTTTGCATTATGTATAGCAGTTGTCAGAAACCATTTCCTTTA^ 

GCAGATATTTTCTGACATTTGTAGATATA^ 

AAATACAACATGCCCTGCAGATTCTTAAAACTACACATTTTCCTTT^ 
TTGATACCCTGTCAACCCAAACATGTATTCCAAAT^ 

TTT TAGGAATGAAATG CCATGTTTTGATG CAACAGG AGTTGAAAAACTGATT TTACAAATGTTT CCATTG ATT AG CTGCTAT ATGA 
GGGCTAACAGTTGAAAATATTTTGGAAAAATACAAA^ 

GTTCCCACAAGAGAAATTTACAGTGTAGATAAAGTACAG CATAGGCTTAACAAATATAGAAAAATATGTT ATTG ACT CTTGGTGGA 
AGTAC^TATT CX3T AAAGGTTTTCTTAAAG TTTT TTAT TCT AGTT CAAAGTT AATACATAT ATG AATACACTTCATGTT CTAAGGAT 
TTATATTATTAAAATCACTCATAAG CAT AGCTTTTT AT ATGCAATTTCAG CATTG ATTAAAAG AGTG AAAAATGGG CATTAACTTA 
GTTGACTACTGATAACAAAATGGACATATAGTGGAGTAATATATAATTGTTAAAAAGTAAT^ 
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GTGAAACCTTCTTCCAAGGATATTTAAGTTACATTGAAGTGAAAATAGGAGTTC 

AGTCTTTTAAACCAGGGAAAAACAACTGGGTCTTGAGTCAGTGTTAGTCAGTGAAAACATTATT 

TT^GTTATTITTTTGAAAAAAAGTCAAGTTAAACreAT^ 
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AACATTGCTATATTTTTTAAAAGTATCTTTGM 
TCCATAGGATTTGTAATCCTAAAGTGTGTAGCACCTATCT 
ATAGGGACTTTTACrcATTATATGGAATGGAGG 
GGATACATTCGAGCAATTCCATACTCATTCGTCATTGGT^ 
5 ATTATAGGCATAGATTTTTCTATACTTTTTTTCATAATGACT 

CTAATACAATTTTCCTGATTGAGGTTATCACACCAGGTTTAAAATCAGA 

TTCTGTGGTCAGATTTAGTGGGTAGTACTGAGGGCAATTATCACATTAA^ 

TTTGATCCTAATGAGATATGCTTTATATTTTO 

ATTTATCCTACTTGAGTCTGCTAAAATAGGAAAAAAATAGGTTAATTATGGAGAAG^ 
10 TCTGTGTGAG AGGGAAAGTTGGAATAG AGGGGCAAAGAGGTACAAAATGC^ CAG ATTTCACCAATGTATG AGGTG CTGTGGAAAT 
ATTTTCTTAGTTCTGTGATCAATTAACTTCATACTTATTT^ 

ACCATT ATTTAAG CAGTGAACATCATATTTTAAATGTTTTAGGCAG AGGTG CCAC CAACCTCAG CAAGAAG CTTGGTG ATGTTCAG 
AAGATATAGAGGTTAAATATGAAAACTATGATTTTC 

GGATCAAAGTCT ATAATCAGGACATTGTAACGGG AATAACAGTATTAC TGTGTGTTG CATTTCATTTAG CGTTCACATTTTTGTTG 
1 5 TG TTAGTT CT ACATTGACAGAT TCGGGTTGGGTCACTGTAC CATTCTTCCTGAAAATTG CCAAGATGATGAGATTT^ 

CCATTTAGCTCTTGAGCACATAACAGCATTT^ 

ATGATATGGACAGATCAGATATGTCTACCTGAGGCTAACAAGATGATGTCTTTGTC 

GAGTAGTGCTTTCT CTTCT AGAACAATT AAGTAGCAT^ CATT AT ACCTT CTTGTAGAAGG AAATGTT AAAATA 
CTTCCT CTT TGAATAGCATCAGAAAAAATG CAGTGTTTCT TTTG CTGGAATTG AAGACTG GTT CTGTCAGAATTTCACAGAT AATT 
2 0 ATTTAAATATTTTAAATTTGCATAGCCACAT 

AGGAGGTATTTAAATATGTTGCTATTTTAACTTCATCGCCTCATGTTCTCTTTCCAATC 
CTTATAAAGGTGACATCCCCAATATACTAAACCATATAGCTTAAGCATTGTAATTTC 

AG AAAAATCTT CAGTT ATAC CACAAAT ATT AATTGTTCGTTT CCTC TGGG C GAGGCATTGTTTTGGTG AT CTGT ITCT ATT AATGG 
AGAACTAGTGTAACCACTGCCTTCTCTCATGGAGCTTACATTTTTGTC 

2 5 TAATATTACTCTGCTTTCTTCACAGTACTATGTAGTATT 

TGT^ACCTAGGCTAGAGTGTAGTGGCOTAGTCATAGCTTACTGCAGTCTTGAACT 

TCCCAAGTTGCTGGGACCACAAGCACATGCCATGCCATCATGTGTGGCr 

CACTGTGTTGCTCAGGCTGGAAGTGTAATGTTATGTACT 

GTGCTTTGGATGGTGGCCTGAAAATCTTCTAAGAAGATAACTTTGCAGTAGGGACTTGGACGAAGGCCT 

3 0 GT ATG CAAAGTGCAGTG ATT CATTGTG ATAATTAT CTTCTAGTGAG CTTAAAC CTTTTACATT TTT AAAAATTTTG CCTGAT ACAG 

GAATTAATTCAAAATTCACTTTTAAATCATTGTCTGTAT 

AAG^TCATTTTCT AATTGGAAT ACAGTTTAGTTG CCACTGGAATTOT CTAACCG ACAC T C CT 

TCATTAT CCTT CCTACAGTAG AC CACT AGG TGATTTAGGGAAC CAAACTGTTG AATGGAT CAG AATTCT AAAAG ACTG CTTCG ACA 
GAGCTTAGATTACTTATTTGCGTTAGTAAACACAATCTTCCTTTCTT^ 

3 5 CATTGG CAGTCTCT ATG CCATT C CAGAGGATGTGTG ATTGAAAGTT AT L^rrrr rCTT TGCAGT ATTGTT AAAAC TT AG TCAT TT AC 

AT ATCTCCTTGATG ATTTTCATTT ATGTGTAGTCT CTTAGGTAT AT TTGTGTGT AT ATG TATCTATTCTTT AAATCTTTTTTAAAA 
ATTTATTTAATTAATACTTTTTTTTTTTTCAGACAGGGTCTT 
CTGCAGCCTTGACATCCTGTGCTCAAGCAATCCTCCTGCCrrCAGCCTCCrr 
GGCTAATTTTTGTATTTTTTGTAGATATAGGGTCCTACTATGTTGTTCAG 

4 0 CTTCAGCCTCCCAAAGTGTTGGATTAC^GGCGTGAGCCACGGTGCCCGGCCCTTTAAATTTGTAAAT 

AAGTTATAATATTG ATGT AATCACAGAC CGG ACACACTAGTTTTAGCATT CTAATC CATCTGTTAAAATT ATGTTGTT TGTG CATC 
ACCTGAAATTGCCATGGGTACTGCTGGTAGTGTGCTTACCCCTCACTGAGAAATACTTACCTCTC 
ATAGTGCTGAAGGCTGCATTAACTCGTTCAGATGGCAGTTCTTCCCATATA 
AGTGAATACAGAGATGGCTGTACAGCCTCCAGAAATGCTAAGTTGGTTGGTTGTATATGGAAA 

4 5 CCCTGGGAATAATTGTGTGAGCAAACTTAGTTTACTAAAGAACTGCATTGCTAATTATGGAOT 

AAGAAGTTGTGGTTTTGGGCAAGGGGTTCTTTCTTATCAACTCCAAACTT 
AAACTATCTTAATAAACAAAAATCATAATAAACGACCTCAAAAAATT^ 

TAAAAGCATATTAAATAAAATTCTTCAAAATTACCAGCTAACTACTCTGGGATATAACTGGAC^^ 
AGAACTTTTGCAATGATTAGTAAACATATGTGAGCAAGA 
50 TCATTTTCATACCTTGTAAGCATCTCTTTATCT 

TATATATATGTAATTTTTAAGAATAATTGTTGAAAGAATAAGAATAATTTCATTCTTCTTTATCCTTTTCT 
CTGTCArTTCAAATrTTATTTCTTTCCTTTTrATTTTCTCCCTTATATTTATCTTAAAG 

ATT CAAAAAG AAATTGGT TCTTGTG G CAG AAT ATTAGTGTTGTAGATGAT GT CAGATAAAGCAAGAAACT ATAG CGTTTGG AATGT 
GATCAGTAJGGTTAAAGAGGGACTGCTGTTACCTATTATTCAGATTTTAAAGTTTGATTCTTTTTTTTTT^ 

5 5 CAGAGTCTCGCTCGGTCACCCAGGCTGGAGTGCAGTGGTGTGATCrCGGCTCACTGCAA 

TCCTGCCTCACCCTCCCGAGTAGCTGGGGCTACAGGTGCCCGCCACTATG 

GTTTCACCGTGTTAGCCAGGATGATCTTGATCTCCTGACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGC^ 
GTGAG CCAC CGTGCCCG^ CCTT AAGTTTGATT CTT ACAATATTTTTGTTGTTCAGAGAAGT 

TGAGATATTTACCTCAGAAG CAATGT ACC CATT AGT AGAAATCTTT C CTTT AAT AATT ATT TACAG ACAAAATTTATTTT AGGCCA 

6 0 AATATGTTTG CAAG AATTCT CAT AAT CTGG ACACT AC CCACAT ATC^ 

TT AGTTAATG T TTTGTG AATTCAGTATTG AT CCAAATTCT AT AAAAT CTTAATTT AC CAAAAGAG ATTT CT ATT TTTTG AAATT CT 
TTTTTTAATATAGAGAAGGAATATAAAACAATAATCTGATATCAGCAGTTTAGTGAAA 

TT AC C CACAGTGCCCT AT AT CAAGGG CAGG CACAT ACATT ATCAGGT CTAG CT AGACT AATGCAT AATG GCCAGTATCTATTTT AA 
TTGTT CAGTC TCTATGCCTG AACTAATT ATT AATACTT TGATT ATCAT CCT TGTCTATG TC CTTT ACAATT AAATGGTACAAAT CT 
65 ATGTTGAT AATG CTTTCAGT AGCACTT TTG AAGTAAATAGAATTTCT CTG CTACAG AAAGT AAAT AT AAT CAGAACACTGATTTG C 

AAGCATTTTAGTAATGAAAGTATATTTTGTACTTGAGGGATTC 

AA1TJLTT AGT TGT ATT T AG AT ATGACT ACCAAG CAATGCAGATATTT CTTGGGGTACCAGT CTTCT TACAT AT ATTGTCTGG ATG C 

CATAGATGATGATATTAAATAATCTATTTCATCA^ 

GTTTACTCATTTATCAAACATTTTAAAAAAAGATCTGTTATGT^ 

7 0 GT CAGGGCAT ATATAACACTG AG GGGG AGAGGTACG AAGAGCTAC^ CAT 

AGTTGACCCAGTCCTATAGAGAAGGTGATTTTTGAACAGCAATT^ 
GTGTGTGTCAAACCAAACAGGGAGTTAGCATGTAACAAGTCCCAGGA 
ACTCCTCTCCAGGGGAGATACAAGAGGTACAGCACTGAGAGAAGCAACAAGATAAA^ 
AAGAAAGAG^GAGTTGTAAATCATGTTACAGGTCAAGTAAAGGCAAGAACAAAACATAGTCTTTG^ 
7 5 TTGATGAACTTTGCTGAGCAGTAAAGTAATGGGTGTGAAAGTCAGACTGATAAGGATTCAGA^ CAG 
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AAAAACAGCACGGTAGATGACTCTTTAGATATTTGCT^ 

CAGGGTCCTCTTCCTCCTGCCATGTCACCGTCTGCATGTTGGAGTTTCTTCAG 

AGACAGGGCGTGCATGTATAGAGGAGGCAGAAA^ 

ACTGTGGGCAGGAGGAGGGAGAACTGGGAGAC CAAGAAAGATACAGAAAGATAG GTTAAAGACTTGTGAGT AGGAAGGTAAGAAAA 
5 T^TT CATGCTTAAGGGCT ACTGTTATGGCTATGAAfiuTATT CGTGACGC^ 

AAGAGTAATGAAGTTTTAGAAAGGCTGCCGTGTGCAA 

GGGCCTATGTGACCCTGGAACATAATAATTTATAGAGGCAATGATTGTGTGTGTG 

GTTTCAGTGTGTTCATGTGTTGTAT TCAT ATATGTGTTCATGTGTGTGT^i riGTAT ATTTTGTACCCTAGAAGTAATCCAAATCC 
TGATGCAG CAGTTGATAATTTAGTTTACCAAT ATTTTTGGAACT CCAT CTATTTTGCAT CATTATG CTAAGCCATGG GGT TACTGA 
1 0 GATGAATAATATATTTTTTCTGTCCTTAAAA^ 

GGTGCAGAGATAATTCAGAGAAGGGAGTATTACGATATGG^ 

AGTAACATCTTAGCTGGCTTTAGAGGAAGAAAATAGTTACATGTGAGGCACTTGAAGA 

TGAC CATGCAGGTG GTC CATTCTCAAGGCATGAAAT CATTGAGTAATTCITAATGCAGAACAAAGTTTCAGGAAGAT AG ACACAGC 
TTACAGTTTATGACrGGACCriT^ 
1 5 CAAAATATTTCTTGAACACTGGGGATGAGGTTTTTTT^ 
TGGCGCAATCTCGGCTCACTGCAACATTC^CCT 

TTACAGGTGCCTGCCAGCATGCCCGGCTAATTTTTTGTATTTCAGTATTTTTAGAGA 
CTCGAACACCrrGACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTA 

T TTTTTAAAT T^i-iTi^I'TAGACAG AGTCTTG CTCTGTCAC C CAAG C TGGAGTGCAGTGGTG CAATCTCAGCT CACTG CAGCTTCCGC 
2 0 CTCCCGGGTTCAAGTGATT'CTTCTGCCTCAGCCTCCTGA^ 
TTTTTAATAGATATGGGGTTTTACCATGTTGGC 

GTGCGGGGATTACAGGCGTGATCCACCAAGCTTGATTGATTTTTTAAATCT 

AAATAGTTTTGGAAAGTAGTTTTTTAAACGAAATTTAGTGTTAAAA 

GTGTTTTTTAATGTACGTATGTATTTATATCCTATACTCTAGTTTCCAAAAATTAT 

2 5 GG CT CATGAAGAAGT ATCATCAAATACTAGTT ATCAT ACTGTGTCATTG CCT CTTT AAAATGTG AAAATTTTACTTT TTAAAG CCT 

TGTTTTCCTCATTGGCAACATTCAAACAAAAAATAAAGAAGAAAAA 
TGCTCTAAGAAGTAGTCTTGTGGTGCTGAGTTAACTGAT 

GGAATTGTACTTCTGAAGTGCATCACGCTGCTCAAATCAAATTGCTCACTTA^ 
CCTTGAGGTTGACTTCAGATTTACCAGACACACTGACAA 

3 0 AATGTACGAATAAAGGATGCTTTTGTTCATATCTTGCCTGCCTGTGATATATTGACAAAGTGTCT 

TACAGTATG AGTG CrrGTTTTCAGTGGAGGGGCAGGACTTGGG CTTTAG AAGG AAG TGAATTGGATAGGT CAGGAGAATGTACCATT 

TGTAAACACCCCTTTCCTTTTTTATTCACGTGTCAGGACAGCAGGAGCAGGCTGTTC^ 

AAAGCAGCTTGGAAGTCAGCTGCAATTTTTAAACCTTGCTTT 

AGCTGTTTTGAAACTAACATGGTTTAGTCCACTAACTGCATGTTGGGTAAATTCAAA^ 

3 5 TAAGTCACATCTGATGGACATTTTCTGTGCTTATAGCATAGTAATGAACGTCTGACAGGCGCG^ 

TATTGG CTTT CTGC C CAG AATATTG CTG CAACACTATCTGTGAT T GGT TATCTCT CTAT CATG CATTGCTTCACAAGGGG AAACGG 
CCCCTTTTTTTAATAAATCTACGATGGCTGGCTGCAATATGGCTCACTGTAAGTATTTTGTGTA 
AGAGGCTGACTTGAATATTACTTAGTCTGGTATCACATGGCAAGGTTGTGATACTGTAGGACATCAGTGAAG 
TCTTAACCTATCTCTTTTTTCTAC^GGTTGGTACTAAGAAGTGCCTTTCCT 

4 0 TCrCrGCTTTTG^CTTGCTTtSCTGCCAGCTAGACTGTGACGA 

ACTT ATAAT CAAGAGAAAAGCT CTAAGTATCTGG CATTG C CCT AGG CT GCTTT AGTGTT AAAAGAAAAGTTTGCTG AAAAAGTAAG 
ATATCTTCTG^GAGGAAATCAAGGAGGAAAAAAAAAATCATTTT 
AATCAATACCGATTGCACCACCAAACTCCATTCCAAATTCAGCTOT 
GGAAATTCGGTGTCGAAGGGTCTGCCACGTTTTCATGCTTGCATT^^ 

4 5 CAAGGCTGATACCAGGCCCTACTTTTAAACGTTCATCTACrrTACAATCCTAGTATT 

CAGTTTCATGCTGTGAATTTCTAGTCGGAGATCr^^ 

GGGAAAAGTATTTTGAGGGGACATTTTCATCATCAGTTCIAGCTTTTTTTTTTTGCT 

TTGGTTTCACTGAAACATTTAACTACCTGTAAAATCTAAACATGGCTGTTAGTC 

CTGGAAGTATGTAGAGAGTTCCAGAGGGGGACTTGCTCACGGCCAGACACGGAATC 

5 0 AGTTGAAAATG^ACGAGTAATCGCCTGCTTTGATTCATTGAAAGTGAGTAACTATTATATTCTTTTAA 

TG AGGGGT ATATGGACAT ACAGTTCTCTGCATATGGGATTATGG ATTG CTTT GTT CAT AGT TTAG TTATACAGTGTGTTGATG ATG 
TTGGGCTGGGGAACITTACGAGAAGCAGTAGAATCAAGTCTGGCCAAACrTTGTTATGAATAA 
GTTTTTTTTTGCTAGGGCAGAAAGTGTTGACIATATTTTTCACT^ 
TAGCTACCTGTGAGTGTTTAGCATATGCATATATTATATATAA 

5 5 AAAGGCATTGTACAGTTGCAAAAGAGAAGCAGATTTAAAATCAGCTAAACTT 

TTTAGATTTTCTTTTTATTTTGTGGGCATTTTATTTTCAGGCCAGAGCCACGCATTTTA 
TGAACTGCTGCTACAGCACCTTACCTGAGGCACAAACTTTGTTGCATGTGA 
GTTTATTGGATTTTTCCAGCTTCAAAAATTCTAATGTGAATTAGCAGTTTATT 
TAGTCTTAGCTAGAAATGTTAAAAGTTAGCTTTTGTGGGCACCATAATACT 

6 0 ATGT CATTGGATTTG T AATT C^GGTATCAT ATTTG AAAATGAGTC TTTTAAAAGAT AACAT AAAT ATCTTTATTTTGACAC^ 

GTCAAGACTAGAAATGTGTTCCTGGGTACTTTCAGCCTACTTGGTTTAATCAAAT^ 
CTTTGGACCTTTGAGGGGAGGACACTATCACITCTACATATGTAGAGAAGTAAAAGTCTCATAG^ 
TATGGTGCTATAATAATCTTTTCTTTAATAACATCAGGGTGAGTTATGAGTTAATTGTT^ 
AGGCTTGTTC^TTTTGCrrAAAGTTTTTGCCAAAGATTTC 

6 5 AAT ATTTT AAAATGTT AG ATTCTT CTTAATGTCATAGTCCTAAAG AT CAATAAGCAAAAG TT ATAAACT CC CATTGAGACTTTC TT 

CACCCTTTCCTTTGAGAGGCATCAGTGAAGGGCACAGTGGTTGTACCTCOn'GAATGTGTATTCCTAAATT 
CAGGTGTAATTAGAGACTGAGACCCATAAAATCAGGGGCAAGGAGCTAATTACTCTTGTCGGTAACCAGCT 

C CTGTTTTGAC CCAATT AAT AGTAAAT CCAATTTTTAAT CCT ACAG ACAGTTG ATGGTGC^ TAG CTG CAAAATGAAGG ACAT 
TAAATTTGGCTGTGACATTGAGTGGTTACACAGAAGACCTGCTTCTAC^ 

7 0 CTATTTTTCCATCTCAAAATATTTTTTTTTAGTTGCTGCTCACTTATC 

TGCTTAATGACATGCTATGTGGTTAACAAGATTCAACTGCCACATCAA^ 
AAATGAACTCATTACAGCAGCTAATCAATGCAAACTTTGTCCTTTTTCTTTGT 

AAATT ACAGTGTCTAAAACTTG GTAAAATATTTTAAATT TG AAT AAG AGGAAAAAAAAAACATGAATTT ATTTAT C 
ATAC C CAGGACAGTAG CTTCTTAAATGTT AGT AAAAT AATT ATG TT AAG CATTAAAACAATTTTGAATATGTAATT AAG CATTTAT 
75 GTTAAAAAACATGTTTCTTACGTGTGTGTGTATATATT^ATAACTGACACATGTGAT 
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TCAGATAGCAGTCCTCTTGTATTCACTCTTGTGGCTAAA 
ATGCTTQAAGTTCTGAAATTTOGGAGCGTTATGAGGT^ 

TTCTGAGTTCTTGAGCCATQAGATGGAGQTGGC CTGTGTAATCT ACT CAAATACTCTCTG 
5 AGAGGACCCTATGAAATGGCTGTACCCTTCAGCTGTGCrcCTTCAGACT 
CTTCTAGTTAGAAGAGGAGAATTTATTTCATTAAAAAAAA 
AAGTCAATTGCTATGTTACTAAAGATAATAGGAAGAAAATTAGCTCACT 
ATTAGATTTAAATATACCGTATCTAAATGTAGAAGTCTGTATOT^ 

AGGAATGTAGCTATTAAGAATGTCAGAATTAGAACGAGAACAGTAAAAGAGCAAAAGCTGTAATAATGTCT 
1 0 ATACTCTAAATACTTAGTTTCTCTAATTTTCCTTTTACAAT^ 

CTTAAATTG CTT ACGTGCT AT AG ATTT AGGTGTATT AGCTATTTCAGAAATC CAGAACATTTGCAAAGGTGCATTATTTT ATGC CA 
AAGAGGATTATGTTCCCCTATGGCTTGTGAAAGAGATAGGA 

TTATATTTCAAAAATGATGTTT CAAATCAATGACAAATTG CCTATT CAGTCATGAG CT CTCAACTAAAATTTGGGG AGTCAATGTA 
AAGAAAGCCTACT^GAATGTATTTTAATATTTGCCAAGCTA 

1 5 AG ACTTATTGTTTACTGAATTCTATTTCAAA 

TTTACTAT CTATTTTCCAAAAATCTAGCAAAATTLrrri TACCTTTGAAAACTTTAGT CACTTGTT CCrri ATCAAATTGAGCCATT 
TTATTTAAGATAATT AAAATTT CTTTTTG CT ATAT ACAGTGATATCT AT CT ATAAAT AGCACC CCGATCAGATTTATCATAGACAC 
ATTTG AG ATCAG ATTCCACTG CAAAGTGG CATACAAT C!AAGGGTGCTGAGAACAGTTTTAAAGAG CCTCTG CAGGCT TAGCAG AAT 
ATATG AAGTATGGAAATTAGGTATTAGAAGG CACACAAAT AT ATGT ACATACAGG T AAATG AACAACTAAAGCACTCAAAAT CAGT 

2 0 AAGGCAAACAAGTATCCAGATGGTCACATATGGATTGGATGGCATAACTA 

AA'iTrr AGAAGATC CTCTGTTG CT AGCATGGTTTATC CTCAATCCAG AAATTGCCTGTGTTGCTG CTGTTCTAAGCCCCAAG CAT A 
G ACTTTTCAT CAT AT TTTAAAAACCAGTTTCACTGAT AAT CCTGAT AAAAT ATCTG CAACTTTT AGACT AAATATGCAATTTT CAA 
TT AAAATTTTGATCT ATTAGGGTCCTAACTG GAAT (2ACTGTT ACCAGGAAGTTTTTTTGTGGTTTGA'i"i"i"i TAACCACTGGTAT C T 
CCCATGT ATTAAAACTTTG ACAGATTATT AT ATAT ACTGTG CTG G CACAT AT ATTCAATCTCT CTTTCAGTG ACAGATTG ACAT AT 

2 5 CACTTCCTTGTTTTCTCCTAATAACTTAGGAGATGAACTAACT 

CCTAAGGAGAAACTTCACAACTTTGTAAAGAATATCCZTGTGAATATGCT 

GAAATTCTCAAT AGCTGGAGTTG C TTTGG TG AAACTAGTAATTGTATT CATTTG CCTCTTT AT ATTAG AACTTTTTTAG AAAACTT 
TTTTTT ATG AGT AAAGGTTTTTTT AG ACTATT AG AT AAATTGT AAACCATG TTT CGAATCAAG CT AC CTG CTGCAGTAATTCTAGA 
ACTATCTATTTTGCTTCATACTATAAGATTTTGGTACACTGCGTCTACTTTTTTTCT CTTG CTTCACTGTGTGAGTGT 

3 0 GACAAACT AAAAACAATTT AG ACCTCAAGCAATGTGG CAC^ CAGTTTCATTGTAATT CAGATGTCTG AAG CTAA 

TCAAAATTCCACGGAGTACClCTTGAAACAATTGCATATATAAGGCTGTTTCTCACTTAA 

GTCTTTTGAAGAACTGTATGACAC^TGCCCAGTTCAGGTC 

GGACCTAGTTTGGGAAGCTTATCTTCTCATTTTCCTTGCCAAGGA 

CTTTGATGCTGAGCAATGTCTCTGC CTTC CCACAGAC TT AG ACAAAG CTTCTTATTTTCTT ACIAAAGAGGT CCT ACTGAGGAAATT 

3 5 TGAATTTTTTTTTTTTTTTTAAACTTAGCAGTACT 

CAAAAATCATGACTG C CAGTAGAAGTG GGT AACTAGATAG ATAGAT AGATGGC CTTTGATTT AGCACTC CAT 
GT AGCAACTT AAGT AAAAGCAACTGGAAAG G CTTGAT AG AAATT CTG AGTTGTTTGAAGT AT AATAGATTATTAGGATAGT AATTT 
CTTTGTGAATTTTACTGTAAACAAGCTTGATTCTAGTGAGGACTTA 
TTATTTAATACTGTTTCTTATACAAATATCATCAAATGCACATCTACCTA 

4 0 TATTTTCACTTTGAGATAAAGCTTGATTTCTTTGCTAGCGATAAAACTTTCTATG 

ATTACTAAACAGTATAACCCCTGATATCTTCTAATTTGGTTGACAG^ 

TGACCATTACATTTCTTCTTTTGTTATCTCTGAGTTCTTTCCAGCTTrCTAATTGT 

AATATGTCTTTCCATAGTCACACAGAAGTGATTTAGACATTT 

CCCCTACCCTCCCCAGCATTTGGTATAAACAAACTAGGGGTTTATATTTCAGTCGTCTAGGTCTATTT^ 

4 5 GGAACTCAGGTTCCTTCTCTCTTAGTTCCTAAGGAGCACATGGCTTCCAACTTTAGGATTATTTCAA 

CAGTGCCAGCCATCATATCCTTAGTCTAGCAGGACGCAGAAAAGGCAAGAC^ 

AAGGTTCTCTGAGCTTCAGCCAGTAATTTTTCAGTCTCTTA^ 

ATGTTTGTTTGTTTTGTTITGCTTTTATTTATTATTA 

CAGAAGGGGAGAATGGAAATGGATGTTCAGGAGGGCCATTTCTGGAATAAAAGAGTGTTTATTrTT'CCTCT 

5 0 ATTAAGTAAATCTCTCTGGAAAGC^TCTGGGTAAGG 

GAAAACTAAAACAG C TTCATTCTT TATCTT CTTAT AACTGTACTT AT CTATTT ATTTATTTGAG ACAGAGTCTT ACT CTGTTG TC C 
AGGCTTGAGTGCAGTGGTGTGATCTOX5CTCATTGTAACCTTCGCCT 
AGCTGGTCCTACAGGCGTGTATCACCATGCCCAGCTAATTTTT^ 
55 GCTTGAACT CTTGACCTCAGGTGATCAGCCCGCCTTGGCCTT CCAAAGTGCTGGGATTACAGGCGTGAGTCACCGCGCC CAGCCTA 

CT AT TCAT ATTTAT AT AAAC AGTCACATTT TTATAACTGG AATAT AG CAGAAATG CCATGTCATT AAAAACAGGTCCCAAGTGATT 
TTATATGACAAAAGAATCTGCACTTTGATTTGTATTTCT^ 

TAACAAG^TGAGTTAATATCTATTGAAACTTAATGTATGTCAATCAGTGTTTTCAGTGCT 
CCTATAAAATCTCATCGAGATAGTTACTGGTGTTACTTCCATTTCACAGATGGACACACTGAAATACAGAGAA 

6 0 TGCAAGGTCAGTCAGCCAGTAAAATGTGGACTTGGAGCCTCGTAATCTGATTTCA 

TGCTGGAAAACATTGCTTCTTAATTTCCATATGC 

AAAAATG AATAT C CAAGACAAATTTAT AGT ACAGGG C CTAAAACAGT AAGT ACT CAGAAG T AAT CACTGTATTTTAAGT CAACAAA 
TGAGTCAAGGAGTG AACCCAAAAT AG CAGAAAG AATG CTGCCAC CAACTTT CAAAG ATGTTGTAT CTGTTGTAAAATAT ATGGC CA 
TATAGGAAAAATTTTAAATGAATCATCTTAGATAACT 

6 5 ATTCAAAAGGTTTAATTGTTTAGAAAATGTTTTTCCACAGAAGCAAGGTGA 

GCC CAGAGGATG CITGAGTT AT AGTTCAGTACAAACAGTAC CTTGGAAAC CCTGGTT ATTAATT AAAC CCAAGAGGCT C CAT AGAA 
GTGTTATCTTATCATAAATCTTGAGAACTCCTAGAGTCACAGACCCCAGCTGCTTC 
TTTAAATTTCAGTCTAGGGTTCCAGAAATTCAGCCTGCCTAGTGCAGGGGAGGTGGCCTTTCCCAAAG 
GAGCCAGTGTAGTTTTCCCCAGCTTTAACTTTGGGCTGCTTCTGTTTCCT 

7 0 GCAGGATCTAAAGACAGGGTATGCAGTGAGGAGCTGGAGTCC7VAAGCT 

AATATGTCTTCCMGTGACCCCCATGTTTCTAAGGACTCA^ 
CTACTGCAGAGCTTAATGAACAGCTTCTCCCCATTCTTGCCCCTCT 
AATACTAGTAGCCATGGGGAAACAGGTAGATCATAAAAGTATCAAACTCAATT^ 
ATCATAGGATATGGTCAAGAGACCGGTTGTAAAAACTCAGATGTGAGGAAGAAAACCATTA 
7 5 AAGTAATATAACTTATGTACTTCCTATGTCAAACTTTAGGGAG 
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GGGAGACTAGTATATTAGGTGGTGGCGGCAATC^TTAGGAGGGG^ 
TTGAGGAAGAGAGAAAGAAATGGCTCTGGTGGTTTATCTTTC 

CAAGTGGGAATAAAAGAACT AGAGGGAATCCCCAGTAAAAG ACTTCACATAG C CACAGGC CAjCCAGACTGTGGCAGCTGTCCCTGA 
AT C^TCATGGGTCAGTTGGTAACX^CCTG CTGTCTAG 
5 ATTTATTAAGCTATATTTATATTTTAGTTTATGTTAATATGAGGTTGAGATC 
CAGTATCTTAAGTTAGAATTTATCTCTCTGCATAAAAATCAGACCT 
GTCCAGGCTGTTCTCAAATAAGGCAAGAAGCATCTCCT^ 
GTCAGAAAAGCTAGGAAGCTTTTCTATCATTT^CAATTT^ 

GATAAATT ACATCTGTAATTCCTTCTT TT CCATATTACTG CATTCAGACGATAATTTGCTTTCAGAT ATCTTTG CT CAT CTAATCG 
1 0 TTCATAGACTGGAAATAAGTAGTAACATCrCC 
AGTGGTGACTATAAGCTTTCGAGCTTCrGAAA 

TAGTCTTTAATTTTAAAATGTTCCAGTAGAACACAAATTTGCTCAGGTAACA 

GAGAAGTTTGTTATGAAATATTTTGGAAACCGCTGCATAGTCAGTGTAGGA 

GTAAAAGACTATAOSTCCTTGTATTAGTCAGAATGAGTACACCACTA^ 
1 5 AGTTTCTATAATTCAGTGAGATAAAACTGAGTCAGATTGATTTTTAGGTAGCACA 

TG AT C CTCATCTATTGATT'AT ATAAACT AAAG AAGCT AAGAACAATT A^ TCAGGAG ATG CTGAACTG 

AGATT CAGTGTAG AAAGTC!TGTCTTCAG AG CCT ATG CTTTTAGT CTTT ATGCT AAGTTTAACTTGTTT AAATAG CAAGATTATGAA 

GCACTATACAGTGACCTCGTATAGACAAAAATATAGTATATTGATTATTAGAGAA^ 

CAAGTATTTGTTAAATC^TTTCAGTTGCCTAAATTTAAGCAACT 
2 0 GAAAGTCACTTCTGAAT AATTGCTTGTTT AGA i"r CATTTGGTGTGGG AAATTT ATAT TAAAA'iT IT AAC T AATATT<wTAACA 

ATACAGAGTCTGAACCT AAAGTC CAGAAGAATTTTAAGTCATGCCG CAGACAGGATGAACAGTATAGCAAATCAGAATAATAGACT 

GTTT GGGGG GGTTGGGGGG AACCCATG AG AATTTC^GG ATTTCAAG ATA 

TGTTTCATTTCATAACTTTTTTTTTGCTT 

AGAACCATGAGACTTTl H l"rrrrrriTTTTTTTTTTTTT 

2 5 CTCAGCTI^ACTGCAACCTCCXSCCTCCTGGGTTCAAGCGATTCT 

ACCATGCCCAGCTAATTTTTGTATTTTTACTAGAGGTGGGGTTTCACCATGTTGGTC^ 
TCCGTCTGCCTTGGTCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCGCCCM 

GATTTT ATTCATCTTACT AGG CATT TATTTATATT ATATTGT TT CCAATAT ATGCC T AGT AAG ACAG AAATATG CTTAAAAACCTT 
TGTAAAT CTTTT ACAAGTTGT ACT ATATT AAGTTAAAAT CCAG ATG AAACC^ CTTTG ACAATG ATTCAT ATATCTTA 

3 0 TTGTATGT CATC CTTGTTTGTG ATT TTGAGTATTACATTTCAGAAATTT^ TTT ATTTTAATTTTT 

TGTTTATTTGTAATAATTATCAATAATGCTGA 

TGATGTATT TAAAAATAG CT CACACTATTCTTTTT AAG ACTTTT GTT AT ATGTAATTG^ 

AATCGTTGATTCTATTTTGTTCATGATATGGCATAAAGAGCATC 

GGCATACTGGGTATTTAGTAGCTATCAGATTGTAATCCT 

3 5 TAATCCCAG CACTTTGGGAGGCCGAGG (?GGGCGG ATCACGAGGTCAGG AGATCGAGACC^ 

TCTCTACTAAAAATACAAAAAAATTAGCCGGGCGTAGTGGCGGGCGCC 
GGCGTGAACCCGGGAGGCX3GAGCTTGCAGTGAGCCX3AGATrc^ 

AAAAAAAAAAAAAAAAAAAAAG AG ATG TAT CT CAAATATTAGAATTG ATCATGTTATTT AAAAT ACT CCATGG G AAATATAATTAT 
TTTTATGGAACAGTTCATTTGGATGAATGGTAGCTTGAACA^ 

4 0 AAACTCTTGCCACGCATTGTGTTTTGGCGATTTTCATGGTTGTAC 

ATT AAAAATG AATTTATGTGGATTCAATG AGAT CAGTTGGGTAAAT C CCTT AATTCAGTGTCATG CT CATGTAAAT ACTC CAT AAA 

TGGTAGCCATTAGTATTGATTGirCTTATCTGAATGCCAGAGTTTAGTCTAAGTTATCTTTCTTCTTATGGGAGG 

AGGT CAG AATAAAG ATTGTTAT AT AG ATT C AAACTTGGTATGTAATT CAAACTTATTTT ATGTT CAAG ACT AAATTGATCATAG GG 

AAAAGTGACTGGTGTTCCTTGTCCITTCATTTCATTTCT 

4 5 ATTGTGTTTTGGTATTTTCTGAGTGTCTAGATTGGCCTGTATGCTCTTTTCTTTGGGATGA 

GC!AAATAAATACTGGCGACAGGAATAATGACAATGTGACT^ 

TCAAACT AGGAT CTT CAG AAT AAT TGCAAAAAGCCACATGAAAAAATG TGCAAGTG AAAAC TTTTTTTTTTTT TTG AG ACGG AGTC 

TCGCCCTTTCACCCAGGCTGGAGTGCAATGGCACGATCTCGGCTCACTGCAACCTCCACCTCCTG 

TCAGCCTCCTGAGTAGCTGGAATTACAGGCGTGCACCACCACACCT 

5 0 CATGTTGGCCAGGCTGGTCTCAAACTCCTGACCTTGTTGTCTC 

ACCGCGCCCGGCGAACATTTTATTTTTTATGGCAAAG 
TTTGATACCATAGTGACTTCCTAGTGACATTGTGCCAATCACTTTAC^ 

T ATCATTTG CTTT CAAGATTTGTG AGGATT CAATGAG ATCAGTT AGAT ATAAATCAT TT AATGTAGT G TCTTAC T C CATGT AAATA 
CTCCATAGATACTAACGGTAGCCGTTTGTATTGTTCTTATCTGAATGACAGAGTTCAGTCT 

5 5 ACTTTG CCT TAGGTCAT AGTGAATGTT ATTTGTCATTATG CACATTG CACTACAGGTCTCTGTGTTGTTTGAAAAAATT AAAAAGG 

ACTTAACCT TGAATG T AT ATAAGTGTG CACAT ACTAGCTT ATTT ATTT ACCTGTT AT CAAGAG AAAAT AAC C C CTTAAC CT CTT AA 
CATATGTATGACTTGACTACTAGCTAAACC^GTAGGTATTAATGGAATAATTCAGAATTTATATTGTCT 

TTGT AAATGCAGTT AAGTATGAATGGAAAAATGGT TGTAGCCATCTG TTAGCAAAT CT AAATT CAT TT AAAACTAAT C CTATTTAG 
CTTTATCCTGGTCTACAATAAATAATTCATACTTGATATTC 

6 0 TAAGACTTTTCTAGAAATAAAATTGTTTATTCAGCAAA 

ATTGCACTATTCITTTCTATTTTTTTATACCTATATAAACTTAATTGGTCAAATATCT 
AGCATCQAAATCCATAATACTCTGTTTATCACAGCTTTTTTT^ 

CTAGCCTGTTGAAATGGTCCGTTTTGCTCCAAGACTTTTTTTTTTTTTTTGAG CTGGAG 
TACAATAGCGTGATCTTGGCTCACCGCAGCCrCCACCTCCCGGGTTGA^ 

6 5 TACAGGCATXSTGCTACCACGCCCGGCTAATTT^ 

TCCAACCTCAGGCGATCCGCCrcTCTCAGCCTCCCAAAGTGCTGAGATTAT^ 

TTTT AAAAAG CATTTGC C TCATTTATG AGTTTGAG ACTT TTAACATCATTAGAAAT AGGAGAG CATGTACTTTT T CCAGCAAGTGT 

AAAGGGGGAAACAAACACCAAAACCCCATCTCCCTGATTTTTTTGGCTTTTCA 

GGGTTTTTCTTGACAGCGTCIX3TAACTGTTCTTGTTATAGTCTAATTCT 

7 0 CCATGCCACTGCATTCTTGTTAAGGTTATTGATTATTGCC 

T AATTGT ATTACTTGATGTTG AAGAAGGATCAAATATCTGTT CAAAGGGG ATG CTTGT AG CCTGTG T AATTGAG TTGG CTATTTCA 
GGCTTTTAACATGTTGTGAAACAATTAGGAGAAAGTACAGGAGGACCTCTC 

CGCAGTTG C CAAAAATGGTG CCGTAAGGCTTT CCTGGAATTAAGTAJGTGAGTTGGTATTTGGTTAG CAGATGCTTTG TAAAAC CT A 
ATATATTCCCACTATATTTCCTTTAAAAATCAAGGGGGGGT 
7 5 CTGAAACTCCCTATTACTGGATATAAGAAAAAATAACAAGCAAGATGTGTTC 
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AGAGATAGAAAATTAAGCTJU\ATTATTAAGAATCTGATGAG 
TTCCATGTAATAACATTTAAGATAAAGCCAGTATT^^ 
GACACTTAATGTAAAAAAAAAAATTAACCTTCAGAATGAG^ 
AGCTCAGTTGGCTCTAATCCATGGAAAAGTTTTTATTTCCCATCAG 
5 TCCZAGCAGACAAGTTTCTACAGATAATAGTGAAAGCX3TGTGAACACACA 
T CCTCTCTCTCTCACACACATATGCGCG CGCACACACC 
ATAATAAAAATAACTGTAATATTAATAAAGTACTCCTTATCAGTTATT^ 

T CAAAATGT ATTTGCTT CAGGATAAATTG AAGAAGTACCACTCTCTCCCTGTATTAAT AC CTTTCAGGGCTTT TGATCTGTAACCT 
CTTGTCACCTCTGGTTTAT CTGGCTTAGCATATG AATTTTGAAG AATG AGGGAGCATAAACAATO 

1 0 CACGTTAAACCTGCGGCCTCAAACTTGTCAGTTCTGAA 

AGAG ATTTTG CCAGTTGG ACCACTTTGCTG CAAAT AC CT ATTAGACC CAAACCTTT CTTTTG CCTAGTGATCAGCTATACTAATAG 
CAAG ATCCACTTAATTCCATGTGAGT CAACAGCTTGCATGG CATG CC CACTGTGTG CAAAT AGCTTCTCAGGTACITTT CCAAGCG 
TCCTG CCTGCACAAAGC^GAAGAG ACTTGGAATAT CAC C CCACTGCT TG CTCT ATCAC CT AGACCTTCATG 

CT CCATCAC CTAGTC CTG CTTGCT CCAT CIACCTAGTTGGAATTTCTAAATCAAGA.G ACATG GGCA 

1 5 GTGCCTC!TGATTAACAACCATTCCTTTAATACTGCTGAGAGGTTCT 
GAAGATGTATTTATCTCCTAAGGATAAAACCATCTGGAAATTT^ 
TTTTTCTTCCTCCTCTTCCTTCTTTTTCT^ 

AAAG ATTAAGAAGATAAAAT AACCTTCT CAC CTGC TT AGT AAT TTCTT^ 

T CCCCATTAAATTT CTGATTCAAAAAATATATATTTTTCT CAAACA 
2 0 AAAAAAAGCTACTTGATGGTACCTTGGTTGTAGTTAGCAGGTCTCCATTTTCT 

TATT AATGACAAAT CAGCGAGTTGATAAAATAATGTGTTA^ CTT ATAAATTT ATGTGGAG TTCTGTGACT ACC 
TG AT ATG ACT TTGTGACTA1' ril' AGTTGTATAAAGAATTTCAAATT ATTT ATT CT 
C^CTGTCACTCAGGCTGGAGTGC!AGTGGCGCAATCrTAG 

ATCTC CTG AGTATCTCGGATTACAGG CATG CACCA.CCATGCCGGG CTAATTTTTGT ATTTTTTG T AT AG ATG AGGTTTCAG CATGT 

2 5 TACCCAGGCTGGCCTTGAAATCCTGAGCrCAAGCGATCCACCCACCT^ 

GTGCCTGGCCTAl"riT 1TGTTTTATT TTTTT AATCTTGAAATAACTC CACAGT T ATCTCCAAGT AAGTCTAAT ATCTT CT CTTTGA 
ATTTCTAACTGATTCTTATTTATAAACCCATAGAATCTGTAA^ 

AT AG ATGTT CAACG AGATGTTCAAAAAGACG ATAAAGGAT ACATTGTAACCT A CT 
TTTAATTGTAGTGGACTTAAATGTATACAGATTTTCATGTACAAC^ 

3 0 ATGTGAATAT AGTT TTG CAAAAT ATT AATGTATGAAAGT AGTCCT AAG ACTG CAAG T ACTGTTGCTTT AG AACTT CATGCATAAAT 

TGGGTGTGTTGTTACAGGATACAGAGAAAATTGATGGTC!AACCTTAC<:CTGTAGTATATTT 
CAATTTTCCAGACrrGTTTAATGGTACAAAGGAAAGGGCTGCrA 

GACTGTTCTAGAGAAAAACAGGTCAC CGTTGGAAGAAAAAAG CAGT CTCACGTTACAAGCATTTTGAAAGTGGTTTGATCATTAAA 
ATGTATTTATCTTACGACAAAACTTAAAAOSGATACTC 

3 5 T CATTACAAGTTTT AATT C ATGGTAG AAT TTTGG AGATT ATATTT ATTG AG CT ATGTTTTTT ATTT CAT TAGTACAAAT AAAACTG 

CATTTTATTTGTTCTAATGATAAAATGTTTTAAGTTTATAAAATT^ 
AATTAAAATAGGACTTGATTGATGGAGAATCATTTGTTTGTGGGTTTA^ 

CAGCAG CAACTGTT CACC CAGAGACATAGTG CTTAGCATG AG CTTGATGGATT CAAACCAC TGGGT TT ATCATTACT ACATAG ATC 
TTTGAAAAGACAGGCTGCAAATCTTGTTTCTAAGCT 

4 0 AGATTCAAAAAAAGACACTCTACAGTCATTCACAGACTGCAA 

ATTATGATTATTCAATATCATGTTAATATTTTVAAACTTGGAATACAACCTACTAACCTTATTT^ 
TCTAGC1ATTTTTGAAGTAGCTTGGAAAAAAATTGAGATTCAGTTGTGGTTCT 
AATCAGTGAGTAGTCAAGTCTTCATATAATCAAAGCTTTTAAAATATGATAACAA 
AAACCTTTTTTTGGGGGGCCTAAGTTTAAAATGTGTTTTTTCTTTG 

4 5 TTAGCATTTAGAGATGATTTTTTCCTTTGTCACITTCCACATTGAACTCTGCTGATTTAGT 

CCTT AGAAATGATGT CATG CAGTT CATT AGTTTTACAGAT AGGG AAC CAAGG AG TG G TG AGGTT AGGGTTT AATC ATAT C CAT AAA 

GATCTTGAGGTCATTAGGACCCTTTAAATGTGACTAGTTA 

GCATCATTGACACACCTGACAGATGAAAACTGCTAT^ 

TTACGGTGGCATTAAAGTTCTATACTGTTTTTTTTTTTTTTTTT^ 

5 0 GTGTATACATTATGCTTATTATAAAATTTACTAATAAAATGACATGGAAGACAATTA^ 

ATTTAAATTCTATATGGCGAATTGATTCTTACAGAATGCCTTTTTG 
GCAATTGACAGAAGAGTTTAATTCrTTCTTTGTGCrrCTTA 

CATG AAAATG AAT AG CT TCACTTTTTGGCATTTCCTTCCTUTTT ATG TGT CAT AG AATACAC T ACATTATTCTTGATTTT ACAGTT 
AATTTGTAAATTGTTTCTAACCCCCACTAGTTTATAAACTCCT^ 

5 5 TTTTGAATATCCT TGGCATTCATTAAATGTTTAGTGAATTGAATGCGATGTTTCTT CCAGACTATGACCAATAATTGTGAGAAGAT 

TTTATGTGTTTTTAAATTCAACTTTAGAGAATTTCTTTTTGTGTAATAGGAC^ 

ATGTTTCTCTG CG GAGT C CAGCATCCTT CATTTGTGATAGG ACT CAT C CAGTTAAATTG CT CAGG CACTAT AATGGACTATTTATG 
TTCTGTTGATGTAAATTTCTAGAACAGGCCTTTCCCTGGACTCATGCCATGTTTTGTATGTTATCTTGT 

6 0 TACACAGTT ATTTTG GTGACAGTATTAACT CTCTTTT CATAAGGGTG CTTTGTT G GAAAAC TATTCAG CT TAACACAG AAT ATTT A 

GCTTTCAAAGGTTGCATAGGGAGAAATGATACTGCC^ 

TTTTTGACTCTACCGAGAATATTGAAAAGAGTTTTGATTAGTATTGGTGTTTATTAC 

AGT AT AATT ATTATTGTTTTTATTAT CAAACT AATAG CACT AACAAGGACATACGGT ATTTGT CAAGGGATT CCTAT ATCT CTGTT 
ACACCTAAATTTCTCTATACTCTAATTTTAGAAAAAAAGTTTTGTCTTCCAACTGTGAATAOT 

6 5 CTAATCTTTAATCTGAAGTAGAAAAGAGAATCCAAGCTTCCAAGAAGTATGTCTCTAGACTCTGC 

TTTTTTCTCTC CAACAATTATTTTAAGAT CTGTATG ATATAAGC TATTTTT AG AAGAACTG AAAAGTTCT AG AGAGT AAAAAAATG 

TTCCAGATTGTGTGTTGGTGCCAGTAGTACTGACAATTAAA 

CATCAGGGGAGTTAAAAATTTAGTACTTCAGGCTCATTAA 

ACCACTTGAGCATTGTGATATGTTACTCATTGAGACTT^ 

7 0 GAGAGTAAAGTTCTGCTAATCCATTATTTTAGTTCTGTCATCTTTTTCTGCAAA 

GTAATAATCTTAAAAATGATACAACTTTACAGAATATCrCCCCTTGCCA 
GTTCATGCAGCATTTAAAAGAGATGAGTGTAGAGGTCTTCCTTCT 

TTTCTAATCTTG CAAGTTGGGAAATTATTAAAATGCTCAAGAGACAGTAGATCAAGCCTGGTGTCCCTTT CTCCATCAAATTTTAA 
AGGATTTTTTTTTCCTCAGTAATTCTGCCTTTGTGTATTTCCAAT^ 
7 5 CAAAAGAAAATGTATTCAGATCTACTTTCAAACTAACTGAAAGAGATAGTGGTA 
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TATGGTTGTTCTTTGTTTTTTAAATCTGGACATATCAGTAATCT 
TTTTAGATACTCTTTTAAAAGCCAGGTTATG^ 
ATTTTTGTACTGCATAAAAATGCAGTGACZAAGAGAAAAACATTAGAA 
TCTAACTCCATGTTCCCTGTCTXKSAAGCnTTATAAGGCCGTG 
5 ATTACTTTATTTGTATTAAATTTAAAATGGAATAGAATGTGGTACATCAC 

CTTG CCCCATCCC CAAGATTAAAAAAACACA^CTATTAGACT AATT AT AG TCATTACAGTTGTAAATT AGGTCATG CATTTCATT 
AAAAATTGACGCAGCTCCCTTTTTGAAAGGCTATGTCACGGA^ 
TGAATTATTCTAATTACGATACCACAAA 
GTTTTGATATGTAACAAGATTGGTATGAGTTArrrrACGTGA 
10 TGAGGTTTTCTCCCCTGCAGCAGTCGACTGAAAAAGTTA^ 
AAATTGACCTAATAAAAGATCTGTGCATCCTACT^ 
TCAAGGCACATTTTGGATTTGTTTTGCATA^ 
T^TAAGCCAGGATACTGTTAAGTAATAGACTGACGGTTTGTGGG 

CAAAAATGGGGTTAT ACTGGTTT CAGTTTGCT AACTGCCTAAG T AAACT TG TGTAGAATTTGCATTTTTTGCTTTGCAATT ACTGG 
1 5 TT ACTTCTCTTT AACATTACATTTG CATTCT ATGTA CAAAATTTATCCAAATGT ACTTACT CA 
TTGGACATTATTTTTAAATGATAAAGGAACT 

TAAAATTCATTTAACT AGACTTATTCATT CAACAAT ATTGTTAAATAAC CTAAAGGAATTTTTTTAAAG AAAT AGTGTAAACAT AC 
AAATATG AAT AATTTGAACTCCCTTTT ATAT AAATATAT CTGT AGTCAC TAACTGTAAATAATT T AAAAAAT AT ATATGTACACAC 
ACGTGCGTCCACACAGACACACACGTCATGCCTTAACAAT^ 
2 0 CATCATAGAGTATACTTACACAAACCTAAA^ 

CAAACGTATACAGCCTGTTAT C CTATTG AAT ACTGTAGGCAGTTGTAACAT AATGTT ATGTATTTGTGT CAAA 
CGTAGAAAAGATATTGTGTTGTGCTATGAGGTTATGATGTCTACAAATGTCACTGGGTGGTA 

CTT ATGGG ACCACCATCAT AT ATGTGGGTC CTTTGTTG ACTGAAGCGT^ CT ATTG ACTATAGTT AGTG AAAAAATCT 
AGTTCT ATG ATTGGTTTTGTTCCCT CCTT T CTTATG AT CACTATTCATTCAACACCTTTTATATGACATGCAGTTTGAAAG CACAG 

2 5 GGT ATG CAAAAATTATCCC CG'rr rTCGGTTG CAGTAT CTIGTTGCTGACCCAGATAACTTCATTTCCTT CAG CTGTGCTTT CATTTT 

CAACTTCTCTAGTTGGAAATAAAT CCTCTTGTGTTGCCCCTTGAGGGAG AAAT CGTTTAGCAGGAAATCACTTACTAATCACAGTT 
CATTAATCACATATAAAGAATACTTTATAAATGT CAACTGTCATTCAT AGAAAATGTATGTTA^ CT TG AGG AATGTCTATG AATTA 
CrTATTCATTTTCATTG C CTIT TCACACATT ATATTCCTTT AGGAG TTT ATT AATTT CAAATATAAATTGACT ATGACTTAT ACAGAG 
T(ZTTGCAAACAAGAGAGATAGCAGTTTTGCACCTT^ 

3 0 CACT CTGCC TGTAT AG CTTGTTGAAGG TT ACAGGCAGTGGAGTG CTGGAAC7TGGCTTATGTCGAGACCTCTTCCCAATTCCATGTT 

CAATGATGTCTTTGTTCATAGCTTAAAATTGGCTATGGTGAGAGC^ 
TCCTAGACAACCACTTATTAAACCTT^AGCAG<ZACTCACTGTTTATAGTGT 

GAATAG CCATGTAGAATTTT ATTT CCAGAT ATTTTGTTGTTTAAAACC CT AAG AACAT CACrCACTATACCAATTGCCAACTGAGT 
AACTGGGAAAGTTACTTAGCTTTTCTGTGCCCAAATGTCCTTATCTGTAAGACGGGG 

3 5 GATGTGAGGTTGAATAAG AAACACATTT ATTTCAG CT AC CTGTT ATCT 

CTCTGTACTTTAATTAATAGATATTTATTGATCCCTTACTAAGTA 
CAGTACCTCCCACATCAAGTTATTAGCCACATTGACCTGTGGCAAA 

CCCCCCCCGCCCGCTTTTTTTTCATTTGTTTTTGTTTGTTTTTTGGTCTTTCTGCCTGGGACTCTTi-r 

TG CAGACCTTTTCCATGTTTATATATGGCTCATACATT CCCTCCTCTAGAAAG CTTCTCCTCACTCTAGACCTGGCCACATAACTG 

4 0 TTTTGCCTCTTTGCTTATGTAGTTTGGAGTATGAATCATA^ 

CGACITACACCrrTTGTGTTCriTAACCTCTTAGTAAGAAATG 

TGGTT AAAGGAATGG AACATGGACTT C ACTCAT AG AGAC CTTACT ATTT GGTTG AAG AG AT AATGCAT ACACATGTG ACATGAAGG 

TAAACAO\AAATATATTGAAGATATCTGTATTTTATTTTTTTTAGTTTATAGTTAATTTT^ 

TAAGTCAATTATTTTCATGCTACAGATATGGACATTTATGTATTTAT^ 

4 5 AAGTAGCAAAGCCCTGTTTGTAAACCIAGGTTTTGTGAC 

CAGATATGAACTCATTCTTTTTTTTTTTTCCCCTGTGATTTC 

AAATGTACATAATATTAGAGAAACAATATAACAACCCCCTCCATATATCTATCAGTCAGCA 
CATCAGCCAACATTTTGCCCTTTTTGTTACCTT^ 

ATGTCACTTATACTTCrTCAGTAGTATCTCrGAQA.GTTAAGAACTTATTTTA 
50 CGACAATTTTATTGTATCATTTGATAGCCACT^ 

TTTTCCAGTTAGCATCTATGGTGTCATTTAGCTTGTTCTTCCATCATCCTTAir 

CTTTAATGTTATTT ATT ATT ATTTGGGGGG AACATTT C CTGGAAGGTT CTGT ATACTTTCTATTTCAT CAT ATCAAGAGG ACAT AA 
TTTTAGTAATCCCTGCTTTTAATGATGCTGCIAATCTATCAG 

ACITGGATCCAATGATTTCACCTCATTGTTGATCATTTTGTAGCCATGATATTTT^ 

5 5 AT AT AATTT CTG CTGCGTTT ATT AT CAAG AATTTTTTT CAT AAAG AT CTTTT C AT AGTAACH" AATGGT CAACTT CAAAATAT AGTT 

TATATCAGAAAAAGGGAAAAAATGCATGATATTTATCAATTTTCAGAATCAGATATTCTCTTCT 

ACACAT AT AATCAAGGATG CCAGTAAATT TTT CT AAAAAATTTTTCC CTTG GAAAAATT ATTT ATT T ATTT ATTT ATTTATTTATT 

TATTTATTTATTTATTTATTTATTGAGACAGAGTTTCACrATT 

AACGTCTGCCTCCrcGGTTCAAGTGATTCTCCTGGCTTAGCCTCCCGAGTAGCT 

6 0 AATTTTTTTGTGTGTATTTTTAGTAGAGATGGGGTTTCATCATATTGACCAGGCrGGT 

CTGCCTTGGCCTCCCGAAGTG CTGGGATTACAGGCGTGAGCCACTG CGCTCGG CTGATAAATTATTTGAATTGCTTCTCCTAATTT 
TTTTTGATTAATATGTTTTGATAACATTGTTCCTATGGTAATATTATCAAAAAAGTTTTC 
TGTGCCTCTGTATGGTTGAATTTATTTACTCAGAAGATATTTATTG^ 
AGGATGTAGATATGAGCCC!TGCTTTTTTCTTTTCTTTCTTTCTTTT 

6 5 AGTGCAGTGGCATGATCTTGGCTCACTGCAACCGCCGCCT 

ATAACAGGCATGGGCCACCATACCCAGCTAATTTTTGTATTTTTGGTAGAGATGGGGTTT 

AACTCCTGGCCTCAAGTGATCTGTCTGCCTTGGCCTCCCAAAGTGCTGGGATTATAGG CTTAAGCCACTGTGACCGGCTG CTTTTT 
TCCTAGAATATATAGTTTGGGTTTCTTTTCCACTCGTGATGTGATTTA 

ACTATAATATTTGGTATCATATACTATATTTTTTTCTATATATATATATGCTACTATATAGTCAT^ 

7 0 GCTTTCTTTGGAGTTCTTGGTCrrTAGTGTCTAAAATAGTC 

AAAATCAGATGTTTCTGTTTAGGACACTGTATGTGAC^ 

AGAATAATGAGAATTCCTCCAAGATTAAAAGTAAATTATAAGATCCTGTGCTATTTTAATC 
ACTTCTATTTTCAGTAAATTAGAGTGAAATTTTAG 

ATTTTGGATTTT AG AAAATG TACAGTTTCT AAAGGCTTGT G AACGTTT TATAT TGATAGAAAATGT AAGTAAGAC TAGTAACACAT 
7 5 T ATT TT CTCCAAAGGATCCTGTGAC TGACACAAAAAAT ATGAT AT AATATTGAAT CTGC CAAAAGATG CACTGTTAATCT AGGGTT 
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ATGTTATGTTGTTATTTTATTAGGTATGTGATACT 
TAAACATCTTCTGAGATGTTTTCTCTAAAGCAGAAACTCAGCTT^ 
► • TGAGATTAGGGTTGAGG CAAATTGTTG ACATATAATTAAAATGT ATTTT AT AACT 

GTATTTATAAGACTTTGCTAACATGCTTTATC 
5 TTTCCTTTGCTGGAGTACCTTATCAGGAAATTO 
TTTAACTTTGAAGGTCAACATAT^ATTTTT^^ 
TAGCTTAATCTGAACAGCACGAAACTTGACAAAAAAGTTTGCT 
CTAGTACCAAGGGAGACGGTAGTAAAATTATAGAAGGAGCTAGTG^ 
AAACTC^CAGAGTAGAATAACAGGAAGATTAGTCACTTCTGGAAG^ 
1 0 AGGAATGTXSGAACTOGAAGTGCTAGGGCTAAAAGAC^G^ 

CTGGTAGCATTTAATATCAAAATTTAAATTTTATGTTAACAA^ 

CCTGTTAAAATAAAAAGACTTATATTTACTCCTTTGTGGAGCTGAATTGG 

AAATACATTTCTGAGGCCAAGGCAGGAGATTTCTTGACCCCAGAA^ 

ACCAAAAAGAAAAGAAAAGGAAAAGAAAAGAAAAAAAATTAGC CAGG CATGG TGGTGAG CACCTGT AGT CACAG CTACTCAGGAGG 
15 CTGAGGTGGGAGGAT TG CATGAGCTTGGG AAGTCAAGGATG CAGTG AGCCATGATCAG CCATTGCACTCCAG C CTGGG CAATGAAA 

CAAGACCCTGfTCTCAAGAAATATACATATATATTTTATCTTATATATATATATCTTA 

CATATATAATATATAATATATATAAGAAATATATATATAATTTCTTTCAAAACCGGTCAAA 

AAAATAAAACCATTGTGGATATCATGGGATATATGTGCTATTGGTTTGAAATAATTT^ 

GAGTACCAATCTCTAAATATTTTAAGGATTATTTOTATGAATTCTCTA 
2 0 AT1TAGCAATCCTTATAACAGAAAACAAGATAGTAAACATTGTGTGTATGTTCATGTG 

TTAATCCCTAGTCTCTACTG ACGTTATTAGGGACATTAGTCGA^ CTGAACACATTTTCAG CCACTCTGAGTGTTCTT 

AGGTATTTT ATTT ATTT AGAGACTGAAT CTGTGTTGTC CTG CCTGGTCT CAAACT TCTGGGTT CAAGCGAT CCri'CTGC CTTACTT 

TCCCAAGTAT CTGGG ACTACAGATGCACAC CACTGCC CAGC TAAAATGTACTTTAAAT AAAG TAG AT AATTCTGGGGTT ATCTCAA 

2 5 GTTATATTAGCATGACACCAGGACAAAAATCTC^ 

GCTTATGCCTATTGAAATAAAAAATTCAGTCTAATTTAGACACCTAAGATTATTATTTGCTATAACGA 
GTGTCACTGAATTCATGGAAATGCAGTACAGAATGCAATACAGTAATCCCCCCTTATCTGC^ 

AC C CATGGTACAACACGATAAGATATTTTG AGACAGAGAAAC CAT ATTCACAT AACTTTTATTACAGT ATAATT 
GTT CTG'IT 1' 1 ATT AT AGTTATTGTTGTTAATCACnTACTGTGC CTAATTTAT AAATT/iAAC TTT ATTGT AG GT ATGTAT AGGAAAA 

3 0 AT AATAGT ATATTT AGGGGTTGTTACTGT CCATGGTTT CAGGGATCCT T ATT C C CCATAGAT AAGG ATT 

ATCT ACTATATACAGT C AGGTAAG C CATT CCCATTGATTTT ATTACAAATTGCAGTGCTTCCATGGCTT^ 
TGAGGTAACTTTTGCAGAAAGTAACCATTTTACGTGAACACCAAATACTGATTA 
TCTCTCCCTTCTCCCTCTCTCCTTCCCltaTCrTACTCCCTCGTTCCTGCTCTTCT 
CCCTACACACAC^CACATACACACATACACACAC^CACCCTGCCCCT 

3 5 AGAAAAAAACCITATCTCAAACACTGAAAGTCCITCC 

(TrTTAGCCAGTTTT ATTTAAAGCAT AAAGTTGTGAGGACAA CAGG AATTTGT AATGTACCATT AG AG CTTTTCACITTT 
AGG ATGCTAG TTCATATTCAGATG ACTTAGT ATTAAAG ATTGATGGGTTGTTTACT^ CT CACTGTAA 

CTTCACTT ATTTTGAC CT AAGT ATG GC CTAGAGTTT ATGCTT ATTAATT CATTT ATTTC TCG GTAAAG C CATGT CCAAGG AATTAA 
AT AAAAGGAAGCGGAC CCAGT AAGG GTTGATG AATAAGAAG CT G TTACC TT ACT CATTT ACATGT ATT ACTTCAAATGTTATTGAT 
40 TGATTGATTGATTGATTGAGATGGAGTTTTTGCTCTGTTGCCCAGACTGGAGTGCAGTGGCATGATCTCA^ 
GAGTTCAAGCGATTCTTCAGTCTCAGCCTCCCAAGTAGCrGGGATTACA 
TTAGTAGAGACAGGGTTTCATTATGTTGGCCAGGCTGGTCTTGAACTCTTG 
TGCTGGGATTACAGGTGTGAGCCACCGCTCCCGGCCTTGATACCT 

G CAGC TGATTG CATTG CCATTG ACAT CAGTGAGAAAT AAAAACTATAG CT ATAATGGTT AAG TTGAGCAG TTGGAAT CTGTATGT C 

4 5 TGTATCTATCTATGTTTATTTAGATGACTGGATAAAAGAGTCATGATTCCTCCTAGTCTATC 

AATCATTTGTTATGATCTATCTTATGTAAAAGCCTGTGGAGTTACTTTGTTTTCC 
GJ^TTTTTGAATAATTTTTGCTTAAATTCAGTGTTA 

CAAG CTTATG TGGGTG CAAT ATATAACATGAC TACCCAAGAAATT CTG C AAAAAAATGT AGTTGT CTAGCAATTAGGT ACAGAG TT 
GTCATCTTCCTTCTATGATAACTTTTGATT^ 

5 0 ACATGG GCACCT AAATAAGTT CTTTTTAGTAAATGCCAC CAAATGGCTG C TTATGG CTAAATTCTGACAGAGGG 

GAGGTG CCTTCACCATAAGAGT ATTTT CTCTTTGAAGTTTTCATGGTTTTGTGTGTGTGTGTGTGTGTGTG 
GTGTGTGTGTGCGCGCGCGCATGCGCACACACTAGTTTATATTAGGTTAGAGTT^ 
AGAGAGTTAGACTTGCTGCCAAACTAAAGAAATAGCTTGGTTTCAGTATTCTTCTGGCTTCTCAGTCT 
GTGCTTGATTTGAGCTGACTGAAATTAACCCAGATGGCCCACTTTGGGTCCTCAAGAGCTAATTA 
5 5 GAATAATCAAGGCCGTACCTTCCTC CAATGTTGGTTTCAGCCCACAC CTTACAGATGAT CGCTGGCrTTCTGAAGCTCTCTCTAAA 

CTCATAATTATTGTTTGGACCCTGCATGTAACTAAA 

TAATC CC CC7yU\ACAGAATGAC^^ AT ACAGTGATGGT AATAATGTT ACT G C CCTG TT AATG CCTC CGAAG ATAAGAT ACACTTGAA 
AAAT AATGTGAAAACTGAATTTGTC CTTGATTTG AAAAATCTG AGAATCACATAA^ TT AATTT CTT AAG CTGGTAAAT A 
TCATTAAGAGAAATGGACACATATAAGATAAGTTTG^ 

GTCAGTAGAGAGTGGGCATAAAACGACAATlTCAGGAGCTGCATTCTCrrGGGTCCAGATT 

AACTTT TGGCAACTTAACATT ATATGCCTC!AGATTTATCAGATATACCCT ATTAAATGG CAATGTG TCTAC CTTGTTAGGTTATT G 
TGACTCTT AAATAAGTT AATACATGTAAAGAACTTCTGGAATTTCAGGGCACACAATGTTACCT 

65 ATTAGTGGTGGAGTTTTAATGAAATAATTTAGAGCTCCAGCTCTGG 
AGTTATGTAACCTAACTAAGCCTCAGTTTC 

TT AAATGG G AT AACACGT ATAAAG CACAGAGAACAGTGCTTGGCACTTAATG AGTCTACAGT GTGTGT TAG C TATTGTTTT ATTAA 
TTCTGTCATGATGAAGTGAACACTGTGATGGAATACTTTTATACTTAGAATCTGCCTATATAGGGT 
AGTGTTATTCTCTGCTGTGTAAAGAACACATGCTTTGGTGTTTATAATTTTTCT 
7 0 TGTTATTTTACTTATCCTTTGTTGAGATATTTAAATTATACAAAAAAACTC 

G G ATAAAT CAGTTAGGT AAATG CATT ATTAATGTTGG AAC C TGCTT AACTCTGTTGT C CAG C CACATTGT AG CCACTGGTT AAGTC 

GAACGCCACTAGAAAGAAAAGCAAATAATCAAAAACATCAAAATGGAAATGAAATATGGAGCAAGGAAGTCAT 

TAAAGGTCTGAGACCTAATATTTTCCCATTGGTTGGAAGTATTGATTATGAAAACATTAGAATTTTCA 

C&AGATGGCAG AT AGAAT ATT ATG AAAGAAATGAC CAAAATTG CAT ATG ACATTTATTTT AT ATGGT AACATTC^IITCTIT CATTA 
7 5 ACAGGAAATTTTAGGGCAATTATTTGCTTATTGGGAATGAGAAGGATGA^^ 



2079 



WO 03/008583 PCT/US01/51291 



AATAAAGTAAACTTTTAAGAGCATGTTTTTCCATAGAAT^ 

GGGAGCAGATTTTCAGT A.TGAT CACAGGAAAACACCTGTTTTGACACTCTGAA CT CTGTAG 

GTGTAAAT AAGAATGCTTTGGCTAG CATC CTGCW^GGAGATGGGTCT AAATGGTCTG CTCCAG CTITCTG CTG CTGCTGCTGCTGCT 
AATATTGCTGGGAAGGAAAAGTGGCTGACCTGGCATATCC^TTACT 
5 GGAACCTTCTGG ATCCTTTTCATGGCACCATGGCAAGAAG AAG CTGTAT CTT AT CTATGGAAGATAAAGCATGGAGTTGG CTAATG 
GATGCTGGTGAGTGAATAAGGC!AATTGGGGAGCCAGTT^ 
CATCCATGGGAAATGGCTTTTGGTATTCTGTACATO 
GTGATTCTGTATATGTTCCATAAAGTTCTCATTTATTCT 
AGTAGATGGAGCATAGTTTTCCATGTAACTTCTTAGTCACTAATOT 
1 0 ATAACTCTGTGGTATATGAAAG CAGTTGTTCOT CCTTTCAATTOT 

TTITAAATTATCITCCCTTTTATGATTTACATCAGA 

AAGAATTTTAAGTTAATCATAGAGATGTAGGTCAAATTTTCTT AGAATTAT^ 
AAGAGCTATGTTTGGGAACTTACAGTGGCATAGTAAAOT 
CAGATTCATTTGACCAACCCCCrrCTCCTCAACT 
1 5 TGTTTATTGAGCACCTACTATGTTGCTAGGCAAACTAGGTG 
GCGATGCTTAACAA.CAGTTATGAAAA.CCITTGTTTAGA 
AAAAATGTAATCTGAAGAGCTTCATAAGTGAATAAACCTACAAT^ 
GGGGTTTGTTTCATTTTTGTTGTTTAGATTGTTAAAAACTGG 

CAATTGATTTTTAATTTTTTATTTAAAGTCAAA CAG C CTCCATTTG GATTCTGCCATAG AGG CTT 

2 0 ACACAATCTTTCTGCATTCAAACCTATCACATTGACATGATTATATTTT^ 
TAGGCACTCTCTCTAAGATGAAAAAGTATTTTTAAAGGAAAATTTAGA 
TAAAATACTAACTGCTreAGGCAGTACTTAACATGTCATAAGG 
TATCAGACAGAAGTAAACCTTGCACAAGCTATTTGATAGCTTT^ 
AGAAATCAATTTTCCCAACACAGGATAGTTTGAACCATTTGTCT 

2 5 ACTAAATACCTTGXSAATAGAAGAAATTAGATATCCGAAGTCAAGAATTACAC^ 

TT AACT AATTACAAGT ATAAAGTCCATGGTT AATTTG CTT CG AGT C TTG AAAAATTG AG AGTT CCT AAATGATTTTGAC CATTTTC 
AT TTTCATTTCAT ACATATT AGGAATT ATT AAAGTC CTGT CAG AGTTTATACTGAACCTCTG ACTTGATTTTATC CAG CT CATCAA 
AGCAAAATTGTC^AGTAGCACTCTTTTGTAGCT 

TGAGTAGTTGAAGAGCTGTGAATGGTGGTTAACAAGCTATTATCATCTCCT^ 

3 0 TCTTAGACAGTGATTAGTCTATCTGACAGTGCATCAAAACTAAATTAGTTGGCCAAGCACGGTGGCTC^ 

CTTTGAGAGGCCAAG<5CAGGTGGGTCATTTGAGGTCAGGAGTTCGAGAACAGCCTGGCCAGCATGGTGAAGGCC 

AT ACAAAAATTAG CTGGGTG TGGTTGTGCACG CCTGTAG T CC CAG CT ACTTGGGAGGCTG AG C CAGGATAATTGCTCAAACC CGGG 

AGGTGGAGGTTGCAGTGAGCCAAGATTGCGCCACTACAATCCAGCCTTCCAGC^ 

AAAAAAAAAGGAAAAGAAAC^CCAACTAACTAAATTAGTTATATTTTCTGTCCTTCCTCTT CTTTATCCTTT T TC f TTTTCCATCT 

3 5 TTTGriCrrCTCATCTCrGCACTGCTTATCTACTT 

CATCTGATCATAACAGAAAGTGATAAGTTTGATAAAATGTATATTGTACITACTAAA 
ATCAGTTGTTTAACCTTCTCTCTGTCTTTCCGTTTTCTCATCTAATTCCCAATAATGTTG 

TCAG CAC C CTGGGATT TTTT TTTTGTATTTG TAAATTAAATATTG CG ATG ACAAGATAT AAT AATT TTATGTCACTTATTAGATTT 
TTAACCATAAATGCCGT AGG CATG CT ATAAT ATGTAT TCCTT AC T TTTCTTTCT AG AACT AAAT CT CCATACC CACTTCATCCGTG 

4 0 TTTTTGGCTTATGTATGGGATGCTACAATGGCCTATCTCCATGTATTTTGTTGCATTTCT 

ATCTTGGTGATTCTTrrCAAGCACTACCTGAGCTCTGTG CCAATTGTTCCTCTTCTCCCAGGGTGTTGTGCTG CGTGGTCATGTCT 

CCACTTCCTTAGCCCTGTCCATTGACAGAACCTTGGGTTCTGTGATGGCT 

CCCCrrcCTGCrACAGTTGAGCACCGTGCTGGGTACCATGTTG 

CTGTGGCTCAGTGATCCTGCTGTCAAAACCCTGAAACTTTCCTAGCCTGGACACTCAGTGGTAGCAGCAGGTGT^ 

4 5 AAGCCCCTAAGACTCTGGGAGGAAGAGAATC^CTGTTTGACATAGACCT 

AT ATGT AAAGGTAAGATTC TG ATTTCTCT AGG AAG CCGTT CTAGAAG CT AAAAACAAAAC C CTGTAG CTATGGATTTAATATAAGT 
TAGTAGTTAAAGAATGTCCTTGGGAAACTCTGTGTTTGATCAAACCAAAATATGAGTCT 

TAGGAATTTTGTGTG AGATT CAC ATT AGTT TTATTTATATT AAATTTGGCATGGAACAG ATTTTT AAAT AT TCA^ 
CGGACTCTGTGAAGCACTTTTTCACAATCTGCATTGTATATCTAGTAAG^ 

5 0 ATAGAAATTATTTTTTCAAATACCGATGTTTTATTTGCCT 

AATT CTTCAAG CTGAAATT CTT AAAT CCAT AATT AATT CAAAAGG AAT TAG AAG AG CAGTT CAAATCTT AG TTG GT AGTG CT AGGG 
ATGCATTACTGTCCTTTGTCAAAAGTTACTGCTTAGTATTCT 
. AAAATTAAAATAAAAAAACTTTTTGGGAAAAAAATTCACTACAT^ 
GGTAAGGGTGTGTGTGTGTATGATGGGTAGGGGAGAGAGATCAGCAGCATCT 

5 5 TGAGTTTGTGACAGCTGTTATTCAGTGCCCCCTTCCTTTTCTTTTTTTTCC 

ATATTAAGACACCTCTTCTAATATGAGATC^TTGTTGGCCTTACTGTC^GTTTAGGTCTrCTTTTTTTC 
T(2AATATTAACTTAATTCCTTAAATGGTTTCCCCTTGTTGAAATTAACXSCrATT 
AATGGCCAAAAATGCAACTTAATTTTTAAAAATGAAGGAA 
AGTGTTTATTAAGGATGCTTTTTATTTTTAGTAAACTTTATTTTTATTGATATT 

6 0 TGTAAGATGAGCAAAAGAACAGATACCCTAAATCTCAAAGAGCTTTTTCTT^ 

GATTCCCCAGTGTTGACATCTATTATTTTCTACCTTGAGAT^ 

TTG CTCAT C CCTGTAATC T CAG CAATTTGGG ACGT CAAG ACCAG AGG ATTG CCTG AGG CTGTAAG ACCAG C CTGGGCAAT ATAGCA 
AG ACATTGT CTCTT Oy^G AT AAAGAAACGAACAACAACAACAGTAAAAAACT CAG G CAC CTGTAG TCCTAGTTA 

CTCAGGAGACTGAGGCAGAAGTATTGCTTGAG^ 

6 5 AGACCTTGTCTCAAAAAGAAAAAGGGGAGATAATATCAACATGCAAAAACT 

CCTCTGTAAGATTGTAATGACAATGACTGAGTTTTTTCCTGCCAAGTTCA 
TGAATTTATAGACTTATTTCTCAGAGTATATTAAAAATAGTGGAAG 

TAG CTACAACTGACAG GAT CTGTTCACCATACAT ACTTC CAAGTTTGGAAACT ATC AG CAGGCACCTG CAACT AAACCCACAAGCG 
TAAATGTTCCAGAGTGTTAGAATCTTATTCGCATAATAC^ 

7 0 ATTCCGTGATGGCTGAGAAGCATCCAGGCCTGCTAACTCACTGAAGAAGAATCCAGCAGAAGGGGACTGGAC^ 

CATAACCACTGAT CCCACCTAAAGCAG CCAGTGCTGCATAGACAAAGC CAAG CCAGTAT AAAGGCAC CTGTGGGC CAG GGTCCAT C 
TGCATTGTTCrCTTCTAATTCTCTCCAGACAGGAGACrGCA 
ATGAGGGGACACAAACTGTACGATGCTTATTTTTTAAAATGAA 
AGTTTATTCCCTCTAGACTTCAAAGTATAACTAAATGAAATGTTTTG 
7 5 AG AAAAGTGAAATTTTCATGC TTTT AT TAACAC!TT ATAGTGAGTTTAT AATTTTTT AAAATGTGTGG CACAGT CT CAAG CT AGTGG 
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TTAGCATT ATTTG CIAAGTTTGTCT ATAGAGTGCATTTTGTCTTTTCT CTTAAATTCCCTTGGG ACAAGAAATGA 

AACAAATATAAGAATGTAAATTATGAATTCTCITTCTTGG^ 

TAGAACACTTAATTGACAAACAGTATGAACTACT^ 

T TOCAGTCATAATTTCTATAAGAATTTCTGC CATCACTC^^ CTCCC 
5 TTCTGTTCTGTTCTGTTTTTGGAGAGC^TGGGAGCrGTCCATAC^ 

TGTCAAATATAAGCCTTTTATGATACGATTATTTCCTTTGrrGTAATAAATTTCT 
AAATTTAAAAATAAAGTGTGTACAGAGC^GCTAAAAGAGGTGGCAAATATAGGGATGAGCC 
AAGGGACroTTTCACTTCAAAAAC!AAAATAT^ 
AATTAAAATGGTGTCTTTAGTTAATGGTAGTAAT 
1 0 TTTCACAAAAACACCTTGAGGTGTAGTTAGCATTATTTCACCT 
GCTGGACACAGGATTCAAACTCAGGAAGTGTGGCTC 
AGTGCCAGTTCTCAAATCAGTCCTAACCCTTTTCATAGCCTGGTACC^ 
TTGGGTTGCCATTCACrrTGCTGGCTATTTGACCTATCCTCTCTCAG 

C CTT CTCAGGGTTTTTATG AAGATTTAATTTT ACGAGACAG A CACAATGGCTAG CAT ATAGAAAATAGC 

15 CAATAAATTATTGATGTTATTATGTAATGT^ 

TTACTCGGAAACT AAACAAAACACACACACACAC^ CTACAAAGCAAAA 

CTCGTG CTCTTTCTGG ATTAT AACACT ACCT CT CAACT CT AACCAT AACTTTTTAG ATGAGT AAATAAGG ATGTCATGTATGTG CT 
* GAAGGGATAAGTAG CATAGTTTAGTGGCTTAGG G CAAACTTT CAAATTGAGGTTT CCATTGTGTGACAGCAGG CAAGTT AAC CCCT 

CTGAGGTTCGTTAACTACACATATTAAACTAAAAACAGTAT CAGTT GTATTGCATT AT CATGGATAAT AAATTTTAAT ACAATTAA 
2 0 AGAG CT CCAATAAATGTTGG CTTTT AGTAATTQATATAG A CTTT AAAAAT ATATT CACAAAATTGGGATGCATAG 

TTAGTGAAATCATAAACATGTAAAGTCCTAGAGATGGATATAATCTTAGTAGA 

CT CTGT CAAAT CCATTT AAC CT CTCTGAAAGGTCAGTGG AG AAAGGGGAAGAAGG CATCAG AGATCAGGACTGACCCAAATATGGG 

GTGGTTGTACCTCC^GAGATGTGGGCAAATTTTCTCAGTCTT 

TAACCCCTGTTAAGCAGATGATGGGATTATCAATCTTC 

2 5 GTCCTGAAGAGAGAGAGAGTTACCTGGACTGAGTTGAATGCCAGAGTTTAAC 

TCTTGGAGCTC!AAGTGTCTGGCCTGCCTTACTAGATTCTGAACACCT^ 

T TAGAACAG TGTCTGGG AAAT AGTGGG CACCCAATG CATGTTTGATAATT AAATAAG AG CAT AAGATCCAGAAACGGTACAGAAGA 

TCTTGTTTAAGGATCCCAAGCGATCAAAGGGGTTAG 

GGTGCCATGGAGCTGAGGGTGCATAACATTTGTAATG 

3 0 TAAATGT AGGAATAAAGTATTCCAAAT TGTTTAAAAAATGT^ C CATTGGAATC C 

ATTTATATTAGTTTTTCATACAGTGAAAAATAATATTTCT 

TTTATAAATGTGTCATTGATATATTCCCTTATAAATAAGTTATTAAATAGC^TTTTATGTTTATAAT 

TATGTATGAACGTTAAGAACTTACATGGATGATGGGGGAAGTAAGA 

TATAACTTTGAGGTTTGTTTTTTAACTATAATTTTACT 

3 5 TCAGAATCTAAGTTAAAAAACATAAAACATAGTAAAAAGAACAAAG^GAATTAAC 

AC!ACAAAGGA.CAAACTCATACTTTTTT^ 

GTGTGT ACTAATGGGAG ATGTAG CAAAGTTTAG TGGTTG C CACAAAGGTGATTTT AGGTAG T C CTAACAGGTATTGCAGTG ATTTG 
GCGTGGTATATCAAAAGCCAGAGACACTCGTACCTTCTATTAAAAC^ 

4 0 CCAGTGCTCCTCTCAGTGTGGAACTTTCTGGTTTCTCAAGAAGAG^ 

ATTTTCGCTTTCGAAAACATACATTGTTGTGAAAGTAAAATATTAAATGTAGAG 

GTAGACTATAGAGTCCATTTGTGCTATTATTATTTTCAGATGTGCCTGTAAAGTGGCTTCTTTTGCTGTTCTTTTC^^ 
CTCTGTATACACAAATGTCCATAGAGATGGAAAGGC^ 

AT ACTAGAGT AGAAATTACGTGACATT AATTCTCTGT AA'IT CTCTAT r l'Cl'Tl' CCTTTCATG GCT ATGT ATG CCTG AAT TGTATGT 

4 5 AGGTCTAGCATACCATTTGGGCATGAAAACGTTGTTTTACTTCACT 

TGTTTTTAATTTATTTTGGCAATTCACTGTGGGAGTTTTAA 
ACCAAAAGTTAAACTCATCGTAAAACCCTTTACA^ 
TCCTGGATGTGTCCTTGGCAGTTTAATCTGCTTTATTTTACT 
ATCTCTCCCTGAGGAGAAITTTGTCATCAGAAATAAAAGTGACAACA 
50 GGTAATCATCACCATTCTTGATTTCCTATGTGGAAATAGGACTTTGATATA 
ACACATGGAACAAGCCAAATAAATATTTTTAAAATA 
GTAAAACATTGATAATACATTAGAGATAATTTGCAGAAACTACTAATTT^ 
CTGGTTTAGCTTTATGGTTAAAAGCTVTTC^CTCTAGAGCCAGATAACACA 
TACCTTGGGTAGGTTACTTGACCTriTTGAGCTTCAGATTTCTAATCTTTATTTAA 

5 5 TTTGAGGATCAAATAAGTTAATAAACCACTTAGAACAGTGTTCAATAGGCT 

TTTTGTGTTATGTTTTGTATCCAGCAGAAGTG4GTTATTTAGCT 

GAAAAAGT ACAACTGT AAGT CAGTG G A&GT TTTT CCT AGT CAAGATG T CAAATG C CAG AGGT CGG AT T CATGTGG CAGGATG GG GT 

CCAAACTCTACAAAAGCTTGGAGATGAATGAGGTGAGACAGCTAAGCTTAAGACAAAAAACA^ 

TGTTTGCAGTAGTATAGTGCCGCTTTGGAAAACTTAGAAG^ 

6 0 TCATAGACTCAGAAATTTCTACCACTATCTGTTTAAATCATCGTT^ 

GATACATTATTTTTAAAAACGCAGTTTTCCTT(CTTX3ATTTTTT 
TAAGTTATTATCTTACAGAATAAATCTGATTACTX3ACTA 
TTACTTATTTGTACCTTATATGACTTATTTTCAGGTTAGAAAAATGTGGTC 
GAAGCCACCATTATCATTTACGTCAAAAATGAAGGTTATCTGTTGCC 

6 5 ATAGTGAGGGTTTCATAAGCTGTTTCAGTGATTTCTGACCTTCGG 

GACATTGTCCCAGTCTGC^rTACOyUlTTGTTTCAAATCACTTTAATTAAAW 

TGT AATTCCAGCACT TTGGGAGG CTGC!AGTGGGAGG AT CG CTTGAGT CCAGAAG GT GGAGAGAAGC CTGGGC^ C 
C TG TCTCTAGAAAAATTTTTTT AAAAATAAGAATAAATT AT CTATATTGTAATTT CAAAGT T TAGG ATT G AG AGATTGAG CT CATG 
CTTTTTAGTGTAATTATAATTTTGGAGCTATGCTGACATATTTGTAAACAAACC 

7 0 CAACTATACTGTATTAAGATTGCTACATGAGAAGAGTTCCCTTTAAAAA 

ATTAGAGAACOiAAAATATTTAAAAATATTTTCCATGAGTTGAA 

ACACTTT AAAAAAATATAT TTTTAT AGAACCTCAAATTGAGTT AAAATTATACTTTTAGT CACTTTTAG GTT AT AGTTTT CATATC 
AAATGTACTTTAAATTTAGGTTTTATGTCTTTTAAGTCTTCAAAT^^ 
AGCATCAGGTCAGTTTACACTGAAAGCTTTCAATCTGTTTGGCATTTT 
7 5 AATCATGTAAGAAGACAGTGTGAAAAAGCACTGGACTTGGAGTCAGTTATTTGG 
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GTT ACTCTGTGCGTGT CATG"i~i"i u rCTCGG AAGCATAC CTGCCGC CATCTTGACT CC TTGGTTGAACAAATCTAATAAAGATAATGC 
ACAGCACAjGTACITTTTGTAAACTGATACTCATCT 

TATACTGTGAAAGTCAATATTTTTATGAATTCTGCTCATGAATCCCACAACTGTATGTTG 
GTAATTAT*rGTTTTTATAAGGATAATTGGTACT 
5 CCTCCAGGATGAGTCATAGAI M rrTAAAAATTAAGATGATATTTACCA 
AAAGAGTTAGATACCTTCCAAAAGAGAAAAGGAAT<n\AATGGG 
GAAAAATGTATTTTTAACAGATACTAGAGAATATAGAAGTTGCTATATCAACAAAA 
CCAATGTGTATTTACTGJATAAATGAGTATGTGCTAGTGGAAATCAATGAG 
AGTAGGTAACACATTATTCTTCCTGTTTATTATTTTTA 
1 0 CCTGGTGAAATTCCTCAT'rr i' CTCTTGTGGTAAT AGG CAGTTGTTATAGTACAGTCATAAT CTTTTGGTTGTAATTTATT ATGTAG 

GTTGAATTACTCTACATTTACAGAGCTTTTGGGAATCAGGAGATGAG 
CTAAAAATAAACATTGGCCCATGTAATCACATCCTAAATA 
TTAGTGTGTTTTCTAGCCCCATTTTCATTTGATTTTGAT^ 

TAG CCGGCAGCAG CAATTG CAGCAATATCCAAAGTGTATTGAATG TCTTTGGTGT AG AAAAAGG CACTGG ACTTGATATGAT CATA 

AGAGCATAATAAGTCTTACTGCCTCCTGGGAACAGAATGCATGGTTACAAAAATAAACr 

ACTTTGGCTGACAGGGAAGCAAGATCAGGTTAGCAAAAATT 

CCCTTGTGCTAAATAGAGTATGTATGTACAGG 

AATTATTTGTATCTATTGCCTTTATCTTAGTGCTGATTTATGGGGCTAGATTT^ 
2 0 GTTTGAGAAAGTACATAAACAATATTGAAAAAACTAAACATCAGTGGTGTT 

TTTTAAAAGAATAGCTGAATGTGACATATCCATGGCATGTTAAAAAATGAAAAAAAA 

ATG CTAGT13T ATAATTTTT CTG AGTTGCACTATAAAAAATTTTT ACTCTTG CAACT AG AGGAAAT CACTCAACTGGAAGTG CTGG A 
T AT AATTTC TG ACTTATCCATT AGT CATCCT AAT AAT AAG AAAGT ATTT AGTT CCTTTT TTTCAT AGCGGT CCATCTTAAACAATA 
AAATTAGATAGCCCACTCAAAGATATTTC^GGAAAATTGAGGTGTTGCTTC 

2 5 CTGTGTGTGTGTGTGTATGTGTCTATTTTAATGCT 

ATTACCTTGAG ATTTATAGTTGGG ATGACTAGGC CTGGAG C ATTG CCCTTTGCT AAGAT AAG AAATAT AGGTGTTAAAATTGATGT 
TTT CTCTTTCTTTTTAGGGAGAACATGTAATTGGTTTGGGAGG 

GATAGTGC TGTGAT AGG AAGAAAATAC C C^VGAG AGAATGTACAGTAAG AAGAAGGAATCTAAAGACAAAC CTTGG CGCATCT CTCT 
ATAACCAGCAG TCAGAAAGGAGGG AAAT CAAAAG AAAGTGTCAG AGAATGAGTAGGTTAAT ACATAG CAGGTAAATG AGG ATGGTG 

3 0 CTTTGCCAAJGG CAAAAG AGAAAAACIATT CAGAAAGTAACTAGTTATT CTAT AGAGAAG ACAAGAAATG AGAAAAAAATCACT TTGG 

CTGTTAAAAGGACAGTAGTAATCTTTGAAAGCAGATGAGTAGAATTGGGAGAGAGGAGGTAGAATTC^ 

AAGAAGATGAGTCACTAGTAAATTGGGAGGTAATGCATGAGGCCTCAACT^ 

AAGAGAAGTGGGAGTCAGCTGAAGGATTCAGCACAGAAGAGGTTGTTGTAGA^ 

GCATGTCTTTT TCACT AAG GGAAGGGGCAGC CGT CAGTG GAGAG T G AAAG AATG AATG ACAAAT CGCT AAAAT AG GGAAATAT CAG 

3 5 AGAAGATGAGAAGGGATTAAGAAAAAGTGAGTTAGAGAGATTGGCCTTGAATGGAAGATTAACCAAAAACTTCCT 

ATGAAGGTGCTTAGCCAGATTCCATTCCCTAATACTACCTGCATTrTGGT 

CACACTCCTTTCTTTACTCTTTTTAAAAATATTTTATTTTAACTTTG^ 

GGACATGTTTTATCCATGCATGTGTTCTAATGTGTTTCG^ 

GCCTATCGATCATAGTACAAGGCTGATGACTAATTAGCTACCTAGGTGAACCTCAAA^ 

4 0 TAATGAATGGGGGAGATTATTATGTATCAGACTTTTTTTTTTCCTGTTTTTGGTTAATA 

AGAGTTCAGAAGTGGAT AT ATCCAAAG C CAG ATAG AAGTTGT AACTATAATT CATTGTAAT AAG TAG AAACATTT AACTGAAAAAA 
ATGAAAAG ACAAAAAT TTGCAAGG CAAATAGTTGTT AAATGCCATGGTTT AT AC CACTATG TCAAACAAATGAG ATTTCATTATGT 
CATTATATTTTGT CTG ATCCAT AC CAT AAG CACAAATCCCTTGATT AGAAT AG TTG GATTG CA'lTTi'CTGATTCTT ACATAG CAAA 
AATAACTCTAAGCAAAAGTTAACTAAATGTTGTCTGCCTCTCATGTAGAAAACATG 

4 5 ATTTTTTCTGTAAGTTATTAGTTTTCGAGTTTTATTAAAGGTGT^ 

TACTTGT ACATAT ATGGTACAACAATTT TIT AAACACCTTTTTATGTGT AAACACTTATAG G AGGGG TTTT AAG GTAGTGG G CTTG 
AAGTCAG ACTGCTAGTGTGTTG AATTCTGG C CT CAGG AAT CACTAG CT ATG CAAT CAGAT ACAAGTT AC T ATAAGCTGTAGTTGTG 
TCATCTGTAACACTGGGTTACTTAAAATTCTTTGCTC 

ACAGGGT CTAG CACATGAT AAAGCCTTATTAAATGTT AG CT ACT AT T ATTAT ATTTTCAG ATTATATTT TGGAGAT TCAAAAGT AC 

5 0 TTAGTAAATGTCCAGCATTGCTCTAAGCATATACTn^ 

TATTTTTCAAAGT ACAG TTAG ACATAG AAACAT T ATGAACTGTCT AC CATTTTTATGGC TTT AG ACT CAACATTATATTTG AAGTG 
AAAAATTGTAGCTATTTTAATTCCTAAATTATGTAAAGCTATAAAATGTGTTATGTTAGCC^GTTGATTTTATTC 
ACTTTGTTTT ATTTCTCTT CAG AAGGAAAAAT AGAAAG AT AAGACAGTAAATT T AGG ATGAG CAAAAT ATGG CCG ATGACAG CTTT 
CATGTATTCATT/TTTTCATGTTTTAAAAATTA^ 

5 5 ACAGACATCI^GTTTTCTTCTAGAGAGTCCCAJGGAAAG catg agaacat cattt aact ACAC TAGAATTG T AAATT AAAACAATTT 

TATCCTGGCTCACGTCT CTAGGGCTGC CTTGAACCACTCTTTAGTGTAG ACTCTC CTGTCCAACCTCCCTGGAGTTTTTAAAACAC 
ATCGCCCTGTCTCACATTTCTGTCTTCACACAATTTTCTTTTCATAAAAATTTTCCC^ 
TTTTTTGGTTGTTGTTGTTTTTAAACTT'ACGACTCATCCTTTTAGACTCAATACCGG^ 
TCCCCTCCATTTCCTAAGCTGTGGGAAATGTCTCCGTTAGTAACCC^ 
60 CTGGGATGCAGCAGAGACACATCATTTGCCCTGCATGTACTGTGGCTAACACT 

GACTGCATGCAAGAATAGCTGTTTTAATTTGGACATTAGGCTAAGTTTTATCCrTTTTAGGGAGGTG 
GATTATGTAGTACGTTATGAAATGAACTTGTTTAAGTCTAT^ 

ACTCTCT T AAAATTAAATCCAGACAGATTTAAACT ATGTGTGT CAG ATATAATTTTTTGTCTTTTAT ATGAGAGTACAAAAGTCAG 
AGTACTCTGTGATCAGCGAACAAAAACACACCrCAGTCTC!AGAAGCTTATCCT 

6 5 ATGGTGTACTCTTTCATCCAACTACTCAGGAATCCAAGCTTACTCTGCCTTCCTCTGA^ 

TGTCAGAGAG AGATGGGAAAAGAACATTGG TTT CCAAT AATCTCTTC TGGGAAAT CAAAC CACCATTTACGCTTAG T GTC TTTTGG 
CTAG AAT AATGTGCCAC CAACTTAATGCAACAGAGG CTGAAAAGGTTG C CTTTTAG AGT AGT CAG G CAG AGG AAAT AGTGCGTAGG 
AATTGACCAAGCCACTAGGTGATTTGTCTCCTTTTTTGTGTTT 

GGATGAATGATGTGAGGTAGAT AC CACT GGGTAAATTCAAGTAT AAAACATAAATTCT AT ACT AATT AT CAAGCC CTTGT CATAT C 

7 0 TTAGGCATATTAATT ATTGAAG AAG AGT CTAT CTC AACAAATCCT^ 

TGGTAGAC^TTTTAAAACATAGTGCACATCAGCATGTAAC^ 

CCCACCCACTGTAGCCTTCCCAGAGTTAAGAGCACCTGGCCTGTGCTACATGGATTGGT 
CAGTATGAGCATAAATTTTATCTTCTTTTTGGTGAGTACTTTGAAAGAAAATTAGGA 
AATTTGGTTGCTACATGATAGAGATTCATTGAAGGAGAAAGAAGTAGACAGTAGACAAAGTTGTT 
7 5 CAGG CATC CAATTGAGGT CTGAACAATAG CTTTTATTCATGTTCTCT CAC CAGTC TGTGTTACATG AAG CTTT AATG AT ATTTGAT 
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GATTGAATTCATTTTTTAAAATAGCTGCA 

GGATT ACHTX1AGCCCAGGAGTTT AAGACAAG CCTGGG CAACAT AG CAAG AC C CTGT CT CTAAAAAT AAT AATAATAAT AACTGAG C 
ATGGTGGCreCCGCCTGTGGTCCCTGCTACGTGGGAGGCTGAGGCA 
GCGTGGTGGTGCCATGGCCCTCCAGCCTGGGTAACAAAGCAAAACCTC 
5 TCATACAGATTAATGAAAGAACTACAAAAGATTTTTACTG CTTAATTCTGTATTTGGGGT CTCTTCATTTTTAAAAATAAT AAAGC 

AATTTGCTTT 

HUMAN SEQUENCE - mRNA 
CTTGCCGGCTTCCTTGCMAGCrCGGTG<^AGG^ 
1 0 GAGG GCGTCCGGTCTG CACGCCCGCCG CGAGGTT ACAATGCTGAACG CATGAGATG GAAGATAC CAACGGGAGGCCGAGGGGATCC 

ACGGCGCCCGCGCGGGCTCCGGCTTCCTCCTGCT 

TTGACAGGCGACCCTGTGGCCCGGGGAAGCGCGGGAGGGCGCCGGCGGAAAGTTGAAGAGCGTTTT^ 
GGAGCTCGACGAGTCCGCCCTGGGCTTCCTGGTGGGGCTGGGCGGGCGGGGGA^ 
GGCGATAAGAGGCTGCACAGCGACATGCAACAGTCTTTTCACTGCAGCTGAATGAG 
1 5 GAGCTGACAAGTTCCATTTTCCGTCGCGGGCATCTTGGAAT 

CCTCTGAAAAAAAAAATGTGAGAGGTTGGTACTAAGAAGTGCCTITCCT 
TCTGCTTTTGCCTTGCTTGCTGCCAGCTAGACTGTG^ 

TTATAATCAAGAGAAAAGCTCTAA.GTATCTGGCATTGCCCTAGGCrcCTTTAGTC 
ATCTTCTGCCAGGAAATCAAGGAGGAAAAAAAAAATCATTTTC 
2 0 TCAATACCGATTGCACCACCAAACTCCATTGCAAATTCAGCTC 

AAATTCGGTGTCGAAGGGTCTGCCACGTTTT(^TGCTTGCATTTTG 

AGG CTGAT ACCAGG C CCTACTTTT AAACGTTCATCT ACTT ACAATCCTAGTATTTCTC TAAAA^ 

G TTTCATGCTGTG AATTTCT AGTG GG AGATCTTTTC CTTG ATATTGACGACACAATTTTCCATGT AC TTTTAAAG C AG GGAGTGGG 
G AAAAGT AT TT TG AGGGG ACATTTTCATCATCAGTTCAG LTIT^TTTTT TGGTTGTTG CTCTTTTTTGGGGGGGTTGGGTTTGTTG 

2 5 GTTTCACTCAAACATTTAACTACCTGTAAAATCT 

GAAGTATGTAGAGAGTTCCAGAGGGGGACTTGCTCA03GCCAGACACGGAATGTAAATTTC 

TGAAAATGGACGAGT AATCGCCTGCTTTGATTCATTGAAAGGCCGTTG CTCCAGGGAGAACTG CAAATATCTTCATCCACCCCCAC 

ATTTAAAAACGCAGTTGGAGATAAATGGACGCAATAACTTGATTCAGCAGAAGAACA^ 

GCCAATGCCATGATGCCrcGTGCCCCATTACAACCCGTC 

3 0 CGCCTTTAATCCCTATCTGGGACCTGTTTCTCCAAGCCTGGTCCCGG 

ATCCGGGTGTCCCTGTACCTGCAGCTGCTGCAGCTGCTGC^ 
CAACGTGGCAATTGCAACCGAGGAGAAAATGATTGTCGGTTTGCTCATCOT 
AGTCACTGTGTGTATGGATTACATCAAAGGGAGATGCTCTCGGGAAAAGTGCAAATACT 
AGATCAAGGCTGCCGAATACCAGGTCAACCAGGCTGCAGCTGCAGAGGC^ 

3 5 GTACTTCCCCCATTACCAAAGAGGCCTGCTCTTGAAAA 

AGAGGCrCTAGCCAACATGCAGTTACAACAGCATACAG 
TGTCCGCAGCAACAAGATCTGCCACAAGTGTTCCCTTCGCTGC^ 
CTGACTAGCGACAAGTATGTTACCCAGATGTAGAATTTTCATGACTAAACAATCATG 
AGAGTAAAGGACGAGGTCATTAGCCATATTGTATATATCGTCAAGCAAC^ 

4 0 TTGCATGTTAACCAG AAG AAAAGACAACATTTT CCGGAAA^ TTG CCT ATACACT TTGTACATTTAATTG ATAT 

TTGTGCTGAGGTGATATTCCTGTCTAAAAGAACAACATTC 

GTTAGTTTCCTATATATTCATGCCATCTTGAAAAGACAGACTATGGTGTAACCATGATTCTATTATGTA 

CAAGAT AT AATTTTTTAAAAATAAGTTT ATTTCTT T CAAGGT TT ACAAAT AACAAAGGTG CACCTTGT ATTT AAAATTG CCAT TAT 
AGATGAGAGCGTGCATGCACAGTCATTTTTGTTTAAGACT 

4 5 TCACAGAGATGAATCTGCCAAGCAAAACCACAACTGTGTATAT^ 

CTCATGAAGTGCCATAGACTGTACA.TCAAATTAGAGTATT ATTTCTT 
TGCTAGTACTAATCAGTCAAAGGGCACGATTCTTTT^ 

CAGTAACAAT AG ACAG CACAACACAAACT CT CTCAATACAGATAAACT CACACAT ACTGG AGAT AT AT AT AT AATAGATATATAT A 
AAATTATTTTAATGCATTGTAGTGTAATATTTATGCATACTATACTGTATAACATG^ CAAAAGGG ATTG C CATT T CTGAGAC 

5 0 ACAGTAACAAAAAAATGAGGAAATT ATTT TGCTTCTATTTAT CTTTGCAAAAAT 

GGTTTCACGTTTGCTTAAATGCITCATCACACTCACATT 
GCATGATAAACCAAAATATGAAAATGGGAAATGTTTAATTAACCTAGTAAT^ 
GATTTTTGTTTTGTTTTGTTTTGTTCAGATTAACTGCTTATAGCCTTAGAAAGCCTTTTACAA 
ATTCAGTTTTTAAGAATGGAATCATCCAAAGGAATTC 

5 5 GTTCAGCAGTTCTAAAAATTGCTX3AAGTAGGGGCCAGCTCACTGGTAGTTATAGTAT 

GAACTTT C CATACTT C CAAGTTTAC TGCAAGTTTTTATGC TTGAGAGAGATG CTTT CTAAT AT AAGACTGATCTGTTG AT1TT ACT 
GAT TGTACTGT AC AT CT ATTAAAG CCTTAGATTATTACAT TACGGGTTG G AACCCAT AC CAATGTAAT TTCAAT CGTGTT AAGAAA 
GTAATGTCTX^CTTCACATGTTATTGTAGTTAGTTACA 

T ACAT TTATTAG AT TTT CATTTTCTATT AACAATTGAAT AC CATTT CAG TT T ATAGAC1T GTTTT ATTAG ATTT TAC CAATG AATT 

6 0 TTTCAAAATAGAAAAAAAAGTAGTTTTTCCTTCATA^ 

TAACTAAATGATTTATATTTTACTrGATTrAATATTACAGTGTAAGAAT 
GAACAAAGATCTTTTATATGGATATCTTATAAATATATAATCATTGCTA^ 

G CAGACAATTG AG T AATATTTTG ATG ATTTATTTTGT TTGT AATT AGTT ATTATAAG AAG AT CTAG AT CCTAG AT ATT AGAAT AAA 
ATTT ATT TT CTACTGTATC CATT TCAAATGTTAAAAT ATTGTTT AAT ATTTTTC CAGG CCTTGTTATAAAT A 

6 5 AG CTGCATAATCAAT AAAT AGAACAAGGGACTTTTTG TTG ATAAT CAAAGTTTACGT AATG AAAATTAT AGCGTGTG 

TGCAAACTCTTGAGGGTTGATTATGCTGCAAT^ 

GTCAAAAGAGAGAAACCTGTATAATAGTAAGATCTTATTTTGAATAAAAACGTCTATAATTACAAGGAGTT^ 

CAATGACAGACTGAGCAAAATTGCTTGCAAAAGTGGCACAGAGTTAGCACTCC^TACCCCT 

TGGACAGCTTGTAGTTTGCCAGGATTTTTTCAGCTGGAAAGATACGCCATCCTTTCAA^ 

7 0 GG CCAAAT CTGCCTGAAG ATCATT ACCAAAAATAG CAGGT ACTT CT ACCATT AAGGTG AAAT CATGGATCAGAT ATTC CTTACATT 

TTT CAAAACTACTGCATGTTTAAAACTT CAACAAAAAAAG AG AGAAAGAACTAT ACTAAGAACATAT ATTATT 
GCCAATTTCAGTGGTTTATTGTTCACAAAAAAATCTTCAAAACAA 

AAACAGCAC!ACTGTAGCTATAGTTGTTATGTGATTGTTTTTTAATTGCTGTAGGATCCTGTTCT 
GCAGTTCAAATTTCATGGTTTTAATTTTCAACTCAGAAGCACT 
7 5 AGTX3TATC CAGCCTTCACTCCAG CTGGTTAAAAA 
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GCTTGGTTTTACTATTATGTAATCACAACTT^ 

TTTATGCTTTWTTTTATTATTACTGC^ 

TTAAGT 



10 



HUMAN SEQUENCE - CODING 
ATGGCTGTTAGTGTCACACCAATTCGGGACACAAAA 

GCCAGACACGGAATGTAAATTTGCACATCCTTC^ CGCCTGCTTTG ATTCATTGA 
AAGGCCGTTGCTCCIAGGGAGAACTGCAAATATCTTCATCCACCCCCA 

TTGATTCAGCAGAAGAACATGG CCATGTTGGCCCAGCAAATG CAACT AGCCAATGCCATGATGCCTGGTG CCCCATTACAACCCGT 
GCCAATGTTTTCAGTTGCACCAAGCTTAGCCACXZAATC 



GCA.CAGAAATTAATGCGAACAGACAGACTTGAGGTATGTCGAGAG 
GTTTGCTCATCCKSCTGACAGCACA^ 

CTCGGGAAAAGTGCAAATACITT(^TCCCCCTGCACATTTGCAAGCCAAG^ 
1 5 GCTGCACAGGCTGCAGCCACCGCIAGCTGCCATG 

AACCAACGGTG C CACCGCAGTCTTT AACACTGGT ATTTT C CAATAC CAACAGGCT CT AGCCAACATGCAG TTACAACAGC AT ACAG 
CATTTCTCCCACCAGTTCCGATGGTGCACG^ 

GCTGCAACAGCCACAG CCAACCAGAT ACCCAT AATATCTG C CG AACATCTGAC TAGC CACAAGTATGTTACCCAGATGTAG 
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Table 102 

MOUSE NOMENCLATURE 
ICSGNM N/A 
Celera* mCG105l6 

5 

HUMAN NOMENCLATURE 
HGNC N/A 
Celera hCG23249 

10 MOUSE SEQUENCE - GENOMIC 

GGOTCCITTGACAACCCCAAAAGGTCATGA 

ATCAATCTTTATTGTGTTCATGGAGGTCAGAGGACAGCTTGTGCAGT^ 
AAGCAGGGAAGTCTCAGCCCTAATCCTTGTTCCTCTGTGCAAACrrCAR 
TCTCCTGACITAAACrCCTAGCAATCAAATAAATAAATATATTC 
1 5 TTGAGGCAGGGTTTCTCTGTGTAGCCCTGGCTGTCCTGGAACT 
TGTCTCTGATGGGATTAAAGACTTGTACCGCCAGGCA^ 

T AT AGAACCAGTT OTAAG ACAGCAAAGGCT ACACAAAGAAAAC CTGTCTCT AAAAAACAAAAG CAAAACAAAAGAATCTAAGGGCC 
TGGG ACAACACAGGTTTCACTCTTCTGACCrrCTGG AAACATACAG AC CTC CG ACT 
TGCTGGGTGGTCTTGGGAAGGTCAGTTACCCGGTATAGTGTTCTGTAGGCCTCCCCTTAATG 
2 0 AGCCCTAAGAAGGAGATGAGATGCTGGTGACAGTGTCAGTGACAACACAATGA^ 



CAGGAGCAGATTGGCTTCTGGAACAGGTAGCAC 

TCCCAAGCAAGGTCTTCTGCTGCTTCTGGCCTTGGGAAGCCCTTGGAGGTCATTC^ 
2 5 CTCAACGTTATGAATCCTT C TTGTACAACATGTTCAAGATCT ATCTAT CTAT CT ATCTATCTATCT ATCTATC T AT CT AT CT ATCT 

ATCTATCTATCTATCTATCTATCGGTATATCT 

TGTG CCTGGGTTTTCAATAT AGACCATATGGGAAT AAGACT C CAGG CTGATTG ATTCACATTCGTT AGT AAGTG CGGTCTAGG AAA 
AGGCAACCTAAGTACAGAGGAAGAACACGCTCTGGGCAGAGGGGACTGTAGGGATCCAAGCAGGAAGTCT^ 

TT ACTG TCAGG AGT CGGGTGAAAGGAAAGGAAGGCATTTAAGG CAGCAAG CAAC CACATGGAGTT ATT C CTGGGCACAACCACAAG 
30 TCATTCAATTTTGGAAGCAGAGAGAAGGGATTTATCA^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNIWNNNNNNNNNNNNNNNNNNNN^ 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNTOJNNNNNNNNNNNNNNNNNNN^ 
50 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNNNNNNl^NN^ 

NNNNNNNNNNNNNNNNNIONNNNNNNNNNNNN^ 

NNNNNNNNNIJNNNNNNNNNIJNNNNIJN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNl^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
65 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNl^NNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNIJNNNIJNN^ 
70 NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

GTGAGGAAGAGCTGGGAGAGTTGGAGCAGCAG TC AT AT ATGGTAAGCTG CAGTGGTTACAGT AAG CACAAAGGC CTGAAG CAAGAG 
CTG^CTTAACAGTACACAGAAGGTAGGCATGG CAAAGG^SGGAAGGATGCTAGGTACTCGGGCCAAG CATGCAGGACCTTTGGGCCA 
TGATAAGGATTTGGGACCTTTAGTTTTAGTAATGGGGAAGTATCrrAGACrri^ 
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CCCCAGGAAAAAGGCTGTTTGCTGAGACAAAAGTCAGGAATCAAGACCTCT 
AGTTCMGCAATTTCCATTGCTGTGTCAT^ 

ATTCC^ATTATCTTGGAAAACAAG GT AATTAGAATTATGATAAGTATTCACC!ATCTCAGTT CT CACAATG AAAAACCACAAGGACA 
AAAGTAACTCCAGATAG CT CAGCC CTCCTTGGCACTGGGGTGGGGG CTGACAG CGGATGGAGC CACTAAGTTATCTGAAACTACTG 
5 TTAGTGTCCAGTCGCTTTCCTTTCAAGGTACAACC 

GTGTAGGCATTCTTCTATAGGGATAGAGCAGAAAACAAGATTTG^ 
GCAAGATAAATATGTGAAACACCCTAGAAAAAAGTAAAGCTGGCACT 
GGCTACTTAGGGTCGGGGCCACAAGCTCATTGGAGGAGGAGC^ 
TTTCAGATTCATTGGACATCrAATGGAAATGAA 
1 0 TGTGAATAGTGGGTGTGTGTGCAGAGTGGAAAGAGGAGGAGCTACGTGTC^ 

TCCT CTGCAATTCGAGCTCT ACAAGATG CAAATGTG CACTTT AT AATTTTTTCCCA^ 

G AAGGGAAG C CTGTAGCATCAAAGGAT ACAAACTGGGCTGAGAC C CAGG ATTCTTGTTTGTT ACTTT AACTATGAC CTTGGGGGG AA 
ACCCCTAGATATCTGAGCTAGGAGAAGAGCATTA^ 
15 GGTGAGGACTGCAGAGGAGAAAAGAGCTGTTTCTGAGTCAAACGTrATC^ 

GGAGTGGAGACAGAAGGGGCAAGAGTCCAAGGCTGTATTGAGCCCTGTGTGGTT 
TAACCAAAGAAATGGAGAAATATGTCAGGACTTCAAGCAACACAATGAAAAGCr^ 

GTGTGGCACT CAGGATCAGGGCAGGGGAGCAT1TGCTTTCCATGG ACTGAGGAGG CAAGAGGGTAAGGTTGTGGTACAGTCTTCTG 
GGTCACAAGAGCCTG AACT CT C CTCAAG ATAAGC C CT AGAGTTGG CAT AAT CTGTCAATCATG ATACAAGGCCC CAG AAGGTAG AG 

2 0 AGCTCCAGCAC!AGCGCTGCTAGGTAGTCACGTGGCTAACT 

CCATGAGGCTGGCTACGATCCTCCTTCTTCAGCT/rCCTGACT 
CTTCTGCTAGATTTAAGAGTATCCATACATGCAGC^ 
GCAGATTTTTGAGTTCGAGGCCAG CCTGGTTT ACAGAGTC^ 

CC CAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGA^^ CCAT ACCTAT CAAT AAAGT 

AATCCCCTAAAAGCCCTAGCTGCATACTTGATGCATATTAGGTTCITGACTCTTG 
CCAAGTTATAGGCCAAGCCCAGTTTTCCCTTCCATATCT 
AAGGACTGGGATTGCTCTTCCCTTTCTCACCCATCCT 
CTCTGATGTGTGAACCCATTGCCTGGGGGACTCCCACTAATCTC 

3 0 CAG CAAGGTCAAGGGTTATG CTT AG AT ACTGACATCGGGT CA'J.T C"ri"i'CAGTAGTAGGAATTTAGACACAACTAAGTTCTAT CACT 

TAAG ATAAGG ACCCTATAAAG GGTGCCAAAGACTG TT TCTTTAATT AACCT TTC C71CITTTGCTT CACTCAGCTTCAAAACC CACT 
AAGTGCAGAATACTAGCAGGGCTAGGCCAGAGGATATTTCTCITGGTTTTGGAACC 
TCTACCrCTTTTGGCTTTTTGATCTTACCTTTTCCCTCTGATGATAATTACCCTCTAA 
AATTAACCTAAGTGCAGACTGCACCTCrcCCTGCCATCTTCAAA 

3 5 AAATTGG C C CAAAT AGGT CAT AT ATGTATACCAT AT AT AG TGC AAG ACTGGTTC CTTAGCATAAGTT AAGAGG CG ATG GCCTGCTG 

TGAAGTCAGTCAGGCTGATACTGGGGAAACTGCAAATGCAAGAATGTAAGTTATGAA 
GTTGAAAATCCACCTTAAATTCCTATTGGGAAGTGCTC^TGTTTTTTAT^ 

TAACCAAGGGCATGCTGGAGAAAAACTCTAC TATTAAC TTAT CCTCTCAGTCAGAAGTi' IT T AT TTTT CCTTTT T AGCTTTGTTGG 
AAAAAT CCATTGACTACAGTGG AGGGAG CTTG AAATT TGAGG AGTCTTT AAAAGCAAATC TGGCAG AAGAGGTTGTCTTT T CAC T A 

4 0 AAAGCAATTAGTAGTTTTCTAATTTCTCAGA 

ACTGTTITAGAAAGCTATTTTCrrCATTGAAGGTTTAAAATAAAA 
AAAGTCTCTCCAGAAGAAAACTTAAAAATACTCCGTCCATC^ 
CAAAACAAACAATCCGTCCCCCCCAAACCAAAACTTTATTATGAACAAT^ 
ATATCCCACAGCTCCAACATCGAAGGACATCCTCAGTTAATTTATTCCCTTCTTTCC 
4 5 ATAAACGTATTGGATCATTTTACCTATCAACACTTTAGTC 

GTAAGC^CGTTCGATAAGGATGACGATAATTCCTTAGTAACATAAAATACTCGGGGAGCGGGGGTGGGGTGGGGGTGTGGGGTGTC 

ATTGCTAACAGGTAGGGGACAGAACTCCGTAGTGGGGACTGAACAA 

CCGGCTGCGAGGAGACCAGCACCCGACTOCTCCAGGTTTCGC^ 

50 CACCTGGAGGGTCCCTGGAGGGGAAACGATTGAC^CAGCTGCCTGC!ACTGTGCCGCGAGCCT 

GCGTGGCTAGGAACTGGGACTGGCTCCGGAGTCGGTGAAGTC(ZAGCAGGCCCTTAGCCAGAACCTATACCGTTCGCCAGCT 
ACCGGGCITTAGACACCAGGACCTTGCGGAAGAGAGAACTACAACT 

GCGTGCGGGGCGCGGCGCCTGCGCGGCGGTTTGAGTAAGAGGCCGTACGATTGGCTGCGGAGTCGGGCGGCCGCGCGGGGCCTGTG 

GGAAGCGGAGTGACGGAGCGAGCGGCTGTTGGAGGAAGGAGGTGGGGGGCCGGGAGCGCAAATGGCGT^ 
55 TGTTCAAACTCCAGCGCTGACCATTCACCGGCGGAGGCGGCGGCGCAGAAGGC^ 

GGCACCGGCGGCGGCCGCCAGGGAGGCCTAGGCCCTGTCCGGCCGGC^ 

NNTGTCGCCCGAGCTTCCTATTTACCGAAGCGGAGC<^ 

CGGCACTGGTCTCTCGCGCGGGGCTCCCX3CGCCCGCCCGCGGGCCGTTG<3GA 

CCCGCCAGGCGCCGAGGGTAAGTGAGGTCTCGGGCGGCTGCCTAGGCCCSGA 
6 0 GCGAGGCTCGTTGCTCACCTGGCTTCTAAGGAAGTTGGGCACCGGA 

CCGTCGTCTCCTCCGCCTCTAGCCGTC^CCGAGAGGCCTTGCTT 

6 5 GTCTTTCCTGGAAGTGGCTAGGTTT CTTTACAGCTTCAGTGTAAGGAGGTTGGTTGGTTGTTTTGTATACCCTTTCCCT 

ATCGCAGTGCACTTCCATCAAAAGTCAGCrcGGCTTTGGGATGAAGCGA^ 

CTTTTCCTTTTTAGTTAGGACTGTTGGTGAAATTTGA 

ATATATATGCTGAATTTGTACATATGT^AAGCTTAGATAATC 

AACAGTAAAAGCAATCACGTAACTCTTGTACTCATATCTCCrGGATTCTGTACATAATAA 

7 0 CTC CC CATTCCTTAATATT AAAAAAAT CACTG AT ATTT CATCAAT ATTTAATGATGTT AAT ACATCAG CGAAT ATTTCTG AGACGT 

GAT CTTTT AAAAAT ACAGTTGT CTT ATT ACITAGC TCA^ CAT AT ATT CATTTGCAGAT CTT AAT CTTG CACAGTGAATG C TTT 
ACT G w lTlTlTA CGGTGAGr mT C3^ 

TAGATGATTGCCATGTATGTGTGCCCAAAACATTCCAACTTTGTTGGAATGTATATATC 
CTTAAAGAAATGACTTTTCTTGCCTITTGGCAGC 
7 5 TGAAGCTTGTTAGTGTTCAGAACTACAAAATGGAAACT^ 
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AAATTATT ATTTTAT ( 




kCATTAGGGTGGTGGTGAGTTGTCTTCTGTVATATTTGCTGAACTAA 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



ATTGGATAGTGATTCCTTAAGTTATAAGAAC 
TTATTCTTATTCTAAGTTTCCAAGTAATGAAGA 

TATTTAAA.CTGCAGAAGGCCAAAGTGTTAAGAAATGGATAATGATTG ATTGTGCT A CACT AT AC CTT ATATTTTTT AAAG 
TGTTTGArrri^TTTTTTGACn'lTTCTTGTAAA^ 

AGGAAAATCAAGGCTATAT ATTT CTATTTTATAAT AAT AAATCATGT ACCTTTGT AAAATGGAGT^ CT CCAACTTCAGAAAGATGT 

TACTCAGCIAATGAGATACAGGAAGTAGAAGAATGGTT^ 

AAATCACTTTAGTCTAAGAATCAGAATCTTTAAGCT^ 

CTTTATGAGGAGCTGTAAAAATGTCAAAGAACTAAGTTO 

TAAAGATTTTGTTTTTGGGTGTTTACACTATTCTGATTA 

AGGTATTG CATTGTGAGTTGGTTAGTAGTAAG CTTTTTTTTTTTAAACATTGT AAGTTTCATATGAAT ATGTCCT^ 
CAAGCCAAAACACCACCAAACCTGATGTATGGAAGTAAAGGATGGAAAACGAACCTC^ 
AGTTTGTGTGCTGAATGTTGAAAACTATATAAGGAAGTCCTAGGGTCrTATTTT^ 
TGATTATATTTTCTTAAATTTCTTTTCACA^ 

AAAGGAGGAAATGGCAGTTCAAAAT/TTGATGAAGTTTTTTCCAACAAA 

TATGGCTAAATTAGGGCAGAAGAGG CCCAATTT CAAACCAGATATT CAAGAAATTCCGAAGAAACCTAAAGTAGAAGAAGAAGATA 
CTG^AGATCC CITTGGTTTTG AT AGTGATGATG AGTCTCT AC CTGTTT CTTCAAAAAATTT AGCC CAGGGTAAGGGTTCATCT 
TCAGAATCTAGTGAGGCTGCn'CAGOTGGAAGAAGTGACTT 
CAGTTTTGCTTCCGACAGATGCTITACITGTGGAGGATACTTTAAT^ 

CAAACAAAAGT AGTTGCACTAAGTTGCTAACTTCAG^ CAAGAAGGC CTTTT TAAAG CT ACTTT 

TGTGATAATGAAATTCTAATTTATTACTTGTGTC 

TTTTTGTGGGTTCTAGAGATAAATGCAGGTTGCTAGGCTTGCGT^ 

AGCCAAGTGCTTGATG<3GCATTGTAGTGCTT<3CT 

AC CATGTG TGT ACnX3GGAACTGAACCTGGGTC CT TTGGAAGAGAAG CCAG TGCT CTT AACTGCTATATTATTT CT ACAG T CCCCTT 
GTAAAAG AACTTAAAACTTT AAAG AGTTGGAGGCCAG CCTGG T CTG CAAAATG AGTCCTGGACAGCT AGGG CTGTTAACAATG AGA 



CATTTATGATTAGTAGAGGATATTGATAAGTTTGTTTAGTGTGTGTGTGTGTATGTATATATATA^ 
ATATATGTATGTATATATATGTATGTTGCCTC 

AGAGTTAGACACCATGTCTGTACTGGGAACTGAACCCAGGTCCTTTGGAAGAG 

CAGTCCTCTTGT AAAAG AG CTT AAAACTTT AAAAGAG CT CAAT CAT ATTT AAAT TTTAATTAAAACAACAT CTT AG AAATTT AGTA 

TATTTTGGGAACAGTCrTTTATACCrTATATTTATTGGATAATC 

TGTTACAGTCTTAATGTAAGATAGATCAAATAATATAATGGAGTGTGACT 

TTTTTCTACTTTTCTCACCTTTGGGTGAGAG^ 

GT TGCCTTG TTCAT ATTT CT CCT AAAG CATTGTT CTT AAGTT AAG C TT AG ATTTTCATAGGAAAACAAGTCAAAAGAAG CCTAAAA 

TTGTTTCCAAATGAGCTETCTAAAATTTATCATATAM 

CAGGTGGATTTCTGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGTTCC^^ 

CCCCCTCCCCCCAAAAAAAGTTATCATACAGATAAAATTTCTGAGCTCAACT 

TTGCAGAGATTATGTGACTTTCTAAGATCAGCTAGTAGCAAAATTGGA^ 

TACGAG TTCTG CGTGCATTT ATGT AT ATAT AC CAT ATG CATG CTTG CTG ACCAC CAAGG CT AG AAGAGGG CAT CTAATT CCCCTGG 

AACTGGAGTTATAGACAGTTGGGAACTACAATGTGGGTTCT 

CACTGAATGAGATTTATTGATTGCCAAGATGTTGAGCCTACT 

AGATAGTTGTTTTACATAGGAGGTAGGTGGCCTGTAGAAGAAGAGTC 

ATT ATGG GGAATAAT ACTGCAG AATGAGTTTGTTT ATTCATTT T C C Ccr r 1TC ATTG ACTTTCTGAAG ATG G AACCTCTGGCAACT 
CCTATAGGGTAGGAAGCAAACTTGAAGTGTCTTTAATTTTAAT^ 

TATTTTTATAAGCCATGTATAAGTTCATGTATTTATAAGTTGGTTCTGT7U^TAGAGAATGCTTTTGGGCTTTT 

CTTTTTTTTTTTTTTTCATTTTTTGA 

CTTGAACTC^GAAATCCACCTGCCTCTGCCTCCCAA^^ 

GTTGCCTTTTCATCTAAACTTTTGGTTGTTTTTVr 

GTCCTGGAACTCATTCTGTAGACCAGGTTGCCCTCGAACTCAGAAATCTGCCTCCCTCT 

TCGCCACCACTCCCAGTTTGGGATTTCTTGATTTTTGCATTT 

AAAATTTTATCAACAACTTGTTCAAAGTATGTAGATGCACTTGTTC^ 

G ATATTTGTTC AATT ACTGGTTT CTG CAG AGGAATTATTT CACACCAATG AGAAAAG CTT ATCCAAG CTTAAG AGTTTT ATCATTT 
TT TT CTAATAGTACATATTAAGTACATGTTTTCTTCTGTAAAGAATTATGTCAT^ 
AGTACACTGTTGCTGTCTTCAGACACACCAGAAGAGGGCGTCA^ 

ATTTGAATTCAGGACCTTTGGAGGAGC!AGTCAGTGCTCTTACCCGCTGAGCCATCTCACCAGC 

CATCTCTGGGAACCACATAGTAGGAGAGAATGGACTCCTGCAGGTTGTCTTTGGACCT 

TCTGTCCCCACATGTGTGCTTACTCACACAGAAATAAATGAGTAAGTGTAATAATAAGTTTTAAA 

GTAAGTAGATGCTTAATAGATAAGGAAAAGAGTAGAAAGTACTACAAACTATGTCTAAGAACT 

AGTACTCAAAACAAGACCCCAAATTTTCCACTTTCTGAAAAGTAGTGAAAGAACC^ 

CTTGTTGTCTCTCCTTGTGAATGTTGTGGGTTGTTCTGATC 

CCAGGAGCTGTGTGATCTCTCCTGAACCTTATTCCCATTTTAACTGGTCAAT 

AAGGAGG AGG ACTG AAGAAG AG ATGAGGAAG CCAAGATGGACCAG AACCACGTG GCCAG GAG AAAG CACAGG TAGCAAAGGGTCTT 

TTTTTTTTTTTTTTTTTCGAGAC^GGGTTTCTCTGTGTCACCCTGGCTT^ 

CTCAAAAGTTGGCITGCCTCTGCCTCCCIAGGTGCTGGGATTC^^ 

AAAATGACTGAACAGGAGCTAGAGTTCATCTTTTTGAGGTATCAGTATATTAT^ 

TGATTTTTAAAAGACACAGTCTCACTATATAGCCCTAGTTGTTCT 

AATGTTGGGGTTAAAGGAAGCACATCCATGCCTGGCTTAAGATTTATCTTTAATTATG 

TGTGCA.CATGAGTGCAGGTGCCAGAGGCATTGGATCCTTTGGAGCTTGAGTTATAGGCA 

AACTGAACTCAAGTCCTCTCGGAGAGTAATAGTCACCn^ 

TGTG CACATG AGCATT AGAAT ATGG AAGTTAGAGTAC^TTATTTT CAAATGT 
ATTCACTGTCAGGCTTGTATGACAAGCACTT^ 

CTTTGTATATCCTAGACTGGACTTGTGGACACTGTGAACAAGGATGGCCTTGGGCTTCTGATCT 

GTGAAG AT T CAAGGCG CTCT CCACCATAC CTGGGTTTTTTTTGGTGTTGGTTATTG AACCAGAGCC TTTTG TATGCTA^ CAGGCAC 
TATACTACCAATTCCTGCTAGTATTCAAGTTTGGAATCCGGGACTGTGCATTA 
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CACCTTTJUITCTCAGCACTTGGGAGGCA^ 

CCXK^ CT ATACAACGAAACCCTGTCTAGAAAAAGCCTATGTACC CTATGTGGG CAGTGTCTGTAGAGAG CAGAATAGGGTGTTGG A 
TCC C CTCGGACrGGAGTTCTAGTCGATTGTGGGTGCTGGGGCTT CTGGAAG AGTAAC CAGTAC CTCTGAACCATTTGCTT AC CC CA 
AAGATACTTTTAAAAAGGTTGCTTTTGCTTT^ 
5 TATGTAGAGACTTAAATTTCTGCTTG^ 

AACCATAGCAT AGGCTCTTAATTTG CATGGTTGGTTTCATAG CCAAT AAAT 11 i 1 1 1 CX3TTTG CITAAAG CTGTATTGAGTGTG TG 

AGCTTTCTGGGGGGTATTATCAGGAC^TTCrcAGATTTGATTT^ 

ACTTTTGACAGTTTTAAAGTGGGCTGAAGJU3GTGGCT 

1 0 GAAGGATATAATATGAACATATTTTTTTAAAGAAA^ 

AGTTATTTG CCTATAGGTAGTGAGTGTTTGTATAAAA TTATG AAAAGT AG TT AAGAAATCTGTT 

TCTAGTTQIAGCTTTGTGCrrAAGCCTTTTATATTTCATAGCCTT 

ATTCAGAAGTTGGAT AACITGAATTAAATGGAAT AGTTGTG AAACAC C CAAAT TG CT CTTTTCATATT AT AGTCTG GT ATTGTT AT 
ATTCTGGTAGGG ATGAACTTAACAGG ATTTTAGAAT AAAACAGCTGT CAT AAG ATAG AT TTTT AATTTAAAATTTATTTTTTATT T 
1 5 TTTATTTTTGGTTTTTCAAGACAGGGTTTCTGTGTG 

TCAGAAATTTGAGGC CAGCCCTITGGGC CTCCCAAGTGCT G CATGTGT CATCACTG C CCAGTTCATAAAAT AGATTT 

TATCTTGATGTTGAAGTGAATTACAAGTTCTCATT^ 

TGGGTAGTTTGCCTCGCATATACATGTGCACCGTGTGTT^ 

GCTACCATGTGAGTG CTGAG7UVTTGTGCCTGGGTCCTTTAGAGGAACATCCGGTGTTCCTAAC CACTGAGCTATTTCTCCAGCTCC 
2 0 TATTTTTTTGGGGGAXSGGGGCAAAATAAGGCCAGAGTT^ 
ACCTTTTTTTGTGTGTGCTGTATACAAACAAGAGTATGCA^ 
CTGGAGvATTGTATTCAGGGTCCTCTACATCCAGCCACATCCCCAGTTGCCCT 
TCTGTCTQ3AGGCTAGTACAGGCTGTCCTCATATTGAAGAC^ 
ATTATAGCAGGTGCCATCATCCTTTTATTTCTTATCTCTC 

2 5 AGGTCAGACAGGAGTGCAGGAGGCCATTAGCAAGTTTTCC 

AAACATAACTGTCCTCCTGTATGCTTTTTAGGAGGTGTACCATGGACTTGTGCTGCT 

TTTTTT ATT ATAGT ACTTCACAGGACAATTGTG CCC CCCC CC CCCCATATATAT ATT CATGAACA 
AC CACAC CTG GTTTTTGAG AGGGAG AT CAAG G ATGG AG CTT AAAT AAG CAG CATTGATTTG AAAT AAT ACATGTGAG ACT ACGTAT 
AAAAAT ATTTGAACATG AAGT CATTAAAT ATTTGAATTGAGCTGTGAAAAAAG TT CC TATACCTAT AGGGTGGAG AAAAGAAACTA 

3 0 AAAAAGAAAGATAATATGAAACAGCCAG AG CTATTG AAGG CTCAACTT AGTTG AATTGG ATTTGTTG AT ATAAAAG TTAAAT AAAT 

G AGGG TTT<IACAGAATGGTTGTAGTGGC CTGAGGATTG AG AAAATT CACAT ATGTAAGTGTACATGTTTT AAACAGAGGAACTGGG 
ACAGCTGGGAACAGATGG CATG CTCAAATTAG ATAAT ATG AG AAGGGTTTATTT ACAAAGGTAT AGGAG GGAT AT AGGAAAATCA 
GGGAGATGAGGCCTCATCCTTGGTTCTTCAAAGTGAAGCT 

TATAAGGAGCTGAATCTTTCTGTTGAGGACCTAATTATGTTGTCrrCAGGGACAGCAG 

3 5 TAAAGATAAGGGCTGTTTTGTTGATTGGAATGATAGGAAGTTGGT 

TGCTGGTAATGTGGTGGTGGTGGTGGTGGTGATTATAATAATGATAAGAATAATO^ 
CTAAGTGCTGGATTATAGATAGGCTGTTTTGCTCAGTC 

GTTGTTACGCATTGTACATTGAGTTACATCTCTAATCCTCATTTATTTACTATATTATCT^ 

CACTAG AGTATTT CATATG CT AGTTAAAAG AAGATG CT TT AC CT TT CAAAT TTTGATGTAAAGTCGATTGAGGAAATTGGGAGGTT 

4 0 TTT AATTTTT TTG AAAC AGTCTCTAACCAAGG CTGACT TCAGATTC CTCTATAG CAG AAAATG AC CTTG AATTT CTTATC CT CCTA 

TCCCCATCCGGAGCGCTAACATTAAACTCAGGTACCACTCTGCCTGTATTGATTCAGTGCITGAA 
GTGCTAGGCAAGCTATCAACTAAGTTATATCCCCAGCTCCTAGTTCACAAATAATAGTTC^ 
GTTGATATATCCACATGTTCTTTCCACAGTGAAAAGTGTTTTCCTTCAGA^ 
TTCATTTACCAGATTTC!ATATATACTTTCTATTTCT 
45 CTAACCTTAGCTTGTATACTGTGTATGACCCTAAAAACCTAA^ 

GACAGCAAGGTCGGGGCAATCTCTAATTGGAAATTAGTCCAGGTTACATAGTTCCTC 
TTTCAAAAAATACACACAATGTGAGATATAAAATACTGCAATGTGGGG 
TCTTCCAAAGGTCTGGAGTTCAAATCCCAGCAACCACATGGTGGCTC^ 
TGTCTGAAGACAGCTACIAGTGTACTTACATATAATAAATAAATAAA 

5 0 GTGGAT CATGCTTTTAGTACATTAAAGTACTACATTTTAGTAC^ CATTG C T CAGTATTTGT AG 

ATAGATACACTTTGTTCTTTTTAGCAGGCTCA.TAGTTTCCCATT 
TGCAAGTAAATGTTGCATTGAACATCTTTGTAAGTTTTTGAGTACCATGTCACCT 
AAGTAATTTTGGAAGGGCATATATGTTTTATTAAAGTTTTATTTTTATTACTT 
GGTTTGGGTATGTGCATATAAGTACAAGTGTCAAGGAGGCC^ 

5 5 TCACCTGATTTATGTTGGGGTTAGAATTTGG{3TCCTTAG 

TGT ATACAGCTTAAATTTTAATTGATAATGCCACATTTCTTGCTAATAGTA CATCCT CTTTCCCAAACTTGGTATCACCCTTCT 

CCTCTCAAACTTTGTACCTAGCTATGTAAAGGTAGAAAAATAGCTGCTACT 

TCACTTTAGAATATATAGTATCITTTACACATTTT^ 

CTTTCTCAGCCAGCCACTACAGCTrCTCCCrAGAATGACAGTATA 

6 0 T ATTGT AAAG AATGG ATTTGG TTCAATTT CTGAG C AT AAAATTT AAGAT CC T AG AATCTT ATATAGT ATTAT CTTGT AATG AACTA 

AAGGCTGACTGTTGTTTGTTTTGTTTGTTTTTTGGGTTTTCAAGGC^ 
GAAATCCGCCTGCCTCTGCCTCCCGAGTGCTIXK5GATTAAAGGCGTGCCCG^ 

AT AGTGTTTACT AGGTAGATGCAAC CAGGATT AGAGGG TTG AG ACTTTCAGAGAT ACTC CTACCTGAAAC CT CTT AAAGAAGGG AA 
GAGGGAAGCCAGGAAGCAGAGGCCCGCGGATTTCTGAGTTCGAGGCCAGCCTC 

6 5 AT ACAGAGAAAC C CTGT CTT CCAGAAAAACCAAAACCAAAACAACCCC CCC C CCAAAAAAAAAAA^ 

G AACAAG GGCT AAGT TG ATTACTTGTGG CAGAAGAG CCAT CATGAAAAC CATAAAGG ACAGCTTTTTAGAG CGCTT C CAGAT AACT 
GATCATGAG^TTTCCGGGAGGATGATACAGCCTTTCCCTTATGCCTTGTTCATTT^ 

CTT AG GGTTT ATGAGACAG GGTCC CATGT AG C C CAGAAGG CT CTGGAGTTTGT T ATGTTGCCAAGGATAACTATAAACTTC C AATC 
ATTCTGTCTT CAC CT TT CAGGTG C CACT AT AC CCAG CAACT ATT TTCTT CT CCTAATGGATG ATTTGTTTAGAAACAGGACTTCCT 

7 0 GTATAAAATTAAGAATTTGTTAAGTAAAAGTGGTCATTTAATTTCTCTTAGAGAAT 

GCTGGCTTTGAATTCATAATCCTGGTTCATTTGCCCAGCCTTGTGAGGGT^ 

TCTGTATAGCCCTGGCTGTCCTGGAACTCACTTTGTAGACCAGGCTC 

TGCTGGGATTAAAGGCTTGOSCTACC^CGCCCGGCTTTAGTGTCTGGTTTT 

7 5 GAAGTTGCAGTCy^AGATGTATT^TTTGAAAGAATAAAATAAAATAAAAAAAAAACCA^ 
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CAACCCCTTTGGTAAATGGATATTCTCTACAAATGGGTTTATCTGTTAGGA 

AATTGTAAGCCGGGTGGGGTGGCGCACGACTTTAATCCCAGCACTCGGGAGGCAGAGGC^ 

CTGGTCTACAAAGTCAGTTCCAGGACAGCCAGGG 

ATTGTAATATGTAATTTTGGATAAGATGGTGATAACCTTTCAAAAGCAA^ 
5 AG6ATAGGACACAATAAAATGCTGTGGAGACAGCATTAAAATGTGTCCTTG 

TTOGCTTTTCTGTAAAACTATCTACCGTTTTAGAAGTTTAAAAAATGTACTTG 
ATTTGTGGATTGTTTGCTGTTACTGTATACTAT 
CTCTTCCTTCTTACAAGTTTACAAAGATAGGCTACAGCAGCAGTTA 
TACAACACACATAGGAGAAATGAGGGCAATAAACTTAAAATAGTAAACT 

1 o TG rrrrrG . ri . crrrr , r r r rr r n 

GG CCTCGAACTCAGAAATCC^CCTGCCTCTGCCTCCa^ CTAG AT CTGTTT 
TT AAATTAGAATAATGTTT ACACCT AAAACAG A CCTG CTATT ATGCACATGAAG 

CAGGAAGATTACTAGTTTGAGGCTATCTTGGGTTCAAAAGGGAACCCTGTT^ 

TCTTTAATTCCAG CAGAGGCAGGTGGATCTATATGATTTTAGGGCCAGCATGATTTATAGCAAG T CAGTG CT ACA 

1 5 TAGTGAAAACCTTTCTCAAAATGAAAACATACTACAACCCCATTAACTACCCCCCTCC 
CTATAAAAATGAGTTGAAGATTGGAGAGGTGGTTCCCAGCATCATGGAACA 

GGTGTAGTAT AGCTTAGT ACATCTT AG ACATGTTAGGT AAGTG CTCT AACT AG AG CACTC CTCTCTTTTCTTTTT 

CTTTTAAATCAGTGCCTTAAGT^TCCCAGGATTGTAATTAACTTT 

TT AT ATTT AAACTCATTGAGTAG CATG&AAAATT AT ATAGTGAATTT AT C CATAT AAAACATTGCTTTTGG CTGTATTTG CAAGTG 

2 0 TGTTCTTTAAGACTTGTAAATAAGGAGAGTGTTGTGGCATAT 

TGTAAGTTTGAGGCCAGTCTGGTCTACATCATGAGTTCC^ 

TTTATTTGTAAATTGTTTTTCTTAGATTTAGATGTA'r'ri u rT CTTGAGG CAATATTTTATTTGG CAT AG AGT AAATATTTGC CAGTG 
CTT ACCCATTAATTAATGTTCAGGGAATCCATGAT ATT AGTT TGAGTCAT ATTGCTG CCTIT T ACTTGGAGGGTAAAAAAGTAATT 
ACTTAGCCGGGCAGTGGTAGCGCATGCCTTTAATCrCAGCACT^ 

2 5 GGTCTACAGAGTGAGTTTCAGGACAGCCAGGGCTACACAGAGAAACCCT 

TACTTCTG ATGT TAAAAAATTTAAGGCAACAAAAATGAAATTCTCACATATAAAGTAGTCATTTT 
ATTCATAAATAGGCACTATACACTACATAGGTACCAGATCCACTCTTAAGAAAT^ 
AATAACTAGAAAAGCTTGTGGTAGGCAGGTTTCTAGACAAAATTAACATACro 
CCCCTTAAAGGCATCTGACTCAATGTTAAATATGTTTGAGTT 

3 0 AATGTTATCCTCAGTTGATTCAGTTTCrrAA 

GGACAGG CTACT AG AAG ACATGG CAG C TGT ACCTACTGACAGTAGG AGAGTCT AATGTGT AAACTTTTTTG 

TTATTTTATACTGGGGCTGGTGAGATGGCTCAGTTGGTAAGAGCACCCGACTGCTC^^ 

ACCIACATGGTGGCTCACAACCATCTGTAACAAGATCTGACGCCCTCTTCTGGAGTGTCT 

AGTAAAT AATTAAATT AAAAAAAAAAAAAAAAAAAAGATT ATTTT AT ACTACTGTTCAGAT AG CC CTG CTATTTGTTAATTTG CAT 

3 5 AGTGATGACATTTATGTTTTTAACCGGACATATT 

AGTCACCACTTTCACAAAAATGCTGAAGATAGTACTAAGAAACCCAATGC^ 

AACT AAAGAAAC!AAATGATACITGGAACTCC CAGT CIGG AAAAAGAAC^^ CATCTG AAAGTTGT CCAGT CAAAG GATCTG 

TAAGAACTGGTTTATATGAATGGGATAATGATTTTGAAGATATCA^ 

TTGGAGATGAAAGACGAGGATTTAAAAAATCX3GATTGGAGGATTGGAA 

4 0 TGTTCTTAGGCCAAGCAACTGTAGGACGTACTGTAGGGCCAATAAAGCGAGATCCTCACAGGGAGC^ 

TGGATGGCACCAGTCAGTCCTTAGCGAAAGCAAAC^GTGAATCA^ 

TGTGGGACC^TTTTCGAGGAACAGTTGGACGGACTAGAGATTACACTGTTTTACATCCATCTTGCT 

CATCCAGGATACTATGGAACGGAGTATGGATGAGTTCACCGGATCCACT 

AGGCAGATATTGCAACCTCCAAGACOVCTACTAGATTTCGACCTAGTAATAOT 

4 5 GGTTTTGAAGATCATGATGAGACAGGAGGTGATGAAGGGGGTTCTGGAAG 

TCTCAGCGAAAGTGAAGATGATGiATGATGACGACTGTCAAGTGGAAAGAAAGAAAGACAAAA 
CTTCCCAGCAGCCTCCTCCTGAAAGCAGCGACAATTCCCAGGATAGTCAGTCT^^ 

T ATATAT AAT AAAAAAT AAT ATGT ATTT AATAT ATAAAATTT TCT CAGGCTTT AAT TAAAGGG TAGG CTCTTTCCT AAAT ATT CC A 
TTTTAGTTTGAAATATATAAAATTTATTTTTAATTACATGT^ 

5 0 TCCAAACCCAATCCAAATTG CATG TAAGT AAG CTCT C CCAC TGAAAT AT AAC C CT AGG CTAGCATATGGT AATTT ATGTGAT AATT 

TT ATACTATTGAG T CITTGGGACAG AG AT ACCAAACA.TCTTGT TTTAGTTTT GAAAAT CCAAATG T AATACAGACATAAG G ATTG A 
T TTTTTTAG^ ACATTTAG AACTGTACTGATAACAGATGGTTTTATG AG GG AAATACT CAAAAG AAATTAATTAAACCCTTTGG AAG 
TAGCACAAAACHTG AAAAAAATTG AAACAT TCTCAAAGCC T CAAATTG TGGT CTCAT AGGATTGG AAAGT ACTAGT AGGT AAG GAT 
TCTCTCAGTTCTCTCAATTCTCTCTCTCTTTGTCTCTCTCTTTCTCTCTCTTTTTGGTTT^ 

5 5 CTGGCTGTCCTGGAACTCACTCTGTAGACCAGGCTGGCCTCAAACTCAGAAATC03C 

AAGGTGTGCACCACCACTGCTC^CAGTATTCTCTTTTAATAATAAAATTTGTCTCAAG^ 

AGTGGTTGAG TC TCTAGTG AGAG CT AGGTCTATTTTTGT AT CTTGG AG CAATAGAACTTAAATTCTGGAGTATGAAAC G TTAAGAC 
TCACTTTGAGAAATGAAGT^GATATTGAATAACATTTGAAAACTATAA 
TTGATATAACGTGTGAACCAGCAGTCATTTTTAGAAGATGAGGTTCTCTTCCT^ 
60 AG AAGTAATCT AAT C CTT CAGTAG T TT TT ATGTGAAG ACTCACA.GGTCATTG AACT AC^TTGTACTAGCCAAGGATAG CT ATCTTG 
GTTGTTTCTTTGCCTCCTGACTCAGCCTCCCAAGTGCTAGGATTAGGACT 

TTCAGTTCCAGAGAATCTGATGCCTTACCATCTCTGCAGTCTCCCCAGGCAGGCTCA ' 
GTGTATTCAGATGCTGATTTCTATGCCTAGACATCTGCAGTCTGGACAATGGC^ 

6 5 AAAGTTGTTTCCC^TCACCTCTGATTGGGTAATAAAGAGCTGATCAGCTAGTTAGCTGCT 

CAGTCCTGGAAAGAGACTAGAGAGGAGAGAGGGAGAAGATGTGGAGAGACCATGTGGATAAACC^ 

TCTTGAGAGAGCAGCTATGAGGACAAACATGGACTAGAGTGAGCCAAGGCAAGACTCAGATGTAGGTAAAGGACCACATAGCT 
AAGTATACAGCATAACCAGGTTAGATAGAATAGCTCTGAACCTGCCCAGCTTAGTGCTTGAAGCT 
TCTCTGTCITTTATTCAGGAGCTAAAATGGGCTAGAGTGGGCCCT 
70 AAAG AATTAG TT ATG CAAAACACTCAT ATATAAATCTTT AAAA CCAGTGA 
ATTCCAGGC CAG CTAATTAT AAG CACCCATTCT CAGT AAAAG AG AAAATGTTTG ATT T1T1TGCTGATAGT ACAAAAAT ACAACT A 
GAAAAGATATGTAATAGGTTTTGAAGTAAGAGAATAGGGTTAGAGTACTTCATCAACCACTACTATTTT^ 
CTCCAACATTAATGTTTTTAAAAAACCTTGTGTTTTCTGATAACTTGAAACACTGGAAAGCT 

AAAACTTGG ATTTTACAGAGG ACTTGC CTGGTGTGC CTGAGAGTGTGAAGAAG CC CATAAGT AAACAAGGAGATAAAT CCAAGG AA 

7 5 AATACCAGAAAGATTTTTAGTGGCCCCAAACGGGTAAGTAAAGCATTAACACT 
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GTGTGTGTATGTATACATATATTTATCATTAGAGG^ 
TGTTTATCTCAGGATATCTGTAAATCTTTA^ 

TCATTCTAAAAAATGIAATCATATGTCTAAATATGGCATTAAAAATA^ 

ACCGT CTGTTTTT CCAGAGGTCCTGAATTCAATTCCCAGCAACCA CCCT 
5 CTTCTGGGGTGTCTGAAGACAGCGACAGTGTACrrCACTTAC^ 

AAT ACTGGTTOTTTAAACTGACATGAGGCTGGTTGGGAG CTGAGACTTTTTG CTTTTCTGTCTTGTAATAG TATTCTGCCACATGT 
CACTAGTCTAGAAAAAGACCAGAATTACAAAATACAGTTTCTACTAAAT^ 

GTAAGTCAGGGATCTTCTGTGTATCTGT CiTTTTGAC 1T1TTCCAGTTCTTTTAGC TATATACCTACAGTCATAC CATTTGTCAAG 

CTTG CTCAG TAACTGGGAACTX3 TTTTGCATT CCATTTC CTCT ATCAATTTCCCTCATGTCTTGG 
1 0 CCAACATTTACTACTAGACAAAAATTACTATAGGCTTGTT 

• GAATTTAGGACCAACCITGTCTAT ATAGTGAATGAGTTGAAGGCCA^ TGAGAT CCTGTTGT AAAAAAAGAAAGTTCCC 

ACAATAGTTAT AGAGTAGT CATGCTTTGATCTTCATTTCT AT AATGAC CACTG AGTTAG AAACTTGTTG CACATGCTTTCTAGTCA 

TTAGTGAGTCTACATGTTTGTGTATTCTGTGTGTGTGTATGTG 

NNNNNNNATATATATATATATATATATATATAT^ 
1 5 ATCTTCCTCTATTGTTCCCTACCTTAT1TTTTGAGACAGGATTTCACT 

CCAG CAAAGC CCAGGGAATCTTCCTGTTTCTACTTTTCTAGAGT ACAGG CATGTGTTACTGTGCCCAGTTTTTAT AGAAGTG CTGG 

AATTGAACTGAGGTGGTCATGCCTGTATGGAAGTGCTTTACTGACTGAGCTGTCTCC 

TTGCTAATTGAATCTTTTTGCCCTTTTT^ 

ATTAAAAGAACTAACATATGAGCCTGCTGTCTCACTCCTGGGCATATACCCAGAGAACTCC^ 
2 0 TCTCAAGTTCATTGCTG CTTTTATTGT AG CAGAGAAT TTGTAATG AG CATAGTTGTCCATCAAAAT ATGATTGAAT ATTTG GTGTG 

TGTATOTATGTAATAGAATACTATTCAGTTGTAAAGAAAACTATO 
CATAATATTAGCCTAGGCTATCCACTCTCAGAAAGAAAAAAACCACATTCTCTAA 

ACAATGTGGGCACATAGTATAACATGTAAGAAAGAAAAAGAATGCTAAATGTTAGGGGATGAGAAAAGACTG 
TATGTATAAAAGAG CACTTCCC TG AGT TGAAT AATGTTTGGGGCTACTrTTTT CTGTGTTTTATATTTTT^ CAATGTT 

2 5 TTGTATAATGTAAAATTTAGTTAAAAAAGGAATC^ 

CTTCCTCTTTGCCAAGTTTAAAATTTTTAATTATTTT^ 
CCAAATTTAATATAGGTTTTCAAAACTGTTTTTCACTCAGGGTTTTATA 
GTTTGTAGATAGGGTTTTGGTTCTCTCTGTGTAACAGTCCTGGAACTTGCTTT^ 
CTGACTCCCCAGTGCTGGGACTAAAAATAAGTGCCACCACTACCTAACTTGAG 

3 0 TGGTCCACCITCATTTTTATGGATTGCAGCATATAATACCT 

TTTGTTCTTGAACTTGCAGCTAATAGTTTTGTACTTGCTAATTTO 
TCTTCCTTTTATTTCTGTAAAAGCTCACTGGGTATCTTCAAAGTAATTCT 
GTGGTTTTTGATATATGTTAGTTTTCAGAGCATTATTTATCAACAGGTGTGGGGGG^ 
TTTTTTTGTTTTTTTGAGACAGGATTTCTCTGTGTAGACCAGGCT 
3 5 CTGCCCTGCTCATGACTTC^tTTACTTACTG 

TTAGATTTGCTCAAATGTATATGTGTTTAGGGTTGGTTAATTTACATTCTTTTT 

GAGTACACTGTAGCTGTATAGATAGATGGTTGTGAGCCTTCATGTGGTTGTTGGGAATTGACTTTTTTGT 

TAAGTACACTGTAGCHX1ACTTCAAATGCATCAGAAGAGGGCGTCAGATCTCA 
40 GGATTTGAACTCAGGACCATNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNN^^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

TG AT AAACT AATGATTTAACT ATTTGT ATGAAAAATT ATTTT CAAACCAAAAAAG TAG AAG AAT AACAAT ACTAAAC CTTTATTTT 
5 0 TGTAAATCTCTGTAGAGGCAGGGTTTAAAAAAAAAAAAAAGACACTTGAAGGCTTGGTGTGGTAGCCATGCATT 
TGGGGATGCAGGGGTAGGTAAATCGCTGAGTTTGAGGCCAGCCAGGACTACCT 
CTTTGTATTCAGTATATTTATTGTGCTATATTGGTAGAAATATTTGAAG 

AG GT AAG CTTTTTGAG ATGT ATTTT ATATTTGGTT CAGTACACCAAAACTT AAATTTT AGTTTTT TAAAGATT A^ 
AGCCTTTCAGACTGTTTACATTAGAATATTGGTTCTCCTTACATTTTGA 

5 5 TTTTATCCATTTGGAAAATAOTAGCCACCCTGAGTTGTGCAAACTTTCTAAGG 

ATGTTAATATCAAGCCAAAAAATTTTGTCAGTACTAGGAAGCAGTCAA^ 
TTTTGCTAGAGTTGATATTTTTTTCrrCATTGGCATCAAA 

ACAGAAATATTTTTGCCAGATACCAAAGTCTTAATCAGGATGTTTGTCAGTCTTTGAAGAG 

AT ACAG CT CAAAAAGTTGTTGTATGCAAAAGTATTT AATGCACAT T TG C CACACAG AAC ATTG AG ATTTAAT AAATTGAT AATTCT 
60 TCTTAGGCTTTATATGGAGCTCATGGATATGGTTTGTGGCAGTX3ATGAGAAGTGGTATAATTT 
AGACAACATCAAACTGGCTTCAGCACCATCCACAGGAATTTCAACACT 
AGTTTTGAATTTGTAGTCTCCTGAAAGAGTTTGATAAATCTC 
TACTTTATTCTGTTTTCAAAATTTTTATTGTGAAAACTTCAGATCCAAA 
ATTTTTAGCTAGTATTT^CTATCATGTATTCTTCCTTTCrATATAACAAGT^ 

6 5 ATATGTATATATATAGTGTAGTATTAAAATAATTTCATTGTTA 

TGTAAGGCTTTCTTTGTGTAGCTCTGTCCTGGAA 

TGCTGGGATTAAAAGTGTGTGCCACCACTCCCAGTCTAAGCGCGTCCCTCTCTCTCTCTCTCTC 

TCTCTCTCTCTCTCTCTGTCTCAAATTTAGTGATTTATTTAATGTATGTTGAGTATACT 

TTTTTTTTTTTTTTTTGGTTTTTCGAGACAGGGTTTCTCTGTAT^ 

7 0 CTOIAACTCAGAAATCCACCTGCTTTCTGCCTCCCGAGTGCTGGGATTAAAGGC 

CTTAAGAAAAAAAATTTCAGGCTGGAGAGATGGCTCAGTAGTTAAGAGCAOT 
AGCAACCACATGGTGGCTCACAACCATCTGTAATGGGATCTGATGCCCTCT^ 
GTACATGAGATAAAGAAATAAATCAATCTTAAAAAAAAGAAAAAAGAAAAAAAAACTTC 
AAGAAAATAAATGTATTAAAAATAATTCTTAATGCTGATCATATTGTTTTCATCTAAAGTCC 
7 5 CTTTTAACTAAGACTCTTGTAGGAGTTTTGTATGTTTTGTTTTTATATAGA 
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CCCCCCCCC CTTTTT IU'GTGGTrTrTGGTVJ.Tf T CGAGACAGGGTTTCTCTGTGTAG CC CTTGG CTGT CCTGG AACTCALrrri'Li'rAG 
TTTTGTGTTTTGCTTTGTGTGTGTGTGT^ 

TT ATTAGGAAG CTGTGAT ACTTG TIT Tl' AGGGCTGTTCCACATGTAG AAGTTATCTAATTAl"lTr LTTGATATTAT CTGACTTG CT 
5 CTTCACTATTGGCTTAGAAAGCTGTACATTAATAAATTCTG 

ATTTTTGATAATTGCTCATGTTTCCTTATNNN^ 

NNNNNNNNNNNNNNNNNNNNNWNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
10 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNIJNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
15 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNKNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNIJNNNNNNNNNNNN^ 

NNNNNNNNNHNNNNNIJNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNlflNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNIJNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
45 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
50 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN 
60 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 




65 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNHNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

70 NNNNNNNNNNNNNNNNNNNNNN 

NNNNNNNNNNNNNNGGATTTTTAGGAATTAGACTTGAAC^ 
TGAGCTGGGCAGTGGTGGAGCATGCCTCCCGCACTTGGG^ 
GAGTGAGTTCCAGAACAGCCAGGGCTAC^CACAGAGAGAAAC^ 
AAAAAAAAAGGAATTGATATTTGGAAATTGTGGTGTAAATTCTGGTGAA^ 

7 5 CTGCTTCCCACTTTTTGCAGCTGTCGTGCCTGTGCT^ 
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ACCCCTGTCTGTTGC T AAGG CAGlX/ri"!"!"!' ATAATAATT AGCCAAACTTACCTT CAATGCTTGGAAGGTTGCATTTTT CTTGG AAT 

GTTGGCTACCTTGATTTGCAATTAAGATTGCTTCTGGGTCTTTGAA 

ATTTCTACCACTACTTTTCAGTGTTCCTTTGTAGTTTAAA^ 

TTTTGGACrrTAATTTTTTTCTTTCCTTCT^ 
5 TCTTTATAAGTCAAGAGTACATATGGCTTTCTAG^ 

AATTCTTTTTTGTTTTCTGAATTAGTTTATATAGGTTC 

TACTCCCACATCTCACCCAAGAACAAATGTTCTTC 

TTTTGTCCAGTGTATTTATTTAGTGAGTGTTGCTTTTCTATTTGA 

GTGATTTCTAAAAACTAGACACCrGTGATGTTAATG 
1 0 TGAGGAGTGTAAAACTGCAAAGTCATGGCTGGCTGGCTGTATT^ 

TCTGGCTATGGAAG AAAGTCAGTTCAGTTGG CX'TrClTCCTTTGTCTTTAGGTAATGTATGCTGC TCTGTGTG CAAGCTGATCAGG 

CATACCAGAAGAAATTTAAAAAAAAAGTCTTTAAAGCTCACTACAATGTTTOT 

CTTTCTCTTATAGGAATAATATTTGAACACTAAACTAGGCAT^ 

AAAGTTAAAATATTTTTTCTATATAATCCTTTTCTTCATTGTCTTATTTA 
15 NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNm 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNKNNNNNN^ 

NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNN^TNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNM 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNIWCCTG^ 
3 0 CTGCCCAGTAGGTAAGCACITTTCTGGGCAACTATAAGGATTAGACrrT^ 

TAGTGCCTATAACCCCCTAGCACTGTGGAGTTAGAATTTGGGTAC^ 

AGCTCCCGGTTCAGTAAAAGACCTTGTCTCCTGTAAGATGGAGCTATAGAGAACACA 

GCACACACAGGCACACACACTGCACATATACTCACGTTATTC^ 

CAATGGTGTAAAGTCAAACrTGTACACAAATTACTTGGGATTAGTTCTGTCT 

3 5 ATTTGGAAT ATAAGT AGGTTT CTTGGTT CTT CTTTG CTTT AAATTTTAATTTTTCC CTTTGTTTC CTTAGTG AATGATATAA 

ATAACATT AACAT TT TTCCT AAAGT AAAAAJCTATAAAAAAG TTAAG ACCTGTTCCAC CCA 
ATTTCCTACCACTTGTAGGGAACGCACACGCACATGCACACACG 

NNNNNNNNNNNNNNN^ 
40 NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNl^ 
NNN ^ 

CCCTCCCTCCCTCCCTCCCTCCCTTCCTTCCTTCCITCCCCTCT 
TCTTTCTTTCTGTAAAATTTTAGCCAG^CA^ 

4 5 GAGTTCGAGGACTGCCTGGTCTACAAAGTGAGTTCCAGGACIAGCCAAGGCTATAC^GAGAAACCC^ 

ACAACAACAAACAAACAAACAAACAAAAAACAACA^ 

TTGTGTAATTTTTGCTTTTCTTGTTCACTGGGTAGCCTAAGACTTGTAGAGCTCTTTCT 
AAGGTGATAGCC^CrACTGCCTGGCrGTTTGAATTTGGCGCATATA^ 
GTGTGCTTTTACTTTAGTAATTTATGTGTATGAGTAT^ 
50 GTTGAGGCCAAAAGAGAGCATAGGATCTCCTGGAACTAGAGTTGC^^ 

GTGGGTCTTTAGGAAGAGCAGCCAGTATAATCrrTTACCrATGAGCTTTCTTTCCAC^ 

CAGAGTGATCCTAACCTCAAAGGAATGTTATTTAAAATTATTATCTTGAGCTTT^ 

CATTTAAGTGGGTAGCATATTTTTCTTTGA 

TTTCTTCCCCTTCTCTTTTTTTCTGTTGCTAGTAAGGCACTATACCT 

5 5 TTATTCAGATGGACTTATGCTG<3TAGAGTTACATTACTTTGTTCATGA 

CACTGTTTCTGTCATCACTGTAGTCTTGAGTTGATTTTGTAAAAACACAAAATCATACTA 
CTCCCCCAACATGCATACTTTTAGAGGCCTACTCAGTGGAGAAGAGCACAATGGTCTTATTCTTTGCT 

CT TAAGGG AAAAAAAAAAAAGATTT AT TTTTATG TGAATTTGTGT TTGTTAG AT CAC CTGG ACCTGG AATTACAG ACAGTTGTG AG 
CTGCCAAGTGGGTGTTGGGAGTTGAACCCAGTTCCTCTAGAAGAGCAGCCAGTTTTCT 

6 0 TCTTTCATTTATAATTGTAATTCAGTGAAT 

TTTT CATTG ACAGT C ACT CATGT ATAATTGAC CCACAT AGATTGAT C1ATGGGCITATTT TTT ATTTTTTGGTTTTTTT CG AGATAG 

GGTTTCTCTGTATAGCCCTGGCTGACCAGGAACTCACTCTGTAGN^^ 

NNNNNNNNNNNNNNNNNNNNN^ 

CG TAGTAAATTTCTG CATGTAAAATTTTTGTTTGTTG CATATATT CCTGGAGAAGTAAATAAGAATGAACATAGG ATGTTG T AAGG 
65 TGGCTGTTTTGAAACATGCTTACTTGGTTCCAGCTCTTGAGA^ 

TTCTGTAGA1TAGAGAGCAAACAGAAAGCAGAAATTGTCTAACAAGAAGTACATTAAGAA 
CCTTTCATACCAGCACTTGGGAGGTAGAGGC^ 

GTTCCAGGAAGCCAGGGCTACACAGAGAAACCCTGTCTCGAAAAGCCAAAATAAATAAATAAATAAATAA 

AAG AAAAGTGATT AATGTGCTATATAT CTACTTTTTAAACT AAAAAAAG ATGG AAGT CC TG GTAGTC CAGT ATAAGTG ATCTAATT 

7 0 AAG ACTCAT CTAG AG GGGG CTGGAGAGG TGG CTT AG CAGTTAAGAG CG CAGACTGCTCTT C CAAAGGT CCCAAGTTCAAATC CCAG 

CAACCACATGGTGGCTTGCAACCATCTGTAACAAAAAATCTGATGCCCTCTTCTGGAGTGTC 

ATAT AATAAATAAATAAATATT AAAAAAAAAAAAGACTCAT C CAG ATTT CT ACATGG CATTT ATGATC CT ATTTTTCTTGAAAAGT 
TGACTGCTCTTTTTAAAACTATTTTTAGTTACTCTACZAAAATCACTTGTA 

GGTGTTT ACATACACAATTCATATTGGAGCAAGTCGAATTOTT^ CAT T ATT AGGATG CTATT ATGAG CTCAGGGG TCAGT TCA 
75 CTGTTCTCTCCTTCrrCTCCTTCCCrATCTAACACTC^TATGTTATTTCTTTAGATGCATTTA 
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AATAAGCC CTGCAACTGGCCAGAGAGAT AATGGCTCAGTAGTTTCAT CGTGG CT CACAACCATTCATAACTCCAGTT CAGAGT ATC 
TAG CACCXjTlU"l"ITTG CCCTCCA.TGAGCACTAGGCATC CACATCAT ATGT ACACAT ACATAAAGG CAAATACTAATACACAT AAGT 
AAATCTAAAAATTTTAAAGCATTATGATTT^ 
TTAAAATGGCAAGCATCTAAAACTTTACAAGTCTAGGCT 
5 GACAGAATACCGTACAGOVAGAAGCAGTCCATGGAGGAAG^ 
GACAAGTCCTAGCAGCAGGAGTTGAGGCAGCTTGCTAGACT 
TACTTTCTCCTTTTCATGCAGTCCTACACTCC^ 

GTGCCTCACTG CTATGT CTGGTGGCri'G'lTr CTT AG ATGATTCTAGATC CTGTC^U^GTTGGTXjG CATTAACCATCACACAGAATAC 
AAAATTTTTTTGCTTTGGTTTTGAGAAAGTTCATTTAGCCCTGG 
1 0 G CTC CTCTAT CTCCTGAGTGTTAAG AT^ 

TAGTGCCGGCATTCCCTTTCCTGGTCTGAOTCCCTGTATGACGGA^ 

T ACTAAAATAG CTATAGTACTTAAAAT AATAATTAGTGTTTTGC C TG AT CTT ACAGTG TTTTAGTTAAGAATCTTGTATTTCGGGC 

TGGTG AG ATGG CTCAGTGGTTAAGAGCACCCGACTGCT CTTC CGAAG GT CCGGAGTTCAAATCCCAGCAAC CACATGGTGGCTCAC 

AACCATCTGTAACAAGATCTGACTCCACTCTTCTGGAGTGTC^ 
15 AATCTAAAAAAAAAAGAAAAAGAAAATTCTAGCTGGAATTAAAAAAAAAAAAGAA 

GAGAATTAGCCTTTTTCTTAACCCAAAGATTATTACCTAGAGTTAGTCCCAGCT 

ACACATATGGTTCACATGTCTATAAGAAGGCTAAACACCAACTCATAA 

TTTGCACTTTGAAATACTTTTCTCTGTCTTTGACT 

TGC^CTACCAGCCATACITACACACAGGA 
2 0 GTTAGAAGTTGCTCAAATAAAG GGTTT AATG ATGAACAAGGATAAAAC TTGT ATGGTG CTTTTCTGAGAT AACACGAATAAAG AGG 

CTAGTTTTT AG AATTTGTT AAAT CT ATTGTGTG CACATGTGACC C!AGAAAACCTG ATCAAGTCATTGCATGAGTT CAT AAATT ATT 

T AGGTATAAT AATGAAT CATTAATT AATGT AAATCTAGTTT ATAAG ATT AATGGGAGGGGCTGG AGAGATGG CT CAGCTGTTAAGA 

GCACTGACTGCTCTTCTGAAGGTCCTGAGTTCAAATC 

CTCTTCTGGAGTGTCTGAAGACAGCTACAGTGTATTTACATATAATAAATAAATCT^ 
2 5 GG C CATT CTTG AAGTAGGTTTTTGTGCAAGTACTTATTTTGT AT CAT AAAAAGG AGGAAG ATT ACCATCATAAACAGAAG 

CATTAGTCAGATCAGAGACTTGAGGGTTAGAGTGAGGACAGTTAGTGTCACAT^ 

TCATTClTriTTGAAATCAGTTTCCTTTTGAAAAT^ 

NNNNNNNNNNNNNNNNNNNNNNl^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNWOT 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
35 NNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 
- AGACCTGGAAGAGCAGTGAGTGCTCrrTAACCACTGAGCCATCrCT 

TCCAGATGCAGTTTTTTATGTGGGTTAAGTCTGAGACTTAACTCC^^ 

TATTTCTTTATCCCCAAGGAAGTACTTTATAACAAACAAAAA 
4 0 T AG ACTC CCATTAAGTGATGATT TGTTATTAGTCTAAATAATTTATGAATTT ATG CAATGACCTTATTAGGTTTTCTrGGGT 

GTATAGTGTAGATGAAACATAAACTCTCTGTTTTTATGTTCCTCTTAGAA 

AAGTCTTTTTTTTCTTCTGGTTAATTGGTTTCACA 

T AGG CATGG T AATAGCACTCACTGTTTTTGGT CT AAG AATGTGTGTTTAT G CATT TCAGTTATTTT ATAT AAATG AG ACCTTTTTG 
TAAGGGATACAAAGTGAAATGGAAGCAGTTGATTTTTTTTTTCT^ 

4 5 AGG CATTGG AACCAT CCAG ACTCGG AAGAATTG C CTGGAC CAC CAAT AG CAAAACCTCAG CGTGTCACA.GT AAGT AAAT ATTTTTT 

CTTT CTCTT AAAATAAATT AG AAATTAATCTAGAT AGCTATTATCTT AAT TTTTG TTCT TACAAGAT TATAATTG AAAGCTTT ACA 
AATTAAAACTTTGTAGTAGCTCAAGGCCAATGTAGGTTTTATACT 

AGAG C T T TTAAT CATGACTCATGT TTT AT AGAAT TT ATAGT ATTT AGGG CIT AAGAAAAATT ATTCCTT AT ATG AAGTGTTAGTAC 
TGATATGTGTGTGTGGTATTATGTATGTTTTTGTTTTGT^ 

5 0 CTGGCTGTCCTGGAACTCACTTTGTAGACCAGGCTGGCTTCGAACTCAGAAATCCGCCTGCCACTGCCT 

AAGGCGTGTGCCACCACACCCX3G<nTTATGTGTGTATT 
TTATGAGTAATTTCTTCTAGCATGTGGGTTCTGTCAATCAAACT 

CCTTCAAGGTCAAGGTCAATTTTAATGACAAGGTCTCATTAAATTTGGCTTATTTAGCT 
GTCTTTATACCATAAACTGCTGTGGTGCTGTCTGTCrGTCTGCCT 

5 5 TCTTTCTTTCTTTCTTTCnTTCTTTCGTTCTT^ 

C C AAACC CT CAT ATCCTTATG CTTGCATGGCAG ACACTTT AC TT ACTCTCAG CTT ATTG ATCCTG AG AATTT AT AAT AAATCT AGG 
CATGATGGCCAATACCTGTAATTGTAGTACTTGGAAGGCTGAAGT^ 

AC CTCACTCTTT CTC CCAAAGCT ATATAT AGACT AT ATTC T ACAACTTGTG CT CTGC 

CTTTTTAAGAATTATTTTGTTTTTATTTTATATGTAAGAGTGTTTTG 

6 0 TTGGTTTTG GTTGTT TTTGTTTTTGTTT'rTTT'IGG AGACAGG ATTT CT CTGTG GT ATC TTTGG CTGTCCTGGAACTT ATT CTGTAG 

ACCAGGCTGGTCTTGAACTTAAAGAGCATTCTAAGTGCTGGGATTAAAGGTGTTCTTCACTTCT 
' TCTTAAAGTGACTTTCATTCCTTAATTTGATTTACATAGAGCTAAA^ 

TTCTTGCTAGGTATG CTACCCACAATCTAGGCAACTAGGAGATGAAGCCCTTGGTCATCTTGAAGTACCACAGGA^ CTTA 

6 5 CATCCTCAGCTCTAAGAATGGAGGAGCTCTTTTGGTTTTTTGACAAGGTTTCT 

GTAGACClAGGCrGGCCTCGAACTCAGAAATCCGCCTGCCTCTGCCT 

G C T AGAATAGAATTCTTAAT AAACACTAT AATGTTTTTT CTT AAGT^ CAT AT ATACT AGT AAT TATTTG 
TGATTCTGACACACGCTTTGTTCTTT^ 

TGTCGTCGTTTTTTGAAGCCCTGTCTTAACTGCTTATCAATGCTACrTATGCATGC^ 

7 0 TG GAAG (^AAAG AAATGAT AGGAAATGTG AAGTTTTT AAAAAACCAAAAGT ATTT CTCAA CAAAAT 

TTAAGTCTTAGTAGTGCATGGAATTTAAATTATTTCTTAGGAACTTTCAGCAGCCTCCCTCTTCC 

ACTTTTTTAAAG CTAAAT TTGTCACATAG ACAAGATGTTACCTTTT AAAA CTT AG AACAACT AATTTAATAC CT TT AT 
TAAT AGGTG AGG CTGT CT TCAAAGGAAC CAAATCAAAAAG ATGATGGAGTTTTT AAGG CTC CTG CACCAC CACT CAAAGTGAT AAA 
AACTGTGACAATACCTACTCAGCCCTACCAAGAAATAGT^ 
7 5 TACTGTTTCACTGCACGTGGATAGATTTCTATGTGTAACAATTAGCCTTTTGTCT^ 
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TTTGAAC CAC7U\ACX3TACCTTATTCTAC'1T GTC CAATACTAGTGATTCAAAAACATTTTT TlTGTi'GCCATCGAATG AGTGATTTT 
ATAGGCATTTTATTAAATATAATGAGCCTATAG ATTTAAATATC CITATGTTTTTCTTGG CAGTT 

ATATACTGTTGTTCAGCACGTGAAACACTTCAATGATGTC 
TGTTAAGTGGCTTAAAGAGTACTCAGCCTCTAAACACACG^ 
5 GCTATGT<^TTAAGTTTTTCAACTGGATGGGTTTAAATTT^ 
CATATGCTTTTCAGTTTCCAGATGCTACAAGATT^ 
ATTTAAGCCACTGACTGATAGATTGATTTGAGTA^ 
TTGAGTTGGTTGTATTATCTATGATCAGTTGAAATATA^ 

GACTATGAAAAGGAACTAGAATTGGTGATTGGAGAAATACACTACAGAAACTGTC 
1 0 TAATCTTAAGC CTTGGTACATGTTTTAAAATTTTGG'r TTGCCCGTGG I^TTTATAITITGI'G TTGTCAATA r r i"i AAG CCTTCT ACT 

TCTTTCTTGTTTGGAATTATGGAAACTACAG CT AATGTCTTAGCAGTTTTT AGCTGC CTT AAC^TAGT CTGTTTCCTTCT 
ATCAGCTCAGCTACTAAATGTGCCATGCCCAGTT^ 

GG ATGATTCCCAG CATCATCAGGTAGGTATTTTTTAAGTTTCTG GGGTTGAGAAGTATTATT AAAAAAATT ACAGTTTTTGT CTAT 

ATCAGAAAATACTTTCACTAGACATTCATATA^ 
1 5 ATACATTTTTGCTTTATATATCrCATTTTCTTGAAGACGTTTT^ 

AGTTAGCATATTATACTGAATATTTTATACTATAAAATGATTTCCTTTATTC^ 

TAGAAAAGGGTTTTGAATTACAGCCATGGGGCTGGAGAGATGGCT 

TTCAATTCCCAGCAArCACCTGGTGGCTCACAACCATCTC 

TAGAACACTCATATAC^TTAAATAAATATAAATCAATCTCTTTA 
2 0 GAT CCGGAAAAAAGCTG CATT CTCAAAAGTAATAAATGGT AAAG ATT AACGAAC CAAAGTTAATTCTTAATGT CACCATAAGTT AG 

AAAAAAATAGTCATTACOVAACCTGACAGTGATATATTGAGGTGG^ 

GGCACAAAATTAATCTAATACGAATTCTTCTGA 

ACTTGACACAGAATAATGAGGACAATAAGGCACJ\AGGAATGTGTTACT 
CTTTTTCTTAGTGACTTCTTAAGGTCrTAACTGTGACATTCrrAA 
2 5 AAAAGATAATT AAG CAAT AAAGAT C CTGATTTGGTTTTTAC CCATGTAAGGGAG CTCACAAC CACCTGTGACTCT AGCTCCAAGTT 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNN^^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNl^ 
35 NNNTOJNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^^ 

AGTGAAACTGTGTCTCAAGAACTCCAACTACACACCTAACAA^ 

CTGATTTGGTCTCCATAGCGAGTTCCAGGCCATCCGGACTATGTTGTGAGACCCTGTCTTTAAAAA 
4 0 TTTTTTAAAAATCCAGCCTTTAAGTCCAGCACTGAGGA 

TAAAT AGTGGACAGG ACCAC CAG AG CTCTGTCTTGAAAAC CTTAAAAAAAAAAAAAGGGTATGCT TGTTT CTTTTT AAAAGTTAGT 
TTATGAGTGTTTTGCTTGCTTGTGTGTCTGGTGCTTGCAAGAGGCCAATAGAGTT 

TGCTTGGAATAGAACTGGGGTCCCCTGGAAGAACAGCTAATGCCCACTCTTTCTGCTGAGTCATCTCCTCAGCCTC 
TTATTTCAGTCCCCTGTATTCTGATGGAAGGCTTA 

4 5 CATATTTGTT ACTTTTTTTTTTT AAAGAAAACAAAAOT TGCAAAAGGA 

AATATTTATAAATTACATAATTCATAAAAATC 

GATAGAGAGG AAAG CACAACrTTTAAATCAGAGTTTT ATTTATCAAAT AG CTATG TTTC CCT AGGCT ATT CAGTAAAT 

CAAGCATTATCAGGGTCTCTATCAAAACCATCTATAGGAATATTAGACAA 

AAGTTGTTAGTAATTGCTTTCATGAATATTTCTCT 

5 0 TATTGAGT AG AG ACCGT TTGAACATGGATCTTG ATAGGG C CAGC CT AG AT CTC ATG ATTCGGCTTTTGG AGTTGGAACAAGATGCC 

TCTT CAGCTAAG CTACTG AATG AAAAAG ACATG AACAAGATCAAAGAAAAG ATCCGAAGACTC TG TG AAACTG TGCACAACAAG CA 
TCTTGATCTAGAAAACATAACGGTGGGTTGTAAACCTTTTATTTTTAAGTAATATATATTCTGTTTCAAA 
GAAACTTCTCTCTTGAAGTGTTTAAGTTTGGGTTGGGTTTGGGGGACAGATAAAAACTTCCTTTAAGTCTT 
CTAGATTGCAAGGGATTAGCAGGCTGAAGGATAACTAACTAAATC^CTGCCTTCAGCCCTGCTGAT^ 

5 5 AACTT CCTTTATTATATCAAGAATTAGATTACTCTCTGTTGAGAAG TGGCTTTAGG CTTT TTT CTCCACTTCTCTG CT CATTTTTT 

ACTTTCGTAAAGTTGGCTCITTTTTAATCATACTTTGCCTTAAACTTTATACTATAGTGTGGCTC^ 

TTCTCATGTTTACTATGTTCTGAAACrATA 

TGAACATTTAGGGCCACTGACGTTCCCATACT 

60 GAGTTTAAGGTCAGCCTGGTCTACAAAGTGAGTCCAAGAC^GCCAGGGCTGA 

AACAAAAAGTAG ATT ACTTTT AAATGAATTTT TTTCATGAAAGCAAG C TGT AT AG TTAAATCACCGTTT CT TTAGTGTACACTGAT 
TAAAATT AG TTTATT CAC CTATAAAAGC!AC!AT ACTGTTCT ATTT TTGACTT ACTG G AATCTT AAT AAAGAAAAATAAATCCT AAAA 
TTAATGCATCAGAATT ATT CTAACC TTG TGGTGG CG CACACCTTTAATCCCAG CATT CTGGAGAG AGAGG CAGG CGG AT CT CTGAG 
TTTGAGGTAAGCCTGGTCTACAGAGTGAGTTCCAGGACTGTCAGGGATACAC^ 

6 5 AAAATTATTTTACAC C T CAGT AT TCATTAATGGTATTTCT G AATTTTAGACTGGT CATTTAGCTATGGAGACATTG CTGTC C CTCA 

CTTCCAAACGAGCAGGAGATTGGTTTAAAGAAGAGCTCCGACTTCrGGGT 
TACATX3CATACATACTCTAGATAAAGGTCTGTGTATGTAC^ 

ACAGG CAGGTGGATCTTTTG AGTTTG AGG CCAG C CT AGTCTACAAAAC CAGTT CTAGGACAG CCAGGGCTACATAGAAGAGAAATA 
CTGTCTTGTAAGCACAAAATTGGAGACACrrGTGATCAAGGCAATTTAT 

7 0 AGTCTAGAATGAGCATGGCAGTAAGCAGGCAAGTGCTGGA£ 

TTTAATCCCAGCACTCGGGAGGCAGAGGCAAGGGGATTTC 

TAGGGCTT AT ACAGAGAAAC C CTG T CTCAAAAAACAAAACAAAACAAAACAAAACAAAA 
CCGAGGAGGGAGATGACTGGGAATGTCCTGAGTGCTTTGAAACCTTA 

AT CTCCTAAT CCTTC CTAAACAGTCACACAG CTATTGACCAAGCCTCCAGAT ACATG AT C CTGTGGGGT CCATT TT CATTCAAAC C 
7 5 ATTGT ACTTAACTAG CATCAGGCAT AAAATT ATGTTCCCATTCCAG C ATAT ACGT ATTCAT AG CATGTAT GTAG GT AACGTTTGGT 
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GTTAG C TAAGAAATAATTATTGAACTTTCATAT ACTAACTTTTGTCT ATTTC 

AGTAGAGATGATGAGGACGAAGAGAAACTAGTAGCCTCATTATGGGGAGCAGAGAGATC 

GGGCAGCTGGACGGGGAGTTATCTCTACTTGAGATTGTATGACCACrTGATC 

ATCTGAATTAAAGTTGTAAGGATTCTTTT CAAAACCAACTGT CAGTAAATGGGCATTTT AT CATTTT AAGAATGTATTGTTTTGCT 
5 GTATATCCTtitfVTAAATACTTGTGATCCT^ 

TGTCTCAAACCAAAAGCTTTTATTTGTATATTAT^ 
TATAGTTTTCTTAATACAGAAATCTGTTTTTGTTAAAACACGAA 
AGTTAGTACGCTTTTGTTTGTTTAGTTTTTAAG^ 
TTGGGTGAC^GTACGTATGTTTTGGTCTTTGATTTTAA^ 
1 0 TTTAGAATTTATAATTGCmATTAAG 

CTCCTTGTAGGGTCCTGTTTACTAGGTTATACATTATTTCTrTATTTAA 

ATTCTCAACACITGGAAGGCGGGTGAGTTTCrGTGAGTTAAAGAT 

TGTAGAGAGATCATATCTCAAATAAATAAACAGACATTAAAAAGTAAAATATT^ 

CIXJT AT ATTGGAGGTTTTCTGCATTTCAAAGT A TATG AT AA 
1 5 ATT AGT AG'iT AGTTGGGATCGCACTG TGTAACTTTAGCT CACGTGTAGTTCACTATGTATAC CAAGATGTTTTTG AACTCACA 
GATGAGCCTGCCTCATGAGTGTTGAAATTAAAGATGTGTGCTACATTCTGGTT 
GGTTGTACAGGTATTTCTCTGCATCAGGGCTTCATAA^ 
CCAATATATTGGCAATACTAAATAGGTACArAATACAAAAAGTTTTAAAAT^ 

TTTTAGTTTTGGTTGGTTGGTTTGTTAGTTTGTTTTTTC C CT AGT ACTTG C TATG 

20 TAGACCAACCTTGCCTCAGACTCAGGGATCCACCTATCTCTC 
ATTAAACAGTAATTTTTAAAGTCAGTCATT'TTGAC^ 
TAAGTCTTTGTTATATTTAATCAAGTGACTCATTTT^ 
GTGTTTCTTAGCTOTTTXjAGCAGTGTTTGTTGGTGTGTGACATGACAAC^ 
TAAGTGAATTCATTAAATGAGCCATGGGTGATAAGCATTTCTAACAACATTTTAG 

2 5 TGGTTCATTAAAAATGAGAAATTTTCTATACTGCCTGCTTTTT 

TTTAAGAAATTGTGCCCTTAGATGACAAGATCTGTTTCIATCTTTTT 
AAATGAAAATTATGGATTGTTAG AGTCATTTTATAT CCATCCCTTOT 
TAGCCTATAAAGATTCACAACTCATTATTTCATCAGCTAAGTAAGTTTTATAAACT 
TGTATTCTATTTTTTCTTAAAATGTTTTGTCTGTTGAGTTTTGTGAATATGTACATTA 

3 0 TGAGGT AAAATGGTTTGGTGAGAATTGTAC^T CCTTT ATGTTAG CT AT AAT C CAGGAC^AATGACTTGTT ACAAGGTGTTTGTG GG 

CAGGGCAAGAAGCCTGGGTACTAGGATGGTTTTCATCACTAAACAAGTAGATATATCT 
TACAGTGTATATATAATACATGCAAAGATGATGTTTGTTAAT^ 

AATCAAATTCTTTAAGCCTACTAGAACATTGCTTAGTATATTGTCAATGGTAAATATTTTA 
ACTCCATTTCTGCCACGTATTTATATCTCATTTTCCAGATCT 

CTGTCCTAGTTCTCTGGCCTTTATCTTAGCCATTATGCCACCGGGATTTAACTTTCT 
AAGTTGCAATTCTATATGACCCTTGGCTGTCTTTGTAAATTTCCTTATACAAG 
TTGTGTATTAACCTITTATTTGGAGAGTTTCAAATTAA^ 
TAATACATGGGTGAACCTTGTTTCTGTCTATAACCTCATCTTTTGT 

4 0 T AATGT AATTTTGTTAATTTTAT AG CACATT ATTAT CCTTAACATAAG CAATT CTTC CTTAT CAAGG AACTTC CTTGAAGAT T AAA 

ACCTATGCTTATACCAGCCAAGTGGTCTGTCAGTCCTAGTCCCTGGTTrTTTGTTTGT 

TTCGTGATTTTAAAGTTOTTTTCTTATGCITCT^ 

TCTTGC^TGGGGTTTTCTTGTGAAATGTTTGTAGAAGTGG 

4 5 CTT^CAAGAAAACACTTATAGATACTTAGTAATATTTAGCACAGACAGTTTAGTTACTAA 

CCT CAAACATTAAAGAAG CAAATTTTTTAGGTTTGACAGTAAAAG AACATCTTTCTAAC CAAG AACTGT AAAAT ATTAGGAACCTT 

ACAGGATTTCCTCATTGCATATTAAAAATAAGCTAGGAATAGATTAAATATGTAGAATTT 

AATTCATGAAAAATAAACTGACAACGCCAGTTTACTTGATCTGTGTTAGACTTGAAAA 

TGCCTTGGCAG CTG GAG CGATGACT CAG CAGGT AAAGAAACTT GT AGT AT AG ACTTGAC CAG AAAC CATG CAAAG CTTTAAGACGT 

5 0 GCATCTACTCTACAAAGTTGTCTTCrrGACCTCCATATGGGTGATATGGAATCTTTGCTTACA 

AATAATCAGAATTTACAGATAGACTTGTGTTAGCATTCTACTTCCITGTTAGTGTGTGTCAG 

G C CAGAAGACC TGTT AATGTT CCTCAATTAAAAATAATG ATTTG TCT T AACAT^ C 
GAGTAACTTTTTTTTTTTAAAGATTTATTTATTTATTATATGTAAGTACACTC 
AGATCTCATTACAGATGGTTGTGAGCCACCATGTGGTTGCTGGGATTTGAACTCCGGACCT^ 
55 CCCACTGAGCCATCTCACCAGCCCCTCGAGTGACTTTTATA^ 
AACAACTAACTGGAACCGAGAACTGGCATTATTTCTTCAG^ 

ATCTGTAAAATGTGACAGCTCCTGTGAGTAACACGTTTGTTTTGTGAGCCCCCCCCCCCCATTGTTGTG 
GAGACAGGATGTTATTCTTTTGTCCCAGCTAGTA 

GGGACTGTAAA(^TG ATGCACAAACATGGAATGAACTTACAAC CTT AACTG TCTT TAG AGAGAAAT TGT AATGATCAAGTATACTA 

6 0 AAATTT TG CAG TT AG CAAAATT ATTAG CATGTGTGGTACATTAACATTGTAACTTTAAAAACAGG AAAACAAAACAAAC TTAGTAT 

TTTTTTAGG CC CCAGTTAATAAATTATACCATAATTATTTACATTTTATTTTCTCTTAAG AGCAT 

TTCAGCAGTACAAC CGTGCTGAGAACAG CATCTGTGTAG CAG ACAGT AACCCT CTGCCTTAC CAGAATGTAACT AACCAT GTGGGC 

AAAGCAGTGGAGGACTGCATGAGGGCTATAATTGGAG 

GCATTTGCAGATAAGOTATTCTGTTTTTCATGTACTCATTTACAATAT^ 

6 5 ACTTTTAGAAC TG CCT AGT AAATTAGTCTTTGAATATCAAAACT CTG TATTTCATTAAAGAAAT AGTT ATTTTAAAATAAAAATTT 

TAGATGAATACTTCGTAGCCGAGTATAGTGTTTACACCTATAATCCTAACTTGGGGCAAGGCAAGGGAGGCT 

AAAGTCAGTCTGAACTACGAAGTGAGTCAAGCAGGATTAGAGTAAAACTA^ 

AAAATCATTITAATGTGATTATTAGTGTTATCAGG 

AAG AAG GACTCATAGGCACAG CGATGAACTCTCTTCTTCAGG TTCCAA^ CT AC CTCAGGAG CAG AGATTTGATATTCGAGTG 

7 0 CTGGTAAGTTCAAGTGTGGTCACTTTGTAATTCTA^ 

CrGTCATTTATTATAATCTGTTTTGTTTTGGTGGATCCTTATGAA 
ATAGAAGTATAAATGAAATGGTGGTTGATAACTCAGGGAGCTAAGTGGCT 

ACAGTGTCTGTCTTACATAGTCGTTGCAAATGGAAAAATGCTGCTO 
7 5 AAGTT<ZTTGTGTGT13ATGATTATATTTTTAAATGTTTTAT^ 
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TCGACACTGCCTTGTCAACATG<M 

CCGGACAAGTTCATGCTGTTCAAGCTTTAGTGCAGGTGAGCAGC^ 

GTGCACTTGAAGTCAGCTGAGCTCCTCAGCCTGAC^ 

AGTTTTGAGAAATGTTCTTTTGTTTTGT 
5 ATGTCTC CATTTGAT AC1TGGAT AATT CT CATTAGTAACTTGGT ACT 

GTGTTGAACAT CATAGCAACGAAAAGAAAAAGTCATTQAAGAGAAGGGATTTGGTA^ CTTGTTTGTATGGATACCTGATAG 

ATAGTGAACTATGAATAGTCITCCTTACACACTTCCTCC^ 

TATTTATTGTTTATATTTTAGCriTTTTATTGGTCTTCCGGT 

ATTTTAGAAATGX^TAGGGATTTTTAAATTTGCTTTTGTT^ 
1 0 AAAGATAATTAATGGAGACTTTTAATCITGGAATTAC 

GGTGGCGCACGCCITTAATCCCAGCACTCGGGAGGCAGAGGCAGGra 

TTCCAGGACAGCCAAGACTACACAGAGAAACCCTGTCTTGAAAAACCAAAAC^^ 

CTTCAGTTTTGGTTTTCAGTAAATGTAGGACAAAAGCT^ 

TGTTAGTACTTGATGTCTACTTTTTTTTTTTT^ 
1 5 TTGTACA.TGTGTTTCTTTAATT ACAATTGT AAGTTT ATM 

GTAGTATCTTTTTAATG ATTTT CAAAAATAATTTTTATCAG CTATTT CTCGAACGAG AG AG AGCAG CACAATTGGCAGAAAGTAAA 

ACAGATGAATTGATTAAAGATGCTCCTACCACTCAGCATGATAAGAGTGGAGA^ 

ATC^VACTGAAAAGACTGATGGTGCAGAGGAGAAGCAGAAGAAGGAG^ 

CCTGAAGTAGTAATAGAGTCACAGTGAGAATAATGGTTTTTGAGTTCTT^ 
2 0 AATG CCTTT ATAGTTTGT CTAGGTAAGCTTT GTGTGT CAGTGAG AGAT CACTT 

TTCTTGTTTGGTGCTTCT CCTTT ACTCTGTACAG AGCAGCAG CT AAGTTGG GCATGAG AGTG CACAG CTT AATT CTAGCATGAGGT 

AGGCTGAGGCAGAGGAATGCCACTTGGAGGTCAGCCTGGGTTTTTTCAATGAGACT 

TGAGGTTCTTTT AT CTG CCTCAC CTTAAACTGAGGCGTTCCAT CAAAATTT ACAGGGAGTAGTTCTC AG AGCTT ACTAGG CAAAGA 
GTAATAACATCCTAGCTAGAGGCAATAAATATCCTTACTAGTGAATTTATTGA 

2 5 ATATATGTACATTCTGCATAATAACTTTTTGTAACTAAACCTCTGAGATTTTGGGGGTTTCTTGTTTGA 

CTCTATATAGCTCTGGCTGTCTTAGAACTTGTTCTGTAGACCTGTCTG^ 
TTAAGGCCTGTGCTACCACTGCCCCATAAGGCTTTGAAATTTAATATATTCAACACTT^ 

TCTTTCTGTT CGGCTATTCCTTCC CTAAGTT ClG ATACTCX3GTCTCT T ACTTGAAGAC CTGTGGT TCAATCTAT C TTATCAACTT A 
TTTCAGTTGGATGCATTTTATGGCATTTGTTAGAACTATC 

3 0 GCATATGACTG<!ATATAGGCACTTCATAGTCATT^ 

TTCACTACTACAGTGATATTAATTGTTTTCCAGTGTGGTAGCACCAGCCCATC^ 

TTCAGG ACT ACCTTG AAGG ATATTTTT CTTAGTCATT 'ITATTTGATG AAGAAC TTTlTUTTT CATTT ACAGCTCTCTAAAAACTG A 

TTGTTTGTAACATCTrTGTTTTTTGGTCTTTTTGTGTATTTGAAGA 

ATTGGTCTTTTATTATGAAATTGAAAGAATTCTTTATAAATT^ 

3 5 AACTGTCTTCCACTTTGTrATTGGTGACCTC 

ATTCTTTTTTTTTT TT AATTTGAGATACTGAATTGTACATG CATCATTTTTAGTAGCTTTTAAGGTGATAACAGTATAATTGATGT 
T ATGGG GTCTATAT CAG CTT TGCACTGT AATAACT ACTT ATTT AACTGTTTTCM CATGC TGGCAAACACATGGAGGA 

CTTGTATCTTTGAAGATTTTTTGCAACIAGCTTCATGCCATTAATTAA 

4 0 G CTTTTT ATTTCAG AAG AGCGG CTGACTGTTTAG ACAGTTTT ATTTATTTC CCTG T AG AAGAAAATGTCAAG AATATT TCAAT AAC 

TT CC T AGTC4GTTTCCTTT AAAC'iT '1"1T AAAAAT AGAATGTG AACAT CTTTATTTCCTT TG CATT ACAGGCTTTTAT AGAAAGAT CG 

GAGATGTAGTG^CTTGTTTGTTGGTCTTTATGATTTTTTAGAACTGA 

GCAAGCTGGGGGAGGGGGCCTCAGGATGTAACTC^^ 

TGCCACGTAAACTGGTTATGACAGTTATGCCTAC^CCCAGCATTTGAGAGATGAAGGTAGTTC 

4 5 CCACCTGGGGCTAGAGACCTGGTCTCACAAAAACTTAATTAGATACTT^ 

AAT AGTTTGGGATGTTAAAGACTT T AAAATAT TTCTTG ACCTTTTTAAGG ATACATTC TAAAG AAAG AATGAT^ 
GGGAAGTTCTTGGTATGGATGATTAAGTCCCTGCTGTTACGTGTAGTTGATGTAAAAGAAGGGAGGX3TTTA 

TGGCCAAGTCAAATTTGG CTCATTTTAGTGGCGCAGTGGAGTAACTTGT CCTGCACAGGTTTTATTGTCCTTTTTCTTGGTTAAAA 
GCACTCAGCATGGGCTGGAGATTCGGGCGCAGCTCTTAAAAGCACTTGATGCT 
50 CATGTCAAGTGX3CTTATAACCATTTATGAGATCAGCTCC/^ 

CAAACC CACATG CG CACATAT ATACACG C AT AT ACACAT AT AT ACACAT^ 
TAAAAGCACTTGTC^TGGTCATAAGTGATCATACTTAAGGTCCAATCT 
TTTTTAAGACATCATCCTTGGACTTCTTAACCATCT 
CTTATGTTCATCAGGAATTATCCCTGkGAAATATGTGTCTAT^ 

5 5 CAGTCTGTTGCCATTGGAAAAAACAAATTTG AACAT 

GGAATATGTGCCACAGGTAGTAGTAGGTAAGTGAAAGGTCAGTAGT^ 

ACAG TCATGTTT AAGTGT AT TGATAATAG ATCTTGAAGT TGTATT AG TCTGATGTGTG ATCATT ATG GATAATCTGT AGCT TTTGT 
GTCCCTAAAAAGGGGTTTCTGAATTTGTTGACAGTATTC 

TTTTGTCAAT C CTTATAATT AGAAAACT AGATATTAC CACAAGT ACAAAGTGAATTGG CTTT CC CATTGTGG AAGTGGAAGAGATA 

6 0 AAGCACACATATAATTACGGGCAGTAAGCTGTCCCTGTTCTGTGGCCCCATGCTTAG^ 

TGTTTACCTACCTTGACTACCAATTTGCACTCACT 
CCTCTCTGTAlTCrrTTGACrCTAGCACAGTTTCACAAGATCT 
TGGATTGACTTGTCACTTTAAGAACATACAGTTACGCCTGCTTCT 
GCTTTTCAAATAGTCCATGCTATATTTTTCCTCTTTGAACCT 

6 5 ATGTCT AGTTCT ATAT C CAAGTAAAGAGG AATTTGAACTAATG AAGTG CTT TCATT CACATCATGGG AG CTATT T ATTATTGTAG A 

GACA.TACTTTTTTACTG AG CAGT CATT CAGAGTTAC CTCAACCATTCTTTT CTCCAC CATT ATG G ATGTTT ACATGTGATGATGAT 

TGGACTACTTTTGTAGATC&TGGTTATTTTAGCAGGCCTTTAAATTT^ 

TCTTTTTATTGCTGTAAAAAGATGATGACGAAAAG 

ATGGGAAGTTGCCAGAAACTCCTAG^3AATCCTGCrrTACT 

7 0 GTTGTOTGCCCCATCAGTTATCAATGAAGAAAACACCCCATAGACTTGTGTATAGGTC^TCTGATO 

TCTTTTTCTCAAGGTGACTCTAGTTTGTGGCAAGTO 

CITGTGCCTGGAGGC(IAGAAGGGTATTAGATCCAATGGAACTC 

GGAATCAAACCGANNNNNNNNNNN^ 
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AAGATTTAGTTAATTTTTGTCTTCTAATTTACCTTTGTAGCATC 
TGTAGAGTCTGTCTATATATTTAATTAACTGTGCAGTTTATTTCAAACU" 
TTCTTTGTACCTTTGGCTTTGG CTAGCITTGTTGTTTCCCCTTTGTTTCTg 
ATGTTTCreCACAATAGGTAAGTGAATGTGGACTGGAAATGACGTATGAAl'l^^ 
5 GCAQAGCCAACAG CATC^UVGTGAGTTACAGTATCATTTTTTACCAG CTTGACTGTTTTTAAT CTGTTT CTTCAGAT CAATGTAACT 

ACAGTAAGGGAATATCTTCCZAGAAtSGAGATTTCTCCAT^ 
AAGTAACETTTATTGTATATTGCAGTTTA^ 
TCCACATGTTTCAGCCATTGGACCAGTGGGACCTGGCTAGTTT^ 
ATAATCCCATAGTGTTGTTCTTCCCTGTGGCAAAATAGTTAACATAAGCAGCI^ 
1 0 TTTTTGTGTGTGTGTTTTTTTTTCTGTTTGTTTTO 
CCCTGTAGACCAJGGCTGGCCTCXsAACTQ^GA 

CCAGGCTAGTTCTAAAATTTTT AGTCATGTGTGGCAAGAG CAG CGTGGAAGAG CAGAGTGGC CAGCTAATAGAGAATGCCTGTGTA 
TCTGCTTTCTCCTTTTATTCCAGCTGTGCTTCTAGGCTGTGGGATGATGCCACC 
TATTACTAA.GGCAATGATTTAATTTCAGTGTACTCTTTACTACGTAGACTT^ 
1 5 TAACTCT AATTT CTCTCCTTC CCCTCTT CT CT AGAGTGATTGAATATTTGGA 

ACATTGCTAGCTGCTTTACCTTTGCTTCAGGTGCT 
TGAAGATATACCAAGAACATTCATCAGTATCATTCATTCGTGTTTG^ 

GTTTGC T AAATG ACAGTTACT ACATCAATCTG CAGCT ATCAAAAATGAG GG AAA^ T CAGG CTGTTAACAATCC CATGCAGTAT 
TTAAATACACTTACTTGGCAGAGTCTATACCTGCCCCT 
20 GCTGTTAGTGCAACTGCTGTGTATGTTGAGCCACT 

GTTCTGAGGACATTCTAGACAACAGCTTAGATCCTTTT^ 

AAAG CTTTT AATAATTTTGTGAATTTTT TG GTTGTTGTT CC CTGAACTACT 

AAGGGATTAATAGTATATTTTTTTATTCTTGATTAGGTTTGGGTGTTGAACTATTTTTCA 

GT CAT ACCATT ATGTGTCATAG CTAT AGG CACAAGAAAACAAAT AG CAGTTTGAG GGAATATTATGTTATAT AAG ATAATG TG CC C 

2 5 TCTTAAAGGATTAAGCAAAAATAGACAAACCC^ 

GGGACGTTCTCTAGATATATTTTCTAATGTTCAGTTCAGTAAATATAATAAGGAAGCTAA 

AGAT AGCGTGT AT ATTCTT CTATAGAGTG ACAGGAT CAATTG CAT AAG CG CAAAG CCTT AAAATTG CTGGTGT AG AG AAG AC CCTT 
TTTT ATT CAGATTCT ATGTTCATAGAG CAATTGTTT AGAAAATGGTTGT GGAACACACAGAACATG GTTT ATAGGGTAAATTT CAT 
AGATTGTTTTTTCTTTTTTTTTTTTTCTTTTTTTTTTTTG^ 

3 0 TGGCCTGAGC^CAGGCTTTCCAGAGATGTTTCATTAAT 

TGTAAAACTTAAAGC CAT AAAAATATT AGTTTGGTGTGTATTGG GG AAAAT AG C!T AAAAGTCT AACTTTG ACT CATTTTAGACTTT 

GTTATTTCCTTGTATAAAGTAACAAACCGGGGCTCTTGT^ 

GTCTTCCTATTTTTGATAATGCAGATTGTTGGGAAATCTA 

GCCCAACCCCCTACCCGTCCGTCACCTTTTAGGAACACAGTATGCCA^ 

3 5 TACCAAACTAATTTAGTGAATTTTTTTTTAAACATACTGGTAAATGA 

TTAAAATACATTGATATTTATTAATCATTGGATTTGGGGACAGATTTTTGACrGTC^ 
TTGGATGAGATCAATCAGGGCGAATTACATTTTTCTGTTACACTGGTGATCATTC 

TT TGTTTTGTTT TTAAAATACT AACTAATTGT CAAGCACTGATG AG ATG CAGG TTTGG TG GGGTGGGCTG TGAGG AAG ACCTCACC 
TTACCAACTGCAGTGCGTCTCTGTGTTTTAGCCTCAGA 

4 0 TTTTAAGGTAACCTTTTTTCCATTGTAAATACTTCTGTAAATACT^ 

AATCCTATTTTTATTTTATACAGTATTTCTCAGCTGTGATCTTTGGAGCAAAAGCCAACGGCAGGAAAAA^ 

TCATGAAGTATGTCTTTGGGTTTTTGTAAATAATTTTAACTCAAATAAAATTGATAC^ 

GTCTATTGACCTGTTTTGGAGGAAGAAGTCTCAGAACTTGTTAAGGCCAGCAC^ 

CTTAAATATTTTATATACTTGTAAGTACTGGTCrTGAATGGATG 
45 GAGTGTATAGTTAAATCCTTAGAATGCCCTCTATGTTACTTGAAAACTTTTGGCTTTCTGAATATGACGGTAAATTTCTTTG 

TGAGGTTTGATAGCCATTTTTTAAGCTTTTAACTTTTTTTTTTCTTGTTAGCCTTAAAG^ 

AACCTTGCCTTACTTGATTTAGTCGCAAGAGTAGATTACAGGATTTTCCCCTGAAI^ 

ACACAAGTGTAAAGCCACCTGTGAGAGTGGACAAATGGTC 

TTCCCAGCTCTGGTAGTTGCTGTTAAGTTGGAGCAAATTCTTTTTTTTTT^ 
50 TGTCCTCAGTCACACCAGAAGAGGGCTTTGGATCCCATTACAGAT 

AT CT CTGGAAGAGCAGT CAGTGTT CCTAAC CAATGACACATCT CTC CAG C CCTG AAGCAAATTC TTAAAACTAT AATTTAC CCAG T 

TT CCTT CATTGAAAAATGG G AG ACAGT ACAG AC CTAC CTGATGGTTGTAT AGATTAAATATGTGTG AAACATT ATT AT AATGCTC C 

AAATTTAATGTTTATTAGTAAGTGCTAGTTTTTACAGT1TTTTTTTTTTCCT 

TGGAAATGAAATAGAGTGCTTGTGACTATTAAGGGCATTATATGTAACACTGGTGCCT 

5 5 TTTG ACAC CAGAT T CTC CAAATT ACTATTT CCAAACTTGGC CTTGTT AATCTCTT CAT GATG AAT C ATAAAGCAGAGGG AAATTGA 

GCAAACTTTAATGCITTCTAAAATGTTTCAGGTCTTTTTC 

TCTGTTGATGGCTTCCAGATGGAGATATAATTAAGTACCCTATGTAGAATGTCTTTGAACAAACTCTTGA 

TCTTTCTCTCrCTGACrrTCTGTATATGTGTAGTTTGAAT 

GGTTGTAC&TCTCTAAGAATTTGCTGGCTAAAATCATACTAC^^ 

6 0 GACTGTTATAGTGAATGTAGACACTGAACATTCCATTGTTAAGGCAGAGATTTTGAGATTGA 

CACGGTTACAGGATTTGTTGATTTAGTAGTTTGTGGTGAAGGTGGGTA^ 
AGAGCTTGAGTTGCTAGAGTAGTCAAAGCAGAACACT^ 

G CTTGTGTGAGTT TT C CTCTGG CT CT C1ACAAAAGTAAAACATATCAG AACAGGAAAAGT AATTC TGTCTCTGCAAGAG AAAT AGCC 
AAAGATGTCCCCTAAATGTCGTGGTGACTAATTGTATTAACTCAAAACTAGTTA 

6 5 TGAATTTAAATGACACATTAAGCCGGGCAGTGGTGGTGCACACCTT^ 

AGTGAGTTCCAGGAGAGCCAGGGCTACACAGAGAAACCCTGTCTCAAAAA 

GAAAAAG T AAAAATACACAATGCAAGTG CTAATGAAGTATTGATG T AAG ACATAATTT AGGG CT G AGT AATGT ATTTCTCCAAG AG 
CATTTTG T AATCACG AGGTGTGGGGTGTATACAGAACACAACAT CCT AAT ACCACAGAAGTG TTTAT AAG TATGTGAT AAT AACAA 
TCATATCAGTAGGTAAATTCACAACCATAGTTTCAGATTTTAATACAACTTCAGCAAGTAA 

7 0 AC CATG AC CAAGG CAACTCTT ACAAGGACAGCATTT AATTGG AG CTGGGTT ACA^ CATT AATGC CAAAG CAT 

GGCACATCCAGGCAGACATGGAACAGGAGCTGAGAGTTCTGCATCTTCACCTGAAGGCTGCT 

TT AGAAGG AG ACAG AGG CACACTT CCAACAAGGCTACACTTAAG AG TG C CAATGAG CCAGG CATATT CAAACCACCACATT CT ACA 
CATGAGTCT AAGGGGG CT ATAC CT AGT CAT AACATAAAAAATATATT CAGTC CAG CTTTCAAAGTCC C C ACAAGTCTAT AACAGTC 
TCAACAATCTTTAAACGTTCAAAGTCTCTTCTGAGATTCATC 
7 5 ACTCCACACTCTGCATCTCCATGGCTGATGTCAATTTG^ 
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GGTCTGGTTCC^CTACCTGTTAACAGCTTTC^ 
TTCTTGTTTCAACATCTGGGAACTACGCATGATCT^ 
CTGCTGCTGTGCTTGGTGGTCATTCCATGGCCCTGGCATCTTC 
CTCTCATAGGCTCTCTTCATGGTCCTCAACTTC^ 
5 TACCAATGGCCTTCCATGGCCTCTCATAGTGCCAAGACTCAGCTG 
GGGTGACTCTI7lCATGTTTACAAGTCCAGCrGCAGC^ 
GAACACACTTCCTAGAAGATTTTCATCTCAGTGATACT 

NNNNNNNNNNNNNNNNOT 

1 0 TTTG ACCATGAACACGGAGATGTGCATGG CTGTGTCT CCTGAATG CTGGAATG AAAGGTGTGTAC CACCACTC CTGG ATTTTTTT C 

TCTACTTGGAACTTGCTCTATACCAGGCTAACCTTAAACTCAGAGATCTGCT 

ATGTTTGGACCTAGGCTTAGCTGAGTGGGATCTTGTCCCAAGATGACCACTCC 

ATCACCATTTTACTTTTTGGTGCCCCrrTTTTTACrrTGAACCATACA 

AAAACACTCI^CACCAGACTAAACCAGAGAACAAAGCCTCTGCTGGAC^ 
1 5 CT CTTAACCTTAG CCTTGAGAGAAGG CAAAAAGCAGTCACATTCCTCAACAAAATAC CACAAAAACAGT CTTCAGG C CACATACTG 

AAATTCTTCACTTGAAGCCACTTGGGTCAGGTCTGCACAATTCAACT 

AG CC CTACTT AAAG CATTCCTC TG CTTTCCAAATC CGAAGTCT CCAAATT C ACATTCTTCCAAAG AAAAGAATG AT CAGG CC TATG 
GCAGTAATACCCAAGTCCCTGGTACCAACTTCTGT 

AAGG C CAACATTTACTTGGG GCTGGCT TACAGGTTCAG AGGTTC AATT CATTAT C AGTG CAGGAACATG CATGG CAGGGTCCAGG C 

2 0 AGGCGTGGTACAGGAGATGCTGAGAGTTCTGCATCT^ 

GGTCTCATTGCCCACCTCTACAGTGGCACACTTCCCTTTATTTAGCAAAGCCA 
TGCCACTCGCTGGGCCAAGCATATTTAAACCACTATAAATAGCTTAGTTTTGTTATTTT 
AT ACCTATAATTCTAG CTACT CAGGAGGCT ATGTCAGGCCAATTACGTAAGTTTAGGAGTTAAAAGGC CTC 
GTCTCAATAAGTATGTAATTAAATGTGAAGTTACAGAAAAGTTGAGTGGATGAGTTG 
25 TGAACAT AGATTTTTGTATTGCATATG CAAATTT CACCAAGAT AATCTG AGAGCT CATAAAGGAGACTT CTATG CATTTCAG ATGA 
CTGAAGG CT ACAG AGAATATAC CTGTGAATCAGCAAT CAAAAATCAGAACAGG CCTGG AG AGATGGCT CAG CAGTT AAG AACACTG 
ACTGTTCTTCCAAAGGTCCTGAGTTCAAATCTCAGCAACCACATGGTGGCTCACAACCATCCGTAATC 

TGGTGTGTGAAAAAACAG CTATG ATGTACTTACATGT AATAAATAAATCTTCAAAAAAAGT CAAAACAAG CAACAATTG CAAACCA 
T ATTCAGATATTAAGCAG CATGGG CCAGTG AGTAAGT AAAAGCAAGAAAAG AT CGTGCCAATTAAAATACAATACATTGGAGGGTT 

3 0 TTTTTTTTTTTACTTTGTTTATTGCCACATTAAAAGCAGTACTTAA 

AGTAATTTAAAACCCAGTGTAAGTTTGAAGAAAGAAAAATTAGATTAGAAATCAG 
TTAAAAAAAAAAGTTGAGCCCTTAAAACCAAAAACTGCCAGTGTTGGGAAT 
AAAGAAATAAGATGTTACAAATAAGCCAAAAATCTTTACAACTAAAATTTTAGAATTAATTGAG 
CAGGCCAGATTCTGTGGTATGTATGTGTAAG<^TATTTGGGAGGTGGATGCAAAAG 

3 5 GATAGCTACTTGAGACACTGTCACACAAAAATTGTGTC 

AGGCAACATGTCCCCAAAGGTACTAGTTCCACAGAAATGGTTTCCA 

TATAAATACATCCAATGAAGTCAAAAGAAGG CACAAGT CTTAAG AAT ACAGTCAGCTGACT GAAATGAAGTTGAT AGGAGTT ATTAA 
TGCAG AAG T CAATGTCG AG CTAG AAAG ATTGAAG AATTGCTTAGAAATAGACAGT AAATT AAAAGGTT TTAGGGAG TCACAGAT AGA 
ATGGATCAAAGATTGGTTCAGTGTTGGGGATAGAAGGCAAGGTAGATGAAATGTTCAGACAAAGATAAAG 

4 0 TGGGGCACTATCTAAAGACCAAACCTATAAATTCTGGACCTGGGAA 

CAAGAACACAATTGTGATTGAAG TTT CCT T AAAC CT AAAG AG ATG C CCGATCAAGT ACAGAAAG CATAACACCAAAT AGAT AAG AA 
CAGAAGAAACCTTCCTGTATCATAGCTAAAACACTAAAATACACA^ 

TATAAAGGCACACCCCTTAGAGTAGCACCGTACCGCrCrCAATAGAAAGGTTGGAGGGCTTGAG 
GCACAACTGTCAACCCAGACTACTATACCTAGCAGAACTATCATAACT^ 

4 5 AGG AATTCATGATT ACAAAGCAACTCT ACAAAAGGTC CTTGAAGT C C T CTAGACTG AAG AG AAAG AT AAACACATCATGGAAG CT A 

CAAGAAAAACCAAACAAAACAGCAAATGAGG CAAATGGTTAGAAAGTAATATT AAG AGAT AGATTGTGTGG CGCACGCCTTTAATT 
CCAGCACTCGGGAGGCAGAGGGAGG<?GGATTTCT 

CACAGAGAAACCCTGTCTCGAAAAATTAAAAAAAAAAAAAAAAGATTGGTATA 
TTTATCTCCIAAGAAATGTACCTCACCATCAGATGCAACTTTAGTATAAA^ 

5 0 GGTAACAAAAATTGACTTCAAAATTAGAACCTAACTGAGGTCACTTTAGTGTOT 

GCTAGCCTGGAACTCCTGTTTCCCCTTCCCGTCTCACGGGAGTC 
TGGGTTCTGGGGAGCAACCTTCAGTTGTAAGGCTTGAA(2ATCAAGCAT 
GTTAAAAAAAATATTGATGGAAACA.TCAGGTAGGTGGGATACAGCAAAATGGGGAAGCCT 
CAGTCTTATAATTTGAGGTGATGGAAGCCAGGGGTCTGTCTGTACATTACACATTAATT^ 

5 5 ACACACAGAG AT TGG CTTTAAATAGCTATTTT TAGG AAAT AAGGTCT CT CAGAATAATTTGG CT CCAACC TGTAACATT CCAACTC 

CCCACAATCATGGACATTCTAAATAGTCAATCAGATGTGTAGAATGTTTAACACAGA 

T CAGTT CACAAATCT CAT ATG AG AAGAAAGAAGCTAAAGTGCTGG CAG AAGAAACTTT C C T CTGAGAGG ACAGGGTTAG C CAGGGC 
CAGAATCCATGGCAGG CTGCAAACTGTCAGTTTGGGAGAGTAGGCCTAAGTTTCTGTTTT CC^ CAAA 
TCGATTCTAGGAAAACCCACCCCTCACXaGCCAGCCAATCAGGAAG 

6 0 ATCAGGAGCTGTAGAAGCTACCCCACCACCTUIGTATAATTAGCAAN^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNN^ 
NNNNNNJJNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN^ 
65 CTATCAGTTTGTTTTCTAGTTTCTTATAAGATGCAGTGAGGTTATGAATCTTAATGTCCCAGA 

AACTGTCGGATTTGGAGGATCATGAAGTGTCTGACCATGTATGTTTACCATGTATGTTCCTGGCACT 

GAAAGATGAACAGTGTACGTTGCTCGAGGTTGCTAGTTG CCAG CAGGACCTCTGAGTTGAGCACrGGTGCrGAGGGAAGAATGCTA 

ACATCATCTTAGGGTGTTCCTTCATCTTAGTTTTGGTATGGTGGTGAAGCTGATCCCTGGGAGC^ 

TGACTGACAGTTGGATTGTCATAGAACTTGAGTGATCTGCTCT^ 

7 0 AG GTGTACATCG AGGATTTC C CTTGACTG GTTGTTTCAAAAACAGATTGTTCACTGGGCT^ATAG CATAG T GT ACTGG CTAGTTTT 

GTGTCAACTTGACACAGCTGGAGAAATCACAGAGAAAGGAGCTTC^ 
TTCTCAATTAGTGATCAAGGGGGAAAGGCCCCTTGTGGGTGGTGCCATCT 
CTGAGCAAGCCAGAGGAGGCAAGCCAGTAAAGAACATCCCTCCATGGCCTCTGCATCAGCT 
CAGTCCTGACTTCCTTTGGTGATGAA(^GCAGCATGGAAGTGTAAGCCGAATAAACCCrrTTC 
7 5 GATGTTTGTGCAGG AATAG AAAC C CTGACT AAGACACATAGC CTT C TCAT GTCATCTACAAAGACCAAAAATT AGAGACAGAGAGT 
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GGCAGGATGTTGGCCTACCAGGAGGAAAAGAACCTGTT 
ACGCTATGTCAAATCAGGTGTTTCTGACTAGCCAGGGTTCTATCT 
ATGCACAGGCCATGCTTC^TCCATGTA^ 
AGAAACTGCTTGAAACACTITTGGTTCACATTAGAGTCTACTC 
5 TG CTCTTATAGAAACAAT ACTTATACAAAACAAAACCTTTTAC CATTTT ACAGT ACAACATTCTGAAGTTAGT AT^ 

GTAATATGCTAGAATCTTTGATTTTAAAAATTGTGGGTTGGG CTGGTG AAATGG CTCAGCGGTT AAGAG CACTGACTG CTCTTCAA 
GAGGT C CTGAGTTCAATTCCXIAG CAACCACZATGGTGGCTCACAACCAT CTGTAATGAGATCTGACAGCC CTCTTCTGGAGTATCTA 
AAGACAGCTAGAGTGT ACTCACATATAATTAAAAAA^ CCCAGTGCCACATAGAACTGGAC 
ATGGTGTTACACACCTGTAATCATAGCTAG CACCTGGGAGGCAGAGGTCAGATGAACA 

10 

MOUSE SEQUENCE - mRNA 

GGGGCCGGGAGCGCAAATGGCGTTGAGATGGTTCAGGGCCCTGT^ 
CAGAAGGCGGCGGCGGCCCAGCGGGGGCACAGCCCCTCGCCTCTCTCGGTG^ 
CGCGCCCGCCCGCGGGCCGTTGGGAG03GGAG 
1 5 ACAGGTGT CAAAATGACATCCAGATTTGGAAAAACTTACAGT AGGAAAG GAGG AAATGG CAGTTCAAAATTTGATGAAGTr.LT TTC 

CAACAAACGGACTACTCTTAGTACAAAATGGGGTGAGACCACATTTATGGCT 
ATATTCAAGAAATTCCGAAGAAACCTAAAGTAGAAGAAGAAGATACTGGAGATC 
CCTGTTTCTTCAAAAAATTTAGCCCAGGGTAAGGGTTCATCTTACTCAGAATCTAGTG 

TGTAT TTGAAGCTAATAG CAAATGTAGTCATGTGGTGGGTG AAG ACAGTTTTGC TTC CGACAG ATG CTTACTTGTGGAGGATACTT 
2 0 TAATTG GGAAAG AGAAG AGCAT AAGTAGAATTC CAGAAGACAACG CAAACAAAAGT AGTTGCACT AAGT TG CTAACTT CAG ATAAA 

GTGGAGAATT TT AGTGAAGAACATGAAAAAAAT AGTCACCACTTT CACAAAAATG CTG AAGATAGTACT AAGAAACCCAATG CAG A 
AACCG CAGTGG CTTCTGAATATAAAG CTGATGAAACT AAAGAAACAAATGATACTTGG AACT C CCAGT CTGG AAAAAGAACAGAGT 
CTCCATCTGAAAGTTG TCGAGTCAAAGG ATCTTGTAAG AACTGG TTTATATG AATGGG^ 
GACTGTATTTTAAGTTTGGATAATGAGTCTCTTTTGGAGATGAAAGACG 

2 5 AAATGAAACCTTTGAAGAAGATATCATACAAAGTGTTCTTAGGCCAAGCAACTGT AGGACGTAC CCAATAAAG CGAGAT 

CCTCACAGGGAG CAT CAAAT TTTGATAAG CTAATGGATGG CACCAGTCAGT CCTT AGCCAAAG CAAACAGTG AATCAAGT AAAGAT 
GGCCTGAATCAGGCAAAGAAAGGTAGTGCAAGTTGTGGGACCAGTTTTCGAGGAACAG 

ACATCCAT CTTG CTTGT CAGTGTGT AATGTT ACCAT CCAGG AT ACT AT GGAACGG AGTATGGATG AGTTCAC CG CATCCACTCCTG 
CAGATTTAGGAGAGGCTGG<:CGGCTCAGAAAAAAGGC^ATATTG 

3 0 AAATCCAAAAAGGATGTTAAACTTGAATTTTTTGGTTTTGAAGATCATGATGAGA 

T AATT ACAAAATTAAATATTTTGGCITTGACGAT CTCAG CGAAAGTG AAGATGATGATG ATGACG AC TGTCAAGTGGAAAGAAAGA 
AAGACAAAAAAAGAACTAAAACAGCTCCATCACCTTCCCAGCAGCCTC^ 
AGTACTAATAATGCAGAAAACITGGATTTTACAGAGGACTTGCCT 
AGATAAATCCAAGGAAAATACCAGAAAGATTTTTAGTGGCCCCAAACGGTCA 

3 5 ACCATCCAGACTCGGAAGAATTGCCTGGACCACCAATAGCAAAAC 

CAAAAAGATGATGGAGTTTTTAAGGCTCCTGCACCACCACTaVAAGTGAT 

AAT AGTT ACTGCACTGAAATGCAGAAAAG AAGACAAAG AATT ATAT ACTGTTGTT CAG CACGTGAAACACTT CAATG ATGTGGTGG 

AATTTGGTGAAAATCAAGAGTTCACTGATGACATTGAATACTTGTTAAGT 

CTTAGTGTTATCAGCTTAGCTACTAAATGTGCCATGCCCAGTTTTCGGA 

4 0 TAAAACTTTGGATGATTCCCAGCATCATCAGAATCTGTCCCTCTGTACAGCT^ 

ACATGGATCTTGATAGGGCCAGCCTAGATCTCATGATTCGGCTTTTGGAGT^ 
GAAAAAGACATGAACAAGATCAAAGAAAAGATCCGAAGACTCTGTGAAACTGT^ 
GACTGGTCATTTAGCTATGGAGACATTGCTGTCCCTCACTTCCAAACGAGCAGGAG^ 
GTGGTCTGGATCATATTGTAGATAAAGTAAAAGAGTGTGTGGATCATTTAAGTAGAGATGATGAGGA 

4 5 T CATTATGGG GAGCAG AG AG ATGTTT ACG AG TTTTAG AGAGTGTAACAG TGGA.T AATCGA.G AGAATCAAAGCTACTTG AT AG CCT A 

TAAAGAT TCACAACTCATTATTTCAT CAG CTAAAG CATT AC AG CATTGTGAAG AC CTGATTCAG CAG TACAAC CGTG CTG AG AACA 
G CATCTGTGTAG CAG ACAGT AACCCTCTG CCTTACCAG AATG T AAC T AAC CATGT GGG CAAAG CAG TGG AGGACTGCATG AGGGCT 
AT AAT TGGAGTATTG CT CAATT TAACTAATGATAATGAGTGG G GCAGCACAAAG ACAGG AGAACAAG AAGG ACTCAT AGG CACA 
GATGAACTGTGTTCTTCAGGTTCCAAAGTACCTACCTCAGGAGCAGAGATTTGATATTCGAGTGCTGG^ 

5 0 ACCTGGTGGAGTATAGTGCCCGGAATCGACACTGCCTTGT 

GAAGGCGATCATAGTTTAAGGCTAGCCGGACAAGTTCATGCTGTTCAAGCTTTAGTGCAGCTATTTCTC 
ACAATTGGCAGAAAGTAAAACAGATGAATTGATTAAAGATGCTCCTACCACTCAGCATGATAAGAGTGGA 

GTGGAGAAATACAGTGGGTATCAACTGAAAAGACTGATGCCCTTC AG CATG CTGGC AAACACATGGAGGATTGCATCGTAGCCTCC 
TACACAGCCCTGCTTCTTGG GT GTCTCTGCCAG GAAAGTC CAATCAATG T AACT ACAGT AAGGGAATATCTTC CAGAAGGAGATTT 

5 5 CTCCATAATGACAGAGATGCTTAAAAAGTTCTTAAGCTTCATGAATCTTACGTC^ 

CTAGAGTGATTGAATATTTGGAACATTGCTAGCTGCTTTACCTTTGCTTCAGGTGCTTGGTAATG CTGAAGCTATC CTTAGACAAA 
GAAAAGACATGAAAGAAGTCCITGAAGATATACCAAGAACATTCATC^ 
ATTTCTTCATCATGCATTCTGCGTTTGCTAAATGACAGTTACTACATCAATCTGCAGCTAT 
TGTTAACAATCCCATGCAGTATTTAAATACACTTACTTGGCAGAGTCTATACCTGC 

6 0 TTTAAAGAGGGCAATTTGTGCTGCTGTTACTGCAACTGCT 

CTTAGCT ACTGAGGTAT TGAATGTT C TG AGGACATTCTAGACAACAG CTT AGATC CTTTTTCAGG CT CATTTGCTTTTG CTTTTTG 
TTG AATG ATT CGAATCX5TAAATAAAG CTTTTAAT AAT TTTGTGAATTT 

ATG AG ATGG AATC CAAAGACACAAGGGATT AATAGT ATATTTT TTT ATT CTTG ATTAGGTTTGGGTGTTG AACT ATTTTTCACTTC 
TGAGACCATGACCATATTCAATGTC^TACCATTATG 

6 5 TGTTATATAAGAT AATGTG C CCTGTTAAAGGAT T AAG CAAAAATAGACAAACC CAGGGT AGTTT ACAC TTAATGCTGGGG AG G CT C 

TTGAAACATTGTTACGTTTTGAGGGACGTTCTCTAGATATATTTTCTAATGTTCAGTTCAGTAAATATGGT 

TTTTTT ATTAATTGG C CTGAGG ACAGG CTTT C CAGAGATGTTTCATT AATACTTTTAATTAC C TTTCAGAT AGTT AGATCATGTTT 
CTTCATTGGATTTGTAAAACTTAAAGCCATAAAAATATTAGTTTGGTGTGTATTGGGGAAAATAGCTA 

AT'rriAGACTTTGTTATTT C CTTGTAT AAAGTAACAAACCGG GG CT CTTG T AT CCGTGCC AG CTGTAATGTTTTT AAATGCAGTG G 

7 0 CTGCCTTCTATTGTCTTCCTATTTTTGATAATGGAGA 

TTCCTCCTACCCGCCCAACCCCCTACCCGTCCGTCAC 
ATTTGCCCTGAGTACCAAACTAATTTAGTGAATTTTTTTTTAAAC^ 

T AATT AATTGT CTT AAAATACATTGATATTTATTAATCATTGG ATT TGGG GACAGATTTTTGACTGTGGCAG ATGGT AG GGAACAG 
AACAGT AAACAG T TGGATGAG AT CAATCAG GGCX5AATTACATTTTTCTGT TACACTGGTG ATCATTTGAGAATT GATTT ACCT CAG 
7 5 TGTTTAACAGTTTTTGTTTTGTTTTTAAAATACTAACTA^ 
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GAAGACCTCACCTTACCAACTG CAGTGCGTCTCTG TGTTTTAG CCTCAGAGAACT CT CATTCT AGGGT ACTTG AGGCTGACTTGCA 
AAGTGACTAAAGTTTTAAGGTAACCTTTTTTCCATTGTAAATACTT 

TTTTCTGAATCCAATCCTATTTTTAl-r ItATACAGTATTTCT CAG CTGTGATCTTTGGAG CAAAAGCCAACGGCAGGAAAAAATAG 
TTTGTACCAGTTT CATGAAGT ATGT CTTTGGGTTTTTGTAAAT AATTTTAACT CAAATAAAATTGATACTTT AAATAC^ 
5 T 

MOUSE SBQUENCE - CODING 
ATGGTTCAGGGCCCTGTTCAAACTCCAGCGCTGAC 

10 GGGAGAGGCGGAGGCGGCCCGAGGCCAAAGCACCCGCCAGGCGCCGAG^ 

GGAAAAACTTACAGTAGGAAAGGAGGAAATGGCAGTTCAAAATTTGATGAA^ 

ATGGGGTGAGACCA(^TTTATGGCTAAATTAGGGCA.GAAGAGG€CCAATTTCAAA 

AAGTAGAAGAAGAAGATACTGGAGATCCCTTTGGTTTTGA^ 

GGTAAGGGTT CAT CTTACT CAGAAT CTAGTGAGG CTGCTCAGCTGG AAG AAGTCACTTCTGT ATTTGAAG 
1 5 TCATGTGGTGGGTGAAGACAGTTTTGCTTCCXSACAGATGC^ 

GAATT C CAGAAG ACAACG CAAACAAAAGTAGTTG CACTAAGTTG CTAACTTCAGAT AAAGTG G AG AATTTT AGTGAAGAA 
AAAAAT AGTCAC CACTTTCACAAAAATGCTG AAG AT AGT ACT AAGAAAC CCAATG CAGAAAC CG CAGTGG CTTCTG AATATAAAGC 
TGATGAAACTAAAGAAACAAATGATACTTGGAACTCCCAGTCTGGAAAAAGA^ 

G AT CTGTAAGAAC TGGTTT ATATGAATGGGATAATG ATTTTGAAGAT AT CAGGTCACAAG ACTGT ATTTT AAGTTTGGATAATGAG 
2 0 TCTCTTTTGGAGATGAAAGACGAGGATTTAAAAAATCGGATTGGAGGA 
ACAAAGTGTTCTTAGGCCAAGCAACTGTAGGACGTACT^^ 

AGCT AATGGATGGCACCAGTCAGT C CTT AGCCAAAGCAAACAGTGAAT CAAGT AAAGATGGC CTGAAT CAGG CAAAG AAAGGTAGT 

GCAAGTTGTGGGACCAGTTTTCGAGGAA(^GTTGGACGGACTAGAGATTA^ 

TGTTACCATCCAGGATACTATGGAACXKiAGTATGGATGAGT^ 

2 5 GAAAAAAG GCAGATATTGCAACCT C CAAGACCACT ACT AGATTTCGACCTAGTAAT ACTAAATCCA 

TTTTTTGGTTTTGAAGATCATCATGAGAGAGGAGGTGATGAAGGGGGTTCTGGA^ 

tgacgatctcagcgaaagtgaagatgatgatgatga(x;actgtcaagtggaaaga 
catcaccttcccagcagcctcctcot1aaagcagcgacaattcccaggatagtca 
tttacagaggacttgcctggtgtgcctgagagtgtgaagaagcccataagtaaacaagg 

3 0 gatttttagtggccccaaacggtcacctacaaaagctgtata 

gaccac caat agcaaaac ctcagcgtgtc acagtg agg ctgtctt caaaggaac caaat caaaaagatgatggagtttttaaggct 
cctgcaccaccactcaaagtgataaaaactgtgacaatacctactcagcccta^ 

agaag acaaagaattatatactgttgttcag cacgtgaaacactt caatg atgtggtgg aatttg gtgaaaat caagag ttcactg 

ATG ACATTGAATACTTGTTAAG TGG CTT AAAGAGT ACT CAG CC T CT AAACACACGTTG C CTTAJ3TGTTATCAG CTT AGCT ACT AAA 

3 5 TGTGCCATGCCCAGTTTTCX^ATGCATCTGAGGGCACATGGGATGGTTGC^ 

TCAGAATCTGTCCCTCTGTACAGCTGCTCTCATGTACATATTGAGT^ 

ATC TCATG ATT CGGC TTTTGG AGTTGGAAC AAGATGCCTCTT CAG CT AAGCTAC TG AATGAAAAAGACATGAACAAGATCAAAG AA 

AAGATCCGAAGACTCTGTGAAACTGTGCACAACAAGCATCTTCtATCT 

GCTGTCCCTCACTTCCAAACGAGCAGGAGATTGGTTTAAAGAAGAGCTCCGACTTCTGGGT 

4 0 TAAAAGAGTCTGTGGATCATTTAAGTAGAGATGATGAGGACGAAGAGAAACTAGTAGCCTCATTATC 

CGAGTTTTAGAG AGTG TAACAGTG CATAATC C AGAG AATCAAAGCT ACT TGAT AGC CT ATAAAG ATTCACAACT CATTATTTCATC 
AGCTAAAGCATTACAGCATTGTGAAGACCTGATTCAGCAGTA 

TG C CTTACCAGAATGT AACTAAC CATGTGGG CAAAG CAGTGGAGGACTG CATG AGGGCTATAATTGGAGT ATTG CT CAATTT AACT 
AATGATAATGAGTGGGGCAGCACAAAGACAGGAGAACAAGAAGGACTCATAGGCAC^GCGATGAACTGTGTTCT 
45 GTACCTACCTCAGGAGCAGAGATTTGATATTCGAGTGCTGGGATTGGGTCTACTC^ 

GG ACAAGTTCATGCTGTTCAAGCTTTAGTG CAGCTATTTCT CGAACG AGAGAGAG CAG CACAATTGGCAG AAAGT AAAACAGATGA 
ATTG ATTAAAG ATGCTCCTAC C ACT CAG CATG ATAAG AGTG GAG AG TGG CAAGAAACAAGTGG AG AAAT ACAGTG GGTAT CAACTG 
AAAAGACTGATGCCCTTCAGCATGCTGGCAAACACATGGAGGATTGCATCGTAGCCTCCTACACAGCCCT 

5 0 TGCCAGGAAAGTCCAATGAATGTAACTACAGTAAGGGAATATCTTCCAG 

GTTCTTAAGCTTCATGAATCTTACGTGTGCTCTTGGAACAA 
GCTAG 

HUMAN SEQUENCE - GENOMIC 

5 5 AGAACAATGG GCTGTTGCAGG AAAACAAGT AACATGT AGGAAAG ACAA^ C CTT AGGACAACAGTTGGACATAGGAT AGTTT 

ATAACAATGTCTGTTTAGGTGTGGTGCCAAATTCTCTCCAGCCACAAGA 

ATTTATGACAACTG GAAGTTATGACAATTGAGTT CTTTGGGGGAGGAT CTAGTTTTAGGCAG ACAAGGGAGTT CAGGAGGAAAAAA 

AAGCTATTCTCAAATGTTTCCAGATTAAAATATATGTTTTGTTTGTTTGAGACAGGGTCTTG 

GTGTAAjGATCATGGCTCACTGCAGTCTCGAACTCCTGGGCTCAAGCGATCCCCCC^ 

6 0 GTG CAT ACCACCACATCTCATTGATTTTT AAATTTTTGTAG^ CTCAACAT ATTGC C CAGG CTGGTCT AGAG CT CCTAG 

GCTCAAGTGATCCTCCTGCCTCCGCCACCCAAAGTGCTGGGATTACAAGCATGAGCC^ 
AGTAGCATATTCTGGACGCCTTTGCCATGAACAGATACAAATATCTGAACATTTACrT^ 

CAATACGAAATATTTT AGCACAAAAATCAAAG GAG CGT AT AACAAGTTGAGAGTAT AAGACAAATG AAACTTAAAATTTTTTGTT^ 
TTTAATATTC^TACAACAGCAAAGAAACCTTATATCATGAATGTTTTTTCAG 

6 5 TGGCCGGGCGTGGTGCCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAA^ 

TCCTGACCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCTGGACATGG 
TCCX3GAGGCTGAGTCAGGAGAATCACTTGAATCO?GGAGGCAGAG^CTGCAGTGAGCCAG 

CAGCAGAG CAAGACT CGGT CT C CAAAAAAAAAAAAAAAAC CAT CCTGG CTAACACGGTGAAACCCCGT CT C CACTAAAAATACAAA 
AAAAAATTAGCCAGGTGTGATGG CGGG CGCCTGT ACTCGGGAGG CTGAGGCAGGAGAATGG CATG AAC CCAGGAGG CGGAGCTTG C 

7 0 TGTGAGCAGAGATAG CGCCACTG CACTC CCGCC TGGGTGAAAGAG CG AGACTC CGT C TCAAAAAAAAAAAAAAAAAGACTGATATG 

ATTATG TTGGG AGGAGAGAACATATT CAAAT ACTGTTGGCTGGAAT ATGAATGGTT ACGAAGATTTTG CAGGGTG G CTTGGCACT T 
TCCAT CCAATCTG CATGG CTT CTCTT ATAGTAAATGCATTTCTAAGAATTCATT^ CAT AAAATAAAAACAGTCTCAAAGGAT ATGT 
ACAAACACCACAG CATATTTG CAATG CAAAAGAAAAAAATGG AAATAATCTAATT ATTCATT AACACGAAA 

AAGGT ATAATCAGAAAATGGAATAACTG C TTAACT AC T ATAAAG AATGAGATGATTAAG G CT GCG CACAAC ACCTCACACCTGT AA 
7 5 TCTCAGCACTTTGGGAGGCCTAGGTGGGAGGATCACT 
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TCCACACACACAAAAATTTTTTAATTAGC 

TTGCTTGAGCCCAGGAGTTGGATGCTGC^ 

TCAAAAAAAAAAAAAAAGAATGAGGAAATTCAAAT^^ 

GATAAGGATGTATCGTATGACCCCATTTTAGGAAATGGGGACAGAAAAA^ 

TCTATACTTTCAACAGTGGCAATCTTGGGGCACAGGTG^ 

AATTTCAACCACAAATATGTGTAACTTCTOTTAACAG^ 

TTTCACCATGTGTATTAGCATATTAAAAGTTCTGAGAA 

TGATG CAGCAAAG AG CCAACTATGAGGCTTGGACCACTG AGGAG 

TTTTTTTGAGACAGAGTCTTACT 

TCAAGTGATTCTCCTGTCTTAGCCTCCTGAGTACCT 

AGATGGGGTTTCACCATGTTGGTCAGGCTGGTCTCGAGTTCCTGACCTCGTGATC 
TACAGGCGTGAGCCACAGCACCTGGCCAAGGATTTATACTAGAT^ 
CAAAGGCAGTACTGTCCAATATGACGTGTAATTTTAA^ 
TTTTGTITTTTTAACTGAGACAAGGTCTCACCACGTTACCCAGGCT 

TGGGATT ACAGGCATGAGCCACCATG CC CGG CCAAAAT CAGTTTTAAT ATATTTCTTT AAT C CAATGT ATCCCT AACAG TATCAT C 

TCACTATATAACCAATATTTTAAAAGCATAATGAGATATTTTACATCTTTTGGTACTATm 

TTAGACTAGCCACAGTTCAAGTCTTCAACAGCCAC 

AACC CAAACCTG AATCGGATATTT AGGTGACTTAGGCTGAT AAAATAGT AGGAAAGG AAGCAAAG CAG AAAAGG CTG ACTTGG CCA 
GAGCAGGAGACCAGTCCAAGATATTCTGGATTTTCTATTGTA^ 

TGAC CAAAGCAGTACTTCAGAAAG ATT AACAGTTTTTG CT AAAAT AT ACACGT ACAACTT CAACATG CAATTTTTCCCC CACATTT 

TTATTACATAAAAGATTATAAAATGAAGGAAGAATAATCTGTCACCCAGCGATTA 

CTTCTGAACAGTTACCACTGAAATATTTTTATTTCATG^ 

AAATTTAACAGCCTTTACTTTAATATTTTCTAACAAGAGOT 

CACTCAAGAC CCT AT ACTCTTGAAAT ACATT ACAAAG CATTTT AATTACT AGT AAGAAAACTAC CTACAAATCTTAAT ATACATAA 

AACAGTAT CAAAACT ATCCAAGTCAGC CAGATGTGGTGGCTCACG CCTGTAAT CCCAG CACTTTG GG AGGCTGAGGCGGG AGGATC 

ACG AGG T CAGGAG ATTG AGAC CATC CTGG CT AACACGGTGAAACCC CGTCTCT ACT AAAAAAT AG AAAAAATT AGCTAGG C^ 

GGCGGGCTCCTGTGGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATGGCGTGAA^ 

ATGCGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCCGTCTCAAAAAAAAA 

ACAGATTTCTCACCTTCAAAATTGAGAGTCTATCTTCTCAACA 

TCAAAAAAGGAGGTAAGAAAGCAGGATACGGGAAGGGCACCGCAGGCATAA 

TGATTCTT CAG CCAAAAGTCAGAAGTCCAGTG ATGT CATTAGGAC CAAAAGCATTATATG AC CTTCAATTT CGG C AGCT AACTGTG 
CTG CAAGTCAAGAT ATTTTCTTT CTTTTT CTTTTTTT AAG CAGAAAAATT AT CTTAAAGTTCACCATG TTC CTT CT ATTTTTT AAG 
AT AAAGAAACG AAG ATT CGGATTG GGTTAGTGTGG CATCCGTGATTGAAT AGGTAC T CAGT AGTAAAACCAGT ATCAGACATTATT 
T ATAAAAAATT AGAAGC TGAAACTTAATCATT C CG CTG CTTCTCTATGGCTTTCT ATACTT CCCAAAT T ACCCACAAT AGATGT AT 
TGCCTTTAAAATTTTTAAAAAGCTGCAAAAACTG 

CAGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCAGGAGTTCGAGATC^ 
TAAAAATACAAAAATTAGCCAGGCGTGGTGGCTCACGCTTGTGACCC 

CGGGAGGTAGAGGTTGCAGTGAGCCAAGATCACGCCACTG(^CTCCAGCCTGGGCGACAGAGTGAGACT 

ATAAATACTACTGAATACTTCCAAAAACAGTCTCCTTGTCCCTTCTTGCCTAAGGTCTCACTCTGCG 

GCCTTCCAAAGTACCCGCATCCTGAACAGCCCAGCTTCCCTTATCAGTCTTGTA^ 

ACATAACTACAACTAGATAATTATTTAATGTCTATCTTCTCTACCCAGGCCTTAAGTT^ 

TGGCTCACACCTGTAATCCCAGAAC!TTTGGGAGGCCAAGGCGGAAGGATCGCCT 

AGGCAAAACCCTGTCTCTACTAAAAATACAAAAAGGAGCCAGGCGTATTGGCGGGCG 

AGCCAGGATAATCACTTGAACCCAGGAGG CGGAGGTTG CAGTGAGCC CAGATCGCACCACTG CACTCCAGCCTGGGAAACATAGCA 

AGACTCCGTCTCAAAAAGAAAGAAAAAAAATTCTCAATGCTGTTTTAGGCTCTTTGCTAGACC^ 

GTCTATAACCTATAACCTGTAATTCTATAACCTATAACCTA 

AAAAGTGAAGAACAC^AAAAGACAACCTCAATTAATTATATCATTTGAGCACTTAGG 

TATCATGGGGGTTAGTGGGGOU3AAGCTTTTCCCTCCTCTAAGCTAACTTGAGTTAGGGCT 

GAGAGCCACCCAATTCCCACTGTTGGAATATAAAATCTATACAGCAATGACTCCCAAATC 

AAC C CCAG AC CCAT AT AGACAACT GC CAAGTGG AAATCCCT ACTTGGCTGTTT CAGATAT CT CAAACTCCATGTG AT C CAATACT A 
TTTT CTT CTACAAAC CAG CTCT CC CACCT CATTTTT CAT ATCTG T CACCCACAT CT AC C CTGCTTT CCAAGCT AG AAATCCAGG AG 
TTGTTCTTG C CAATCCCATTCCTTCTCTCATCCACGCT AGTTATCGCCT 
TAACCCATTTATGCCTGAGGCTGCAATTTTTTGAATm 

CAOSTGCTTAGCGTTCCAATAGTGGAACACTAGGCATAAATGGGTTTAATCCGGGCTAGGCC^GAAAGACCT 

ACATCTTCAGTCTTCCTTTCTTCAAAATGATTTT^ 

CTATCTCTAGK^ATACTCCTGTCCTCCTTCATA 

ACTTTAACAGTATAACAAATCTGTGCCATAAGAAAGGAAGGCTCATCAGAAATATAAAACT 

ATCAGAGTAAATAAAAG AT ACT CAAAGAT AATTTAC TCATCT ATT CAATGT CT ACT CATTAGTT AT AT ATT AAACTGC CCAT ACGA 
ACTGGCTTAGGGACCACAGGGGCCACCCTGAGATGGAGTCTACTGATC^TCTGATCT 

AAAAACACT ACTGGGG ATTTC CAGGCACAT C CAAG AG CTTTGAT T AGGGAG AGAAACACT ATT CATCAATTCTAAGATGCAT C CAT 

TTTTCATATTTTAGCATCTCCGGAATTGGGGTGC^CCTTACAATCAATGGCTTCTTACAA 

AACACTACTGTGTATCCTACAATGGACATAACTCAATGCAAGTCAACATTTAA^ 

TATGGACCTTGTGATGCTAAAAAGACATCCTAAAGCCTAAAAACTAGTAAAAGGTGGGAAAGTAGAAAAGGAAAAGAAATGGTTCC 

TACTTCAAAATTTAGTGTGATAGGAAACCCATTTTCCTTrCTCTCCATGGTTCGCTGCTC 

AGTTCAACAGCTACTGTTAGGCTAGACAGGAGCCTTTCCCCATTTATTCCTTTTAATATGCAAGAAAAGTCCG 

CTG CAAACT AGT AGTGAAACTAAT ATTTGTATAAAAACTG CT ATAAAGGC CAGC CACAGTGC CTCACCTGTGTAACC CCAGCACAG 

GGAGGCTGAGGCGAGCGGATCAACTGAGGTCAGGAGTTTAAGACCAGCCTGGCCAAC^ 

CAAAAATTAGCCAGGCGTGGTCCCAGCTACCTGGGAGGCTGAGGCAGGAGAATCCCTTGAACCCAGGA 

CTGAG CT CACGCCATTGCACT CCAGC CCAGG CGACAGTG CG AGACT CTGT ATCAA 

AGAAAGAAAGAAAAAACAATAAGGTATTAAAAAAAAAAAAAAAAAAAAACAGGTAAATCTGAGGC^^ 

ATTTAATGGAAGACCTAGGAAACCTCTGAGTGGCGAAGTGCTAAGAACT 

TAGAT AAAGGAGAAAAGTGAAGGCAAG AAAATCATAACATTAGT CCAGGTG GG ATG AG AACT TG ATTT AGAGT T AGGGGGTG GG GG 
TG G AGT CGCC^3GGTGGTG GGGGGAGAACTACGTGAAAGCCAAAT AACACT AGATTT GAAAATAAATT C CTGAAAAAAATGTGAAAC 
ATTAAAATT(ZATATGAGAATCAGAAAATCGACACAAACCAACAGCTATTAAAGAAATTG 

AAAAG AT CT AGGC CT AAATGGTCACATAGGGGAGGTCTT C CAAACTTTCAAGGAACAG AT AATCCCAGT CT AAAATG AT CCTGAAA 
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AAGAGAAAAAGAAGGAAAGCTGCCCAACTCGTTTC 

AAAAGGTTTGGGTTTGGTATGACATCTVACAAAAATAAAGA^ 

AGGTACAGATATGTAGAAAGCZAACTX3AATAAAAAAGA 

AAAGAATTGAGAATTATTCCAACAGAAAGGTCTGAA 

AACTTGGAGAACGCATATTAAAGAGGAAAGAAGAAACAG^ 

AAGGCCCCTCTTCATCTAGTTAAT^^ 

TTTCCCAGATTCTCCCACCA.CXIA.CAAGCACCAG 

AATTATATTTAACTAGAGACTTTA^ 

TACTGCCITGGACCCTGTCTTAGAATACAGC^ 

AAATCTTACAGAAAACTCAAGGAGAGGACTTAGAAAAAGOT 

C CAG AT AACAAGGGGTTAAG AAATGAGTGG CT AG CCAC CAACC^GGCAAAGGCAGGAAAAG CAG ACT ACACTTGG CAATCTCTAGA 
G ATG GATTGTG AATGACTAAATTTT AATTTCAATGTTGCGCT AGTTCG CAATTAACAAC CATGTGTT ACTTTT ATTACGAACAAGC 
TTTTTAAAATGTTAATAAGATTTGAAAAGGAAAAGCAG<^ , 
AACTGAAAAAAT CTTAAGAACATTTGTACTTTAAAGAACC CAAAT ACGACAGAGAAAATAACT CACG CTCCACTAAGCAGG AAAAT 
TAGAAGGAAAGGAAAAAAACCCCAAGGTGTTAACTAATAAGAAAGACAAAAAAAAAAAA 
AAGAAGCCAGTAACAGAAGGAATTTAATGAAGGAGCTCCTGCTAAGTGCCT 

AAGT AGAAAAAGTTTAAAGTGGG CT CCACGGAAAATG GAGGAAAAATT AAACAAG AACTG AGT CTGGTTCT ACTAAAAACGCATTC 
T ACIATTTCAAT CGGC CTTTCAT ACACTCAC C CCATTCCT AGAT CAG AGAATTGCCCATATTTT AG CTT CAAATTT AAAAATCT AAA 
TAATTCTGGTTCACATTCTTTATTTCA 

ACAAGT AAAT ACAATAAAGTAGTTGTTTT CAACTCCAG CAACAAAAT AGCG AT AATTTCAAGT AAGAGT AACTGAAAAATGTTC CT 
AAACGTAAAAGTGTTTGCAAACAGTTTTCGTTTTTTTTCTTCCTA^ 
GCTCCCITAGACAAGTTAGTTAACCrCTTTGCGACTCGGTTTCTTCCCCT 
AAGGATTAAATGACGTAAC^CAGGTGCCAAGTTCATACTATGTAOTCT 

G AGTTTGAC CAAATTAAAACAT CTGAGAG CATG AAAGAGTAC CACGCTCAAACG C CAGACCCT AACACACAG CATATTATTTTTCT 
TCCCTGACGCAAGCTCATTTTTTTTTCCTTG 

AG AAAGTTTT CCAAAACTGAAGT ACGG AAT CACCAGG C CACGG AGCG G AGGCCGC G CAGGACAACAGGTG AAG CAGCTC CG GGAG C 

TCGGGGGCCAGGGCCCCITCCCGCX3GGGACTCGGGGGCGGGCCGGCTGCTCCTCT 

CTCCCCTGTTCCTCCCCACAGCGTCCGGAGCCAAATCACAAATATCAGCCTCGCCGCTCCGGGG 

ATTTAAGGATTCCAGAAGTOTCCCCGGAGCGAC^ 

AAAGAGGGGCCGGCGGGGGACGCGGCGCGGGGTGCCGTGGGTOTCIGGCrGGGCGGTCCCXa 
AGCTGCCGGGGCCGGGCCGGGC^CCCCTGGCTCCXSGCCACCGCGCGCCCAGCrrGCCTCTGCA 

TCCCCGCTGa^C^CCCCTCCGG^ 

GGAGCGGGTGGGCGCGGCGAGCCGCCG^GGCGAAGTGGGGCCCCGAATC^ 

CGCCCCTCGATCGGGCCGTGCGATCTTCGCCGCGCGCGGACGGACGCGCCGAGCCTAAGCCT^ 

CCTCCTCGCTCCTGCCTCCrCCCTTCTCOT^ 

GACTCTTGCCAGGAGGGGAGGGGGCGGCXrCGGGTGCGGGGGCCTGGGGG 

CACTGGGAAGGCCCCCGCCCGCGCCGCGCGC^CCCGACCACGGGCGCCTGCCCCGCCCCGCCCCGGACACGAAAAGAGCCGAGCGC 

ACGGCGGAACCGCCCGAAGAGTTCGCGGAGCCCGGCX^ 

GCTGGTGCGGGCGGGAGTCGGGGCTGCTCCTCCGCGCGGGCCCGC^ 

TAGCTTCCCGCAGACGCCGTAGGTAGCACGCGATGGAACGCGGCGGAGGGGAGCGTGGACACGGCTGCCGGGCGCT 

GCGCCCTGGACGCCCGCTCCGGCCTTCX3GGGCCGGATATCCCCACTCTCGCGGGCTCTTTCGCGTGG 

CTCACACTGCGGTGAACGCCAGTCAGCTGGCGTGAAATAAGGTCGCrrGGGAAGGCACC^ 

CGTGGTAGTTGCGACAAAGAAAAATATAAGTCACATTGCATTT 

CTTGCCAGCGTGCCGAGGCGCAGGCAGGGTGATGGAGGCTCCCCXjGC 

TCGGGAGCGCCAGGCCCGGCX5CGCCCGGGCTCCTGTGACCTGCGAG 

CCCAACAGGACTGAGGTCAGCCAGCTATAAATACGCCTTCCAGCTCCACCGCGGGCTCACG 

GCTCATTCCGTCCATTTGGAAAATGCAAATAATTGTCACCACCTCTTTTCTTCC^ 

AAAAGTATTCGATGACCCGTATGTGAGGAAAGAGGAGATAGGCCCCTTTTTCACCAAATGTGACTTTAACCTCCTTAOT 
GAAATGT ATCAAG TTT AAGTCACTTAACAAAAGATGTT CTTTTAAAGGTG CATTTTAAAAAATCACTTTT AAAACTGGGT AAACTG 
GGTAAAAAGAAAGGGGGAAGGAAAACAATGTGCCAGGCATTTG 
GTATTCCCTTATATTCTC^TGTAAATGTTTGAATTCCCGTGTATrrTGTTT 

TTCTTCTTTTACCCGCCTTTGCCCTTAATGCTGCAGATAGCCTCTGTCATTCTGAATTATTTTTCAATT 

CTTTTTTTCTGGGCTCTTCAGCTGTCTCAGGTT^^ 
ATTACTTGCATCAACTAAAATTGGTCACTTTTCTTTCTCAAGTTT 
TTITTTGTTCl T r TT TTTTTGAGACGGAGTCTCGCTCTC 
CCGCCTCCTGGGTTCACGCCATTCrCCTGCCTCAGCCTCCTGAGC^ 

TTTGTATTTTTAGTAC^GACGGGGTGTCACCGTGTTAATAGGTGTTCTCGATCTCCrGACCTCGTGATCCTCCC 
CAAAGTGCTGGGATTACAAGCGTGAGCCACITGCGCCCAGCCCCAGCTTTAGTTCT 
ATTCACATTGATTGGTCAAAGCTCTCATTGACGTATGCTGAGCAATGTC 
(rTTTATAGTCTAGTTGGATTTTTAGACATATGAATTGGTCTATAAAAT 

TT CTG AAATGT ACAAATG AGT ATTCCCTGG CT TGTGTTATCT GC CAC TGTAGAAATGAG AGCAAAATTG AAG CAT CAAATTTT AAG 

TCGTGATATGTTTGCTTCTCTGCAAAAGACTCTCCTTTTCAAATCTTCA^ 

ACCCAGGCTTCAATCCTCAGCGCCCTGGCTACCAGCAGCTAGTGGGGCATGCTGT 

CAGCCCGTTTCTCCGGTCTGTCTGCCCAGGGAGACATCTGGAC^ 

CCACAOTAACATTCCGTAATGTGTTGGTATTCCT^ 

AGGTCACAAGAGAGGTAGTTGTTACATGCCCAAAGCAGCATTCATTAATTACA 
CCCTCTGCTCCACTAGAATATAAACTCCATGAGCAAAAGACrTTGTCAGTCTCTCTTTTT 
CCAGGCTGGAGTGCAGCX3GCATAGTCATAGTTCACTGTAACCT 
GTAGCTGGGACTACAGGCCAGCTTGACTTTGTC^TCTATTCTCAACTCTCT 

TTAAATGAGAGAAGAGAAATAAGCATACATATCCTCTAGGCATTTCT CAC CAAGACCCTTTTAGTGTTAGAGGGTGG 

CATG GCT ATCT TGGAAAG AG TGAAGGTTGGCTGGGAAAG CTGTGGATCACAT C CTC TTCAGTGAAATG GG ACT TCTG CCTTCAG G G 
ATCCAAG GTG AAT AG ATT AAT CCAAG CT AAT CCATT CAAAGTCATTTT CATGTTGT AAAGAT CTTAT AAG GCAAAAG G CAGCACGA 
AGTCTCTGGGT CTGATGAT AAGTTAGAGG CCAG TAAGGAGG CACGAGTG AAGGAATGTCATTGGTAG GGTGACCCTTT AGGT^ 
TTCAAACCCAATATTAAATGGGTAGGTGTTAGGGAGAATA 
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STTAGTAGTACCCCATCATGGCTTAG&AG 
ATATAATTCAGTGCACAAAATAGAGGAAGTTTAAAAAC 
ACTlX^rAAACAACTTGCTCCTTCTCC 

ATGATGGAGGATATTGCTGCTCTCACTTGGTCCTACCTAGTCCGTG^ 

TACCATAGAACCTGATCTTTCCTAGGCCTTGAGGCAGGAAGAATTATC^ 

GCCTCGCCCCTTTCTAAGAACTGAC^GCCCTTCTCrTO 

CTTGTTGTTTGGACTTAGCTG 

GCTCTTTGTTTCTAGGCTCCCTGTATCTGTA 

TCCCCACTGCGCTCCTG<^CTACCACCTGTTGTTTGCTCACTACT 



CACTGCCC TG CTTTTCCCTATC CTCACAG CACCTGGTTTATTGCACTTGGATTAl'l"!'!'!"!' AACC^GATG AAAACTAT AT ATAATAT 
ATATGCTCTGGACATACACTAAGTACATATTAAAGGTGTATTATATC 
AAGCZACTTTGGGAGGCCAAGGTGGGTGGATCATTTGAGGTC^ 
ACTAAAAATACAAAAATTAGCTGGATGTGGCAGCAGGCACCTGT^ 
1 5 CGCAGTGAGCCCAGATCTCGCCATTGCACTCCAGCCCGGGCAACAGA 

TTTTTTTCTG GCT AAAAATT AAT CCAAGTGT AATAGGGGTGTGTGTGT ATATAAT CTATTG AATAT AATTCAAATAGATTT CTAGT 
CTGAACTTTGTGTGTGTGTGTAGGTGTTTGTGTACATAT 

TGTGCATATG CGTATGTGTGCACACAT ATGTGCGCATATG CGT ATGTGCGCACACG TGCGT AT ATG CGTAT GTG CACACATGTG CA 
TATATGCGTATGTGCGCACGCAT ATGTG CACATGTG T ATG C ATGCAT ATGTGTGCATATGT AAGT ATG CATGT ACACATATGTG CA 
2 0 TATATGTAAGTATACATGTACATATGTTTGTATGTATACACACAAAAAACCCATAC^ 
GATTCTTCrrACTATCCCTAAACACTGATTCTACTCAAGAATTCCTTTG 
TATGATTACCTTGGGAATGACTAGTAAAAGGAAGGCAAGGTGCGA 
TGGATCAGGAGTGCAGATTCATGAACACCATACTGTGCCT^ 
TGTGGTGTCACCAGTGTGACCAGGCATACTAGTTTACCCATCGGATATGTTGAAA 

2 5 AGAGAATGGGAACTGTCTCATTCTCACTCCATTGTATCCAGTGTGCTGCTT 

ATGAATGACATACTTAATT TTGT TCT CT AC TGGTAGTTTGC CCTGGCGCATTTC'rTTT T ATT TCTAT T AG CCAAGG CAG GGTT AG G 
TGTTTTG TTTTG TTTTGTTTTTCCATATTTTTGAGGATTAATGAT AAAG CACTTAT CTTGTTGGAATGAG AGTCTACTAACAAATC 
CTGTACATCAT AATAACAG AACT AT CATCTAACAACTT GTT CT CAATAAATAAACTTTAAAACAACTTC CTTT ATT ATTTCAAAGG 
CATACATAGTGATATATATAACCAGTAACCAATACTTGTCAGTGTTTATTTT 

3 0 TACCTTAAAGCCCCTCTTCCTCCGATCCTGAATGCTGCTAGCTCT 

GTTGTTTAGATTTCTATTGTGAAACACCAGTCCCATTAAATCTGAAAACACTGCTATAGAGTAA 

TACTTTCTG AAGAATGGACCATATT (^TTCT CCATTTAATGC CATTCTCCTCCCT ATACTGCCTG AGTAC CACCTCTTC CAAGAAG 

TCATCTTTGACTACTCCATCCTCATCTCCTTAGGCTTTTCTCTCAATGGT 

TATGTCTTTCTrCACAAATAAGTTTTGATACCT 

3 5 TACCAAAATACACGTAGTATCAAAAAAGCAGTC^ 

GTACCACTGTCGCTATGGTCACGGTACTGAGCACCAGGGTTGCTAGGTTGCACAACTTTAAGGGGG 
ACACGAATGGTGCTCCCTGGAATTATGCATTGCAACATCTGGACGTGATCCT 

GGGG AGG CTGGT C CCTGGCTTG CTT AAAGAACC CG CTTTTTTATTGTGT AACAT T AT AT ACCTTT CAGGAC ATTTG ACAAGATATA 

4 0 TTTAAGCATTTGTT AATTTTGT ATT ATTAAATATG TGTGTT TAT AAAG AAAT AGATT AT CCTG AAATAAAG AG AAAACGTTGG CAG 

TGTTCTGTGGTGGCCTAATGGATGGTTATTATTTTCTTCCATAAGCTTTATCTCCCCCACAATGGACAATGTTATACGT 
TTTTTTTTTTTTTTTTTTTTTGAGAGTCTGCT 
CTCCCGAGTTGAAGTGATTCTCCTGCCTGAGCCTCCTGAGTAGCT 
ATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGATCTTGAAC^ 
4 5 AGTGCTGAGATTACAGGCGTGAGCCACTGTACCCGGCCCAATGTTACT 

TCAGCACCAAAAAAAAAAAAGAGTAGATATTATTTAAAAGAACTGGTTGATTG 
TTTGTCAGAATGAAGTCAGTTAGAATCCCATATAAGATATGTAATTAAAGTTTTTATTT 

GTAAAAAG AGTTATTCT AGG C CAGG CACAGTGG CTCACCCCTGT AAT C CTAG CACTTTGG GAGG C CAAG G CAGG CAGAT CG CTTGA 
GTCO\GGAGTTCCAGACCAGCCGGGGNNNNNNNNK^ 

50 

NNNNNNNNNNNNIWNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
55 NNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNOT 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNI^ 



60 

NNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

65 NNNNNNNNNNNNNNNNNNNNNNNNl^^ 
NNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNIOINNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

70 NNNNNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNN*^^ 

75 
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NNNNNNNNNNNNNNbTNNNN^ 
NNNNNKNNKNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNKNNNNNNNNNN^ 
5 NNNNNNNNNNNNNNNNNNNNNbl^^ 
NNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNIWNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNl^^ 
10 NNNNNNNNNNNNNNNNNNNNNNNNN^ 




15 NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
20 NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNM 

NNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
25 NNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNF^^ 

NNNNNNNNNNNNNNNNNNNNNNNNN^ 
30 NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^^ 
35 NNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNKNNNNN^ 

NNNNmJNNNNNNNNNNNN^^ 
40 CGCTTGAACCCGGGAGGTGGAGGTTGCAGTGA 

AAAAAAG AAAAAAAAAAAG AAAAAGGTT AT AGAGCATC TCG TC CAG GG CATCTT TTTTTTTTT T TAATAAT AATAACTTAAAACAG 

AGCaTGATTTTTAAAGCTAAAATATTAGATTGTAAGAGAAATTTTACAATTGCAGAAAGTTC 

TTTG CTTGTAGGTCTGCTTTATATAAAGTTGAAGAAACGTTTATTG CCTTATGATAGTGAAATTATGCATAACTTAATTATCTCCT 
CAACCAGACTATGAGCTCTGTGAGGCAAGGATTGTGCCTCAT 

4 5 GCCTTGCTCAAAGTAAAGGCTCAATGAATGTGGAACTAAACTCAACTCTGTCTTCCTCC 

CACAAAGGAAAGGACTTCACATCAAGTGCCTAGTGGAGAGAGGTATTTTGG 

TT ACCTG GT AG AGTTGTGAAGTT CAATGAG AAAAG AT ACAT AAAGTG CCAAGCACTGTGC^ CAT AG AAGGTGCATG CACACA 

CATTTAAATCATTCCAAGCAACTAAAAAAGGACCTGGTATC^ 

5 0 ATCTTTCCTCCTAAACTCGCAACCTCATATTAC 

GCTT AT AGGG AT AATTCAACTAG AAAGTGGAGAGAGGTAAC CGCTT CTT AGG AGG AGGG AG CTG CAG AG C CATTTCCAGCCTTG C C 

TCACTCAGTAATCTGATTTCCCCTCCCAATTCC^GGGCATGTCTTGGCCCTGGTTCCAGGAACAGTCCTC 

TCCTAACATAGAGCATGAACCAGGGAATCAGGATCCCAAAATGGCTG 

CCAAGAAACTAATGCAGCTTAAACTTCAAGGCTCCTCCCTCACTTCCAAAGACCCCCATGG^ 

5 5 AGGTTGAGAGGGAAAGCTAGTTTGCAATCAGGGAACTTTT^ 

CTGCAATGGTGCAATCTCACCTTACTGCAACCTCTGCCTCCTGGGTCCAA^ 
TTACAGGCATGCACCACCACACCTGGCTAATTTTTGTATTCTTAGTAGAGATGGGGT^ 
CTCCTGACCTCAGGTGATCCACCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGT 
CTGTTATGTAAGCTTTTCTGATAACTTGCCTAAGTAGGCTTTAGAAGAAAGTGACCTAGAGTTCTGGGACT 

6 0 GATTTTGTCCTCATTCTAAATGAATACTCCATTTGGTGTTTGCAGGTACTGTAACCCCTTTT 

TCTTTTTTTTTTGAGACAGAGTCTCACT 

CAGGTTCGGGCAATTCTCCTGGCTCAGCCTCCCGAGTAGCTGCAATTACAGGCGCGTGCCACCACGCCTGGCT 

TTAGTAGAGACGGGGTTTCACCATGTTGGTCAGGCTGGTCTCGAACTCCTGACTTCATGATCCACCCACCTTGGCCTCCCAAAGTG 

CTGGGATTACAGGCGTGAGCCGCCGGCGTCCTGCCAGCTTTTTTTTTTTTTTTTTT^ 

6 5 TAGAGTGCAGTGGTGTGATCACAGCTCACTGCAGCTTC 

GGGACCACAGTCGCATGCTGCCACGCCCAGCTAATTTTAAAT^ 

GGTGGAGTCTCATTCTGTCACCCAGGTTGGACTGTAGTAGCACGATCTTGGCTCACTGCAA 

TCTCCTATCTCAGCCTCCTGAGTAGCTGAGATTACAGGCACACACCACTACACTTAG^ 

GTTTCACCATATTGGTCAGGCTGGTCTTGGGCTCCTGACCTCAGGTGATCCACCCACCTCAGCCTCCC^ 

7 0 AAAGATGGCCCCACAAAACACTGGATTCCAGGCTGAGTCAGGCCCTTCrrCCAATCTGTATCTACTTTTCTACAA 

CTGCAG CTCACCAG AAAGTCACAAGT AGGATGAGAGGGGCTCCG CAGGGATG TGG AGGTGACCACATGCAAG ATAGAATTG CCATC 
ACCACTGGCATTCCTTCTACCCCAGG CACACATTGGAT G CCTTGCAGTGAT CCTT CTTCCACAAATGCTGGGATT CTCCTG CATTC 
C(IAGTGCTTCTATTTGAAATAGTTTTCAGGATACCCTGAATTCAGGATAATACCCATCCrrG 

TTGAGATTCTTTTGGTTGCAAGTGACAG CAACCGAATTT AAACAAGCAGAAG GAAAAAGG CX1&GAACT AGACTTTGAACCAG AGTT 
7 5 AGATAAAGGGAATAACTGATTTGCAAATGGAAAGGTAT/rCTGTAAGAATTTACTTAATAAAATATTTGGGG 
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GTTTGTGAAAAACAGCACITTGTTCT 
TTTGAGATGGAGrrTTGCTCTTGTTGCCCAGGCrt^ 

AGTGATTCTCCTGCCTCAGC CTCCCAAGTAG CTGGGATTACAGG CATGCGCCAC CACGCC CG^GCTAATTTTGTATTTTAGTAG AGA 
T03GGTrTCACTATGTTGGCCAGGCTAATCTCGAACrCCCGAC^ 
5 ACAGG CATGAGCCAC CGAGC CCGACCCAAATGGTT ATTTCTATG TAAACAGACAGTTTTTAAATAATGTGAAAATACAGGAGTGTT 

TTGGAAATACCGAGGTGGGAACACCGGCCCGCTGTGGCX^^ 
AGGCTGGAGAGACACAGCGGCCTGGCATATGGGACITCT 

TAT AAGTTGTGCAGGTCCATT AGATGGTCAGATTT ATTTTTGG AGAGATCCTG AATGGGAA GG GGTG GAT ACGGAGCA 

AAATGGAGACAAGGAGATCAGTTAGGAAGCTGCTGAGATGATG CAGAGG AGAGATGTTTTTTAAAGGCAGG AGT C CTGAGGATGAG 
10 AGGAGAAAACAAATTCAAGACAGGTTCAGGAAGTACAT^ 
ACTAGGATGATCCCAGATATTTAGTTTCAATGACCGG 

TGGCT AGG AGGATGGAGAACTAAGTCAGTGG CAGCCCTGGAATT CCT CT ATGGAAGAGATT CAGTCAG AAGGCATGTGGT ATTTGT 
CTTGAAATTATAGTTACAATTAT AAGCATTCT GATGG CGG ATTAT TTG 

AAC^CCTITCTCAATAGAAGTTTAATCACCTTGTT^ 
1 5 TTACCAGCCAAATAATAAACTGAAGACCTCGTCTCGCATTATATTTTCT 
GTTTATTTCTGTTCTACTCCAAATGACTTCATTTCTAGA 
GGGTAAAATCCTCTCATAJGCAAITiTTTTT^ 
ATT L T1"1T T CGAGGTAATCTGTTTCTAGGAA^ 

TCAGACCirGAAAGAGCTGTGCAATTACAAATAGAAAAOlCATCTTATTGT^ 
2 0 AATGTCACATCCTGAATTTGGCATTGGATGTTGTATTAGTTGAGATTC 

TCAAGAATAAAGGGAAGGCCAGGTGCXK3TGGCTCATGCCTGTAATCCCAGCACTTT 
TCAGGAGTTCGAGATTAGCCTCGCCAACATGGT^ 
GCTOTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGTA 
ACTGTACTCCAGCCTGGGTCACAGAGTGAGACTCTGTCTOVAAAAAAAAAA 

2 5 AGCAAAAGGAAAAGATAAGTGTGGAGCCTTGGGAATGACAAGATCCAGAAGCC^ 

TCAACTCTTTGCTTTCCAGTGCCIX3TTGGCTT(^^ 

GTTGGACTCATTCTTCCAGCTCAGCCACCAGTAAGAGAAGAGA 

ATTCTGATTGTTCTGGCrTGGTTC^CAAACCCACTTCTTG 

ATAGACGCACATACAGATGGCCAACCATCCAGCCCGTGAAGTAGGTGGGCTTGGGTGGCT 

3 0 GCAAAAT CTGGACACATG AG CCTG AGG CT CAGAGGACAGCACTGAACTGAAGACATTCTG TCCG ATGGCT CAGAAAGTCA 

GCCTTGAQ^AAATTAATGGCAGGATTTATTCCACAAGCTCA^ 

ATGTGTGTTAATAGAGGCTT AACTAGG AT CAGACAGTGAAGTTC AT AGTTATAAAAT AAATTGTTTTT ACAAGA^ CAGG CCAGG 

CGCAGTGGCTCATGCCTATAATCCCAGCACTTTGGTAGGCCGP^^ 

CCAACATGGCXSAAACACTGTCTITTCTAAAAATACAAAA^ 

3 5 GGCTGAGGCACTAGAATCACTTGAACCCGGAAGGCGGAGGTT^ 

GAGTAAGACTCTGTTTCAAAAAAAATTTTTTTAATAAAATAAATAAAAATAAATAA 
GTGCAGTGGCTTACTCCTGTAATCCCAGCACTCTGGGAGGCCMGGCAGGTAGATT 
CCAACATGGCGAAACCTGTCTCTACTAAAAATACAAAAATTAGCTC 
GTTGAGGG&GGAGAATCGCTTGAATGTGGGAGGTGGAGGTTGCA^ 

4 0 AGTGAGACTTTGTCTCAAAAAAAAAAGTCAAAAAAAAAAAAAGTAAACTCCACAAGAGCAGA 

TTGTGTTGTCAGTGTCCAGAGCAGTGCCTGTTCATGTCACTAGTAGGTGOT 

AAG CAAACAATATT T ATT CT GTC C ACAAATCC T TAGG GGGGCAGAAATTATGTACAAGTTAATTAT ACCAAAT T CT AATTTTTGGT 

4 5 GTCTG ATTCAGGTGCTGAAAAT CAAAT ATGGT ATGT ACAACTT C CCTTT ACTCTT CAGT AATTT CT AGAAACAG AAT AC CTTGAAA 

A71GAATGAGTTTGATTTTGTTAAACATAGTACCCTCAGAAGGGTCTTCATTATTAAAAA 
ACTCTGAACTTATTTGAAAACAAATGATGTAAGAGGAAATATGGCAACT 

AAATTAATTGGAAAT ATTTT AC CG CGAAGT AGCT AAGTTAACAA^ CG AGATGAG GTCTTTGGTTTAT 
T ATTTG C CTGGT AATT CC CTTOTGT ATG CAGAA(^TT CGTC C CTTGT CT GTTT AAACATGGTGATT ACACTG CTA 

5 0 TTGAGCATCAGGGCAAGCAAGTTGTGTTTATTCCCACAGAATGTTTTGTTTTGT 

AGCTACTTCTGATGTGACATGAAAAATTTTACACAGAAAAAT^ 

TAGAATTCACACAG GC AAAAAAAAAAACACATG CAAAATATTT^ C TTT AACAGG CT ACAT TC CACTTCTGAAATGTT 

TCCAAAAACAAAACAAAAAATCCCAAAGCCAGAATACTGAGTTAAAAAGT^ 

CTGCAGATATAAGCTGCAGATATAAGTCACTCCATTGC^ 
55 NNNNNNNNNNNNNNNNNNNNNNM^^ 

NNNNNNNNNNNNNNNNNNNNNN^^ 

NNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNW 

NNNNNNNNNNNNNNNNN^ 
60 NNNNNNNNNNNNNNNNN>ftINN 

NNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNKNNNNNNNNNNN^ 

NNTTTGTCTTTTTGGCAAATAGGAGTCCCTCAGAATCTAGACTTGCTCTGAAATGTAGAAAAATTG 
GAAACaSTCCTTTGAGATATTTTACITTTCTTOT 

6 5 GTGTGTGTGTGTGTGTGTGTGTAAACCCTTTAAAAAGAGATTTTGGAAACTGAATTCTGGGAACGT^ 

CAAGACC^CCCCTGATAATTGAATAGATAGTTTTTGGAGA 

TGCATATTTGAACTTTCCAAAAATAACAGAATGCTCCAAGGTACTTGTCTGCAGCT 

TTTTACTCTTGCCTGCTGCTTTGTATTCACCrrTCTCACAGAT 

TC CTGATGCT CC CTGGGG GAG ACTGGTGG CTT CX3TGCTTGGAAAAAATAG GGGCTT CCTG CAG CTAGGGACATAGCT AGCTGCTTC 

7 0 CTTCTCCCTCATGGAGAAGAGACAGGTAAGATGGCAGTCCCCT^ 

AT AAAACATTTCAAACATG CAAAGAGAAAAATCCAAAACAGGAGTGAGGACAG AATAGAG TGAGTTGGAAG^ CCTAAATATCG 
TGCTCAAGTGAGAAACATCCACGCAAGTGCCTGCTTCAATG 

GCG ACTACACTGTTTCCAAC CACAGAAT CTAAGCTG CCTTCTTT ATCACCAAACAG CTTTAAG ATG CTTAC CT AAC TCAAACTATC 
ACT AGT AT AAAGT AGTTTG AAGG C CCCTCT ACTTCAAT AAAT ACTT AAAT AAT AGCT AGAT CAACTCAG CT CAT ACATTTTCAAAA 
7 5 TATTGGCCAAAACAGCAAATGCATATATATCAGTGCATCTGTCCCTTCCAGCAGTTAACACAATT 
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TTTAAAGGAACATTTGAAAAGGATGA 
GTAAGTTTCCTCTTCCCCACCCAGAGTTTGGGCATGTGAG 
TTTTCCACAGGCCACCAAGCCTATTTGA^ 
ATGCACACGACTGGACACTCATGCTAACGAACCCTTAAA^ 
5 ACACACCACAGAGCAAAACTGGAAAAACCX^ 

TGTTTCAGACTGAGAGAGGGTGGCTGACAGGGCCATCTTCT 
CAATTGCCCMTGGCTCCCAGAAGACATCTCTTTGGACGTCC^ 
ATGCGCAGAGGTAGCTCCATGACGCTAGTAATGGAATAACACTTCCTT^ 
CTCTCAGCTATGACTCTAAAAATACX:CTGCAGCGAGTGCTGT 
10 CAGAATG CTTCAGATTCTCCACAGCrGCTCTAAGCTCTGTC CTC CT CTGGTGACCAAAGCAAGTGT ATCG CTTTGGGAAAG CTGCT 

TGTTGCTTTTCACTTTTGAGTTCTTGGAATTCTGCT 
TTGAGTCTTCATTTGGCTCCTAGTCTGAACTATC 

GGCTCAGTCG<ITTCXTCTCTCTCCCCATTCCCCTCAGAAGAAACATCAG CCTC CCGAATCCTTTGTTGCCAGCAGCAGGAGCAGCT 
CCGGGCCTCGTCAGGACAGGCGGCCTrcGCC^ 
15 AGTAGAAOSGCTGCCCCTCCAGATTCACATGGCAGAC^ 

G CAGATTTCAATCACTITTTTAAGAAAACG CCCTGGCCAGG CG TGGTGG CT CAGTC CTGTAATC CCAGCAC TTTGGAAGG CCAAGG 
CAGG CAGATCACTTG AGGTCAGGAGT CCAAGACCAGC CTGG CCAACATGGCG AAACTCCAT CT CTACTAAAAATACAAAAATTAGC 
CAGGGGCATGTTGGCATGTGCCTGTAATCCCAGCTACTCGGGAGGTTGAGG 

CAGTG AG CTGTGATTGTG CCACTG CACTC CAG CCTGGGCAACATAG CAAG ACTGTCT AAAG CAAACAAACAAACAAACAAACAGCC 
2 0 TTGCCCCATGCATTCCAGAGAAAAGGCAACAGCCTATTTCTCCCACTATGAATTCT 
CTATATAAAAACTAAGATTCCTGAAGAACAGAATAA 
GAAGCTGACCAATTCATGCTATCTAAAATAGCATGA^ 

TCTAAAG AAAAT CATTAGTAAACTAACAAACACATCAG AGAAGATTGTACT CACTAAT AATTAAAGAAAAAG CAAATT AAAACTCC 
CCTGACTCCAGGCGCAGTAGCTGATGCCTGTAATCCCTA 

2 5 AGACCAGCCTGGGCAACATAGTGAGC CT CTG T CTCTACCAAAAAAAAAAAAAAAGAAA^G AAAG AAAG AAAAGAAAATTAGTTAA 

AAAAATTACAAGAAAAACTGCCCTGAGATACCACrAGACACATATTAAT 

TGTCACCTTTAACCT AAT ATGCCTGT AG CCAAAAAGCTTAAAAAAAAAAAAAGTATTCAAT CAAAAATCAG CAATG CCAAAAAGG A 
ACAAGGCCTACTCGTGCACTGCTAGAGACATGTGAGTGAGCTCAGTGTTCCATAGGGAATCTGGC 

AGATGTG ACTACCCTGTAACCCAGT AATG C CACGCTAG AG AATTTT ATC CTGCAGAAG CAACT CTG CAGAACAACAAAACAACCTG 

3 0 TGCTCAACT AG AAACTGAAAACT AGTAAAATTTOT CT CAATAGGAAG ATGATT CAGT AAATTTT CATGGTGTATTTTTCCAATGG A 

ATGCC^CCATTTAAAAAGAGAGATGGGCCAGGCATGGTCG 
AGTAACGCTCCt3CCX3AGCCCNNNNNNNNNNNNNNNN^ 



3 5 TGTCACCCAGGCTGGAGTGC^GTGGCGCGATCTCAGCTC^^ 

TCCCTAGTAGCTGG^ATTACAGGCGCCCACCACCAT^ 

GGCCAGGCTGGTTTCAAACTCCTGACCCCAAGCAATCCACCTGC^ 

CGCCCAGCCCAAGAGATGAAATTCTATATCCCCIATGATCACA^ 

ATT AAT ATTTGTGGGG ACTAC CATG G AC CAAG CAGGATGCCAG ACACTCT AT AT ACAG CAG CT CATTT AATT CT CATG AAAACCTT 

4 0 ATAAGGTAGGTAAGCCTTTCTCCCACTT'CACAGCTGAAAAAACTC 

AGTAAGTTGTAGGCTGGGAATCTCTGTCCATGTCTGTAAGACTCCAGTTCTATGCTAATTTTTCCCCTACCCCATAGAAGA 
ACTGTGTTTACTTGAAATTGATCTGGGTTCATGGGTTTGTACAGATGTAGCCCTCTGAGTTTGCT 
GGCCCTGGGTTGGCAATATGCCAGTAGGTCAGAGTCTGCCAGGAAATCACATTGGCTGGCATC 
TGAAGATCCCTGCTGGGGGGCAGAATTGGGAGCCATCTCCAGGA^ 

4 5 GTATCCCCCATGGAGACAGAGAAATCACCCCCTTGAGTTGA 

AAAAGGGCACCACGCTAGCCCCATGTGCAAGGCTCrGATGGTCCAGTTGGGGACGTTATACCT 

AGG CCT CAAG CTGCTGGTGACAATTTAAAG ACTATGCAAAAAG ATGGACATCCAAGAT CTAACTGGTGAG AATG GG CTTTAGGG AA 
GACAGAAAGGCATCTCTGGAATGAGTCAGCCTGTCAATGACTGCAGCTGTGGGTGGGAG 
50 TTGCTTGTGAGAACHTTGGGATG AGTCAGGCTGGGGAG AGAAGG TGT C C AGGG ACTG AAGACCAACAAAT GAG AAAG GGACAGAAC 
CTCAAAGATGG CTCATAGGTGAGATGGGGCTGAGGGTGGGAACCCT CATAAGGAGTTGTGCCCAGTGTCTTCTTGGTGCAAGGTGG 
CTTCATAGTGGAGGAGGCCTTCTGCCCACCCMCACAGGGGGCAAGAGGGATGGTC 
CAGGGCCrCGCTATAAAAGGGTAGCGACCATGGACAGTGCTGACTCCAAGTCCCAGC^ 
CAGCAGCCCCTAGGTCTGCCTACAAAAGTTATATTACTGCCCAT^ 

5 5 AGCAAAAGAG GGATGGGTAAG ACATTTTT CCT CCACT AAGGTCGGAC C CTTTCCATTGGAACT CAAGTGT CG ACG CCAG ACAGAAG 

CTTAACTTTCACCCTGACCCACAGTGCACTTCTCAGGCTCCCAGCCTTCACT 
TCTACTTTTGAATTGCTCACATCTGTTGCTTTGTTCACCrrTTCAAGATTCT 

TCTTGTACTGCCCAAATGCCAGGAATATTTGGTAAACTGAGTTAGTCTAAAGAATGTGCTATCCTGAGATCT 
CAGCACATCAGCTGGGAGAGCTACCCCAGATCTTT 

6 0 ATTTGGAAATATGAAGTGATAGAAAGAGGACCAAAAAAAAAAAAATCCCTCAAGTCCCTCAAA^ 

CCTAATGTTAAGACGATGTTCCGGCTGGGTGTGGTG 

GAGGTCAGGAGTTCAAGACCAGCCTGGCCAAGATGGTCAAACCCCGTCTCTACT 

GGCGCCTATAATCCCAGCTACTCAGGAGCCTGAGGCAGAGAATTGCTTGAA 

ACCACTGCACTCCAGCCTGGCGACAGAGCAAGACTTTGTCTCAAAA 

6 5 GTT GACAGAG CAGTGAGTAT CATGCAGGCTG GTTTGGAGGGAGACCTAACGATCTCCATGTTG ATC CTCT AAAGGGCAGTG CTATG 

GGCCTCTCACAGGCTGGACTTCACTCTCrGGAACTTATTTGTAATATCAGCCT 

TCTCTCTTGGACTTTTGCTGTTGTGACTCAG CTGAAATTGCGAGGCTTGCAGAAG CAGACCCACACTCACCTTTG CCACGATTCCC 
TCTGCATCTGACACAGACCTCTCATCAGCCTGCCCA^ 

CTTG G ATCCTTC CAGAAGTTGGGT AAC CAG ATGTG CAG GCAGTG CC TCAGGACTAG ACAAGCCATGG CTTGAATGTA^ 

7 0 GACCCCTGTGCTGTGCCTGACACAGGGGAGCTATGACTCTGCAGG^ 

TTGGAGG03ATGTATTTAAAACACCCTAAAAGACAACTAGAGAATCT 
TTGTATTCCACAAAATTCACGTGCAAGAAGGCAGAGTT^ 
GGGAAACTCCATCT CAAAAAAAAAATAAAAAAAAAAAGG AGAC CC^ 
ACACACACAGAGGGGAGACAGGCTTGTCACAATCGAGGCAGG^^ 
75 TAGCAACCGCTAGTAGCTCGGAGAGAGACTATGGCCTCCTGTC^ 
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ATTCATTTCTGTTGTTTGAGCCATCAGTTT 
TCCCACAGATGGCGGGGTATGGAAGCGAAGTGTGGAACCAAAA^ 
AAGTCTATAAAAATTATTCAAGTAAAAACTGTA 
AAATAGTAGAGGTGTCTTAATTTTCCCAAATCAGCCAGGCGTGGT^ 
5 GGTCAATCACCTGAGGTCAGGAGTTTGAGAC 

AGGCATGGTGGCATGTGCCTGTAGTCCCAGCTACT 

TGAACCGAGATCATGCCACTGTACTCCAGCCTGGGCAACAAGAGCAGAAA 

TCGAGATTAGGCAATGACAGATTGACATTAGGGGGCTTTCTCAGGGATGA^^ 

TAGGCTCIACCATCATGTGCCCACCCCAACCCAAAA 
1 0 CAGAAATAAGTAGCACACAGTTAATAGTTAATC CAGGG CTTATCTC CTGG CAG CTATATATCAGG AAAGGAC CACTT AGGAAAATA 

AAATGGTACTCAGTCTTGCTTTCCAAAGGTGCrrACCAAGAGAAACCCTAT 

TGTTTGGGGTAACAATTTGGAGGGGAAAACCTTCAAGTCACTCCAT 

CCTCTCAAAGCCAAGCATGA.TCTGCTGCTTTTGCCCGCT 

CACTCTGTTGTCCTTAACGAAATGGAAAGTAACAAAGCGAC^ 
1 5 GATGTATCGAGGCATCATGGTGTTTGCCAGCTTTTAAAAATGATC^ 

TCATCTGGCTCCCACTTCCTCCCTCCTGCCTCTTCTCAGAAAGCCA^ 

GTCGTGCCAAGTGTGGCCCAGGGCTTCGATAAACTTGTCGCCAGCCTCCACGGGGAAA 

GATTGATGTAGTCTGAAATG AAAAGCAGAGTG AAGACC CCGCACCCAGTGGGGGCAGC 

ATCAGTGG GCAGAAGCAGGGGG TGAAAT CTAATTGGTTGTTTGT CATCTCACAG ATGAAATACT CAGTTACATGACTGATAGTTTT 
2 0 CCATTTTCCAATCATTGCATTGATGTTGTTm 

ATTGTCATCAACCTTCTGCAGGGAGATTGTCTCCTCTC 

CAATCCACATTT CCAAACAT CTATGCTTAGTTTGGAAGAATG TGGGTC CAACGTATTTAAG AGTTATAG AATTTTCTTTC C 

CATTCTTCTGTCATGGGGCATTTGTGTTGACAACAAGTTTTTACTACTATAAACAGT 

CCTCCTCCTTCCCTACCCCCAACCAAGGAGATAATC 

2 5 ATGGATGGTGTTATTTACAAGTATGTCAATGTC^ 

TCAATATCCAAAACATATATAGGGAATGAATGCATTAACCAATCAATCAGTTCA 

TCAG AAATTGAACT ATTATT AT CTTTTTGTTTTGTTTTGTGGAT G AGAAAATTGAGACT CAAAG AGGTGTU^GGC CC^GC CAGCAT 
CCCACAGCTGAAGCAC^GTGGGTCAGGTTCTGTGCrrCCCT 

C CACACTGACTGTAGAACATCT C TAGATTATTCTAGAAAAGCTCCAT CCQ3TTTACAAAG CCTT CAT CCCTCCTGAGAAGTGTACG 

3 0 CATTC CT C CATAACT CTT CCACAGTTGG CTTT'IT CTT AAGTG ATTTC CTGTT CTT CTTGTGG AT AGT ATGT CCATTGATT GT CTT A 

GACCACGTCTTACTGAGTGCCAGTGTGCCATGGGGGTTACAAAAT^ 

CTATT AAGAAATGTTCAAAC CTGGGGCAAGTGGCGT AACTT CTCAAGG CCT CAACCTTCTT C ATCTATAT AATGG GAAT AATTATTG 
T AC CGAACT CATGGGGTTTTGAGTG GATT AAAT AAGATAATGTAAGTG AAGT ATTTGG CACAAACACTCAAAAATGTT GG CTCTT A 
TTTTGAAAGTCTTCATAACTGATGCAAAGAGGGAAAGG^ 

3 5 CTCTGTGAAGAGATAATATTCCCTATTTATTTATCTATTTATTTTAT^ 

GACGGAGTCTCACTCTGTCACCAGGATGGAGTGCAGTGGCATGATGTCGGCTC^ 
CTCCTGCCTCAGCCTCCCAAGTAGCTGGGAGGGCAGGCACGCGCCACCACACCCAGCTATATU ^ 

GCTTCACCATGTTGGCCAGGATGGTCTCGATCTCTrGACCCTC CCTCCCAAAGTTCTGGGATTACAGGC 
GTGAGCCACCGCGCCCAGCCTATTTATTTTTTGAGGTGGACTCTC^CTCT^ 

4 0 CACTGCAACCTCTGCCTC CCAGGGTTCAAGCAGTTCTCCCACCT CCACCACG 

CCTGGCTAATTTTTGTATTTTCAGTAGAGACZGGGGTT 

CCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCGCGCCCAGCCTAGTATTCCAATTTTAAAGGGGAGG 

GACTGGAGAGGTTCCATCACAGAAGTCATAAGGAGTAGCCCCAGTCCTGATC 

GCTGTCTCCAGCTCCTTATGAGC!ATGGCATTG^ 

4 5 CCTTTTGTGCTTAATTGTAGCTTGTCATGCAAATGCTCTT^ 

TTAGTAGGCAGAAATGCATTGCCTTGCCAGAGCTAAGAAAACATCGTTCTC 
ACTCTGTGCTCTGCAAGAACTTACTACTATCrGGTATGGGCTGACCCGTCTCCACCTG^ 

T CTTT CTCAT CACT CCCAGGTGGTCTCATGGTGC CCTGGACTTT CTCTT ACAC CTCCATG C CT CTGTG CAAACCAGGC C CTGG GCT 
TGAAAAACCATCCCCACTTTCTGTACCTGAGAGAATCTCrrcCAATCCT 

5 0 ACATACCCCAACCAAGAGTGATGAGATGGCCTTGCTGAAGGCTT^ 

TTCATCCATCATCTCCTTTTTTTTTTTTTTTTTTTTTT^ 

TGGTCACAGTTCACTGCAACCTCAACTTCCTGGG CTCAAACCATCCTCCCAC CTTCAGGCTCCTGAGCAGCTGGGACCACAGGTGTG 

GGGCTCCAAGGGGTCCrrCCTGCCTTGGCCTCCTAAAGTGCTATGATTATCAGGCGTGAGCCACCAAGCTGGG 

5 5 CTTACAATAAACT AT AAAAATT AAATTAAATGG TTCAAACAACCTAATGTAAATAGTG CAT TG CCTT CTG CTTACTGGCTTATTCA 

CAGACTAAGCTATTAAAGTGGGCCTGAATTGGCCA.GGTGCGGTGGCTCACGCCTGTAATCCCAG 
TAGATCACGAGGTCAGGAGTTCAAGACCAGCCTGGCCAAGATGGTGA 

TCAAGATAGCATGCCACTGTACTCCAGCCTGGGTGACAGAGCAAGACTCT 

6 0 CACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGG 

AACATGGTGAAACCCTGTCTCTACCAAAAATAOVAAA^ 

GCTGAGGCAGGAGAATTGCATGAACCCCAGAGGCGGAGCTTGCAGTGAGCCAAG^ 
AGCGAGACTCCGTCCAGCCaSTGGTGAAGACAAAACAAAAO^AG^ 

T CATTTTTCTGTTTT AAAT TTG ATT C CAT ATG ATT TGGAG AATTAT CG CTTTGTTT TAAAATTTGGT AG CATGGG T CTGATACTAA 

6 5 CACATTTTTGTTTAAGTTGGTATTTTT 

AGTCTGTGCATGGGCCTTTTCCACAAGGAGTATAATCTAGCTCTTCATATATTTATTCCTTC 
CTAATTTTCTTTGACTCTCTCTTGTCCGAAGCTGAATTACTACCCTCACATTTAATGGATTTTATTGCT 
GTCCATCAAATTG CAGGTGCTCTGTGTTGAAAAGGGGCTTGGTACTTTTCTAGAGCAATG 
GGGTTTGTTGAAACACAGCGTGCCAGGCrCCCCTCCAGAGTTTCTGATTCACC^ 

7 0 TAACAAGTTCTCAGGTGATGTTGCTGCTGGTGTTGGGACCACACTTTGAGAAA 

GGGTAGX3GTX3GGGCACCCTACGAAATGCTGCTTCT 

TTCCCTAGGTGACCATGATGCCAGGTTTGGGAACTGCTCGGGTGCAGGTTCATCT 
TATATGCAGATGTCAGCTTATCTTTGCAGCCAGCTGAAGCATTTTTCAGCCTAGACCCT^ 
ACAGGACTCCCTTCTGTGTCTTCTTCCAAATTGTTGATTA 
7 5 TAG CT ACT ATTTTTT AAAAAACAGAAAATAACAAGTGT TTG CAAGGCTG CGGAGAAATTG GAAC CTTT ATT CTCTGTT AATGG GTT 
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TCTAACACAGTGCAGCCGTTATGGAAAACA^ 

CT CATGCCTGTAATCCCAGCACTTCAGGAGGCCAAGGC^^ 

G CAAGACCTCATCTCT7VCAAAAAATGATGAAAATTAGCTAGGCATGGTAG CGGATGTCTAAAG TTCCATCTACAGG CTGAGGTGGG 
AGGATCAACTGGGTCTCTCCAGTTACCTAGG^ 
5 CCTCGATCTTCTTACTCAAAACATCATTTTTGA 
TATCATTCTTTGAAAAGCCTCAG 

AAGTTOTATAAGCATAACAGCTCrrrCTTTAACTACCGATCAATATCLATATGCT 

CCCTTTTACAGATATAGAAACTGAGG CACAGAGTAACTAAGTGACTTGTTTCAGGT CATTCAGTTAACrTCAGAGCTGGGAAGGGC 

ATTCACTAGTACAAGAACCCAAGAamnvrm^^^ 
1 0 AGTGCAACGGaVGTATCTCAGCTCACTGCAACCTCTGCCTCCCG 

ATTACAGACGCC CX5CCATCATG CTCGGCT AATTTTCX5TATTT TTGT AGAGATC TGGT CTTGAA 

CrCCTGACCTCAGGTGATTCGCCTGCCTTGGCCTCT 

ATTTTCTAATTTACAGTCAGGGGAAAGAAGTTTTATTTTG1T 

TTAAAGTAAACTAACTTACTGTGTTTCTCTTTTTGCCCTCAAATTT 
15 TCAGCATGCCTGCTTTTCCCGCCTCTGATTTTCTGGTTCrACT 

CAAATC CTTTCTGAAGGGGGAAG CCrCTGAATAAATGAGTGACTGACTTCCAC CTATCTAATGAAGTGCTTTGTCT CTTGAATTTA 

GAATAATTTACTTAGCATTTTAAGATAACATATAGCATTTTGG 

TCTGCIAGGAACATCAAAGTCTGATAAATATCACAGGGG^^ 

TTAATCCTCACCACAGCTCTGTAAGGCAGGTGTTTACAGACGAGG^ 
2 0 CAAAGCAGCCCAAAGTGGAACCAGGAATAGAATGAAGGACAAAGTCCACA^ 

AATCAAAATAAC CTGAAAG GTTTCCCAG TGGTCATAAGATGCCAACTTGTGAT CTACATTTTTTTTTTTTTTTGAGAT ACASGATC 

T TACTCTGTTG C CTAGG CTGAATACAGTGGTG CTAT CACAGGTT ACTG CAGCCT CGAACT C CTGG ACTCAAG CGAC CCTC CCACTT 

CAGCCTCCTGCGTAGCTGAGACTACAGGCACGCGCCACCACACCTGGCTAA 

GTrGCTGGTGTCAAACTCCTGGGCTCAAGCAATTCTTCTG 

2 5 T AAC CAGATCCACATATTT GATG CATAAAATAAGC T ATAG ACT CTGCAC CTCACTCTC CC CAAAG CTGCAACCATATGGG CAGGGG 

AGATAGTTAACTTCCTTTGTGTGATATTAGCATGATTCTGCC^TTAAAAGCAAAAACAAAG 
GGGTGGTCGGATGTGAATGGATTAGGCAGGAGTTAGTTCCCTGTGCCTGCCCCTTTCr 
ATCGAAGTGTTCCTACC1TTCTCGCAGTAGGGCTCCCCGTCT 
ACAGACGAAGCAGGTGGTGTGCCATGTCTGTCTOVAGGCATGC^ 

3 0 GAGAGGGAGCGGGCTTGACTACCCCACCAGGCTCCAGCCACTGCCCGAGCATCCTACCAGGGAA 

CAGGCCTTGGTCACTCTGAAGAAAGACCCAGGGGTGTCAGAGAAGGGGTT 
TTACATAAACrTGTCAGGCTTAATCCTCCCATTAAACCTGGGAGGAGGTACTGCTTATTACT 
. GTTAAGGCCACATTCTGGTGAGTGGTAAGGTCAGGCTGTGAAGCAACCACTTGCTTTGCAGAA 
GTGGAACAGGCACACAATCATCrCTTGOVT 

3 5 AATGGAGTTGCTTTCTCTGTTACCAGCAACTTCTTGGCTTCCTAATGGGATC 

GGGTCAG AGGAAATGGAGG CCTCCCACTTACCCCCATAATTTTGGTGTTG CACTTGGCACACAGCGGGGCAAAGAATTGCTCATAA 
CATCGCTCACAGTAAACGTTGTTCTGCTCTTCCACAAAGCACACATCT 

AGGGTGCCAAGAACGGCCCATGGCTACCAGAAATGGGCCCCTGGGGAAGCACAGAATGGCACCCAGTA^ 

GG AATT AGAT ATTTG AAGAAC CCCAACACTGTTT AT ACTTTT AC C CAATAACT CTGACTTTTGTGACTGTTCTGATG AAGT AATGT 

4 0 GAAATTCAGAGAAAGCTCCATGCAGAAAGGTGTTCATTGTTGC^ 

CCAATAGGGGACTGGGTAAACATATTATAGAATTCAGTTGACCACTGATAATGATAGTAAGAAATGCAA 
GCTAACATTTATTAAGTGATTACTCTGTGTCAGGCTCTAT^ 

TCCTACGCA.CTAGGTACTATT ATAAT ACC CATTTTT AGAT AGGGGAATT G AGG CAC!ACAGAGTTTG AACAACTTG TTTAAGG C CAT 
AGGG CT AGT AAGTGGT AG AC CAGG AT ACAAACTTGT ATT GT CT AGTT C CAGGTTAG CAGTCAG AACAGTGGGACT ATGAAGACATT 

4 5 TTTTTATATTCTAATTTTTACTTTTCTGTGTTTTCCAATTCTTCA 

TTTTCACXJTATATAAGCTCACACCACACATGCAC^ 
TGAAATCACTTGTTTTCATTTTAGTCT^ 

AACCCTAAAGGAAAACTGGGTCAAAACCMCACTGTCTAACAAAA^ 

TCAACCTTAGCCCAAGGGGGAGTTCCTGGCACCAGGTATCCTGCTGGAGCTTCACCCACATACCTCATGCCATTCCTCC 

5 0 CTGGGAG AGTGGGTGTCACACAG GTATGGCAG CT AAGGCTC^G AGAGGTAAG ACGAG ATGG CAAG CCAGAAGGACCCGAGGCTTTC 

TCCCTACACCTGCCTCCTACCCCCATCTCTCCCAACC^ 
CCAGTGCIAGGGGCACAGCTGGACCATACCXiGATGACATTGTTG 

CAGCCCTCTGGACGGTCCCCCTGGCAAGTGGTCGCACCTGAGGACCCGCTGGGGTGTAGGCTC 

TTGCTGATGGAGGTGGTGCTCTTGCCCGGGGCAAACTTCTGGG AGAAGCTATCATCTGTCA CAGG CTC 

5 5 CGCAAGGCCCCCGCTGTAGGCCACCGAGGGTGCAGGGTTATAGTTGGGGGCAGGTGAGGGGGTATAGGCTGGTG 

NNNNNNNNNNNNNNNNNN^AAACGAGCTG^^ 
GTCAACAACCAGGGTACACGATCAGGCACACCCTGAATGAC^ 

AGACGGAGAGACTTGCTCCAAGGACCGCAAGAAGGAGACCCGACAACGCGTGGGAATCGGCCA^ 
AAGTGTATGCACCTAAGCACCCGCTGGAGTGAAGGTTGGGCCTCAC^ 
60 TTTGCCCGGGGGCAAAGCATTCGGGGAGGAGAGCTGATCC 

CCGCTGTAGGCC^CCGGAGGGTGCAGGGTTATAGTTGGGGGCAGAGTAGCGGGGTATAGGCT 
GGGACAGGTGAGGGGGTGTAGGCAGGGGCAGGGGAGGGGTGTAGGCAGGGGCAGGGGAGGGG 

GCCCCTGGGGACGGGCTGTAGGTAGATGCAGGCACTGCAAAAGCCAAAAAAAACCCAAAGAAAGGTGTGTCAGGAATT 
CCAGTTCCCCTCACTCCTAG C C!AGAG AAATGCAAAAGGTGACTTT AACATTAGAG CAGTACAATTT CAGTTTTTTTGGGAGACTTG 

6 5 TT CCTGAACTT CTTGATGG CC CAG ATTTG AATTCTCAGCAG<; GTGTT CACAGAGCTAGGGAGCAAG CACAG CAG AAC C CTCG ACAG 

GGCTGG AAAAGGGACATGG CTATAAATATTTGCAGCTGTGCAGGGTGGGG CTC CGCCAGGCCTTGACTGCCCAAGGCCTGCATAAG 

ACGCAGAGTGGGGGTGGGGGTACCTGGGCGGGGAGAGGTGGGAGTGGGTCACCTCGCCTTT^ 

TCTGACCAGTGATTTGACATCTGAAATACCTCCCTTGGCIATA 

GTGACCTGGT AG C TGGGTTTT CAG GTTCAGGTTTCAGGTTT AG CTACAATT Au titl: ri G AAAAACAG CTTGG CTTGATT TGTATT 
70 GGGGGGCAATTCCGTCTTGGTTTGATTAGGGTGGGGGATCITGAA^ 

ACAGCCTAGAAGTAGCTGCTCACTTGGCCCGTGGCTGCCTTGGGGGACCCCTGT^ 

GTGTCTGAGTGACTGTCTAAAGATTGCCCAGGCAGTCAGAGC^ 

CAGCTCCCTGAGAGCAGAAAAACGTTTTGGGCTC^CAGCACAGAAAAAGCA 

75 AAGCCTCACCACATTTCAGTCTGCTCCTCAGCAAAGAGGCCrCATT 
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CATTCCTGAGCTCAGTATGGGCAGGATGGACTGCATT^ 
CAGCCTCAACCTGGAGAGGAAGGTTTTCCTT^ 

GACCGCCTGGGGCAGCCTGTGGCTCCCACTCTGC^CCTTCCACTCCCAGGTGACC^ 
CTGTGCTGGGGGTGAGTAGGGACC7^3GTAGATAGGATCCATCCA 
5 CCTGGAGCCATCCTGCCAGTGAGGCCTTCAAAAG^ 

GGCTGAGGTGGGCAGATCATGAGGTCAGGAGATCGJVGACCATCCTGGCT 

AAAATTAGCTGGGTGTGGTGGCGGGTGCCTGTAGTCCCAGCTACTCAG 

GAGCTTGCAGTGACCCX3AGATCX3TACCACTG<^ 

AGATAATACCTTCTCCATAAGGAATCTTCCTCCCCTTTAGGGGTAACG 
1 0 TATTCCCTCTTTACACT TGAGGT CACTGAGGCTCAGACAGAG CTAGT AAGTGGGAAT ACATCTCCAAATCCATTCATTCTATCATC 

CCAAGCTGCCrCTGTAAAGAATATGAGTCCAATTTCTAGAAAATCTCA^ 

TGGGCAAGTTCCTTAACCrCTCTGGGCCTCAATTTCTTCA^ 

GAGG ATTAAGGGAAGGTAAGAAATAT AAAGAGCGTAGCCT AGTACCTGG C AT AGTG TGTCACT AAATGTGAGTG AGG ATGGCAAGG 
1 5 ATGGGGGCAACCTGGGTGTCAGGGTCAGATGTACTGTCAGCCTGTCT 

GAC TGGTAG CG CCT AGGGTTGGGGGAGAGGAGGG AAGGG AAATG G AACAT CCTGAAGT CAGAT AAC CTGAGACTGAGGTCTATTCA 
GTGG TGGTGGAATG CCTGACACCCTGCCTG CCCCACAGGG CATGCTGGG GAC CTGCTG GGGTGACATAGTGAATGGATTTTGT AAA 
TGTGTACAGCCCAAATTTGAGGTATTGTCACCCTCCCTAACAACT 

GGAACACCAC CAGGGAC AGAG AGCAGGGAGAGGTGATCTAG CACAACIT CAG AGCAG CTTCAT C CCTCTGTTAT ATAT AAGGGGGT 
2 0 TCTGTGCAACATTTTCTTTCCTTTCTTTGTG^ 

CTGTAATCCCAGCACTTTGGAAGACTGAGGCCGGTGGATCACTTGAGGTTAGGM 
CCCGTCTCTACTAAAACTTCTAGTAGCCGGGCATGCTG 

CAAAAAAT AAAAAGATTAAG ACCAG AAG CCCT AATCCTGCGGTG TCCAAT ACAT AG TGCTACGTGGCCGATGAGCAGCTGAAATGT 

2 5 GGCTAGTC CGACCAAGAAGTGCCGTTAGTG TAAAACACACAC CAGAT TTTG AAGG CACATTAATAATTATTTT ATATTG ATTATAT 

GTTG AAATT AT AT ATAT ATAT AT AT ATTTTTGAG ACGGAG C CTTGCTCTGT CGCC CAAGTTGGAGTGCAGTGG CG CAATCTCGG CT 
TCACAGCCACCTCTGCCTCCTGGGTTCAAGTGATTCTCCTGCCATAGC^^ 
CCCGGCTAATTTTTTGTACTTTTTTAGTAGAGACGGG^ 
CCCGCCTCGGCCTCCCAAAGTGCTGGGATTACTGGCGTGAGCC^ 

3 0 AT AT AT T AAAATG AATTTAACTTGCTTT CTTTTATTTTTTAA^ G AAATTT CTATTGCGTGGTGCTGCCTTCA 

CCAAAATGACTGCTTCCACATGCC^GGCCCTTTACGTGCCTTAT^ 

ATTTTACAGATGAGGAACTGAGACTTGGAGAGCITAAATGGCTGCCTCCAAGT 

CTCAGGCTGCCAGACTCAACAGCAGATGCCCGCITTAGAAACTCCCCAAGCAG 

3 5 GGCCTTACAGACTCCCGGATCCACCTCCTTTTCTTTT 

CCCGATTTGACGGCrTCCTTGCTTGGGGCTAACCCACCCAGAGAAGGAACCTCC 

TCCCCCGTCGQIACCCCGACTCCGTCGGGGGGCGAGCAGGCAGTGGCGCTTCGCAGCCCTCAGCTTCTGCATCC 
CGCCGTCACG CCAGG AG AG ACCTGATAT C C CAT CTCG CAG C CC CGGGGCTG TTG CT ATTTTTG CGTGACGGAATTAAG CCAGT CTA 
AATTAGAAGGTG AT ATTTCATC TCT AATTGG CAAGTT CAAAAGT CCC CTAT CGCACTCGGCGGG CT CGG GGATATGTAAT T AAACA 
40 AGGTGATTTOTGACTCACXSCCGGGCCGGGCCCGGCrcrrCCGC^GCGCCTGGCCGC 
CCGGCCCGCGCGCAGGGTCTCCGCGCCGCCGCOSCCCGCCCCCTCCCG^ 
CGCCAGCGATAAGATCCGGCCTGCGAACGCCGCCCTCGGCCCGGAGGAG^ 

GGGTCCCGCGCGGGCTCCCCGCCCTCTCTGACGCCCGGATCCGAAGAGCCTGGAGGGAAGGGGAGC 
ACGCCCCACTTCATACGGGGCCGCCTGCCAGCCGCCCCCGGACT 
45 GCCGACTCCAGCGCGCGCCCX5CCCGGCCGCCTCTTTCCCCCTAG 

GGGCCCGCGGCTCCGGCCCGTCTCGCTCGCCTGGGCCCCAGCCCGGGAAGGAGGAGGAAGGAGACGCAGGGCGGGCT 
CCCCCXSGAGCCCTGACGCCGCCTGGCGGGCTGG^ 

GAGACC^AAATGTCCTCCTTGTGCCCCGGCAAGGCCTCGCCTCCACCCGCCGCTTCT 
TGTGTTCCCAGTCTCTGCAGGGGCAGCCTCTTCCCACT^ 
5 0 GAGTTTTCACCTCAGAACCGTCTATCCTTGCCCCTTCTCTGTAGGGAC 
GTGCTTCATTCCATCCCCAGAACACAAGGTGTCCATGAACAGCGCA^ 

GGCCGTGGCACCGGCCTCTGTCTCGGGCAAAAGCCGCTTCGACCGCCCACACTCAGCCCrGCTC 

GCGCCCCGAAGCAAGGCCTGCCCTGACTCAGTG CCTGTCT ' 
CCCTGCGCCTGGATAGGACCTGCTCCCGAGTCTAGGCTCCTTATGTGCTGTTCTGTACC 

5 5 CTGGCATGACACAGAGAAGATGGAGTCTCATTCTGTCGCCCAGGC 

CTCCCGGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCH^ 

GTAGAGATGGGGTTTCX5CCATGTTGGCCCCGGCX3GTCTCAAACT^ 

TGGGATTCCAAGTGTGAGCCACCATGCCCGGTCCGGAGAAGACTTCTCTC 

TC TTGC TCAC CTGTCTC CCCCAGACCAG ATGCAGGCT CCATAGGATTCTG ACCCACAG CT AGGGGACGG AGGG CGGAGCAT CATT A 

6 0 GCAAGGCCTTGCTTCCCATTCTTCTGGGGATGTGCCCCCAGAGCCACTTCTTGGCT 

TCGACAGCCTCCCCTCCnXSCTCTGCCTCTTACCTGGCTGGTAGAC^^ 
GTGCAGGGGCGGCGGGGCCCTCACTGTAGCTGGTGTGGGTGGCAGGTGCTC 

TGGGGCCTGGAACCAAG CGGGGAGGGGTTAC^CCCIAGAAGGACIAGCCCCC^CTGGGGCCTGCAGTCAAGG CAGGCTGGCTCCCAGG 
TCTTCAGAAGCCTCTGTGAGGACACCTGACAGAGACATTTCTGGGGAACTGCCAGAAACT 

6 5 G AGGGGAGG AG G ATG TT CCTTGGGTG G CTAGTGCCAC CG CCTTTGGAGAG AGGG AG AAAAG C CACCAGGAGGACACCGGG AAGACC 

CTTCACTCT CATG GGGAG AGACTGTCCAGGTGGGACTGTGGAAGAGCG CATAGGT AGG GGAGAGGAG AGAGATGG CGCTGGTCAG A 
ACTACCTACCCCCACGACCTTGTGGGGCACCTCAGCCCCTGACCACCTGGCCTCTT 
GAACAGCAAAGTCTCAGGTTTCTGGGCACAGATGCTCCA^ 
GGCACCCCCCTCTCCCCCAACACACACCTCTTCTGCATAGAGTTC 

7 0 GGGTCACCTTTCCCCATGTGTGGCCCAGGGACAATGGGGT 

GTCTGTGCGTCGTGGCCTCTCCTCCCAGAGAGCTGAAGGCTCTGAGAACTCCTGGA^ 
AGTGTTCCCCAATTGTATTTGTTTGTAAAATGCTTTTCTTCATTTAAAACTAATGA^ 
AGATCTCAAAGGGGACCACATGCCCAC^CCCAGGGCAAATGG^TTATTGCTAGAGGTGCTGGA^ 
CCACATGGCTTTCTGCCTTCCTTCTGTCTGTTCCTCAAGGACCTGGGGC 
7 5 GTGCCTTCCTCTCTGGGGCTCTGAGAAGGGACTGGCCCTGGCATGAGAACCCACCACC^ 
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GAGGGGCTGTCCCAGACAAGGCGGAGATAAAATGAGCCTCAGGGAGCTGC^ 
CAAGCTGCCGAGTTTCTX3GCTGAGTGAAGCTATTGTCTCCCCTCCC 
GACCAGOVGAGTAGGGGTCTCCATGGAAGAGGAAGAGACAG 
GTGCAGTGGTCTTAGCTCGACTCCATCATTGGA^ 
5 TGGGAAAGGGAG CAGAGGCTTATGGCAT (XIATTGAAAATTCTGAAG C CATGCTCCCATGAAAGGCTGGGGAGTC CATTTGCCTCCC 

ATGGGCTCCCCCACCTCCCGCTCCACCCCAGCCCATGTC 
CTGCTCa^COZACTCCCCGCTGG^ 

AAAGGGAAGGAAGGGAGGACTCCCCCTGTGGGGGGCATCCCAA 
CCrAAGCCTGG^CTGGTGCCTGCCTTTGAGCAGGGTTGGGGCC^ 
1 0 AGCCCCTCACCATTCATATTGGGAGGCTTCTTCCCACTCA 

GGTAGCAAGGCCTTTATATGAAGACxATTCTGGTATAAAGAGGGGAA 
TCCCTGGAGGGCCAGGCTCCTCCTTCAGGCTTGAGTTGGGCTGG^ 

GGATGGGGG CAGCAGACAGG CCTGCAAGCCGGTTTGCTCACATCAATCCCACCTCAGGCTGCACCATGGACCT 
CAAGAGGCATGTCCTCCCCAGCAAACCGTGGGGGCCAGGGTGGGAGGC 
' 15 CTACTCTTGGAAGCCCAGACAGAGAGCTGGCCCCATCCAGCCTGATGCGCCACATGGCCACTG 
TCTCTGCCTTCGCTCTTCTCCCCTGGCCTTTGGTCTCTGCTTGCTGGGTGTCT 

GT CCTTCAT ATGTATTAGGTGTTTTACAGACATTTTTT CTATCATAA^ CT AT AT AATAAGTGTTAGGTAAACATCTTAACAGATG 
AGGAAACTGAGGTTCAGAGAGCTTCAGTCACTTTTTCAAGGTTACACAC 
GGTGCCTCTGGCGTTAACATGGCTGAACTCTCTGAGGAAATCTGACCT 
2 0 AGCCTTCCTTGATTTCTCCAGCCCACAGGGA 

CCTTTTGTTATCTAACITTGTGTGTATGAGTGTGTGTATGTGTCT 

TGTGTCAGAGAGAGACACTGTTACCTCTCTCACCGGTGTATGTGT^ 

ACCCCTGTAATTCACCCTCAGTTTCCCCTTCACACTCTGC^ 

CTACCTTGGACACAGCTGGCCACAGGTAGACCCAGAGTGGGGCGTTTCCTGCCTCCCTTCCCCAGGG 

2 5 CCCAGTTACCTTGGGCTGTCGGCAGGACTTGAAGCAGGGG CTGTGGTGGAGGCGGAGGCGATGG CAGTGTGGGCAGCAGCTGTGGC 

CAGGGGTGGCGAAGCCGCACTGGGA^AGGCAGCAGCAGGTGGCTGGGCAGAAG 

ACACCGG CG CATG CTCAAT AG GGGTG CTGATG AGGCGGGACCAAAAAGACAACAGGTC^ C CTGTGAGG CCC 

CTCTGCATGGGGGCAGGTGGCTGCAGCAGAGGCA 

GTGTGTCTTCCAGCTCAGGGAAGGAGCTCCTCCAGGATCCCAGGAGCAGAGGCTCAGACCACAGGAGGGTTCCAGT 

3 0 ACTTTTTCAATCCTGGAGGAATTTCCCACCCCCA^ 

GGTAGAAAGGCCGATGGAATTATCCACCa^GGTTCCrCCCTGATCCT 

CCTTGGG AAGG AATGAT ACAG CAAAAAAAG CAGAGTGAAG AT ATTTGAG TGTCAAGGGCTT AGGCT CTGGAGTTG TG AGAACCTAG 
GCTTTGGGCTCGGAAGGACTCAGGGCACCAACAAGACAGATC 

AGTGGGCACTGTGGCAAGGGGAITCAAGAATAGAAGCCCCTGTGCCCCTTTTGAAGGTC 

GGATTTGCCATGCACCCAAGAGTGACAAGAACATGGTCCTGAG CACCTG CAG AT CTGAAAG ACCTTGT AACTGGG AAAGAGAAAGG 

CGACCACACTGCCAGTGTGGACCAGACGGGTTGGGGATGTCTCAGCAAATGCCA^ 

GCCCCCACACCCCCACCCTGAGTGCCTTGGCATTCTGTAAGCACCTCTTCCCTAGAACCACTCT 

CTGGGTTGCTAACTGTAAAATGGGGTGGGCAG AG AAT AAAAATTAAATT AT AG G AAAG CAAAGAAATG CAC ATT CTTT ACTCTTTG 

ACCTCATGCTATGTGC CAGGAATGATG T CTTT ACTT C CT CACAG CAG C C CT CTG ATG TTGGT ATCATGGACT CTT ATTTAATAGAT 
G AGGAAATGGTAACAAAGAGAATTCAG TTG CT CAAGGTCCT ACAACTT AT CAG TAAG AAT AG CAGGAT CTGATTG AG GGATGTTGT 
GATTTGAATCCrTTAACCGGTACCTCATGGAGGAAGAAATAAACCTCTTGTTGTGATCAGCCAAGTAGACTCTTT 
GCACTATTCTTATTGGAAATAATTAGAATAACAATACT 

4 5 CGTGGTGGCTTATGCTTATAATCCCAGCACT 

CCAACGTGGAGAAACCTTGTCTCTACTAAGAAACAAAATTAGGCCGGGC^ CATTTTGGGAG 

GCCGAGGTGGGCGGATCAtTGAGGTCAGGAGATCGAGACCATCCTGGCTAACACAGTGAAACCTCAT 

CAAAATCAGCCAGGCGTGGTGGTGGGTGCCTGTAGTCCCAGCTACTTGGGGGCTGAGGCGGGAGA^ 

GAGCTTGCAGTGAGCCGAGATCACGCCAATTCACTC CAGCCTGGGCGACAG AGCGAG ACT CCGTCTCAAAAAAAAAAAAAAAAAAA 
50 ATTTAGCTGGGCGTGGTGGCGCATGCCTGTAATCCCAGCTACTCGGGGGGCTGAACCCTTGAACCCGGGAGG 

AGC CGAGAT CATGCATGC CATTG CACTCC AG C CTGG G CAACAAGAG CG AAATT CCAT CTT AAAAAAAAAAAAAGAAATTACC CT 
TACTCAATTTTATCATTTAAACCTCCCTGAGCCATTGGGATTGTGA 

TGTGTCTCTAGTAAAACATCACAAAGCTGCAAAATGGCAAAAAAGGGCTTGGGGCCTCACCACT 
GTGTGCTTGAAGCCCCTGGGGATCCTGTTAAAATGAAGTTGAT 

5 5 ITTCTTTCnrrrTTTTTTTTTTCT 

GGCTCACTGCAACCTCTGCCTCCCAGGTTCATACAATTCTCCT 

ATGCCTGGCTAATTTTTCTATTT TTAGTAG AGT TG GGG TTTCAC CATGTTGG C CAG ACTG GT CT CGAACTCCTG ACCTCAGGTG AT 
CCACCTGCrTTGGCCTCCCAAAGTGCTGGGATTACAGG CGTGAGCCACTG CACCCGG CCAAGAGTCCACTTTTCTAACAGGCTTCC 
AGGTGATGCTGATGCTGCTGGTCTGTATACAGGCCAC^^ 

6 0 CGGGCCCTCAGCTATTTGCAAAAGCATTACTGAATTACTTTCCTTCATCTGGTGAGGTGTA 

GACAGGGTCACGGAAGAATGGAGCTG^AGAAGTGGCAAGGTTGAGGTTGACTGCCCCCATGG 

TCTTGGGCAGCCCAGCTTGCAGGCCTGCAGCCGTGCCTGGC 

AAGCCATGCCGGTGATGTGACTGCCTCAGAGAAGTAGCTGQ^TCTCTGTAGCCACGCCACT 

6 5 TCAGTGTGGGCAGGTATGGCCTGGGACGGCrCAGGCTGACACAGAACACT 

ACTTCCAGTAGGCTGCTGGGAAATGAGGACACTCCACAGGAC^ 

TGAATAGGAGGTTGGCTTTGCCTGTCTCCTTGGCTCCAATTCACTGTTAGGCCAGCTTACCTGCTGOT 

AAATGAACCCTGGATTCCAGGGCATGAAGTTGGAGCTGGATGTGGGCCCTAG 

GATCACCACCAGGTATACCTGGGTATGCACGTAAGGGAJGCCTGCT 

7 0 T CTAAAG ACAC CAAGAAG CCAGGGCTAGGGAAG GC AGAGGAG G CATCAAAGGGTATCTTG AAGG ATCT TGG G CTCTTATTCCAGGT 

CAGTTCCTGCCTGAAGTTCCACAAACCCTCTGA^ 

CTGGCTTCATTCCTAAGGACAGTGCTGGTGAGATCACCCCATCCTGATCTAAGGTCTACA^ 
AGTTTAAACZATCTAAGCAAAAACCAAATCAAACCAAACTAGGCCC^ 
AACACAGCCTGACCATCGTCCCCTTTTCAGCTTATCTCTTCCCTC^ 
75 CCTGCTCTTCCATGGTGGCCCTTGGTGGTGGGAATGGTGGGGTTTGGTTGATGCTGTTATTGATAA 
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AGGATGGGAATGGTGGTGGTGATGCTGGGTCTTGGGGTTGTTC 
TGATTGGTATTGACAGTGCTCAAGGGGTGGTTGCCACTC 
AATCTCTGTTGATTGGGTGATGATAGAATGCGft^ 
CC^CTTGCTGCCTGGTGACTATGGCTCAGGGGAACCCrGAGCT 
5 CACATTGCCATTGTGOTCTAGATGAAGAACACCTCATCA^ 

TTCAGAATGCATTCAAAATCCCATAGCCTGGGAGTCTAGATGC^ 

AACTAAAATGATTAACrTGGGGTTCTGGGACAC CACTTCTCATCAGCACCTCTGCTTGTTCTG CTTAGAGT ATCT GTCCAGCCTTT 

TGTATCTGCTTCCACCCAAATTCTAACCATTCCTGATGATAAAATACACTGACCr 

CTTGGAGGGATGCTCCCTGTCrCrrGGACCATACATGATCAAC 
1 0 AGCACCCATAGTAGTCAGGCACCTCCACAAATGCCCACAGACACCGGAT^ 

TTTACATGGAAC CCAGC CreGGGCCCCCXIAGGGCACTACATGAAGGGGTCTGGGG T AGG 

CTCCAAGAGAGCTCCGACAGGCTGGGGTCTGGGAGCAG 

CCTCAGGATACCCATTGCCCAGGACCTCrrCrrCTATCTGGCAG 

AAACATCAGCCAAATATGAGTGTGTTCCAGGAGGCCTAGATGAAG^ 
1 5 GGAGGCAGGATGGGGTGGGAGGCCAGAATAGGTTGGCAGGGGGCTTTAGGGAGCAT^ 

TCAAACACC^CGCTTCCTGGTGGGATTGCGAAGGTTCCT 

AG CTCTGGCT AGC C CAG C CACAGAAAAGG CAGGACAGC CTTGAGGGCTGGGC CT CAGCATCTGCTT CTAATGTAAGTG CCAC CACT 
CTCATGTCCTCAGCCTGTGATTTCACACAATCAGGGACATGGAATTG^ 
GGTCAGGCCTGGGTCAGGTGAATGGGGTGCTAGCrrAGTGGCT 
2 0 TTGGCTGCTGCTGCCCITCCTAGCCCrTC 

AGGTGGAAGCCTTCCCAGCCCTCTTGCCTCCTGCCACTTGAGCAAGAATCrrc 

CGAGCCCTGGCCCCCTGAGTTCCCTTGTGAACTGCTTCTC^ 

ATCGAGCATAAGTAGCTCCTTCCTGTGAGAACAGCTTC 

GTTGC^GCAGGGTGTGGAAGGAGAGGCAGGGGGCATTCGAGTATTCCCATATGCACTGTCCT 

2 5 GGCCTC&GCACAGATTCCCACCAGCCATCCTCCCGCCGTGGG 

CCCTCACAGCCCCTCTrTGGGTGGGACACACACAGGCACAGCATGACTC 

GAACAAGCATG CTT ATATGT ACAGAAGAG AGAACT ATG GAG CCAG C^C ACACGGT ACAG AAAGGAGGGTGCATG C CCCCACCTCAC 
CACTGCCAGGAGGGGTCTCTGCCTCATTACCCTGGGGC 

3 0 GAGGGCAGCGAGGGAGCCGGGCAGCGCCAGGGATCAGCCTCTTCCCCAGACCCTGATCT 

AGGCCAGGAGCTGAGGATGGTGGGAAACTGAGCAGCGGCCCATCCATTTCCCTTGGTTCTGGGG 
TGCCCTCTCTGGTCTACAGCTAAGGCCCTGTGC^ 

GGGGAGGTTTGGTGGGT ACAGAGCTGCAGGGGGCTGGGCCCCAGCAT CAGCTCTGGGGTGGGCAGAGGACAGAAATGCTG CGGTTG 
GCCATCAGAGGTGCCCCCAGCAGAGTTATACATTGTC1AAGCATCACCGTG 

3 5 GAGGGAGTCCAGCCTGTTGGGGTGGGGGGATTCCAGGCAGCCrTTTAGCTTTTA^ 

CCAGTTGCGGAATTTGGCAAAACGTGGGCTGTTACGTTCCGT^ 

AGAGAGAAACAGAGAC^GAGACAGAAAGAGAGAGAAGAGAGGGAGAATGAAAGGGGTTAGGGAAT 
AACAGGTGGGGCAGAGGGCATGGGAGGGAGGGAACGGGCTTGGAGAAGCTATGCAATC 

CCTGGGTTAGGGGAAGACCGAGAGTCTTGGGGACCAGGATGGAGGGACTGAAGTCCCTGCCTTGAGTGAGTTGG 
40 . AGGGACCTGGCTAATGGCCCTGAGGGCAGACTGAGCAAAGAGGGCCTCCCT 

GCTGTGAGCTCrCAGGATTGAACCAAACAGACAGGTGCACCTCGCATGATCTCTGTCC 

CCTATCCCCCGTGATGAGGCTTGGTGGGCAGACCTGCCCCTTCCCTGTGCTGC^ 

GAAGATTTTGCACAGAGCCCCTGCCCACCCCTGCTCGCCAGGCCTGAGCA 

ACAAGGTCACTGTCACAGATACAGGGCCAGGACTTCTGGCCACTTCCCACCAGTGGGGCCTCTTC 

4 5 AGCTGAGGCCACAGTACAGAGCAGGCAAGCAGCAGTCAGAAA 

CCACAGGCGTCAGAATCCAGGCCTTCCAGGCTGGAAGGGCCGATCACATCOT^ 
CCCATCCAGCCACAAACACAGTATGTTTAACCTGTACGGTGCTTCCAACT 

ATG CTT C TTAGCAATAAAAGGG AAT AGATGAACTAT TGGGACACT C AACCACTTGG GTG ACC CC CAGGAATTATG CTGATT AAAAA 
ATGGCCAATCTTAAATGGTACCATGCTGTATGATTCCATT^ 

5 0 ATCACAGATTAGTGGTTGCCAGAGGTTAAGGAGGTGGGTGGGTGCTGGAGGGGAGTG^TATG^CTATAAAAAGGCGGCATGGGGC 

ATCCTGGGTGAAGGGAAGTTCTGTGTTTTGCCTGTATCAGTGTCAGTATCCTCGTTGTGATAT^ 
TACCATTGGGAGAAGCTGGGTAAAGGGAACTAGGGATCTCTATTATTTCTTATAACTGCACT 
AAAGTTTAATTAAGAAGTTGGCTGGGC^TGGTGGCTCATGCCTGTAATCCC^ 
GTCAGGAGTTTGAGACTAGCCTGGCCAATATGGTGAAACCCCXJTCTCTACTAAAAATACAAAAGTT 
55 GCTTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCCGGGAGGCGGAGGTTGCMTGAACTGAG 
CACTGCACTCCAGCCTGGGCGACAGAGTGAGACTCC^^ 

GAAGTTGGCCGGGCACGX5CGGCTCACGCCTGTAATCCCAGGACTTTGGGAGACCGAGGCTGGTGGATCACT 

AAGACCAGCCTGGCQAATATGGTGAAACCCCATCTCTACTAACAATACAAAA 

AGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAGCCTGGGTGGTGGAGGTTGCAGTAAG 

6 0 GCCTTG GAGACAGAGCAAGACTCTGT CTCAAAAAAAAAAAAAAAAAAAAAAAAGTC 

AACTTTAAGTCTTTGAAAAGAAGA1X3TGGCAGTGCTGGC CTTCTATTTCTG CACAGTAACAGTCATCGGGGCTGGCCCCCATGTCC 

CCACACTCACAGAGGTCCTCGGTTCTGCACCTGCTTTACCTGGCAGTCTCTGTC 

CTCTGTCTAAGACTGTTTGACAAAAGGGACCCACTXaCTAACATATAAC^ 

T ACTAAGAGGAAG CTG AAG C C CAGAG AGGTCACATGAATG ACCTAAAG C CAC ACAG AAACT CAG GG GCAGAATC ACAATCT AAGTG 

6 5 CCAGCGTCITCTTCCCTTGCCACATACCTGCCTATCTGAGGTTGGGGTAAAGGGG 

GGGCTCCTTGAGTCCTGCTGACAGCACTACAAACAGTCCTGACCATCCCGTAGACACTACCGAGGAC^ 

CACAGGCAGTTTGCCTGTAAATCGTCAAGTCCTAGC^ 

GACAGGGACAGTCCTGGAGAGTCCAATCTAGTGAGCATGCTCTACCT^ 

C CAC^AGAGG AAG GCAGAGG AG AGAGGAAAAGAGACAGG AGAG CAAGAG CAAGAAAG ACG ACCT CC CAG GGGCTGCTCAGGCAG CG 

7 0 GCACCTGGGGCAACATCCCTGGTGCTCCAGTTGTGGTTTTCAA^ 

AGAGAGGTAATTTTTTCATTGCTTTCTTACAAG^3GGCCCATTGGCGGGGGCACCC^ 
GTAAGCGAGGTGGCTGCAGGGGGGGCAAAACTGAGAGACTGCGCAGCAGAGGGAGG 
AAGAGGGGGAGCCAGTGAGCCCTGGC^TTTTATTGGAAGAAGCTTAATGGCTGCATGATA 
ATGATCTGGGATGGAAAGGGCTATGTGATG(2ACCTTCAGCTCCTCrAGGGACT 
7 5 GAAAGATGGGGAGATGCAGAGCTCTTTACCCAGGO^GGCAT 
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TGGGTTACATGCTGGGCTACATCCTGGATAGTTCCC^ 
CACCCCTATCCCCAATGTGGCTGCCCAATGTGGA 
AAGCTGTGGCGATGACTCTGAGATTAGAGGCTCAGTTTGTGGGATGGA^ 
CACAGCGTCACCTATACCTACTAGAGTAGTTCCCATGTGGAAGTAGGCAGAAG^ 
5 AGGTGCCTGGTCCTTCCAGGGTGGGAAGTTCATAGCTGATGCCTGGCCT 
GGAAATTAGACTCTGTCTTGGGCTAGCCAGCAGGTTGGCACT 
ATCAGGAGCCAGGACCCGJVGAGGAGAAGAG 
AGGTGAAGGGAAAGTTTGCAAAGTCTTCT^ 

G CCTGACACT ACT CACTCAAC^TTCCCGCTAACTTT CT CACCACACAGCAC CAGGAAGGAGAAAGCACCTTTGCATGGGCATAAGA 
1 0 CTGTGTAGGTGTCACCATG CACAGGATTGGCAAACC CAGGCTCTGCTGGAGTC^ 

CTGGGGAAACTCAGCCTTCAAGAGAGACCCTGAGAATC^ 
CCAACAAGATGGGCCAGGAGAGTCTAGGGGrc 

AGCAAGACCCCTGACCAGCCTGGACCAGGGTGGGACAAGGCATGTTCCCTTCTGCT 

CAGTCATGACCAGTGATCACACACACTGCTTTAGGCTTCA^ 
1 5 GTCCATGGGATCACTGCTTGGGATTCAGTGACATGGGTGGCTCTGCCCACGAGGAT 

GAGAGGAGCCCCTCGAGACCTGGGCAGAGGCTC 

TGATGGGGTGAGGGAAGTTGCCTGGGACTGAGGGAGGAAGCTAAC 

ATGAAACCCTGCCGTAGCTGGGGGAGAGGCTCCGAAAGCTGCCTTTTGCT 

GTGAAAGGACTACTCTCCCACCCTTCAATTACCCT 
2 0 GAAGGCCAAGGATGAGGCCTGGGGACCACCTTCAACACTGCCT 

TCCAGGGGAAAGAAAAGACAAAAGTTTGCTGCAATGCACTT^ 

CAAGCTGAGCGTGACTGCCTTGCATATCGGAGX3GAAGCAGAACATTAAGG 

TCAGAGGAGGGAX3AAAAAAGGCCCCCTGCTGGCCTGTTGCTGGAAGGAACCCGG 

TGATTTAAAGACrrriTTTATCAAATGTTTGGGAACATCTGCATTG 

2 5 AAGACAAGCATGGGAGCAGCACG CAGGG ACAG CGGAGCACAGCGGG AG GGGAGAGCTGAG AG CTCTGGCACTT AAAGCAG CTGGAG 

CTTAACTTTTGAGCTAAATCAAACATCACT^ 

GGACTGCATGCCATCCTTAGCAGGCCTGGGAGGACCC CACCTTAGG CTTGGATGCCTGGCCAAAAAGTGGGAGCAACTTGAGTTGG 

CAAAAGTGTTGGTTCAAAAATAAAATCCACAA 

TGGGCCTGCTGGTTAGCAAGTACAGAAAGCTTGGC^ 

3 0 CAGGATGC^CTGGCCCAAGGCAGACAGTGCCGGGAAGAAC^^ 

GAGCCCTACTCCTTACCATGGAAGGCCCCAGCGGGT^ 

TGAGCACCTGACGGGCCACAGTGGTAGGTTGGTGGCTC 

CTTGGCAOIAACCCATCCTCATGTCCAGGGCTTGTTC^ 

CTCAATGTGGGTCTGGACAGGTGAGTCCAGGGGTCAGGCGTTCACGTCCAGCACAGCCCCCT 

3 5 CCTTCGGAATCATAGGGGGACAGGCCCTGAGATGAGTGGGTGGGCAAGTTGGCCAGCCGGGTGTCCTCACAGG 

ATTCTCAACAGCCCAGCAAAGGCCAAA.GCAGCCCTGCCCCCTGCCCAGGGATG 

GCCTGGGCCTGGAGGACAGAGACTCGGGTCCAGGTGCC 

CTTTGCAGGCCAGGCTTTCCAAATCCATGAACTT^ 

GGGCTAGAGGGCAAGGCCACAGAGCCTGAGTCCAGGCACTTACCTTGACCTTCGCAGAGCT 

4 0 GTAGAAAGGAGCTCGTCAGGGGACTCACGGGAGACACAGCAAGGAAAGCTC 

GCCAGGCCCCACACGGTGACTGCCCCCnTGCCT 

GTAGCCTGAGAGGGCCTCATCCCAGCCCTGGGCAATC^TCT^ 

GCATTGGAGCAGAAGGCTGCTGTCACCGGAraGCCTGC 

CTGGCCC ACCTTCAAG^^CAGGTAGCAGGACTCC CTGGCTT CTGCAGTGCTGGGCCCAGGTCC CTGGTGGCCCCTCAGCATCTCCC 
45 TCCTCTGCAGCCACCCTGAGAGCCTGCACT 

TTGGAGGAACGGCGTGCCCACTCOTCAGCCTCATCTTCTGGCTTGTTOT 

GTCTACGGCAAGGTCCTTAATAGGGAGGCTCCTGTAATGCAGAGAGAGGCCGTGGGCGAGGGCT 
CTGCTGGGTCCCTGCTCCATGCCCATTGGTGGCCrc 

GATATGGGCATGACCTGGGTTCTCGGTGCTCAGATTGGTCG CAGTGAAGCAGTGACT CCCCTCCCTCATGAGAGACAGGAG<3CTCA 

5 0 GATCCCTCAGGTCTGCCCTTCCCCAGTGCTAA^ 

GAAACAGAACAGGTCTTCITTGACCrCCGTGAGaVGGCCGTGAGTTTG 
ATTCCTCAGGATCCATCTCCTGCCCTCCTCACACTTTTGGATCCAAACAAGGA 
TGGGGGCCATGGGCTCAGTGTGAGTAGGCIGACTCT 
TGTAGGGACAGGGGATTTCTGGCCTGGGGAAGTTGGGAGTGGGGCT 

5 5 CCTTTGTGGAGGACCAGGTGACCTTCCCTTAGGCT 

ATCCTACATCAGAGCCAGAAJ3GGACCCTAGAGCCCCTCGGGTCCAGTGCCTCCCAGCCTGAA 
CAAAAGCCTTACAGCCCACTCACCCTGGCCGGGGGACCAGAGGACCAAGCCG 
AGGTGAAGCCCACAACAGTAACAGACTGCTGGCCCAGCCCT 
CTAGCTCCAGGTTACAGCAGATAAGGGCCCCATTGGGTGAGA^ 
60 TCCCTGCAGTCATAGGGCTCAGGAAGGCTACTTCTGCCCAGTGGGAGGCCCTGGACCTATGCCTGGAAAGAT^ 
TCTTCTGCCTGCCTCTCCCTTTTCCCCA^ 

CAAGGAGTGTGACAGGGAACTGACCTTGTGTCCCAGGACTGGTCCCTGTGT 

AAAGGTGATCGAGGTTGGGTGGAGCTCCCCrGGCTTCXSCTGAAGGCC CTGCCCTGGCCCAGAGAG CT ATAGGTACTGGCAG CCTGT 
GGTTGGGATCGGTTGGGAAAGACCAGCAGAGGCTTCTAGG^ 

6 5 GGTGGGATCTTGCTGTTGCTGGGACAGGCCATTTCTGGC 

CAGGAAGG CACAACCAGGTCTCCCrrGGTCCTGGCTCCTGAC CCACTGGGAGGGGAGGGAAGGAGGG 

TGAGGGAAGGTGTGGATGCAAG^CCCTTGAGCCTCTCTACAGAGCTTTG CAATCCAGAGAGCAGGTGG AAGGTGTTTGCTG CTGTT 
TCCCTGGAATGTGCCCAGGAGGAGCTCCCCrGTGCCTTTCTTGGCTC 
ACCCCGTGCAAAGACCAGGGACTCTTGTTGCTGCT^ 
70 CAAGGCTCAGGGAGAATATATGCAGGGAAGATGGAACAGGTCCTG CAGT CCTTGGCCCTTGGCCTGAGTCCACTCTCTGrG ACCCAC 

TTTCAGCTTGTTCTTCTTGCCACAGAAAACACTC 
CCCCAGTCTGATTCATCTTTCCCCTCCCAT^ 

GAGTGGCCTCCAAGAGGTTACCTTGTCACCTTCTAGCCCTGAATAATGGCCCAGTGTGCATTC 
CAGTGGAGATAGGAAAGGCGGGGGGAGCTACCCCAAA^ 

7 5 TGAGTGTCACTACTCTGTGATTTCTGTGACCACAGTGACATTATTTGGA 
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CAAGGGATCTCGTTTGGCTGACAGOT^ 
CACCAAACC^CACACAaMCCCAC^C 
CCTCCCAAGCAGAGGAAAACCCAGCCG 

GGTCAGAAACGAGTT ATGGGAATGGGGTGGGTGAAGGAGGAAG AAAAM 
5 TTAACAGCAGAAATGGGAGCACGTTATGGCCAAGA 

GGAAGAGGAGAAGAGGACCXTTTCCAGGCATACCAGGGGAGGAATAGGGAGGGGATGGGA 
CCTTOJrcAAGTCrCGATTAATCAACAGTGCATCAATTC 

CCCAACGTGCAG AAAACAGAAAGTTAGTGAAGAGCATTT CTTTCATTATTCTCCATTG AACTTC CCTTTGGCC^ 
TTAAGAGCACACCAAGGACACA.CGTG ACAAAGAAATGTT AACAGGAA^ CTGTTAGCTCTTGGG CTT AGGCAGC CTTGTTTTTC 

CAGAATTAGGCGAAGAGCAGCAGCCCTGAG^ 

AGAGGAAGT ACTG CATCTCCAAAAGGG CCCCAGGCTCCTCAGTTCCAGCAGAGC C C^GAGAG CC CACACTGTTCCACCCTTGGGGC 
TGGGGAGCCTGCCCTTTTATAACCrGCATGCATATCATGGCCACAAGGGCT 
GGACTTTTTAATCCCTCCATCCCTCTTATTACATCGGTTTTTCTG^ 
15 CTTGAATTG CTAGGGG ATCCTG AC CCAGGGAT AAGATGTTG C CCAGACTGATAAT CAG CCAACACAGTGGT ATTCCCAAGT AG AAA 

ACAGTGAGGAATTATGTGTATGTGTGTAAAAAAGAAACAACAGA^ 
ACACACACACACACACXIAGGGTCGGGGGAGGGA^ 

GGAGACACAGAGACACGCAGAGAGAGGTCTGTCTGCATGAATGTTTGCATTGCTA^ 
ACTCTTTCTTTACCAACCCTCAAATTCCCCAGCCATAGCT 
2 0 TGAGGCCC^GCTTCCTAGGGCAAAAAGAAGGGGATTGCTCCCATCTGT 

AGACCCCTTCCCAAACCAGAAACCCAGCCAGTGTATTTGAATCAATTCTCATTGCrrc 



TCATAGAGCCTGGGAAAATGCAGAGGCTGTGAAAAGCACITTGAGGAGCCAC 
TGACCATGGAGAGGGTCTGAAGAGAGGATACCCACGTTCCAGAAG 

2 5 AGGAGGTCCAACGTGAGGGAAGAAAGCTGGTCCCGAT^ 

TGGAGGAGGGAGGCGCTTACCCACTGAAGTCACTGCCTTGGGCTTC 
TACATGCTGATGGGCGTGTTGTACntSCGCATGGATGATGGTGGCCTTGCCGCC 
CAGGGCCGAGTCCTTCAGGGCACTGGGGTTGAAGCGTTCCTGG^^ 
GGGAGGAGAAAAGGGCAACAGGTGGCGGTTACATGGGTGGCCAA 

3 0 AGCCGAGGCCCTGAACCCCCATAGGCTACZACAAGTCTCCAGATGGTGGGTGGTCTGGGTGT^ 

CCCCAAAGCCCAGTGCAGCAGAAGGTTTTCTTTTTTCT^ 
G<3AGTGCAGTGGTGTGATCTCCACTCACTCCAGTCTCAAC^ 

AGGACC^CAGGCATATGCCACTATTTTTTTTTTTTCTGTAAAGACAGAGTCTCACGATGTTGCCCAGGTTG 
CCTCAAGCTG^TCCTTCTGCCTTGGCCTCTCACAGTGTTGGGA 

3 5 TGCCC^TCTTGGCTTTCCCAGATCCCCTGCAGTGCCTGGCTCAGCCTC 

AGAGATCTCCCACTACACACAGCAGGGGAAGCGCGGGCATGGGGCrrcGGGGTAGGGG 
AATGCCCCAAAGGCGAAAAGCGGCTCCAXrrGGCACCCGCAGGGATTCT 
CACCCCCTTGGTGGCCCAGGGCTCTGATCTTCCAGACGCTTCAGGGAATGTC 
CCCCCTCAAGTCCAGTTTGCTGAGTAG 

4 0 CAGACAAGATCCAGTGCAGCCCTTCIXSAGGGGTCGCCAGGACA 

GTGTCCCCCTCCCCCTGCACCCGCCACCGTCACCAACCCCAGGCACATCTTAAGCAGCTGCTGC^ 

T AGAACCATCGG ATGG AG CCC C CACTC C CACCCCCAAATGCTTCT ACTCCAG CT TCACACATG CAGG CT CACC CAG GGGT ACAAG T 

GTACCCCACTCATACATATGGACATACATGTACA.^ 

GGCTGGC!ACGCLAA.GCGCACACACATGCrrGTCCTTTGATAC^ 

4 5 AGAAGACAGGGTGGTCA.GAGAGAGGCTCCACCAGGGGCAGTTTGAGCGCCGGGGCCA 

C CAGCCCAAAC CGCAATG GGTGAG GAG AGCAAGGAGTGAGG AGGGAGAACT CAGAG C CGGGAATCACAG CTACTGGGACCCTCGGT 

TTGGTTCAGAGACTGACACAGC^UVTCTTTCCCTCT 

ATCTTC^GGGGCACTCGTACCGCATACATGAATCCCCC^ 

CAGGACCAGCTGTGGCTGCTATGGAAAAGGCTGAGGCCCAGGGGCCTCCCTC 

ACCTGACCTCCTCTCCCTCCCAGAAGAGACGCAGGCAGCTCT^ 
AGGGGCCACTCCTAAAGGCCCCTTGGGCCAGCAATGAGGO^ATGGAG 
CCGAGGCAGCACAGTGAGAGGCCCCITCCTCTCACTCTGACCTGAGAGTGAGCC 
GTCTOTCCTGTGTTTCTTTAAATGCAAATTCCCT 

5 5 AGTGTGGAAGGTGGGGGGCTCTGTCCCCAAACTCTCCCACCrrCTO 

CCCGGCCAGCCTGG CCCTGGTCATTTGCAGCCAAACCCTGGGAAG CCAGGCTCAAGTCAGTGGTCCCCGGT CAGGGAGCAGCAC 

AGTGGCCCTGCTCrrGCCCCAGGCCCGCCACTGCCTCAGAACTGATAAAATTCCCTCT 

GGAGCTATATTAGGAAAGGGCCAGCACGGAGCTCCATTCCTTCTCCATTTTTGGCCAGTGCT^ 

CCTGACCAGCCCAGGGCCCCCTGCAGAGGGCAGTGTTGGCCTGGCGGGGTGGGGGCTGC 
60 TTCTTTGACATTTATTCCTGCCTCCTGCT 

GAGCCAGTGCCTGGGTCTTGGGGAAGATGAGCCAAAGATGTAAGACCTGCCCAGGGGCCCTTC 

GCCCTTCAGTGCCAGCCTCAGAAGGGAGGAGAAGAAGGGGGCT 

CTCTCTCCTCCTTCCAAGAGGATACGAGGTGCCAAAGAATTGCACTGCAAGT^ 

TTTGGAGCCAGACAGGACAGAGTTCAAGT^ 
65 CCTGTTTCCTAATTCTAATTCTTGAGATGGTAGTGCCTACCCCG^ 

CTCAGCATGGGAGAAGTGTGCCATGGATAAGCACGGGGATTAGCCCTTGCT 

CCTCAGCCCAGCAGAGGTTGCCCTAGGCCAGCCCAGGTCAGAGGCTCCAGCCCT 

GCTTTTACTTTGGATGAAGCACCCACTGCACATCCTCCCCTAGCCC^ 

CACCAAGAAAT AATTTACACATTCATCAGGGTCTGG GCAGCC CCTG CACCGACAGCAGAGTAC CTT AGG CAC T ATG GG AAG CAGAG 
7 0 GTGGGCCTGGCCC CTCACTTCCCTGAGCAGGCAG CAAGTCCCTGTCTCAATGCAAAGTGGAGT CAGGCAGACCTGCATGCCAGTTC 

TGTCTGCCACTTACTAGCTGTGTGACCTTGGGCAAATTACTT^ 

CATGCCCACCTCTGGAGTTGCAGAGAGAGTTGTTTTAGAGAGTGTTGCTGAGAGATAAGGCATGTG 
CCAGAGTGCATGGTCAGTATGGGGAGACATTATCATCATTGCAJGGTGGGTCTCCTGGGAGTCCCAG 
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TGTGGTGCCAGAAATCAGCTTCACTGAGGAXrCTAGAAAG^ 
ATGGGGCAC3U3GTGCCTTCTGCTGCAAGGGOUUVGGCAAGGGCATC 
TGGGTGGCTCCAGGCAGTGATGATACCAGGTGTCATGCAGTQAGTGGGTG^ 
TTTATCCTCCCAATAACCCTGAGACATG^ 
5 CTTGACTCAGCCTGCCCTGCCTGGAGAGGGCTATGCCAAT^ 

GGCTCAGAGCTAACACAGGCCCAGTGTGGGGACTCCGTTCCITTGC 

CTGTGCTCCCAGAGTCCCGCCAGCTAGACCTrAGGGGTGAGGCTGCTGTTCT 

CCTCGCCACCCCACCCTGGCCAAGGTCACTCXriX3CAAACCCGCTCCCTGCACGC 

10 GCCTCCGAGCTCCCAGAGGGGTGGGGACCTGTGCATA 

TGAAAAAGTGTTGGAAGTCAGAATCTCCTGAA.TTCAGGACTGGCTCGG 
TCTCTGAGCC 

AGGTCTCTGCCGAGCACCGAGGACCCAGGTGGCCCAGGCCACCTGTGGAGAGCTGTA^ 
1 5 ACACCCGAGGCCTTCCCTCGGAGGCTCATCTGGTACATCTGAGCCATCTCCCT 
GTTATATTGCCTCGGCTGGCTCGAGCCCGGCAGG^ 
GGCTCGCCCGCGGAGGGCCAGGGTa^GAGGCCTCX^C 

CTGAGCTCCGGGGTGCCTGGGGTGCCTGGGCTGGCCCTCGCCTCAGGGCTGGGGCTG 
AGCGGGGTCCTGGCACAGGGCAGGCCATGCAGGGGAGAGAGAAGAG^ 
2 0 ACCTGAAAGAGATGTGTTAGCGCGAGCCCCGGA^ 

AGGAGCAGGCAGGCACATGGCrrCCTGGCCTGGCGTGAGGCTGGCZ^ 

CAAGCTT CCTGCCCCCAGGG CCCGAGGAAGTGGGTGTAGGGGTCAGGGGATGGCCCGTTCTGTGTCTGTTCCCCCTGCCCCTGAGA 

AGGTGGTGGGGAGGTGAGGAGGGAAGCTGAGGAATTGGGTAGCTCACATT 

2 5 GCAGCCCTCCACCCACTTCCCC^GTCTGTGACCTACCCCA 

TCTGCCCCATTCGACAGGTGGGGTCACTATCAGCTCACATAG 
CGGGGCCTTGGGAAGGGCCTAGGCAGTGGTGAGGTCTGGAGCCAGGGCTTC 
CGTCCTAAGGGCTACAGGGGAATGCACTTGGGAGGGGCAGGTGGGGAAGTGGAAACCAGAGGGA 
GCCCCTCCACACCTGGAGGTCCTCGTCAAGCTGGGCCT 

3 0 CCTGGACGCTCGTCCCCTGCCTGGCCAGGAGGGTGACCC 

CACGAGGCATAGCGGAGGTGGCTTGCACCCAGGCAGGCATTCATTCCCACATCCAGTGT 
AGGGGCTGCCTCACTTCCCCAGGGTCAGCTGCAGCAGGGGCAGGGATC 

AGCCAAAGACCAGGCAGGCTGGCCAGGGCCAGCACTCCAGGAACCAGGGCTGAGTGGACCCCGCCGCCIACAG 
CTGGTGAGGGATCACCGGCAGAGGGGTCTGGACTGGAGGTGCTGTCGTGGAGATC 

AGTCAGCTCTCAGAGATAGCTGAGTCAGAGGGCCCCAGATCCACTGTCCACCCCATGGGCTGGATCT 
CCGACGAGGATGGAGAAATCATGGCCCCrTGGGCATTTTGGTCAGAATCCATGGGCCCCT 
AAGTGGAAGTAGGTAAGACCTGCCTTCCAGGACCAATGGCCCACGTACGGGCAGCACCAGGACCAC 
AGTGAGGCAGTAATGTCTGCCTCTAGGTGCTTGACAGCGACTTGGGCCTGTTCATTCCTCT 
40 GCACAGTTTTGTAGCCCAGGGCGGGGCCCACTCAATTGCTA 
CCTCCGCCCCCTGCTCTGGAGAGCTCCCACCTACTTC^ 
CITCCAGGTGGGTCATGGTGTCTGTGTTGACGCCGTCAATGGCC^^ 

CTG CCTGGTGTGATCTGATTGGATAGTGGAGGTGGGGGGTGAGCATAAGGAAGGTGGTCCTGAGGAAAGG CCAGTCCTTGGGGAAG 
AGTCACCCGGGAGTATTCAACAGAAGCCX3TCATTGTGTTTTTCATT 

4 5 CTTTCCGGCACATGACTACCACATTGCTTAGAGCTTTGTGT^ 

GAAGCAATGTATG CTCCTTCCAGGTCTGCCTCTAGCAAC TGACTGAATGC 
AG GG CATGG AGAG GAGG AG AG ATGATTAAGGAGCACTGTATTGGACGTTG CATGAATG AG AAACTTTTGTTGTTCTGAG C CATTAA 
GG TTTGGAGGCTGTTGGACATGGTGGCTCATGCCTATAATCCCAGC^ 
AGTTTGAGGCCAGCCCAGGCAAAGGAGCCCAGACCCTGTTTCTACAA 

5 0 TAGTTCTAGCTACTCAGGAGGCAGGTGAGAGGGTCACrCCAGCCTAGGAGTTTGAGGTTACA^ 

AACTCCAGCGTGGGTGACAGAGGGAGACTCTGTCTC1TAAAAATTTGCAGGCATCTGCTG 
CAGCACTTTGGGAGGCCAAGGTGGGTGGATCACAAGGTCAGGAGTTCGAGATCAGCCTGGCCAATATGGTG^ 
TAAAAATACAAAAATTAGCCATGCCTGGTGGTGGGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCTT 
CCAGGAGGCAGAGGTTGCCGTGAGCTGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGGGAGACTC 
5 5 AAAAAAAAGGGTGGCCAGGTGTGGTGGCTCATGCCTGTAATC 
CCTGGCCAACATGGTGAAACCCCGTCTGTACCAAAATTACAA^ 
ACTCGGGAGGCTGAAGCAGGAGAATCGCITGAACTCITGATTGAACCCAGGA^ 
GCACTCC^GCCTGGGTGAGAGCGAGACTCTGTCTAAAAACAAACAAACAAACAAAAAACTG 

CT CTT AT AAC C CC TG CACCCTG CAAAG CT ATGGAGCACTTTTC AG AC TC CAGG AGAAAAG GGAG ATG CAG CAGCTAAG ACACAGCA 
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GCAGATGTGGAGAGAGTGAGGTAGCCGACAACCCTGCCTGGATGAGCAAACTC^GCTGGGAAATGGGGCAGGGCTTAGGGTTCT 



CAGTCTGGAGGCCCCATCTTCAGTGTAACCACACAGAATGCCACTGCAACTCCATCCCGGG 
CCACACCTCCTGGGACATTACTTGGGATGTCATCGTGGCT 

6 5 CCCCGGCCTCCAGTAATAGCACACTGAATGCTTTGGTGGCCCTGCCCACAGTAATATCACA 

AATAATGTCACCTAAAAAGGCTTGATGGCTGGCCCCTAGTGACATTAGCAGCCCCACCCCTAAAGATACCTCATGGAATG^ 
ACTCTGGCTGTACCCTCTGTGACATCACATGGAATGGGCTC 

CCATACCATGTTGGAGGTCCCCAACCCCCATGACATCATGAGGAGTGCCATGCAGGCCT 
GAGGCTCCGGACCCCACCCCCTACAAGCTGTGGCTGGAAGTGTT^ 

7 0 GGGCTTCTCTCGGCACATCCCAGTCACCri^CCTGGCrrATAACCCAAGGACCAAGC 

GGCCAAGGCCCGAGAACCATCCTAGAAGGAGCCCAGAGAGGCTCTGGGGACAGTCCCTTGCCCCAGGTGCCAAGCCTGGGTGGGCC 
TCTGGCCAGGCTCTGGGCTTACAGACATGGGTGGAAGCTCCTGCCAGTCCCCTCACAGGGAAGCCCCT^ 
GCAGTCACCAGTCACCAGTGTCCAGCCCATTCCCAGATGGGCTCTGCAGATGCAGCCCAGGAACTG 
ATTACGGGTCACGGAAACACTGCCCTACACCACGTGGGTGATGACGGCAAGACCAGGACAGCCCTCT 
7 5 CATTTCAGCCCTAGGGGCTGTGGTGCCCTCTTTTTTTTT^ 
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CCCAGGCTCGAGTGCAJVTGGC^^ 

ACAGTAGCTGAGACCACAAGCGCATGCCACCATGCCCGGCTAATTTTTG 
GGCTGGTCTTCAACTCCTGACCTCAAGTGATTGGCCTGCCTCA 
AGCCTGTGGTGCCCTCTTAAAATACTGGCCACCTTGTGCCCACAGCG 
5 ACAATATAGAAACAGACGTCTGGACTGCCAGGAAGCCACAGCATCTGCTTTGC 
CATCITAGAGCTACTTGAGATTTC^ 
GTATIGCTGCCTTGAGTGTAGAGGTGAACAAA^ 
GGACTCCTGGGGGCAAGTTGAGCTGCCCAGCCAGGCCTGTOT^ 
TTTTATTATAGACATTTTTGCAGATTTCrAGGCAGGA 
1 0 TGTACCATAACATTTCACTCGOTTGCTTTATCACACTCCATCCCT 
GAAAGTTGCAGGTACCAGCATACTTCTTTCCAG^ 
CACAG(^^ACACCCCCACAGCAAGGCTCTCrCACGGC^GGGCACCC^ 

GGGCTGGGTCATTGCAGACCCCACGCAGAAAGGCTGTGGCCTCGGTCACAGAGGAGATGCTGACAG 
AGAAGATCATGATGACAGCATGAGCAAGGGCAACAGAATGATATCCAAGAGAGGCTGGG 
1 5 CTATCTGACTCTCCTCCCTCTGGAGCTGCCTC 
GGTCTCCTCACTTCCCCTCACCTAGACCT^ 

AGCTCAGCCTCCTGGGGTCTGGTCAGTAGCCAAGCCTCTGACCAG^ 

CCTGCAGCTGTGTCTGCTTGTCAGGGCTCAGCTGGCCTGGCCTTTGGCTGCCCTGAGT 

TGGGGGTGCTGATATCCTCCTCTAGGGAGGTGCITCAGGC^ 

AGTGCAGTCCAGGCACGATGCTTCCACXSTCrGTCriTGTCTGACTCCATCCT 
GCACCCAACTCCTTGGAGCTGTGGTTTTTTCTTTTTAAT^ 

TTCATTCCCACTTCACATTTCAGTCTTATCCCGCCAATGGGCCCCTCTAAACCCTG 
GCCTCAGGGTGGGGCACCGAGCTCCTGTCTCCAAAGCTCCCAAGA 

2 5 CCAGGAGCCCTTCCGCCTTGCTGGGCCAAGGTCTTAGGGCACTC 

AGTGCTGGGXaGCTGGGGGCGGGGAGGCAGCAAAGGAGGAGCACAGGTGCCAGT^ 
CAAGCCCTCCAGTGTCAAGAGAATGACAATGCCTTTCrrCGGGATTGTTC 

GGGTGCAGAGCTGAGAGAGTTTCAGGAAAACTCCAAATCCAGGAGGCACAGTGCTGACCCCATCTGCT 
GCTGGCTGGCTACATCCCCGGGCTCCACAGGAAACAAGATCCACAGCTGGCAGAGGGTGGAGC 

3 0 CAGTTCTGAGCCAGCCTGCTGTCCCTAGAGGGGCCTGGGTGTTGGGGG 

AGG CACCAGTGAGGCTTCAGACGCACCCATGCCATCGGCTCCTATTTCCCC^ 
CAGACGTTTCTGGCCATCATTCACTCCCGGTGTTCA 
GTGGTGGGTGGAGGCTGGGGCAGGTGGCAGATAAGTAGGAAGG^ 
OTGCCTGGGTTGGGGCAGGTGGGACAAAGTGCATTTTCATTGTATGATT 

3 5 TGGCCTCCCCTGGGGCTTTGCTCATGCTCACCCTCTCACTGGAACCTrCTGTCT 

CCGATAAATAAGCCGGAGTCCCCTCAATGCTCAAAAAGCCTCTCCCTGCCTGCCTGGGTGGCT 
CTTTCTGCACCTCCAGACTCCTCCAAGAAGTGACCCACA 

TCTGCCTTTGGCCCCATTATTCCACGCATATCAATGACCTTCTGTGTCCAAAGCCACGGGTCACTGCCCC 
ACCTCCCCTGGCTTTCCTCAAGTGTCTCTTGAGTCTCZTCCTGCCTCTCTTCCTCCTGTCCA 

4 0 GGTGTGGCCAATAAAATGCCCTCATGGGCACTGGCTCT(^^ 

GCAGAGGTGGAGCCrTGGTCCCAAAGGGGTGCCAAAGCTGTGTTTGGGGGTGGGAGGGCTCAGTG 

GG GGCAG C CTCTGGATG AGAGGGGAC CATGCAG AAAG GGAAGG AG AC TCAGGGAATTGGGG AG CTAG CC C CCAG CAGAGCCTGGCC 

CAGCCTTTCTCAAACAGCCCTGCAGTCCTGACACTGGOT 

TGCCCTTGCACGGTGTCCTGGCC(^GGGGCCTCCAAGGGTCCAATTCA 

4 5 GGATCCCTCACAGAGTTAGATGCAGTTCTGAGCAGCCCGGAGGACCTGGCCTGGGGGACCTCT 

GCCTCTGAGCCTCTGGCCXJTTTATCTTCCCCTGT^ 

TTGCCTG ACCTTCAGTG CGGCTAT AAACAGCACGG CCTGTC^GCACTGCTAAG AGCTGCCCAAGAAACAGCACTGCGTGCTCC^ 
GGCCCCGAAATTCCAGGAATTGCCGGCACAGCTACCCC^ 

CTCAGCATGTTCCCACCACAGCTGGGAGAGTCTGATAAATACTGGAACCXSAAACCC^ 
50 AGGTGAGCCTGCAGGCGGGGTGGCTGAGACCCTAGGAAGAGAGGGTTTCCAGTAGACAGGCAGGCCCCTCOT 
CCCCCGACTCCACCAGTCTCACAACCAGTCCCAGTAGACAACAGGTC 
GGTGTGGAAGGCATCATTTTATTTGTTTATTTATTTATTTATTTTTAAG^ 
GTGCGATCTC^GCTCACTGCAACCTCTGCCTCCTGGGTTCAAGCTGTTCTCCTGCCT 
ATGTACCACCACACCTGGCTAATTTTTGTATTTTTAGTGGAGACAGGGTTTCACCATGTTG^ 

5 5 CTCAAGTGGTCTGCCCCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGGCACrc 

GGTGAGAAAAACTG AGGCCG AGCAGG TAAACAG ACTTCTG AGGTG^ CAG AGCCTAACAAAG CAT ACTCCTC 
TTCCCCAAGGCGGAGGCATGAGAGGCATGAAGAGGTGAGACCCCAGACC1A 

TGCAGGCCCAGCCACACCTGTCCAGGATAAATAAAGGCATTTCCCCG CCACCCTCCTTTCCCAGTCTCCC^CTTGAACTC CCTTCT 
60 CTGTGGGTCTGCAAGCAGAGACCCTATGAAGGGATGCCTrTGA 

GTGCCTCTCTATTAGTCCCAGTTAGAGGGGTGAGCAAGCC^ 

CATCCCTGTCCTGTGAG CAGACTCGAGCCTCAGTTIX3CCAGGCTCTGAAGCGAGGTAACGATCCCAGCCTCGCCTGC CACC 
CCACAGGGTCACACACTCTTGCCTCTGGTGCCCTATGGCACAGGCTCA 

6 5 CTCTGGCCATGCCTCCCCCGTAGGGGGATGTCCCTGGTGCAGA^ 

GATGGCAGGGCTGTGTCCACAAGGCCTGCCCT^ 

AATTTCTCAAGAGGCTCCGAGCCCTGGGGAAGGCT 

CTAAAGGAAGCCGAGAGGCCATATCATGCCCATCCCCCGGTGCC^ 

ATTGGATCCAGCACATCCCTGCATACCCCCrCCCCAGAGTAAGGACATTCCTCCTGCTATCTAGCTTAAGGCTT 
70 TTAAGCCCCTGGCTGGCCGGGGCC^GGTGGCACATAGACCCTGGGTGAGGTCATTGGCT^ 

CAATTTGCACCCATAGGG ATAGGG CAGGGAG CACCAG CAGGAGG CC T CGAG ATGGTG GAGCTGGGGG AG ATGTG GG CTCAGT C CCA 
GGCAGGGGAGGCCACAGGGCCCATGAGAGAGCACX3CAGAGGCT 
GCTGTGC<3AAAGGGCAGTGGGAAATCCACTGGGGCTCACATGCT 
CCCAGGACCCTGTGGTCTCCAGGGCAGTAGATCTGGAGGGGTGGAAT 

7 5 GAGGTCAGGGCCTGTTCCCTGCACCAGAGCCAGGCCAGCACCCCTCCTGTAAG 
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CGGC^TCCCTGGGTGGGGCTTCTACTTCTGGGGTCCCCTCAAGTACCCGTG^ 
TTGGCTCCTGACCCGCTGCCCAGGTTCCCr^ 
CCTTCCTCC^GCCCCCAGGCCCAAGGO^TCCTT^ 
AGATACCCCTGCCTACCTACCACCCTCCAGTGACAGACrrGGGGGCCT 
5 CTCAGCCACCAGGCAGGGTTGTCCTTCCCCTGTATGCCTCACAGTO 
GCCTTGCTGGGGAGTCTCTTTGTTTGACAGAAACTGACGGAGTGCCT 
CTGGAGTGCTTCCTCAGGAGCATGGCTAACTTTCTC 

C CCTGATGTTCTGTGTCATCTGCATC!AGGC CATGCTGCTCCCCAT CACAGACACTGCAGGATAGGGGGCCATCTCACCAGGCC C CA 
GCACTTGTCGGGAACAAGACCTGAGACAGACAAAGCAGCCATGGTGAATGGG 
1 0 TGTCTCAAGTTTGGGATGGAGCGGGTGCCCAGTACTCAGTCCM 

GGAAATGCTCAAGGCTTGGGGGCACCCTGAGCCCACTCTAACAGTAAAGGGA 

GCCTGCCTG CCTGCCTGCCTGCCTGCCTGAAAACTCAGCATT1TCTTCTCT CTACCACCTGCTCATGATGTCTCA 
CATGCATAAOVGGCCGGCXSCGCTCCAGTGAGCAAACAGCTGTGGCTGAGACCAGAGG^ 
1 5 CTCTGCTTTCTGAGAGGCTCAGAAAGGACAGACAGAC^ 

AGGCTGTGGCAGGGTGCACTCACCCGGGAGATAGTGAGGGGCATGTTGAAGTCCTTGCCCCCCT 
GGGCCCAGTCAGGGTCACACTGTAA 

HUMAN SEQUENCE - mRNA 
2 0 TAATAGTTTTTTGTTTTGTTTTGTTTTTCTGCCT 

TGTATTCTATG G ACCCGC CCTGAATTCTT TC1TGCACAAGATCCAAG AAC CCTCTCTTGGG AAGG AGGTGGGGGC CGGGAGCGCAA 
ATGGCGTTGLAGATGGTTCAGGGCCCTGrrCAAACTCC^GCACTGACCATTCACCG 

GCTCTCCCGOSCGAGGGTTCCGCGCCCGCCra^ 

2 5 GGCTCCGAGGAGAATATGAAACTGGTGTCAAAATGACATCCAGATTTGGGAAAACAT^ 

AAATTCGATG AAGT CT TTTCCAACAAACG GACTACCCTT AGCACAAAATGG GG AGAGACCACATTTATGG CT AAATT AGGG CAG AA 
GAGGCCCAATTTCAAACCAGATATCCAAGAAATTCCGAAGAAACCTAAAGTGGAAGAAGAAAGT^ 

AT AGTGATGATGAG T CTCTAC CAGTTT CTTCAAAGAAT TTAG (2CCAGGTTAAGTGTTCCT CTTATT CAG AAT CT AGTGAAG CTG CT 
CZAGTTGGAAGAGGTCACTTCAGTACTTGAAGCrAATAGC!AAAATTAG 

3 0 CTTCCCTTTGGAGGACACTTTACTTGGGAAAGAAAAGAGC^ 

AATTAAT AAC TTC!AGATAAAGTGG AG AATTTTC!ATGAAGAACATGAAAAGAATAGT CAC CATATTCACAAAAAT GCTGATGACAGT 
ACTAAGAAACCCAATGC^AAACTACAGTGGCTTCnSAAATCA^ 

AG AATCACCATCAG AAAT AT CT C C^TCIAAGGGATCTGTTAGAACTGGTTTGTTTGAATGGGATAATGATT TTGAAG ATAT CAG AT 
CAGAAGACTGTATTTTAAGTTTGGATAGTGATCCCCTTTTGGAGATGAAGGATGACGATT 

3 5 GAAGCCATTGAGGAAGATATTGTACAAAGTGTTCTTAGGCCAACCAACT 

CCAAGGAGGATCAAATTTTGATAAGCTGATG 

TGAATCAGGCAAAGAAAGGGGGTGTAAGTTGTGGGACCAGTTTTAGAGGGACAGTTGGA 
CCATCTTGCTTGTCAGTTTGTAATGTTACCATACAGGATACTATGGAACGCAG 

TTTGGGAG AAGCTGCT CGT CTCAGAAAAAAGG CAGATATTG CAACTT CT AAGACTACTACTAGATTT CGACCTAG TAAT ACTAAAT 

4 0 CCAAAAAGGATGTTAAACTTGAATTTTTTGGTTTTGAAGATCATGAGACAGGAGGT^ 

AAAATTAAGTATTTTGGCTTTGATGATCTCAGTGAAAGCGAAGATGATGAAGATGATGACTGTCA^ 
AAAAAGAACTAAAACAGCTCCATC^CCCTCCTTGCAGCCTCCCCCAGAAAGCAAT^ 
ACAATCCAGAAAACTTGGATTTTACAGAGGACTTGCCTGGTGTGCCTGAAAG 
TCAAAGGAAAATACCAGAAAGATTTTTAGTGGCCCCAAACGGTCA 

4 5 AGATTCAG AAGAACTG CCTGGG CCAC CAGTAGT AAAAC CT CAGAGTGT CACAGTG AG GCTGT CTT CAAAGG AAC CAAATCAAAAAG 

ATGATGGAGTTCTTAAGGCTCCTGCACCACCATCCAAAGTGATAAAAACTGTGACAATACCT 
ACTGCACTGAAATGCAGACGAGAAGACAAAGAATTATATACTGTTG 
TGAAAATCAAGAGTTCACTCATGACATTGAGTACTTGTT 
TTATTAGCTTGGCTACTAAATGTGCCATGCCCACTCTTCGAATC(ZACCT 
50 TTGGATGAITCCCAGCACCATCAGAATCTGTCCCTCTGTACAGCTGCCCTCATGTATA 
TCTTGATAGAGCTAGCTTAGATCTAATGATTCGACTTTTGGAACT 

ACATGAACAAAATTAAAG AAAAAATC CGAAGG CTC TGTGAAACTGTAC^ CAT CTTGATCTAG AAAATATAACG ACTGGG 

CAT TTAG CT ATGG AGACATT ATT ATC C CCT ACTTCT AAACGAGCAGGAG ACTG GTTTAAAGAAG AACT C CGG CTTTTGGGTGGTCT 
GGATC^T ATTGT AG ATAAAGTAAAAGAATGTGTGG ATCIATT TAAGTAGAGATG AGG ATG AAGAGAAACTGGT AGC CT CACT ATGGG 

5 5 GAG CAGAGAGATG TTT ACGAGTTTT AG AAAGTGTAACTGTG CATAAT C CCG AAAAT CAAAG CTACTTG AT AG CATAT AAAG ATTCC 

CAACTT AT TGTTT CAT CAGCT AAAGCATT ACAG CATTGTGAAGAACTGATTCAG CAGTACAAC CGTGCTGAGGACAG CATATG CTT 
AGCTGACAGTAAG CCTCTG CCT CACCAG AATGTAACTAAC CATGT AGG CAAAG CAGTGGAGG ACTG CATG AGGGC CATCAT CGGGG 
TGT TGCTTAATTT AACTAATGATAATG AGTGGGG CAG CAC CAAAACAGGAG AG CAGGACG GT CT CAT AGG CACAG CG C TGAACTGT 
GTGCTTC^GGTTCCAAAGTACCTACCTCIAGGAGCAGAGATTTGATATTCGA 
60 GTATAGTGCTCGGAATCGGCACTGTCTTGTCy^CATGGAAACATCGTGCTCTTTTGA 
ATAGTTTAAGGATAGGTGGACAAGTTCATGCTGTCCAGGCTTTAGTGCAGCTATTCCT 

G AAACTAAAACAG ATG AGTTGATCAAAGATGCTCCCAC CACTCAG CATG AT AAGAGTGG AGAGTGG CAAG AAACAAG TGGAGAAAT 

ACAGTGGGTGTCAACTGAAAAGACTGATGGTACAGAAGAGAAACATAAGAAGGAGGAGGAGGATGAAGAACT 

CCCTTCAGCATGCCGGCAAACACATGGAGGATTGCATC 

6 5 CCAATCAATGTAACCACTGTGCGGGAATATCTGCCAGAAGG 

TATGAATCTCACTGTGCTGTTGGAACAACTGGCCAGAAATCT 

CCTTTGCTTCAGGTGCTCGGTAATGCTGGAGCTATCCTTAGAC^ 

AGAACATTCATCAGTATCATTCGTGTTTGGATTTTTAAGGCCAC^ 

T ACTACATCAATC TGCAACTAT CAAAAATGAGGGGAAAAG GTTCAGG CTGTTAACAATT C CATGCAGTATTTAAATACATTTACTT 

7 0 TGGCAGAGTTTATACCCTCCCCTTGTTTTCTTGCTTTATTCT^ 

GCTGTGTATGCTGAGCCACTGTTGTCATGCCAGCCAGGTGCAAAGGCAGCCTAGCTACTGAGGTAGC 
TAGACAACAGCTTAGTTCCTTTTTCAGGCTC^TTTGCTTTTGCTTCT 
TTTTGTGAATTTTTTGGTTGTTGTTCCCTGAACTACTGTCTA 
ATATTTTTTTATTCTTGATTAC^TCTGGGTTATTGAACTATTT^ 
7 5 GTGTCATAGCTATAGGCACAAGAAAAACAACAGTTTG?yGAGAATATTA 
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TAGTCAAACCCAGGGTAGTTTACACT 

TTT CTAATGTTCAGTACAATAAATAT AAGGAAGCTAAAACAC CAATGTGGAATTCCTGTTT CCAGATAA(^TGTA3ATTCTTCTAT 
AGAGTGACAGGATCAATTG CAT AAGCGCAAAG CCTT AAATTGCTGGTTTAG AGAAGACC CriTl'lTCATTCAGATTC'rr'rG'rTCST 
AGAGCAGTTATTTGAAAAACAGTTATGGAACACAAAACATTTTA^ 
5 GTTCACACCACTGGTTATALT TI T U ' I T^ 

CTTTTAAATACCTTTCAGGTAGTTACATCATGTTTCr^ 
ATTGGGGAAAATAGCTAAAAGTCTAATTTTTACCCATTTAGACT^ 
TCAGTGCCAGCTGTAATGTTTTTAAATGCAGTGGCTGCCTTCTATTGTCTO 
TAAGGAAGTAACTGATTCCAGGOVAATTGTTTTCrrCCTTCT 
10 CCAGTGTAAanXSGGAACCATTGAGATTGTATTTGCCCrGAGTA^ 

AATGAT ACC CATAAAATTAAATTAGTC ATATTTTATTTTAA CATTGGATTr AGGG AAA 

GGGACAGATTTTTGGTGAACCTGACTTGTGGCAGATGCT 

ATTTTTCTGTTACACTGGTAATCATTTGAAAATTC 



1 5 TTGTGTGTTTTTAACCCTCAGAGAACTCTCCATTTT 

ATTGTAAAT ATTTCTGTAAAT ACTAC CAATTGGAAATTAGAACAGT AG AGTACTTTT CTTGAATCCAATC CT 
AGTATTTCTCAGCTGTGATCTTrGGAGCAAAAGC 

TTTTGTAAATAATTTTAACTGAAATAAAATTG CTACTTTCAATACACAT 

20 HUMAN SEQUENCE - CODING 

ATGGTTCAGGGCCCTGTTCAAACTCCAGCACTGACCAT^ 

CTGCCCCCCCAACCGGGCTTCCTATTTACCGAAAGCAGAGTCCCTCGCCTCTCTCGGCT 
CGAGGGTTCCGCGCCCGCCCGCGGGCCGTAGGGAGCGGGAGAGGCGGAGGC^ 

GAAT ATG AAACTGGTGTCIAAAATGACAT CCAG ATTTGG GAAAAC AT ACAGT AGGAAAGG TGG AAATGG CAGTTCAAAATTCGATG A 

2 5 AGTCTTTTCCAACAAACGGACTACCCTTAGCACAAAATGGGGA 

TCAAACCAG ATAT CCAAGAAATTC CGAAGAAACCT AAAGTGG AAGAAGAAAGT ACT 
GAGTCTCTACCAGTTTCTTCAAAGAATTTAGCCGAGGTTAAGTGTTCCTCTTATTC^ 

GGTCACTTCAGT ACTTGAAGCTAAT AG CAAAATT AGT CATGTGGT CG TTG AAGACACTG TCGn u l C YGAT AAATG CTTCC CTTl'GG 
AGGACACTTTACTTGGGAAAG AAAAG AG CACAAACCGAATTGTAGAAGATGATG CAAGCATAAGTAGCTGTAATAAATTAATAACT 

3 0 TCAGATAAAG TGG AG AATTTT C ATG AAG AACATGAAAAG AAT AGTCAC CATATTCACAAAAATG C TGATGACAGT ACTAAGAAACC 

CAATGCAGAAACTACAGTGGCTT CTG AAATCAAGG AAACAAATG ATACTTGGAACTCC CAG TTTGGGAAAAGG CCAG AATCACCAT 

ATTTTAAGTTTGGATAGTGATCCCCITTTGGAGATGAAGGATC 

GGAAGATATTGTACAAAGTGTTCTTAGGCCAACCAACTGTAGGACGTACTGTAGGGCGAATAAAA 

3 5 CAAATTTTGATAAG CTGATGGACGG CACC^GTCAGGCCITAGCCAAAGCAAACAGTGAATCGAGTAA^ 

AAGAAAGGGGGTGTAAGTTGTGGGACCAGTTTTAGAGGGACAGTTGGACGGACTA^ 
GTCAGTTTGTAATGTTACCATACAGGATACTATGGAACGCAGGATGGATC 
CTGGTCGTCTCAGAAAAAAGGCAGATATTGCAACTTCTAAGACTACTAGT 
GTTAAACTTGAATTTTTTGGTTTTGAAGATCATGAGACAGGAGGTGATGAAGGAGGTTCTGGA 

4 0 TTTTGGCTTTGATGATCTCAGTGAAAGCGAAGATGATGAAGATGATGACTGTCAAGTA 

AAACAGCTCCATC^CCCTCCTTGCAGCCTCCCCCAGA 

AACTTGGATTTTACAGAGGACTTGCCTGGTGTGCCrGAAAGTGTGAAGAAGCCCATAAA 
TACCAGAAAGATTTTTAGTGGCCCCAAACGGTCACCGACAAAAGCTGTATATAATC 

AACTGCCTGGGCCACCAGTAGTAAAACCTCAGAGTGTCACAGTGAGGCTGTCTTCAAAGGAACCAAATCAAAAAGATGATGG 

4 5 TTTAAGGCTCCTGCACCACCZATCCAAAGTGATAAAAACTGTGACA^ 

ATGGAGACGAGAAGAGAAAGAATTATATACTCTTGTTCAG 

AGTTCACTCATGACATTGAGTACTTXSTTAAGTGGCTTAAAG^ 

GCTACTAAATGTGCCATGCCCAGTTTTCGAATGCACCTGAGAGCACATG^ 

CCAGCIACCATCAGAATCTGTCCCTCTGTAGAGCTGCCCTCATGTATATACTGAGTAGAGATCGTTT 

5 0 CTAG CTTAGATCTAATGATTCGACTTTTGGAACTGGAACAAGATGCTTCATCAGCCAAG CTACTGAATGAAAAAGACATGAACAAA 

ATTAAAGAAAAAATCCGAAGGCTCTGTGAAACTGTACACAACAAGCATCTTG^ 

GGAGACATT ATTATCCCT T AC TTC T AAACG AGCAGG AGACTGGT TT AAAG AAG AACT C CGG CTTT TGGGTGGT CTG G ATCAT ATTG 

T AG ATAAAGTAAAAGAATG TG TGG ATCATTTAAGT AG AG ATG AGGATG AAG AG AAACTGGT AG C C TCACT ATG GGGAGCAGAG AG A 

TGTTTACGAGTTTTAGAAAGTGTAACTGTGCATAATCCCGAAAATCAAAGCT 
5 5 TTCATC!AGCTAAAGCATTACAGCATTGTGAAGAACTGAT^ 

AGCCTCTGCCTCACCAGAATGTAACTAACCATGTAGGCAAAGCAGTGGAGGAC^ 

TTAACTAATGATAATGAGTGGGGCAGCACCAAAACAGGAGAGCAGGACGGTCTCATAGGC^ 

TCCAAAGTACCTACCTCAGGAGCAGAGATTTGATATTCGAGTGCTC 

GGAATCGGCACTGTCTTGTCAAGATGGAAACATCGTGCTCTTTTGATTCTTCCATCT 
60 AT AGGT GG ACAAGT T CATG CTGTCCAGG CT TT AGTGCAGCTATTC CTTGAG CG AG AG CG GG CAG C CCAG CTAG CAGAAAGTAAAAC 

AGATGAGTTGATCAAAGATGCTCCCACCACTCAGCATGATAAGAGTGGAGAGTGGCAAGAAACA^ 

CAACTGAAAAGACTGATGGTACAGAAGAGAAACATAAGAAGGAGGAGGAGGA^ 

GCCGGCAAAC^CATGGAGGATTGCATTGTGGCCTCCTACACG^ 

AACCACTGTGCGGGAATATCTGCCAGAAGX3AGACTTTTCAATAA 
65 CTGTGCTGTTGGAACAACTGGCCAGAAATCTATCTCTAGAGTGA 
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Table 103 

MOUSE NOMENCLATURE 
ICSGNM Rorc 
Celera mCG5011 

5 

HUMAN NOMENCLATURE 
HGNC ' RORC 

Celera hCG16918 

10 MOUSE SEQUENCE - GENOMIC 

TCTAACACTGAAGTGGGTGGAACATCCTT AG CAATAGGAAGTCTAAATACTTAG C CATACAAG GCCTC CTTCTGAAAAT CATTTTA 

AGATT ATTT CTTAAGACG T ATTTTATGTGAATGTTTTGCCTGTGTGTATGTATGTATGTATGTGCAC CACGTGCATG CCTGGTGC CT 

GCAGAGGTCAGAAGAGGGTGTTGGATCCCTTGGAACreGAGTTGTGGATGAGTGTG 

TTCTGCTGGACCAACAAGTGCTCTTAACCATTGAGCCATCT 
1 5 CCAGTTTCCAGACTTCACCTGTTCTTTTACCT 

AGACAGCTAAGTGTTTCCTCTCCGCAGCAGCATCGATTCGCAT^ 

CGGGAGGCTGAACTTGGACCITCTCCGCCTCGGTGT^ 

GGTACGGACCGCAGAGGCCrcGGCGCTGCAGGCGCACCCCAGCCT 

ACCCTAATGTCCTCACCCCCXSACCXJCATCCCACAGTTCT 

2 0 GACCGCAACCCGGGCGGCCGGAACCTCCCGGTGGCCTGCTCGCTC 

GAGCCCGCACACGCGTCGCCGCGAGTCCTTGTTCCACGGGCCGCGC^ 

ACGTCTCGGCCCCCGACCTCCGCCTCTGCCGGGCCCCAGAC^GCGAC^^ 

CACACGCCCAGGCCGCAGTCCCTGTCCCCCGACGAGGCCA 

CTTC CACCTG G ACTTCCTCTGCTGCCAACTG CX3GCCG ACCAAG GACAGCGTG CTGCG CCTGGGG CCC CGCGGCGGGCAGCTG CG CC 
25 TGTCCACCGAGTACCAGGCGGGG CCCGGGCGGCTGAGGCTGCG CCTGGTGAG CGCCGAGGGGCTGCCTCGGCCGCGGAC CCGCCCC 

CGGGAGCGGTGG CGGCGGCTGCTGCGTGATTCTG<!GGCTGCAGCCGCGCTGTTAGG CCTGGAGCTCAGCTGGAGCCGGGTGGTCCA 
GGGCAGCTGCAACCCTATCTTCAACGAAGACTTCTTCTTCGAGGGGCT 

AAGTGCTGGACAGGGGCGOSGGGCTGCGCAGGGACGTGCrGCTGGGGGAATGTGAGACGCCCCTCATCGCCCTGCTGCCCCC^ 
GCTGGAGGTCTAGGCCCTGGGTCCTCCCTGGCACCTACTCATCTCAGCCTGTAGACTGATAGA 

3 0 TCCACTGGGTCTGCAGACTTCATCCTTGCCACCTGCCC 

ATGCTTTCCAGTGGGCACCAAAAACTCTAGGCTTTG 
AATCCCTGGAAGTGCTTGTTAAACATGGAGAGCCTAGAGTCCAT 

CTAGGTGCCTGTCTTAGCTTTATTTCCGTTGTTGTGATAAACTATCTCCCCATCACACACAC^ 
ACACACACACACAAAAGCAGCITTTATGGAGAAAGGGT^ 

3 5 ATTG C CAGGACTTGAAG CAG CT AGTCACTTCCACAGTG AGGAG CAGGGAGAG AG ACAATACAAGCTG GAGAG CTGGTT CAGTGGTT 

AAATGCAAAGGACTCAAGTTCAGTTCCCAGCACCCACATCAGGCAGCTCATA^ 
TGCCCTCTTGTGGCTTATAAAGGTO^TGGGTGGGTACGTGCATATGAGCGTGCXSTGTGTGTG 
TGTGTAATGCCCIAGGG^IAGTGGCACATGCCTTTAATCTCAGCACTCTAGATG 
GTTCAAGGACAGTCTGGTCTAAAGAGTGAGTTAGGATAACCAAGGCAAC^ 

4 0 GAGGTTAAGAGCACTGGCTGCTGCTCTTCCTGAGTTCAATTCCC^^ 

ATCCCCTTCTGGCATGCAGGCACACATGTAAGCAGAATGCTGTATACAC^ 
AAACGTTGTCTGGAAAAAAGAACAAAATAATAACAGAGCTTGGTGGCTC^ 
TAGTTCTC!TGAAAATTGGAAGCTAGTTTGATCTACAAAGCAAGTTCCCG 

AACAAAACAAAAAATTAAAGATCAGAGAGAGAAATGAACATAATGTGTGCATGCATGCTTCTAA 

4 5 CACAGTCCAGGACCCAAAG<ICAGG<3GAATGGAACCACCTAC 

CCCACTGACATACCCACAGGCCAACATGATCTAGACTTTTCTCT^ 
AACCATCAGAGGTCCCTCGAGTCTAACAAAACCCTGC^ 

GCCACCCTCTCCTAAAGGTCCCTGTGTCTGAAGAGAAGAAACTGGCCTTTCCTA^ 
TGAAATTTTTGAACCTTAAGAGTGAGCCAGGCGAGCACAG^ 

5 0 GTCCATACACCTGTGCACCTGTGCAGGAGCAGGAAGAATGGCTGGAGTCT^AG 

ATTT AAT CCCAG CACTCAGCGG CAGAGACAAGAAG ATTT CTGAGTTCAAGT CT ACAG AG CAAGTTCGAAGAC^ CC T A 

CAGGGGCTGGGGGGCCAGGGGAGAAGCAC!AGAATAGTGCTTGGGT 

AG AGG AT ACT ATCAAGGGAGTGAGTG CTGTT ATCAGGG ATG CCAGAAAAGG CT ACT C CAG C CAG AAGATGAGGGTGAAAGACAGTG 
GAATAAAAGTGGGTCATCTTGTGTTTACCTCCTACCTG AAAGCCCTTGACGT CAGCA^TGACCTTCTCTCTGCCTCTT CCTT CCTG 

5 5 GGAACTTGTTCCTCCTGCCAGACTTCGAAGAGGGATAGAGAAAGGCAGGTACACAGCAGGCCCTAGACCACT^ 

AGCCTTTCTCACTTCCCATGGTGCCAATTGTCCCCGTATAGGACCTGCTTCTTCTTAACAAAAATCT 

TACAC^CACACACACACACAC^C^CACA^ 

CCACCAATCCCCAGGCCAAAGAAACCTTGCTCCAGTTGTC^ 

CCTGATCAATGGGTAGGATGGACAGCTTCAAACATTAGTTCCTCCTGGCAACTGCCTCT 

6 0 ATGACCTTCGCCCTTAGACAGATAATAAAGAAGAAAACTAGGTGAC^TGACTCAAGGTACTTGTCACGAAGTCTGACAGTCTTGAG 

TTTAATCCTAGAGATCCACTTGGTGGAAGGAGACCAATACACAC^TATAATGTAACTTCTAAAATCrGTATTA 
TTTCTTTTTCCCGGGATAATAGCTCTTCAGTCACTTCACAACCTGG(^ 
ACTCCCCAAGCTCTATCTGTGCTGTGCTGTGCTGTGCTCTGTC 
CAATGC!AACTTGGAAGTATCTGGAAGGATGTCTATAGGTC^ 

6 5 GG ATGCCAGGCIACTGGGGCCAGCAGG CTTGAAAACAAACG CAGGAATCAAGTGAGTCAGAAAATGAACAAATATCAGTAAAGAACT 

AG CTGG CGATG GGCCGTAG AGAAAAGTAAAGTAAG AAAGGG GAGG GAGGCAAAATTACAGGAATGGAAAAGAG AGGG AGGG C CTG A 
AATACTTTTTTAGCTGCCATGTGTACITTGTACCAGGCCTTGCACTTTACCCGC 

CATT ACCAGTT CACAAAGG CAGAAAGAG ATC CGAGAGGTT AAAT AAACTTGCCTG AGGATTGG L"i M ri'GGCTGTG AGG AAGG CTT CT 
AACAGCGACTCTATACCAACATCCTCTGC CACAACCCAGCCCAGGAATTTGGGGAGTCTATGAGATAGAAAAC ACCTCCCCC CCAA 

7 0 AAAAATGTTCTGTCACATTCCCAGCCCCCGGGGCTCAGACAGGTACA 

TGAAGCCTAAG AACCCATC CTGTGTG CAAGG AGCTGAGG C CTCTAAG T AC CGCCATTAG CACAGTACTGCCACCAACG CAGGTCAG 
CACCATGGTTCTCCCCCTTTTTAG CTGTGACAGGATTCAGGGCGCATGGGTGACACCCAGCAAACCACAOT 
GTCCCCCAAACCAGACCTCCCAATT/TTGTTCCTCGTAATTGTTTTTO 
CTACCCAGCTGAGGATGGCAGTGAACTCCTCCTGCTCTTCCAGCCTCTAC^ 
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GTGTCACGTTATGAGGTGCTGTAG<7ITGAAT 

TCCCACCTCAGTGTTAATTGGGAAATGAGAGCTATAAGCAGT^ 

CKSGGGCTCrACATAGGTACC^CGGGGATGC^ 

AAGATTTCATCCACTCCCCAAGTCATTCrGGGGAGTTGAACT 
5 G CAGTAGAGAGGCTCCAGTGAAAGATGGCGCCCTCTGGGCCAAGCCAGCOT CCAGAGGTTTCCTTTTT 

ACCTGCreCCTTGGGTCCTCCCTGGAAGGACAGGACTGTGCTG 

GAAAGAAGAGTTGGTGACTGCTCCCCATTTCCAAGCCXSCCGCCGCC^ 

CCTGGGCCTGATAAGAGGACTGGGCACGTGGGGTAGAGTGATTCTCTG 

AGTTTGCTAGGCCCGGCATAGTCTCACAAGAGAGAGCTATATCT^ 
1 0 AGCATTTGGAAGCTGATTATGGGATGGATCCCrGCATATGGCAATCACT 

CTCTGTAACTCCTTCT ATGGGTGTTTTGTAGAAGTGGATGAT CACAGTCAGC TATTGGATGGGTCACAGGG CCCC CAATGGAGGAG 

CTAGAGAAATTACCCAAGGAGCTAAAGGGAACTGCAACCCTATAGGTGGAATA^ 

GTCTTTAGCTGCATATGTATCAAAAGATGGCCTAGTTGGCCATCA 

CCAGTACAGGGGAACGCCAGGGCCAAAAAGGGGGAGTGGGTGGGTAGGGGATTGG^ 
1 5 CATTG AAAATGT AAACGAGGAAAATAC CTAAT AAAAAAAAAAAGAAAAAAAAG AATGT ATTTG AG G AGGAAAT AGCCC CTGCTCTG 

CTTTAGAAAGCACTGTTCCTACAGAAAAGAGCCTTC 

AAGAACAC CCAGGGGG CAT AATCACCACAACCT ACT CTG AGCTTT ACCAGCC CG GT AAATCCCAAGGTCACACTGGATG ACAG CTG 

TACACTCACCCTATGGCCACTGTGGGAGGAACAAATAGra 

TGGAAGCACAGAGAACTCTCTGGTGGCAGTATCAGTCTGAATAGOK^ 

2 0 GGGCCAAAAG CTGGGTAGGATGATGCCTGTAATC CCAG CCCTC^GAAGGCTGAAGGAACT^GATAGCCATGAATGTTCAAGGCCAT 

CITGGACT ACTGAGTGAG ATCATCTT AGCAACACGT AGG AATG AAG CTG AAATG AAAGAG TAGCTTAGAAG AAAG C 
CACTCTCAAG CCAGGACCCGGTCGCTTCCTGAGCCCT<^CAGTCAT^ 
TX3GCATCTGAAAGCATCCCTAGGGGCCTCCACCCTGTGCCT 

CACCAAGAGAAAAGGACCTGGTCAGCGATGCAGGTCTGTAACCCCAGCACACCCAAGGCAATOT 
25 . GAAAATGAGAGGAGGGGCGGGGTTCAGCAGTTAAGAGCACAGGCTGCT 
CAGTGGCTCACAGTCATGTATGACTCCAGTTCCAGGGCACCTC 

TGCACAAATAAACATG CAGG CAAAACACC CATACTATT AAAAAAAGATAAGTG AAAAGAAG AAAGAAGAACAATAACT AAGGGAG A 
TGAAAG CGAG CTGGT AAGGCACAGAAAACTG AGCAGTGATGGG CAGATAGGAG CACAGAAG AG CAGGAGGGGAT AGCTTCTGGGAT A 
GAGGCAGGCCATTAAGGTCCCCATCCTTTGAGTGCCCTAACCTTTC 

3 0 ATCACTTTG CCT CTTTT CAAAG C T CTC CAG ATTGGCATTGTGTTTTG AGAAGAACAGTATACATGGGC CTGTTT ACCAC CAAAGAC 

CGTGAAGTCAGAGATGACCTAGTCAGTGGAGTACTGCCACAAACAC^ 
TTTTTGTTTTTTAAGATTTATTTATTTTATGTATATGAGCACACTATCACTCTCT 
ATTACAGATGGTTGTGAGCCACCATGTGGTTGCTGGGAATTGAACTCAGGACCTCTGGAAGAGCA 
GCCATCTCTCCTACCCAAGGGTGAAGGTTTAATCCCCCAGAACCCA^ 

3 5 CTTGTAGTCTCAACAATGGGGTCGTAGAAACCCTAGAAA 

GGAGACCCTGTCrCAAAAAAACAAAACAAAACAAAACAAAAAGGTAAAAGGTA^ 

T CCACAG ACATGCACACT CACACGTGT ACACACACCCAC CT ATACAG ACACACACA.T AT ATGTAT ACATAT AAAGGACTT AGGACT 

AGTCAAAGCCTCAAAGGCCCTCAAAA.GTCACCCTACACCCCTACACT 

ATCTTTGTCTCTGACTCTTAGGAGACTCAGCTCCCTAGAGCACTGGGTGOT 

4 0 ACGGTGGAGAATGGAATGCTGGG AGCGAAGGGACZAGCTGCCTGCCCTCTCCACGCTAGGTG CCACGTG CACCAGGTGGAGGGAGTG 

GGCXSAGTCACXAGGCCCTGGCGTGGCCGGCTCCTGCCCTGCTGTTTACCAGCTGGAGAAAGC^ 

GCTCCCAGCCTGG CCCCTAG CCAGCGGTTGTTGGGTAAGCAGGAAAGCCAGGCACAGCTAAT CAAACTCTGGGCAAAGGACATGCC 
GAACTGAGGCATCCAATATTCACAGAAGTCTTGGCTCT^ 

GGGTCGTGGGGGAGGGGGAAGGGAGAACCGGGAGGAGGGGGGAGGATTCCCTCCTCTCAACTTC 

4 5 TTTTGCTCTCCCTCCAAC^CCCCTC^CCTCTCAATTTGCT 

AT CT AG AAGC CTGGAAAATC CAGAGGTGGGCAG CATGGAGTT AGAG AG AGGCAAAGCAAG ACATGTGCTGG GGG AAC C CAAG CTG C 

CACTCTTCCTGCCTCCCTTTCTTGCTATTCTGTCACCTGTC 
AATTGCCCTTGTGAATTCCTGGCAAGTCTTTTCTATTCTGGGCAG^ 

5 0 GGGTTATCATGTGGGATTCCAAAAAACAAACAAAAAAATTCTGATTTACCTACAAGGCCTGG 

ATTAAAAAATACTAGCTCCCTTTCACCTGTTTTTCCT 
GCCAGGACACTCTCCCAGCC^CTCCTTTCCCCrrGCCrGCrGACGG 

CCTGCTCCCTGCCCTC CTGGGCAGCCAGGGCAGCAAGGACGG CACCAAGGGAGCTACCCCATGG ACAGGGCCCCACAGAGACACCA 
CCX^ACATCTCX3GGGTAAGAGACCCTAAAGCCCTGCAGGCAGGTGGAAGGGACT 

5 5 AGAGACTGGG AAAGAAAGAACTGGGGAGAAGAGAGGTGAG CAG AGGGG AGATGGACTGGCAGACAGAG CAAAACGGTG AG ATAGGA 

CCGAGTCTTTACATAACTGTAGGGGGGGGACAGGAAGAGGGCCT^^ 
GGAAGAGACIAGGTTGGAGTGGGGGCAGGAGGGAGGAGCTGTGTTAGGGAGGCTTG 

GGTGTGAGCAGGCCCCTCCCTGCCTGCCACAGAATCCTGGGCAGCAGCGGCTGGAAGGGTGACTCTCCGCCTTCCTATTCATTCAC 
AAGTGGCTGCCGTTCTGGCGCAGGGGCTGGGGGTGGGGGACCTGGGAGAGAAGATGC^ 

6 0 GCAGTC CCACGCTTTCCTTCAAGCTGGTCCGAG CZACGGAGTTTTATCTTTGCTGTGTGAGGCACCTGTGGAG CACAGAAAGGAACT 



CTAGAGAGAGTGAACGACAGAAGTGCCAGCTTCTAGGGCCACTAGAGAAAGGCTCOSGGGTa 

GGGAGTGTCCCCATGCAAGACTGCTCTGAATTGCAAAGTTGGAGGCTGGACAAGGGTGCCA 

CAGACAGGATTAGCAGGGAGGTGGGCATGGCATGAAGCCACACCCTCTCTC 

6 5 T ACCAG GC C C CATTTACT ATCACGG CAATGAAATGCAGGAAG CCTGG CTGCAG AGTGCATCTC CTGG C!AG GAAGG AG 

GGACAGGACATGTGTG CCAGGCTTGGACAG CAGAAGGCAAAAAGT AACAG AAAGG AATCAAAGG TT AGGG ACATTACAGACAGGGC 
TAATGGGCTTTCTGGAACAGAAGACTGGCCCTAGGCACCCTGGATATAGGAAATGGGTTGC 

TCT AGGTGTT TC C CAAAT ACTGACTGCATGCTGATTTCATCC CAGATATGTAAACAG CCCTGTAACT ATG CAG T AAC CATGACT CT 

cccctca.gctgctgctcccctggacagca.gagcatctctgtaaacct 

7 0 ccagacccatccacgagcaaagcagcaaggcaaagggacct 

gagggtctatctgcagctcggccctccrtgtcttcctcttcgcc^ 
ctcttgctttcaggcacagataagcaaccaaaagactgtattccct 
ctagagggttcitgctgttccccagag<:aacaagctgaacaagacagacagaggct 

ggatg atgg catacctagggcagttgt ctgttacgt caagtgc cagg aaacaatg cagagtgctggc tgagtaaac c ccagggcag 

7 5 GG G ACAGGG CATG GGAGG CTGCCCCAG AGGAAAAG AG AG ACAT CTGAGAT CTAGCTAAATGGGGG AGGGAGGGTAGAGTGTGACAT 
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GAGGAAGTCCTGGAGGCAGAGAG 

TGCCAAGGGGCCXAGCTTCTCAGAGGCCAGGTAGTGCTG^ 
GCTTTTCCCCTAACATCTTCAACTATTCACT^ 

AGT AGGGGTTTGGGC CAAG AAGGTGGCATGGGGTATGG CAAAAG CATT CAGTT CAG GAGGCATGAGTGAAAGAACACCCGGGACTG 

AAAAAGAAAAAAAAAAGAAGCCCAAGGGGTGCCAAGTTACAAGCATAGAA 

TGGACAGAGGCTCAGCTGCTGTGTCTGACC^GGGAGG^ 

TGGGAGCGTtTCCCAGGGCTTTGTGGTGGC 

TGGAGAGAGGCTCTGCAGAGCTAGAGAGCAGGAAAGCCTTGTCT 
1 0 AGTGTACGAGGAAGAGACCAGAGTTCCAGAAGGGCTTCCCAATGGACACrrTGCAAGTAAA 

CCCAATCCAGCCACATAGAGAGAAGCTATTCCCCTACAAAATGTGCCCGCTCCACCAA 

TCAGCACCTTGGGGGCACTCTGGAAAAGCCACAAGCATGGTCATTAA^ 

TCTTCCATCACATACrrCATTGGTTGCCCACCAATCTGTCCCCACTAC^ 

CCCTGAGCTTGGGGCTCCATCCTCCTGGGGGGOTCAGAGC 
15 AGGGCCCCACGGAAACGCTGAGCAGGGCCCTGGGAGGAA 

ACACTITGGGGGTGTAGAGGGGGACTTTTGGAGAGTGGAAACTGGGAGA 

ACCCTTACCATGTGTCCTCCTGAGGATCCAG^ 

TTACTAACCAGAAAGATTGAAGACCTCTCAAGGAGGTTGAGCTCCTTG^ 

ATAAAAACTGCCTCCCAAGTGAAATAACCTCAAGAAAGGGGTCCAAGCTGAGAAACCCGGGGGCA 
2 0 GAATGACTAAGGGGCCAGAAGAGGAACCTCCAAGGCTGAGGTACCCCACAGAGCAGATGG 
ATCTCACACCAACAAAACCTGAAGAGAAACTCAAGTG^ 
AGATACCCTTCTATCCCTCCAGTGGATCTGAGGTCACT 

G AGCAGGTT ACTTAATCTCT CTGTGT CTCACCTT C CTTGT CTTTGGAATGGT AGTTATCACATAAAGTATTTT ATGGGGATT ATG A 
AGTCCACACAGAGCTGAATCATTCCCAACACTGAATTAAGCACTGCTC 

2 5 CAG ACT AG CAGT CTTGTTTCCAAAGGCCrrTTCTG AAGGACCAACC CATCCTACAGATCCTACAGAGGGTATTTG CCTGATGAT ACT 

GCCGCTATCGTTTCCTCAGCCCAGGATAGCATCATGTGTAGGAGGAATTTC 
TGGTTATGGCTGTTGACAGTCCACAGGGTCTCAGTGC 
CCTACCCCTGATGTCIACTTGGTGAAGAGGGCCrrcGGGTACrCAG^ 
AAGCTCCCCAGCTAGAACTCCCCTGGCGAGTGGAAACAGCTTT^ 

3 0 TGTGATTGGCTGCCCCGCCCCTCACACCCTGCCCCAGACCC^GATTGGCC^CGTG^ 

GGGGGTTGGGGGGCTGTCACITGGCCACCTGTGTGGAGCAGAGCT 
AGATCTAAGGGCTGAGGCACCCGCTGAGAGGGCTTCACCCCACCT 
AGGACAG<3GAGCCAAGTTCTCAGTCATGAGAAGTAAGTGAA 
TAGGCCCGCTGAATAGGAGTGTATCTTGGAAACCGTGCCT 

3 5 GGGTGGAGTCCCAAGAGAGTTACTCTTCCCTGGCAGCTTCCTTGGCAGCIAATC 

TTCTGAAGTTCTGGGAAGACTGAGCATGC CCAAGGGTCCAACCCTTTGATTTACAAAGAGAC CAAGACCCAAGAGGGGAGTGCATC 
ACTGAAGAGCTCACCATTAGGGCCTGGGAGCrGGTGGAGTCTGGTAGAC^TTAGGACCCACAG 
AGAGTTTATCTTTGTGCCTCCAACTGCAGGCACrrGCrGTACCCT 
ATCAGAACTCATCATCTCCCTGATTCCCTAACCTTAAACACACAC^ 

4 0 CAGCTAGGGACTGCTCTGTGTGTGCTATGTGTATATTTGTGTGTGTGTTTGCATATGCTT 

GAAGAGATGGGAGCAGAAAACCTTGGTCTCACTCCTTTCCCTGCTTGAAACC^^ 

CCTTTAGGCTTCCAGCTGAGCAATGTCGGGGATC CGATCCT CCCTGC CAAAGTCTACAG 

TGAGTGAGGAAAAGGGAAGCCTGTTACCCTGAGGCCTGAGGAAGCCCCAGCTTTAACCT 

GAGXSCCATACTGAGTCCTGGAAAGGCCCATGGACrGAAAGGCTAGTGAGCAGGTT 

4 5 ATCTCTGGTCTCCAGCCACAAAATAAATGCCAAGCATGGGAGTCrCTGAGTTCTAAGAGC 

CCTAATGGGTGAAAGAGAATAGACCCCTATAGTATACAAGGTCAAGTTGGGATTCC^ 
TTGGTGAGAOU^GAGGACCAGGAGAAGAACCAAAAGTCCTAAAATGGM 

CAG CAAAAG CTG GCAGAT CG ATTTCACAGAAACAG C C CCAGGT CGG CCGAAG CAGAGGGCAACAGCCCCAATAG TATC TGAG GGAT 
ACACTTGGGTAGCrCTGCGTTTTATGCACACATGCACGCG 

5 0 AT ATCCAAGTTTTCTGAC CC CTTTGAATT CCAGTT AC CAC CT ACT TGGGAGAGGTCAG AG CAG AGAGGGTTTAGGACCATG ACACA 

CCAAAGAGTGAGGCCAGAGAAACAAGGAATAGATGGAATGTAGGAAATAAGCGATGGGTTTGAGGG 

ATGGAGCACTACCAGGAGAGCCATAG AGT AAGAAGT C CAG AG ACG AAGGGCACAGG T CATAAAGTACAGACACG GGAGGTG GGAAG 
GGAACATGGCAGACTGGAGGGTGTCTCAGGAAGAAACAAAGCCCCTTCGACC 
CATCTCACTGTTAGGAGCTAGTCAGTGTGGAATC^GTAGACAGGAA^ 
55 ATACAGC(^TGATTATGTTGAGCAACGGGCCACAAAGAGCT 

GTGTGTATGTACATTCGCTAGCTGATTCCTTCCTTAGCAA.TGCTGGAAATTGAACTCAT^ 

CACCCACCGAGCCTTCTCATGGGCCTTTTATTTGGTTTTGTTTGTGACA 

GAGAGTGACCGTGAATTCCTGATCTGCCTGCCrCTGCCTCC^ 

GCCTTGTTCCTTCCTTGTATCAATCACTCCTGCATCTTTCTCTTTTCCTTTCTTCCCATTCCT 

6 0 GCTCAAGATTCTAGGATGTGAAGATCTTCAATCTGTTCCATCGCAATGAATATGTC 

CACCCACCCCTCCCTCCCCTGCCTCTGTTGCTGACTCAAACCAACMGGACA 

GACAGCCTGGGGTCTCACTCCTGCCCCGTGATTATTTC^ 

CTGAGACTCCAAGGCTCAAAGAGTGGAGTTCTGTTTC 

TGAGTCTG CGTCCCTCCCAGAGGCAGAGCCAGAGCTCTt^AGG CAGGCAGGCA 

6 5 CTGGTGATTCATCCCCTTCCCCCATCCCCGGCT^ 

GGGTGGTGGGGGCAGCAAGAGGGGGTTTCGAGTGGCTGGCTGGCTG^SCT^ 
CCACCTGAGTTTTAGAGTCACCAGTTTTTCAGTTCTGATATCAGGTATGTC^ 
ACATGCCCAGCAGGTAAATCAGTGGTTCCTGTCCCTGTGGTGGCACCCACCACCTC^ 
ACTAAGGAATATGGGAGGCTGATGACAGGTTGTGGCACATAGGGGACCAC 

7 0 CCTAAGCTGAGAGATGCATTTTCTTTCTCATCCAGCCATTCACT 

TACCCTCAAGGAGCTGTGGACTTCCAGAGGAGCCCTtsGGTCAGGAGCT 

GAAGTAGATAAAGGGTGCATGGGCCAGCCAGGACCACTGAGGGAAG CCAGG CCCAAGGAATGGTATCTGTCTGTCGGTTTATCTAT 

GGTTTATTCCTCACCTAGCCTATCATTACCCTTGTTTCACA^ 
7 5 TAGTATACTT AG GTATCT CAT CCT CCAAGGGGATACAGCTTAATG ATTG AGGACT CTG ACTCCAGTTACCAAAAC CTGAATGT AAA 
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CTCCnXS CTTGCCTTTTCCTGGCTGTAT AGACT CGGTCTAAT CACTTCTCT AGGCCTCAGTTTTTGlTr'ri" ITr'l'AATCTGTAAAAT 
GGGGACAATACTAACAGACCCTACTTTACAGAATTGTTCTGATGATTAAAAGTA 

ACTGGAGGACCTTGCCTTG CAAT CTTGGCGACTTGAATT CAAT CTC CAGAACCCATGAGAAGGTAAAAGGAAAAAACAGATTT CAC 
AGAGTTGTCCCCTGACTTCCACATGTAAGCTGTGGCATTTGT 
5 ATAAATT ATTTTTTTAAAGGAAG CTGAGTGTGGTGGTTC CATAATC CCAG CACATGGGAGG CTGAAG CAGGAAGATCTTGCATTTC 
AGGCCAGCCTGAGTACXSCACTGAGACCCTGTTTAAAAGAAGGGCT^ 
CAAGTTTGATAACCTGAGTGCAATGCCCAGGCCTCACATGGAAGAAGGAGA^ 

CCAG CACTCGGGAGGCAGAGG CAGG CGGATTT CT AAGTTCGAGG CCAGCCTGGTCTACAAAG TGAGTGCCAGGACAG CCAGG^ CTA 
1 0 CACAGAGAAACCCTGTCTCAAAAAAAAAAAAAAAAATTT 

CCCATGGGCTGGTGGGCAGTGACCCCTGAGGAAAGGGGAGACCCACACATGAGACCTTGGATTG 
CACACACACACACACTGTAAAGCCTGGCATACACTAAAAATTC^ 

ATCAGCTGGAAAGCAGGAAAAGAGGAACAGAAGAGAGCCCTAGCATGCCTCTGCCTAAACCATTCCT 
TGGAGTCAGACCCTCCTGGGCTGTAGCCCCTCTCCGTTCCAACC^CT 
15 CAGCTOTGGATATAGTTAAAATGCACTATCAATGAATCTTGCAGCT 

GCTTG CTTCTACCGTATTT CTCCT CC CTGGAGGCTCT AA^ CTC CTTGAAGAGCGTTTCAGGGCAGAAGCAATCCAAAGGAGAG 
TCACTTTTG CTTTCCAGGG CTGCAACAT AATT AT AGT^^ 

ACCTGTATAAGGAATGGGTTTGGGAAG TGATTATCAAAC CT ACCTTGACATCTTGGG AGGAGCAG CCTGGCACAAAAGCAG C CAGA 
GACTAAAATGTTTTTATGCCAAGCATAGTCTGA&GGCAAAGAATCACT 
20 TG CCTTGGGCTGCTTGAACAAAGATCTTCATGGTGGTGGTGGTGTTGTTGTTGT^ 

TCCTATATTTTAACAAGACCTCCAGGTTCTCTTTTTTTGA 

ATTTCTG CTTTTGT AAGAAT AAAGAT AG AGGGGT'lTTl'TTTl" I riTTTTTCCTTCTTTCAGTTTTTAG AG ACAG AG TTTCTCTGT A 

GATTAAAGGTGTATGCCACCACT<3CC(X3GCTGGTTGTCATTT^ 

2 5 ATGGCAACCCACCTGCCCCAGCTTTCTGAATGTTGAGATT^ 

CAATGAGGGAACAGTAAGACACAAAAATCCTAAGAAAGAAGCAGTTTGGCTGCTOT 
GCTCCTGACCTTTGATTGGCTCCAGTTCCTGGCACCGTTGG 
GTTTGAGTAATCCTGTt^TTACTCAGC^CCCCCTCCCCAAGCGGGGTTCCCAT 
TGCTGTGGAAGATGTTTCTCCGCTCCTCTCATGCTGACTTCCCT 

3 0 TCTGACTTTGTTTGTOTGTOTCTCTTTCTCTCTATCT 

ACACC CC ACTGGTACTTTCTTTGATATTTAAGTGTTT AG GGTGG ATGGTACTC GGACGGAG AACTAG 
AG AAAGT CAAGAAGTTTGGTCATG TGGAGGATTGGGCACAGG ACCAGAACAAG CAGGGGTTT CCCACGATTT CAGTTGT CT CTC CT 

GGCAATGGGAAGATAGAAGAGGGGCTACGCTGGTAGCTAGGGTCCAGGGCTCCCATAGCCCCCAAACT 
35 ATATCTCCTATGAGATGGCAGGGTCTTACTTACACTGTGGTAA 
AGGTATGACIXMGGTCATCCATGCGGTCAGTGTAC^ 

GGTTGCCTG CT AGGCCAGGGCCTGAGTT CCCACCGTGGCTTGG CCTAATCTCCCTACCACTTGCrTTTCTGGGTGACCTATGTTGG 
TTTCCCCCTCTCTGGTCTTAGGGCCACTGAAGAGTGATATACCACAGACT 

GGACCCAAGTGTTACTGCCAGGCTGCCATTC CCATGGATTTGCTTTATCTCAGACAGTAATACCCTGCCCTGTAGTGCT CTGCCTC 

4 0 TCCACGGTGCTCCCTGTACTCTCTTGAATCATGCCATGTGACCTACACCACAGGCAATGTTGCT 

GGAACTGAGGTTGAGAGTGACTGGAAGTGTGTGTGCCCCCACATT^ 
TCTTCCCCACCAAGCATTTAACCCAGCTTGTAACTTACAGGAAGCCATTC 
GTGCAGTTGGCAGGTAGAAACTATGTCTGGAGGTAGGACAAAGAACTGAGTTTTAATC 
TCCCTTAAAGAGACAGCTACTTAATCTGACCCAGCTCTGGGCTGGCTTGAGAAGGTCA 
45 AAGTGATGGGCAAATCATGGTCTAACCATTCCATATCAGCAGACCTATGTGTGGTTATAGATT^ 
TTAGCCTTGAACTCATGATGTAGCCAAGGACAATATAGAACTCTTGATTGT^ 
ACGTAGCAGCTCTCATATGACAGATGCTAGGCACAGGCTGGACATGAGATCCTATGTGAG 
GTCCCTAGGGTGGGAGACAGGTCACAGATGAGATGGGACTGTTACTCTAGATCTCT 
ACTC!AGGAAAATCCAGAAGTCAGTCAGTGATTGGGAGAAGAGCCAGCT 

5 0 TGGACTTGTTGATGAAAAGATCTGGGGTGGCTGGAACCCCAA^ 

AACTGCAGATTGAACCTGGGGCTTGATGCATGCTGCTTTACC^ 
TTGTTTGTTTGTTTTTCAAGACAGGCTTTCTCTGTGTAG 

TCACTGAGATCCATCTACCTCTGCCTCCTGACTGGTGAGATAAAAGGTATGTGCCACCA 
AATCAAG CTC TAAC CAACAAATGTGTG TATGTGCACTGGT AT AG ACAATATAC CG CCAATCCAC^ 
55 TTCTTTTCTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCTTCCT^ 

GGATCTTGATATGTAGATCAGGCAAGTCTTGAACTCAAAGTAATCTACCTACCTCTGCCTC 

CT CAAACTGACATCATTTCAACAAGCCACATATACTTTTCTTCTGTG CACACACACACACACACACT CAAGTATG C CTCCAAACAT 

GTCAAGAAGATAAC^GATAATGAACTGTTCAAGAGAGTGGGGTTGTAGACCTGGC^^ 

TAGGTAGTTAGGTCTTCATCTGTGAGAAGAACAGCTGGATGGAGTGTGCTC^ 

TGTACCTGCTGTGTTGGTTCCCCTC&GTTCCTGTCCCTGTGTAGCCTTCACT 

CTCAGTGG CTCTGCCCCATACTCTGTTCTGGGAGCCAAACTrCAATGGGGGG CTGGGCGGG AAG 

CAGG TGTTACGAAG CAGTAAGGC CTCCTCTGG C C CT ATACAGG ATG AGAGG AGGGCACTGAAGATGTTCAACTTG AAGG ACTG AGC 

6 5 AGGTGGCCTCTTTGCAGCTGCTCTCTGTCTCCATAAAAGCTGATACAACT 

ACCCTATGCTGAGAATCATTTACACTTTACTTCCCCTTCCATAATGCAAGAGGGCGTGTTGGAGCT 
CA>CCATACACCTGCGTCTGCAGCCCAG^CTCATGGTAGTGAAATCCAGAAAAAAACATTATGGGCTAGCT 
AGCACAAATTGAAGTGATCCCTTGCAAGATCTGTGGGGACAAGTCATCTG 
AGGTGAGTTGTACATATTTGTCTGCATACATGCACTT'GGCTGTTTCAGCGGTCTCCCCAGGG 

7 0 ACCTAATCTCGATGTAGGAATGTGATCACAGGGTCCATCACAATTATAC^^ 

TTGAGACCAGTGCACATGAATTGCAGGTCCCTGGGACCACCTCAAACTCCGAGAGGGTGGGA 

ACTGTTACCTCCTGCCITGTTTAGCTCCATCTCCCTCCTGAGTAACTATCCTTAG^ 
7 5 CACCGTCTTCACCACAGGAGGAGCCCTGGGTGGAGTCGGGGGCATGAGGTGAGGAAGACCCAGAAGGAGCCTG^ 
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GTGCTCAAAATAACAAAGCCAAAGOSAAAGGAAA^ 

GGGAGGGGTTGGGGGAGTGAAAAGG CAGGAAAGAGAGAG CAGAAGAGGATGTTCAGAAG CAAG CCACCAG AGCCTGGGTTGGGCTG 
TGGTGAGTATCTAGGTCACCAGGGAGCCTGCAGGCCTGACCACAGGGAAACCTC 
CAGACAGGCAAGCTGACCCCAATACAGCCTGAGGCCCCTTACT 
5 . GCC CAAG CC CATTGCTTG AGTTCTGCACTATT AGCATCTAGTCT CTGGGGAG ATGG AAATGCTGGTGG 

AACCGATCCTAAAGAGAGCAGAG CCAGGCATGAAAGG^ 
AAATGGAGCATGGAGTCAGATTCATGTGCCTGTTG 

TTCAGCTGTCAAG^TTTGGCCGAATGTCCAAGAAGCAGAGGGACAGTCTACATGCAGAAG 
AACAGGAACAAGTGGCCAAGACTCCTCGAGCTGGGA 
1 0 CTACCACTGGGC^CCTCACCTGACCTACCCGAGGCCTCTGCTTGTCCCCCTGGCCTCCTGAGAGC 

ACAGCATCTATAGCACTGACGGCCAACTTACTCTTG^AAGATGTGGACTTC^ 
CCAGAACAGGGTCCAGACAGCCACTGCATTCCCAGTTTCTGCA^ 
GCATCTGGGAAGGGTGGGGGCAGTGAAAATGAGACAAGAACTTCCT 
15 ATTAGGGATCCTGG ACAAAAAGCC AAGGGAGGAGG CAGAG CAAGGGAGGCCAGG CAG AGGG CCT CT TTATGAACrTTGGGGTGGG A 
ACTGG CTGAT AT CTAGCTGTGACTT CAT CTTCTGG C C CCAGAGTACCTGGTACAG AATGTCTG CAAGTC CTTCCG AGAGACATGCC 
AGCTGCXSACTGGAGGACCTTCTACGGCAGCGCACCAACCTCT^ 

AGAGACATGAG»1AAGGGAGGG(^CC^CCACACGCGGGTGCGGTG CGGGCGCGCGCGCGCG CACACACACACACACACTGGTAGAG 
CCCAGATATGGCGTCTTTCACAGGAATGGGGGTAGCAATAGGGTATCAC^ 
2 0 GCAAGAT GACTATTT CTGTAG CT C CTGCTTATTG AG ATTTCACCT AGTTAG CAACCT ACAT CCTCTTCCACCCAG ACCTTGC CACA 

CTGCCTTTC^TCCTCCATCAGTAACrrGATGCTAAT(ZAGTAT(nt3AGG^ 

AGAAGCTTTGAATTGTGTTCIAAC CTGCT G CCCAATG CAAAAT CATG 

2 5 CTGTGTAGCCCTGGCTGTACCrrcGAACrTACTTGGTAGACC^ 

TGACCTAGGTCTGCTGAGTCCCAAGAGATGATTTGCCTATCT 

TTTCCTTTTCCTTTTCCrTTTCCTTCTTCTTC 

CCTGGCTGTCCTGAAACTTACTCTGTAGACCAAGCTGGCCTTAAACITAGA 

3 0 AAAGTCTTGTGCCACCAC^TCCAGTTTTCACCTGGTTT 

CCAGCACGTTGTTGACCTT CATGGT ACTTTTGGT AAATGTGAGGT CAG AAG CTG GGGT AGACTGATAAAGCAGAAGAGT AGCAGAA 

GGCAGCCTGGAAACCGCTGGAGOAAGAGCAGTTAGGAAGGCTTTCAGTGCCGTATGAAGAGTGAA^ 

TGTACTCAGAGCTGCTTAGGGAATTTCCTTCrGACCTGTC 

CAGT(2ATCAC!AGGAGTCCTGAGATACAGAAATCACTTCAAACAGTCTCTCCCTGAGGA^ 

3 5 ATTTAAAGAAGGTCTGGGAAAGCAGTTGGAATACAAATGTGGTTGGATCCACTTTG 

TTCATGAAGCCAAAGGTAGCCTTAAACTCATGATCCTCCTGCCT 

C CGG CAGAATCCACTTT CAAAAT AT ATCACGT GTCCCATGATGTG CCC CCC C C CC AT C CAGCT AT CAGC CACAATTGGACTATTAT 
ACTCCTAACTCAATCCCACCCCATCTTTGTCTGCC^ 

AGACAAGACAAAACCTGCCTCTTTTGATCTCCAGCCTCCTATTTTAGCCAAAATC 

4 0 ACCACCCTGCCTTCCTCCTCACCTGACGTTGCACCATCCTTTGACTCCCCTACTCTGACCACAGTATTTm 

CTCrGGGGCCTCTTATACCTGCrrATTGTCrCCTGTCATATTCT 

AGGTTTTCC TTAAACAGTTAC CX3G ACTGG AG AAGGACC CCAT CC CTTATCCTG CTTT AGTAT CAGAT ACAAT CATATG CTG CTT AC 

TAACAGGAATATGTTCTAAGAAATGCTTACCCATAGCTTCACATGGTG^ 

TGTACTGTCGTCCATACTCCCTTAGCCTTATGGGACCACTGA^^ 

4 5 AACTGTGTT TATGACTC C C TGGCTG CTGTG CACATTT AAT AAAAG CACATCTTTTT ATT CAC CGATGTTACACCAGAGCTCATGAC 

AGTGCCTCATCTAGATTTATGGAGTTTCTCAACAAATATCTTTTGAATGAATA 
AAGTACTAAAACACAAAATAAATTCAAGAGGCGTAAGGGACTCACTCTC 
TAAGCTAAGAAGGGGAGATGTCAAGAAAGAGTGGTTTTO 
GGGGTCTTTCTCCCTGCCTCTGTCCCCTGGGCTTCTGCACC!TTT^ 

5 0 ACTTACTACCTGCTGAGTTAGGCACCTTCCTAGAAACTG 

AGG GACAG AAGAT ACAT AGTGGCAAAAAGAC AG ATGG ACTTT AT C T AAGT ACATGTGGGGCAGG AAGG CAGACAGACTGGATC CTT 
TTTTCTACAGTGTGGCAAATGCTACACAAAGGGAAGCCAACAGGACTTTTGCCCAGCAAGGGATGC^ 

ATAA(2ACTTGAATTAGCAAAATAAG AGTTAG CAG AG CCTCACCCT C CAGAT GAAG AAACAACATG AAC CAAG GCAAAGGG CCATGG 
GAGCAAACACAATTCATTCTGGGGTAGGCAGAGCCATTG^ 

5 5 ACAGAAATGACCATCAAAATTGAACCTAGGAA 

TACATCTAACAAATGGGATTGGCTGCTCAAGTTGGGTCAAGGGT 

AGATGTGTTTGCT CI^SCTAAGCACCGAAGTAGTCATTCCCTCCTAACCTATCCTCAG CTGCCACCCAAAGGAAAACTCCGGAGCCT 

GTGCAGCC!AACATGTGGAAAAGCTGCAGATCTTCCAGCACCrCCACCCC^ 

AACTCTTCAGCACTGATGTTGAATCCCCTGAGGGGCTGTCAAAGTGATCTG 

CCAGCACCCGGGAAGATGCCCTTGACTTCAGTCTTTACGCTATGAGGAAGGAAG 

TGCTGACTCCACATGATGGAGAGACTAGGAACAGGACAGGGACCTGTTTCTCCTCCATAGTCTTGCTCAGAATTTOT 
GTAAAGCTGCAGACTCT CCT AGG AGGT AT AAG CAGCAGATGAG AGGG AGGG AGGTTTTT^ 
GAGGAAGGGTTTAGCCAAGTAAACATAAATCCCAACTTGTGCCATTC^ 

6 5 TGCGTGCTTGTACAAGGGTGCAGGGCACACATGTCGGGTGTTG^ 

GTTTTGTTT CTGTGT CAAG CCAGCAATTT ATCTGTTTTAT AAG AAT TTTAGCACACACACAT ACACACACACACCGCC CAAGATTC 
TCCCTCAGCTAAGCAATCCACCAGC<5AGACTGGTGCTCAGATACCTGACACAAGAGAAATGGCAAGCTA 

CT AGACTAAG AACTT CC AACAAATGATACCCACAGG CC CGTTT AG ATTTAG AAATTGCAC^ CTAAAAGACTA 
GAGGCTGTGGAAGCGGTGTCCCGGGAGCTCTCTAGCTCCCCAGGAGGAACCAGGATGTCAAAACT 
70 CTCraTAGCAGACCCTGCCCCAGCCCTGGGTCCCCrCTT 

CAGCCCTCCCTGGTTGCTTTATTTATTTATTTTGCACCAACAGGGTTGCTGCAGACTCATTC 
GGAAAAAAAAAAAAAAAGAAAAAAATGCTTCCTGGCTCTTTTCTCTCOT 
AACAGCAGTCTTGCCAGCCGGCTGATGCGGCAGGCTGCCGGGCAGTGGGGA^ 
CAAGGTGCAGCTCGGATGGGACAGGCCCCAGCCCTGGAGAGATGCAGCGCCCA^ 

75 
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GAGAGCTTCCCCTCTACCCTAGCTGACCATC 

AGGAGCCC CCCCCCCCCG LTITTri'TAG CCT CT AAAAAG CTGCCCT CCTAATCTGTGTG G ATACTCCAAAAATCTCTCTCCTAAGT 

GCCCCTTCACCATGCAGGTCCCCATGCCTCAATCTGGCATGCTTTA 

GACAAGAAAGCTCCATCT CCCTGAGGGGG CAGTGCCCCAG 

5 

MOUSE SEQUENCE - mRNA 

GACGGGCCAGGTGCTCCCTCCCTCTTCCCTCCTCCCrCCCTTGGGCCC^^ 
GGCACCAAGGGAGCTACCCXATGGACAGGGCCCCACAGAGACACCAC 
ACCTCACAAATTGAAGTGATCCCTTGCAAGATCTGTGGGGACAAGTCATCTGG 
10 CAAGGGCTTCTTCCGCOSCAGCC^GCAGTGTA^ 
GCAACCGATGCCAGCATTGCCXjCCTGCAGAA^ 

AAGCAGAGGGACAGTCT ACATG CAGAAGTGC AGAAACAACTGCAACAG CAGCAGCAACAGGAACAAGTGGCCAAGACTCCTC CAG C 
TGGGAGCCGCGGAGO^GACACACTTAC^TACACTTTAGGGCTCT 

15 GGGGCCTCCTGCC^CCTTGAGTATAGTCCAGAACGAGGCAAAGCT^^ 

TCTTGGAAGATGTGGACTTCGTTTTGAGGAAACCAGGCATCCTGAACT 

CCAGTTTCTGCAGTGCCCGAGAGGTACCATATGCCT 

GAGACATGCCAGCTGCGACTGGAGGACCTTCTACGGCAGCGCACCA 

GTCAATGTGGGAGATGTGGGAGCGCTGTGCCCACCACCTCACTGAGGCC^ 
2 0 GCTTCATGGAGCTCTGCCAGAATGACCAGATC^ 

T ACAATGC CAACAAC CACACAGTC'IT' mTGAAGG CAAAT ACGGTGGTGTGGAGCrTCTTTCG AGCCTTGGGCTGCAGCGAG CT CAT 

TCAATGCCAACCGTCCTGGGCrrCCAAGAGAAGAGGAGAGTGGAAC^^ 

TGCAAGAOTC^TCGACAAGGCCTCCTAGCCAAGCTGCCACCCAAAGGAAAACTCCGGAGCCT 

2 5 GCAGATCTTCCAGCACCTCCACCCCATCGTGGTCCAAGCCGCCrTCCCGCC^ 

CCCCTGAGGGGCTGTCAAAGTGATCTGGAGGAAGGACAACTTTCTATTTCCT^ 
ACC(IAGCCTTTCCCTTTCTGCACTCTATGAAGGGTGGTATCCCTAGGAGTA^ 
TGCCTTGTAGGACAACAGCAGCJAAGTGATGGAGAAAAGGCTrGTTA 
TGTGGGGTAGATGGGATAGAGATAGGATGACCAAGTCAAATAAAAAACAGACTGACAA 

3 0 ATAAGGAGAACTCAAATCTAGGCTTGAAAGCTAATAACAGTCCT 

ACATGGATCTAGCTCAGAGACTGGTGGCAAGCCCCCAGAAGGACCTG 

TTCTTCTTCCTAGTTAAGAAATGTTGTTGACCCCCrCrcCCTGTTTTC 

GGGGTGGAGATAAGGTCCCAAGATAACTGTTTTATGGGGT^ 

CTT TGCACACTTTAAAAGTGTCAG ATTCGTT AG CIAGT CTAATCAGAG^ C CACACAGGTGG AGCACACAG AGGCTCTG CC 

3 5 CCCAGTGACACCATTCTGTAGACTTTCCCTCTGGCACACAATC^ 

CATACAGGACACCAGAATTCATCCCAGCTCCAGCTGTCCTCT^ 
CCCAAGGACCAGCACCCGGGAAGATGCCCTTGAC 

CTGTGTGCTGCTGACTCCACATGATGGAGAGACTAGGAACAGGACAGGGACC^ 
TCAGTTTTGTAAAGCTGCAGACTCTCCTAGGAGGTATAAGCAGC^ 

4 0 TTCTCTCAGAGGAAGGGTTTAGCCAAGTAAACATAAATCCCAACrrTGTGCCATTC 

MOUSE SEQUENCE - CODING 

ATGGAOVGGGCCCCACAGAGAC^CCACCGGACATCTCGGGAGCTGCTGGCTC 

CCCTTGCAAGATCTGTGGGGACAAGTC^TCrGGGATCCACTACGGGGTTATC^^ > 

4 5 G CCAGCAGTCTAATGTGGCCTACTCCTGCACGTOTC!AGCAG AACT CATTGC 

TGCAGAAGTGCAGAAACAACTGCAACAGCAGCAGCAACAGGAACAAGTGG 

CACTTACATACACTTTAGGGCTCTCAGATGGGCAGCTAC CACTGGGCGCCTCACCTGACCTACCCGAGGCCTCTGCTTGTCCCC CT 
GGCCTCCTGAGAGCCTCAGGCTCTGGCCCACC^TATTCCAATACCTTGGCCAAA^ 

5 0 GT ATAGTCCAG AACG AGGCAAAGCTGAAGG CAG AGACAG CAT CT AT AGCACTGACGGCCAACTT ACTCTTGGAAGATGTG GACTTC 

GTTTTGAGGAAACCAGGCATCCTGAACTTGGGGAACCAGAACAGGGTCCAGACA 
GAGGTACCATATGCCTCTCTGACAGACATAGAGTACCT 

GGAGGACCTTCTACGGCAGCGCACCAACCTCTTTTCACGGGAGGAGGTGACC^ 
AGCGCrTGTGCCCACCACCTCACTCAGGCC^TTCAGTATGTGGTGGAGTTTGCCAA^ 

5 5 AATGACCIAGATCATACTACTGACAGCAGGAGCAATGGAAGTCGTCCTAGTCAGAATGTC 

AGTCTTTTTTGAAGGO^TACGGTGGTGTGGAGCTGTTTCGAGCCTTGGGCTG 
CCCA.CTTCCrCAGCGCCCTGTGTTTTTCrGAGGATGAGATTGCCCTCT 
CTCC^GAGAAGAGGAGAGTGGAAC^TCTGCAATACAATTTGGAACTGGCTTTCCAT 
CCTCCTAGCCAAGCTGCCACCCAAAGGAAAACTCCGGAGCCTGTGCAGCCAA 
60 ACCCC3VTCGTGGTCCAAGCCGCCTTCCCGCCACTCTATAAGGAACTCT 
TGA 

HUMAN SEQUENCE - GENOMIC 
CTTCTGGACTTCTTATTATGGAAGTCAAGTGTCCAT^ 

6 5 GATGTCAGCCTAAATGATGCCCIACGTGTTATCrTTAATCCrCATACCAACT 

AAGGT AAG CCCGAAAG AG ATTAAGTT ACATG CT AG AGGTCAC^CAG CT AGTC^ CAGCC CAT CTGACT 

GATGACAAAGCAATGAATACTGCAGCACTCCAAGAACATGGACCCTCC 
CATGAATGGAAGGTCTGGAGGCCACCAGGGATCGCCACAGAA 
TAAT^TTGCATTG&CTGGTTTGTTTCATCTTTTT^ 
70 TCTCCCCTTAGACAGCTCTCTATTCTATGTATCCCCCTAGCACAG 
ATTGATTCAAGTATAATTTTGCTTACTACTTCA^ 

AGATTCTTAGTCTAGTAAGAGAGGTGGACATTAAACAAATAACCACATAAATAGAGGGGGAAA 

AAAAGTATGAGGTACTATGAAAGTGTACAGCAGGTGCCCCTAATT^ 

TGAAGGAGAAGAGTGGTCCTAGGGGAGGAACTAGCCTTGCTTCCTTTCAG 

7 5 GATGGTCTCATCTTGTCTGTGGTGGCCTGTGCTTCCTAA^ 
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GATCCAGCTTTATCTTCAGTGTTTCTAAGTCCAGAAATCC^ 

TAGCCAGAATAATGCAGAC3VACTGATCACTGCAAGTCACT 

CAAGGCAGGATTTCAACCAGGAGGGTTTTACAGAAGATGGACTTCACCT^ 

GCCACTGTCCACTGATGGCCGTCTCCCTCTTCTCTGTAGCCCTCCC 
5 , CACAGGAATGGATTGTTGCTAGAAACCTTC^ 

TT ATCCAGGAAAAGAGGTAGGACGTAGGCAGCGGAGGCCCCTG C CAGCTCCTCCCATAAGGACTGCG CTGGGGGAGCAAGCTGCAG 

AGAAAGCTGCTCCCCTCTGGCTCTGAAAGTGTCAGGGCCTG^ 

GCGCGCGAOKSAGAGGGTOTCTAGCTCGGATCCC^GGGCAGAGGGTCGGA^ 

CGAGCGCGTGGTGGCGGAGCCTTCCGGAGACCGCTCTGAGGGTGGCGGAGA^ 
10 GAGAAAAGGAGTGGAGAGTAGGAGGGGCGACCCAACGGCCCGGCCCC^ 

GGAGCTGGACGGTCAGGGCTACrGGGGGTGGGATTCTGTGTCCCCGGGAGGTC 

TCAGCAGCCAGGTGTGTCCCCTGGCTGCCAGGAGCAGAAGTGAG^^ 

CCAGAGGAGGCAGGTGGGGCTAGGACTGAGGGATGGCTCAGGCCAGA^ 

CCAAGGGGCGCCAGACTGGATCCC^GAGGAAAACCCAAGTCTTCGAAATG^ 
15 GTGTCCCAGGG<3GGTTGGGATCTGAGTCCTTTTGCTATAATGCGGTTATTATCCTATTTGTCCT 

GGATACTATATGGGCCCTCTAACTGTAAGGTGAATGGGGCATCTTTAGTACCAGTGGGAG 

CGTATATGAAAATGTGACTGACTGGCCCTTTGCTCCCrrCCATTTCT 

CAAATAGGGGTCCGCGCTCCCAGTGGGTCQ3GAGACCGGGGCGCGGTGGCTGACCGC 

ACTCCTCTGCGCTCCTCGCCCAGCCCCGCTCCGCCGAGCACTCCCATCTGAGACGGGATTTGGA 

2 0 GGAATTCCATTACCCCACCCACCGCATGTGGCTCTTGGAAAAAGCT^ 

CGCCTTCCGGCCTGTTCTCCMGaSTCGCGCCCCGGGCCCGCCCACAAGCGCCTGCCCC^ 

CGACCTGTTCGCTGCCTCACCTGGCGGGCCIGCGAATGC^ 

TTCCACGGGCCGCCACCTGCCCCGGCCGGGGGACTCCCCGCGGC<3CAGTCCCGGCrGCACGTCTCC 
25 CCGGGCCCCCGACAGO^ACACGGCCTCGTC^CGGACTCGTCGCCCTTCGGCTCCCCGCGGCCAGGCCTC 

CCGCTCTTCCACCTGGACTTCCTGTGCTGCCAGCTGCGGCCCACGCGCGAGAGCGTGCTGCX5CCT 

GCGGCTCTCCACCGAATATCAGGCCGGGCCCGGGCGGCTGCGGCTC 

GCCCCGGGAGCGGCGGCGGACGAGGATTTTTTTTTCGACGGGCT 

3 0 GCTAGACAGGGGCGCGGGACTTCGCAGGGATGTGCTGCTGGGGGAGTGCGAGACGCCCCTCAT^ 

GGGGACTAGGTCCCGGGTCATCCCTGGCGCCCACCCATCTCAGCCTGTAGCCTGAGCCCCTGGC^ 

GTCTGCAGTCCACATTCTTTCCACCCTGCCO^CTTGTATTTATTTTTGCTAATAAAATGTCCCCTTGTCOT 

CCCTTACTGGCACCTTACIACGCTCGGGC^ 

AG ACGGTCTGAG TAG AG CAG ACCTTAAG CAGTGGTTCTCAAG CTTG AAGG AG CATCAG AAT AACCAG ACAT CAG AAAT C CTG GAAG 
35 AGCTTTTTAAATATGCAGGGTCCTGGGCTCCATCCCCATGAGATGCT 

GGCCAACAAGGCCCCCGAACAATCAGGGAGCCCTGCAATCACTGAGTCACT 

GGCTGCACAGAGGAAATTAACACTCCCTTCCTGTGCCCCTCTAAAACCCACATCTGTCCTGAAGACA 
GAGAGTGAGTCAGGGGATGTCAGATGAACACAGAGTGCCCTTTAGTCTTCCTTCTTCCCT 

CTTG AGGGGTGGGGCAAGTCTGGAG C CAAG AAC CAGAGAGGTG CTTG GGT AC CAAATGAATG AG ATAT AATGT CCCAGGG ATTG AA 
40 AGGGGGGAGGTGGGGTACTACCAAAGTGTGAGAACTGTTCTCAGGAATGCCAGAAGAGCCTCCT 
GGGGGGAG<3TGGATAATTGTAGGTCACATTGTGTTTACCTCTACCTGAATGTCCCTG^ 

TCCACCTTCCCAGGAGCCGGTTCCTCTGGACTCAGGCCAGGCrTGGAGGAGGGAGGGAGGAAGTGAGGTAGGCACTAGGCT 

AAC!TTCTTTCTCCCTCTCCCTTGGGCCCTTTCTCAT 

GACAAAACCTCAAAAGCCTCAGCACCTCGACAC^ 

4 5 AGAGACACACACACACACAC^CAC&CACA^ 

TCCCTGATTGGGTCCGCACCACCCAG CCCTAGGCCAGAGAACTCTTTGCTCCAGGTGTCCACCAGGTGGTGG CATTGACCAAACAAT 

CCTAAGTGCTAACTACTACACATGGATTATAACCACATGAGGAACTGTTCCAAC^ 

TTCTATTAGGGTGTAAATACTATTATTATCCAATGGGATCTC 

CTAGTAAAGGGAAGAATTCAGGGAGTCTGGCTCCAGAGTCTGGACTCTTAATCACTO 

5 0 GCCTCCCAATCAAGTCACTCTAAAAGGGGCCrCCTTCCCTAAGATGGTAGCT 

CCATGTTTGAATTCTGGGAAGTCTACCTATAGATTTAGGTCrACCCCrAGTACTGGATTTCTAT^ 
AACAACCCTTGGTGAGAAACTGGATTTGATGCCCATTTGAAAGTAT 

ATTCCTT CAACAAATACTTCGGGAAGGTC CAT CAGGTGG CAGG CATTGG ATACAG CAGTGATTAAACAGACATGGGGATCAG AAAA 
GTGATAAAATAAGC CATTAAAATG T AAAG AATG (HACTTGGTG ATATGTTACAGAG AAAAATAAAGCAAGAA 

5 5 AGGGGTCAGAT AAAGAGAAGGGAAGATT ATAGGGACAGAAAG AGGGAGGGAGGAGAGGAAACTTTTTTTTTTTAACCATAGTAirTT 

TGGACCAGGTCCTGAACTTTATCTGTGTTATCTCAGTTGATCATCAC^ 

TGGGGTGGGGGGACGGAGTCTTCACTCTGTTG CC CAGGCTGGAGTGCGGTGGTGCGAT CTCGTCTCACTGCAATCTCCACCTCCCAG 
GTTCAAGTGATTCrrCCTGCCTCAGCCTCCCGAGTAGCTGCCACTACAGGCGC^ 
GTAGAGACGGGGTTTCACCGTGTTAGCCAGGATGGTXrrCGATCTCCTGATTTC 
60 CCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTAAGCCACCGCGT 
CTTTACIAAAGAAAGAAATGGATCAGATAGATAGGTTAAGTAAGCTTGC^ 
TGACTCCACCCAGACACCCC(ZACTGCCACCCCAACTGCTTAGGGGAAGCTC 

TCACT CAG CATTCACAGC C CTGAGGCTCTGTG ATGGGGGC CCTAG CATTG CTGG AGGAACAT CC TG GG TAGCT AAAGT ACAGAAAC 
ATGGGAGTGAGCAGGTCCTAAGAAACCCTCCAGTGTGCAGATTGCTCTG 

6 5 GGTGTGGTGGCrTCACGACTGTAATCCTAACACTTTGATAAGTGGATAAATC 

GAGTTTGAGACCAG CCTGGG CAACATGGTGAAACCCCATCTCTACAAAAAATGCAAAAATTAGTTGCATGTG CATG CACTTG 
TAGTCCCAACTACTTGGGGGGCTGAAGCAGGAGGACCTCTTGAAC^ 
CACTCCAGCCTGGGTGACAAAGTGAGACCTTGTCTCAAAAACAAAAC^ 
CCCATTTTTACTGATGTGGTAGCTGAGGGCTCAGATTTCCCTC^^ 

7 0 ATTCTG C CCCAG CATTT AAAC CCTT CTTG C CACCT C CAAGCCTGTGTG ACT ATAAATGTT AATTGG TAG AGGAGAGGT ACAAGCAG 

GGTCACTGTGGAAAGTTCAAGGCCAGGAGCTAGGGTCAAACTCCCACTGTTCCT 

GAGGGGTGGGGACTGTGAGATCTGGAGGCTAC CGAAGGTTACAGAGCCTGGG CTGT CCITTTTCAACGGTTGAGGCACAACCCATC 
CCTGAGTCACTCCAGTGGGTTGAGGCGTCCCCAGGCATGGAACCCCCTAGAACCCAAAGAA 
GCATCCCAAGGAGAGGGCCCTCCAAGGATGGGGCTGGGGAAGTGTGTGCCACCGC^ 
7 5 TGACCCCCAGGCCCTCCCTGAAAGGGCACATTTGTGCTGAGCTTTAAGTGGTTTCTGTGGTT^ 
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GATGCTGGGGACTGCTCCCCAGTTCCAAGCCACraCC^ 

CCAATAAGGGGACTGGGCACGTGGGGATGAAATGATTCTCTGATCCCTATCA£^ 
AAATAACC^CCGCCCTGCTTTGAAAGGACGATTCTTCdvG^ 

GGGGAACTTCCCAAAAGGAT AACCAAGGGATCITCTTG C CCTGGGGAGAAAAAAAACTCAAACTAT C CATG CTT AGGG CAGGGG AG 
5 AGGTAGAGAATGC CTAGGAAG CATGTGACACTCCTAGGCCTAGAGGTCAG GATACTGT CTT CTAAATCTCX1?IGTCACTTCCTCCCC 

ACCCTACATAGTAGCACAAAGTGGCACAACrG^ 
TTCTACGTGACCTCCA.CTCIGTGCCnGCAATTGOT 

AGGAAGAGAGG CCTAAAAAGGAGAAAAGATAGAGAGGGGACAGCTGGAGAAGA^ 
AGAGAGATGGGGGATAACTGAGGAGTTCTGGGGAGGAGGGGAGCAGAACGGCATCTCCC 



10 




CACTACTCATCCACTXaTCAATGACCCTAGGAGTTGAAAAGTCCTTCTCCA 
GACTCTGGGAGCTACCTAAGAACAGAGGTGCTGATC 
1 5 CGAGACGGCAGAAGAGGGAAGGGACAGCTGCTTGCCCT 

ACGAGGCTGTGGCATGGCTGACTCCTGCTCTGCTGTTTACCA 

CAGCCTGGCCCCTGGCCAGTTGCTTGTTGGGTAACCAGGAGAGTCAGGCCCAGCTAATCAAACrC 
AAAAGCAGCACCAGGATCCCCAGAAGCCTCGTCCTCT^ 
CTGGGAGCCTCTCCTTCTCCCAATGAGTCCAGTGCAGCATCTOT 
2 0 CCTCCAGGCCTGCCCCTTGGCCCAACAGATGAGGAGCACGTAGAAGCATCAGGGGGTCAGGGAGTCCT 

T AGGGG CAGGGGCCGAGCAG GCAGAGGC CTGAGGGAACTG AAC^ CTAGAG AAGAGACACA 

GTCCCCCAAACCTGCCCTCTCCATCTCCCCTATGCCTGTCACCTGTCTTAGGTAG 
GGGTCCAAATCATAGCITCACCCATTTATTGTTCTTGTGTGAGTCCTTGGGCA 
CTGATCAATCTGGAATAATAATGTTTCCCTGTGGTGGGTGCTGTGAATATT^ 

2 5 CCTGCAAAGGGCCCAGCACACAGTAAGTGATTAATA 

GCACTGGTCCACTGGTCTCTCCTCATGTCCCCT 

CAGGTGCTCCCGCCrrCCACCCTCCGCCCTCCTCCCTCCCCTGGGCCCTGCTCCCTGCCCT 

GG CACCAAG GGAG CTG CCC CATGGACAGGG C CCCACAGAGACAGC ACCGAG CCT CACGGGGTAAGAGGCCCATGG C CCTGCCTGCA 
GGTGGAAGAGACTGCTGGAGGTTCAGAGGAGGGCAGAAATGGCAAGAGACTG 
30 TGCAGAGGAGAAAGCCCTGGGTTGGCAGGGAGTGAAACTGGA^ 

ATCGCAAAAGG AG AG AAACAGGAAAGAAAGGGAGAAGCGGG AG AAGGAAG AAACAGG TTGG AGGGT AG ATG GGG AGGG AAG AGCCA 
AGTCAGGGGAGCCAGATCTAGACCCAG CGAGGGGTCAG AATGTGGACAGG CCCTCCCAGGTGTGGGCTTACCCCCCAGGAGGCACA 

GAGAAACCTGTCCTTTAAGGAGTGACAGACTTGGGGAGTGTGGC1AGGGGGAACCACACCCTGTCAGCTGCCCATGACCACGTAG 

3 5 TGGTCCTTTCCTCGCCACTCACCCAGGGATGTGGGATTTCATCTCTGCAGAGAATGT 

AGAATGTCAGGCCGTCCTTGGCCCCACCCATGGTCCAAGTCAGGAACCCCCTCAGAGCAGGATCG CAGGGG 

CTGTGAGCATGGAGGAGGAAAGTTTGAGGGCCAGAGCCACCAAGCTCCAGGGCCACCTGGTGGAGGAAGAACTC 

AGCAAGGGGAGCACCCCCACCACAGGACTGCTCTGAGGTGCAAGGCTAGAGGC 

GTGGGGCCAGGCAGGAGCAAGGGGAGGTGGGCGGGGCTGGGTGGGGCCTTGCCACCTCGGGGCAGTCTTTTCTCCTCCCT 

4 0 AGCAGAGCCCAGGCCATCAGGCCACTCGCCTCCACACAGCCTAGTGACC^ 

GCACTGTATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTCAGGTGTCAAGACT 

AGAAGGAAAAAAATAACAGAATAGAGTCAGAGGTTAGGGAAGCTAT^ 

CTCACCCCAGCAGAGCCCAGGCCATCAGGCCACTCGCCTCCA 

GCAGAAGGGCACTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATGTCAGGTGTCAAGACTGAACAGC 

4 5 GAATAGAGTCAGAGGTTAGGGAAGCTATAGGGGGTATAGGG 

GTGATGAGCTGCTATGTGGATGTAGGAAATGGGCCAAGGTCTGGCAGGGGAATCCTCTTTCCAGG^ 
ATAGATGTTAACTGCATGAATGGGTGACACCCCCCCACCCACCAACCATGCTCCAGCTAAGGCC 
CTGGAGGTCCGAGTATCCATGTGACCCCCATCGACCCACTCATGCT 
GCAGCAGAGAAGGGAGGCCTTCCTCTTCAGGCTGGTAGGCCAGGGAAGCCTG 

5 0 ACTCTCCCCTCCTCTTCCTCCAAACATCCTCTCCC^ 

ATAAGACCAACACAGTCCCTGCCTTCACAGGG CTTTACAATTTGGCAGG AAAGACCCA CA 
GTTAGGTGACAGTTGGTGACATGGTGAGTGCCAGGAAATAAAATGCAAGG^^ 

TGTGTGTGGCTTCT CCAAGG AAG AAACATAATCAG AG ACCTG AG GG AGAAAGAGAAG TTG G CTAG ATTAAGG GGT ACAGGAGGGAC 

5 5 AGGCATTCrTAGGCAAAGGAGCAGCTTGGGGGAGTCCCAGAGGCAGTGAGGAGCCTGACACTTCTGGGTACCTGGAAGAGGGCCAGT 

GTGGCCCAAGGGCAGAGGAGGAGAGAGGGGCACTGCAGGGGGTGGCCTGCCTGCCCTT 
CGGCAGTGGGATCCTAGGAGTGGGGAAAGTGTCTCAGCCCCTCATGATCTTC 
AGAGCTGCTGGCTGCAAAGAAGACCCACACCTGTGAGTAGGGGGTCCGGGGCAAGAAAATC 
AATTCAGCAAGTGATCTGTGGGGGGACTGAGGTTGGACAGAGGTGGAGGAGTCAGGA^ 

6 0 GAG GTGCG AG G AAAC CACAGAT CAGGGCAGTGGG AGGAGATG AGG AAGAAG CCTGAAG CATG AAAGGAAACACTG T AAAGAAACT A 

GGAGAGAACAAAGGAAAAGGGAACCGCCTTGTGAAAAAAGGCGAGGAGAAATGCT 

CTGCTGCCAAGGGAGACAGGGAGCCTGGACCAGGCTTTT^ 
GATGGCACAATCTCGGCTCACTGCAACCn'CCACCTCCCAGGTTCAA 

6 5 AGGCACCCACCACCACGCCTGGCTAATTTTTGTAT 

TGACCTTGTGATCCACCCCTCTCGGCCTCCCAAAGTGCTGAGATTACAG 
GGTCCTGAGCCAGGTTTGGTGTTCATCTCTGTGGTTAACACATCCrrGAGGTGTGAACAAGCC 
AGAGCAGGAGAGCTTTATCTCTGTAGGAAACAGAATGGACAGGGGCACTCTGTGT 
TAAGAAGGGCACTAGAGCTCAGAGAGGACTTCCCA 

7 0 TCrCCTCTCCTAGTTTGTGGCAAAGGCTAGTCCCCCAGAGAATGCA 

TTCTGTGAGCATGGGTCCCTGAATCGAGAGGATCCCTCCAGGGTAGGAGTGG CCACACCATTT CTCCATAGCCGTCCCTTGACTG C 

CTTGCCAGTTCCTCCTGAGAGCTCACTGCCATCTCG 

AGAGAGCAAAGCATTGCCCITCTCCATGAGCrGGGCAGCCCA 

TGCTTTGCTCAG CATAATCCTGATCAGTGTCTCG CCTGCCCCTAGCCACAGGGCCCCACGGAAATGCTGAGCAGGGCCCCX3GGAGG 
75 AAAGCGGGTCAGGGAGTGACGCGGGTGAGCTGGGGGAGGGGGGTGACGGACCTTATGTC 

2125 



WO 03/008583 PCT/US01/51291 



GCAG CAGGAACTGGGAGAGGT CAAACCCTATTCCTAC CCTTAACCCCTG ACCT C^VG A 
CCTTTCCCATCCCACCAGTCCTCGTGTCCCCACCCCAAGAGTGTTGTCCTGAGGA 
GCCCAGCTCCTGAAGCCCCAGGAAGGCCCAGGAAGGCAA^ 
AGACACCTCTCAGGAGTTTGCCCCACAGAATCCAGGAGACATTA 

ATTG AGACAACCCCAAGATGGGGGTC CAAGATGGGGAACCCGGGGGAACTCCC CAGAC CAAGAGGGAG CCTGAATACTGAGGGGCC 
CTGAAGAGGGACTCTCAAAGCCAATGTACCCCACAAAGC^ 
GACCCCCCTCCCCCAGGCCAAAAAACACTCCAGAGAGACTCACAT^ 
TTGAACCAGAGAAACCCTCCCATCCTTCAAGTGGGATCCGAAGCCAGTCTGCCTGGGCT^ 



1 0 TTTTTATTTTTTCTTTTTGAGACAGAGTCTTACT 

GTATTTTTAOTAGAGATGGGGTTTCACCATGrrTGGCC^ 

CCAAAGTGCTG<^ATTACAGGCGTGAGCCACTGCCCCCGGCCCACATAAAGTATTTTC 

CG AG AAC CATTTCTGACACATAAT AGT AAG CACT ATTAT T ATG ATTATGAGTATGATGATGGTGATGATG ATCATT ATCT ACACTC 
1 5 CAATTTCAGCAGTTTGGCTCCTAAGGAAATTTCTGGTTTCCTT 

TCATTT CCATGTATT CCC TAGC<XI7^AGATATATGTGGTGTCTG CAGGG AGGAG CGTGGGCATGGGAGTGGTGGGACAGCC CC CCAG 

GCTGCACCACACTGGGTATGGCTGTTGGCAGTCCACAGGCT 

ACCTCTGAAAACATCCCTGTCCCTGAAGTCACCTGGCAAAGAGGGTCAGGC^ 

TGGCTTC^GGCTCGTGTGAGCTCCATTCTGGCTCCCCAGCCAGAACT 
2 0 TCGCATGIKK3TTTTGGAATTTTCCA^ 

TGGGGCGCCTGTCATCCTACTCACTGCACCCCTT^ 

ACCCCCCAGTCCAGAAGCACTGGGGGAGAGAGCrTAGGTGCAGAGCTTCAGGCTGAGG 

GCCGCCAGCTGCACCCCACTC CTGGACCACCCCCCTGCTGAGAAGGACAGGGAGCCAAGGCCGGCAGAG CCAAGGCTCAGTCATGA 
GAAGTAAGTGAATGGGGCCACCTGGGGGCGGGGGAGCCTGGACCCTGTCX3^ 

2 5 GTGAGCAAATCTGAGCAAAGCAGGGTGCGTCCTGGTCTU^G^ 

GGCCAGGCGGCTCCCTGACAGCMCAATGGTAAGAGGACTGTTGAGGAATTCCTGAGTTC 

GGTATCCAGGCCCTTAATTTTACAAAAAGACTGAGAGCCAAGGAGGCCGCG 

GGGGCTGGTAATTACTAGGGGCCCTGAACGCACTCTCCTGTTACAGTGTTTCCCITGTT^ 

TG CTTGTGAGTAAAAGGAGGCCAGAGCAGTTGCTGTGCTCTGA CAG 

3 0 GGC^C^GAATCCCCGCGTCCAGGCCTCAAAGCCACCAGGCTTTCT 

GGTGCACCCGGGCACATGCCTGCGTGTGTGTGCATGTGTGTGTGTGCG 
CTCCAGGGTC^TCACCCATTTGACCGGGGAAGCAAGGCAATGGAGATC 
TGGAGCCCTGGGCCAACAGTTGGTGGGCAGATGTCTGGGGTTCCTGTGCCTTC 
CCCTGTATTTGTTGTGAAGGGAAGCCAGGGAAGCCAGAGAAGCTCTCAGAGCACAAGGTGA 

3 5 CCTGTGGCTGAGGCCTGAAGCAGCCCCAGCTTCAACCTGGGGT 

AGACCCCTGGACTGACAGTGTAAGACCAGGCACTTGTG CCG C CTAT CACTGGCTAG CCATATAG ACTT AG ACAAG C CAAGTCACTG 
GGCCCTGGTCTACCATCCGCAAAACAAGGGAGCTGAATTTTAAGATAGAACAACAAAACCA^ 
GG CCCAGGGCCAAGAGAG AGCCCAGTGCCCTAAGGCAGGCATC CCCAAGGGTGATG 
GGAAGGGAAACGAAGGAAAGAAAAGAATTCCACTTCTGTGACA 

4 0 GGAGGGCTGGAGAGAAGAGCCAACAGCTTCTACACAGAGGAAAGGGCCCCTTGGAGCTTCCGTTTCCCT 

AGCAGCAAACGCTGGCAGACTGGTCTTCCCAGA 

TAGCCCAGCAGCGGGGCAGGAGTCCCCAGGAGGGGCTGAGGGAGACACTTGGCCAGCT 
ACACAGCCCCACCCAGGCGGGGAAGTAGGAGAGAGGGCCTCCTCCAACGAGG 

TTACAGGTATCTCATAGCCCCAAGTTAT CTGACCCCAATGAGTGCCAACCACCACCAG CCCCTCCAACCGCCGATGGCAAAAGTCA 
4 5 AAGCACATGGGGCTTGGGACTGTGACACA.CCCAGAGACCAAGGGCCCG 



ATAGAGCCAAACGCATGGAGGTGCAGCCGGGCGTGGTGGCTCATGCCTATAATCCCACT 
GTTTGAGACCAGCCTGGGCAACAAAGTGAGCCGCTGTCCCTACAAAAAAAGTTTTTTTTAAT^ 

TGTAGT CCCAG CT ACT CAGGAGACTG AGGCAG G AGGAT CGTTTG AGC C CAGG AG TT CAAGGCTGCAGTGAG CT ATG ATCAATGTCA 
5 0 CAGCACCCTAGCTTAGGTGACAGAGCGAGACCCTGTCTCTTAAACAAACAAACAAAC^ 
GGAATGCAGAAGATCACAAGGCACATAGAGACAGGGA 

AGGGTCTAGGCAAGCGGTAAGAGGGAAGTAGGGTGGCTCTGACAAGGATGGCACCAGCACCCT 

GTGGAGATAATGTGTCTGTACGCCAAAGAGACTCCCAGCCAGCCAGACAGAGGGT^ 

AGCGTGGCGCG^TAGGCACTCAGGCCCTGGCATTTGGTCTGGATGGGGAGG 

5 5 AG CTG ATCTTT CTT AGGTTTGGT CATGATCATG AG GAACAGTG GG C CAGAAAGG ACTC CAAGT C CCCAGGGT ACACAG ATATCAC C 

CTGAGAGCCCAGGGCCATCTTAGGACTCCGT13AAGGGCTTCTTCCTTC 

CCATTCCTAACAGAAATACCAAACCA.CCGAGCCCAAGGTGCTAGACTCTGAAAAGACT 

CGGTCCCTACCAAGGGCTCCTCCCCATCCCCTGCCGCTGTGCTGCTGA 

GTCTCTCCCAGGCGGGCAG CCCAAGGCCCGCG CTCCTGTTCCCATATAGCTAGT CCTTTCTG AAGTCTCACCTCCACGCCTGCTGC 
60 TGCAGTGCAGCCCCTTCCCTGGGGCTCTGAGGCTC^AAAAGGATGGAGATGTCTA 
CTCAGGGCGAGAGCCAAGGTCTTTCTGGGTCCACTACCCTCCTGGAGCIAG 

AGGCAGGGAAGGTGCTGATGGTGACGTCTGGGCTCCCACTCGCCAGAGCTTCCTCCTAGTGATTCA 

TTGTTTTCACT CX3CCTTTCTCCAGTCCCAGACTGTGGGGGTGGCGGAGGGACCAG CTGGCTGG CTGTCA 

TTCATGGCTTTTCAAAACCCCAGACTCTCCCTCGCCCACCTGAGTTTTAGCT 

6 5 CAGCAGAAAATTCTCACCAGGAGCTGCGGTGAGGGCCCTGGCCTGGGCTC 

AGTCCCCACCCAGCCCTCCCGCTCCTGCTCCCCTGTC 

AGAGCTGAGAAAGGAGGGAGGCrGGTGACAGGTTGGGGTGTGCAAGTGACCCCTGACCGGAGCTGTGGTCTCGGTGGGGAT 

CAG AGAGTT CTGG CTGTGG GGAGAG GAG G AGCTCAAG CTAAGG AAAGAG ATG CATT CCTCTTT CTCACT CATTCATT CATTCT G CA 

AAGGGAGGCATACCAAGATGTATGGAAACCTAGTCTATACCCTAAAGGGGCTGTGGTGTTCCTGAGGAGTCCCCGGGAACC 

7 0 GTTGATAT ATAAG T CATCCCACTTG GCCTTAGAG AAAGGGTT AAGGGCAG GAGG AG G CT GGGAC CT CAGAGGAAG AGTTGGG ACAG 

AGGGAATAATATCTCTTGGGATCTTTATACAAACAGGAGTCCTGGCTTTAGTTCTGTGGGTTTCOT 
CCACC^CAATC^CCTCACACTTTTACCCCTACAGCCAAAT^ 
TCCTCATAAAGTATATAGCTTAAGGCTTGAGGGCACIXMCTGCA 
TGGCTGTGTGACCTTAGTCAAGCTGTTTATTCTCTCTAGGCCTCAGTTTTCT 
7 5 ATTT CACAG GGTGGTTCTGATC^TTGAATGAAAATGCATGTAAAG C CTGG CACAGAGT AAG AAT TCAG CCAATGCTTATGTCTAT C 
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ATTATCCITTCTCCrTAATCAGTATCCACTGGAGAAAA 

CACCAGGAGGAGATGAGGACCCCGGGGTTTCTCrGTQTGAACCCA 

GAGAGGAGAGATCCCCCTGGGTTGCAGCCCCTCCCCTGCTCCTACCACTTCCT 

GGTTGG CTT CCTAAGCTGG CACAGGG CCAGG CTCTGGAAGT ACTCAAAGCACn\AT CACAG CTGTGAG AGGCTCGAT 

5 TAGAGGJU3GATGGAGGACTTTGCCTCTACCGTATTTCCCCT 

AAAGCAATTCAGAGGAGATTCAGl"lTCrTTTTCTAGTATTGTCT 
GCAAGAATGAGCTCAGGAAAAGATCGCCAACCCCAGCCCCACTCCCTGGAAG^ 

AGTTCTTGCTACTCAAAGT ATAAGCTGAAGACCAGTAAG CTTAGGCAC CAACTAAGAG CTGACCAAAGTG CAGAATCTTGAGTCC C 
ACCTT CAGATCCCCAGGTAATCrTGTTTGAGATG CAGAAAG C CAT AGTCATT CAGTCATT AGATGACTTAAGGAGCTGG CGGGTGCA 
1 0 TTTGGGCATTTGTCTCTCCTITTTGGAAGATGTGGi^ 

GAGAGCTGCAGGTGJVGATGCAAAGATCTTTCCAAGACAGC^ 
CTGGTTCGAGGTTCCTGACCTTTGCTTGGCT 

CrrCCCAGGGGTAGCTTTTGAAGGTGAGTTATCATCAGAGGTGTTT^ 
1 5 CCGTTCTCCCCCACCCCTTCTCrrACAAGTAACACACACCCTCATGTACACAGCACT 
GCATTGAAGGGAGTGCAGX3TGGCAGGGGAAGG 
AAGGCCAGGATACAAGCGTGGATTTTCTGTGTTTCAGTGTT^ 

CTCACTGGGTACTTCTGTCTTGGACTGG^ 
20 C71TGGGTATTTGCGAAGGGGGTCCTTGACTGGCCGCATCTTCCTATGAGAAGGTCA 

AGGGCCAGGAGAAGCAGTGGGTCAAGCATGGGGCAGGGAGATGTGTCTGGGGTCACCATGTGGTCAG 
GTGGCTCTTGGAGAAGGGAAGAGGCAGGGAGGCCTGTGGTTGACTGCTGGGCCAGAGCCCGAGTTCCCACTGC 
CTCCCTCCCGCTTACTTTTTTGGGTGGCCCACGTTGGTTTCTCCCATCTCTGGTCTTAGGCC^ 
TTGGAGGGGGAGGTGTTGGGCAAGAACAGAGTCCACAGGGCCCGAGTGCTCCTCTCAGCCT 

2 5 CTGACCTCTGCTCAGGAGTAACTTGAATACAC!AGTGCTTTCCATGGGTGATATCATT^ 

CATGGACAGGCAATAATACTTTTATCCTGATCTTGCTG^ 
GGGGCCCCAACGTTTTCTGCCCCTGTTCCAGACACCCAGTGCCCTCT^ 
ATTTACrcGAAAGAACCTGGAAAACTCACTTACCGCCTGGTCCT 
CTGGGCAGGCACGTGTGGCTGGGGGTTGGGTGAGGAATTGA 

3 0 AGGGT CAGCTACTG CCT CTGACTCACCTCAAGTCTGG C CTG AGGG ATG ATGGTTT AAGAATGATGG AAAT AATATTTT<K5GGAATG 

GCGGGCAAGCCATGGTCTCCCTGCCCACTTCACATCAACAGGCCCTACATGT 

CACAGCAATGTGCATTCGCCCTCCAGAGAGGCTG CCACTTCTTTTGCCC CTTTGGTGGGAG ACACGTCATAGATTAAAAGGGGCTT 

TGGCCCTTCTCTTC^CTTTTCTGGCCTGTCCCGCCCCAGGGCTCCCCACTCAGAGG 

AGACCCATGTGCrCTGGGAAATGCTAAAAACACCTACAGTAACCAGGGATCCCAAGCT 

3 5 GGGAGAAGGCCTCCTGTGGTCCC^TGAAGCTCAGCAGAGAGTCGTGGGCATTGACAGGAACAGGGCAAAGCA^ 

G CCAG CAG CIAAACCACAGGGGT AGGGAAGCAGGTACAG GTTACAGTG AGTT CAGTGGGAGAGAGAAC CTGGGGT C CTAGTG CC C CA 
TG AGCAGACACTAACAT ACAAATATG CACGTGTGGACG CAG G AGCAACCAG ATATG TCGT AG G CTCACAAATGACACAT ACTTTT C 
TTGCATATGCACACCCACAAGCACIACCTTCAAAGATGCCCACT 

GAAAGCTCAGGGGAACTGGGATGTAGATCCAGCTCTCAGGCAAACTAGCCTATGACCCTGGCCCAAC03CT 

4 0 TAGTCCTTCATCGGTAAATGAAGAGGTTGAATGAGATGGTCTCTAGTCTCAGCCATTTTT 

ATGGATACAGATGCCTTTGTACAAACATAAGTGTTCACACTCAGACAC^ 

CCAAGAGCCTAAGAGCCCTGGCTAATTCTTTCCCTAGGCTCTCAGGCATCCA^ 

TTCCTCCCTGCCATCCCCTt^CCCTG 

CTATCCCCCACCCCTTGTGTACCACATCTTCGTGCTATACCCTACACTTTGCCCATG^ 

4 5 ACAGAGGAAGCCCCTTCGGCGGGAAGCAGGTGTTGTGAGGCCGTGAGGAGTCCTATGTCCCAAGGCGGGAGGGAGGCAACT 

CCACACTCCCGTCCTCTGTGAAACAAACACAGCTTCCTCACACCCT^ 

TGCAAGAGGGAAACZAGCAAGCTGGAGCl^TT 

TTGAATGCAGAAAAGCCTCCTGGGCTGACCTACTTCTCTTTCT 

5 0 GGACAAGTCGTCTGGGATCCACTACGGGGTTATCAC 

CGTGTGCATACACAAGCGCGCGCGCACACACACAC^ 

ATGGCCITTGTTCTGACrCCAGGGATGATCTCCTGGGCAGCCAGGAAAATGCCTG 

G G CAG GG C CCTGTCTTAAG CTGGGG AAATGACTACAGGATAAATTG CAATT ACACAAAT AG ATGG AGG AGAGAGAAAACTGACAGG 
T CGTGGGATGTAGAAAAGCTGCCAGAGCTTGTGGGCTGGAGGCCTTTGTGAGTGAGCTGGG C CTGACCAGGATAAGCAGTCTTGCC 
55 CTCCACCTGCTTTCCCCAGGGCTTCTTCCGCCGGAGCCAGCGCTGTAACGCGGCCT 
TCGACCGCACCAGCCGAAACCGATGCCAGCIACTGCCGCCT 

TCGACAGCCCCCTGGGGTTTTCCTGGTGTCTCCAGAGGGGCAGCCTGGCCTGCTGJ^CTAGACAAGGCT 

CATCOTOTGGTCTCCTCTCCATTCCTCCCCGTTACAACCCCrrGTCT 

TCTTTGGAGTCCCCAACTTTAGGCrTTGCTAGA^ 

CT CT ATCTCCTTTTCTAAT CCCCCATTCC CATACT TATAG C CAG GAATGGAAACAGAATTGG C CTGAG AACAAC 

CAGAGGGTGGTCGTGGGGGAGGGCTGGTGTTCCTGGTGCCTTATCC^CCCTCCT 

ACCTCCACCACCACAGAAGGAGCCTAGGGTGGAGCTGGGGGGGCATGAGGTGATGAGGA^ 

AGTGCCCAAAATAACAAAGCAAAAGGAAGCACGCAGGGGTGCAAAGGGG CAGGCGGGGCGT AGGCTGTG CCCCTACACCTGGGAGG 

6 5 GGTGGCGGGGGGAGTAAAAAGGCAGGAAAGAGAGAGC^GAAGAGGATGTTCAGAA^ 

GAGTATCTAGGTCACCAGGGAGCCTGCAGGCCTGACCAC^ 
GAGGCGAGGTGAC CCCAATACAGCTTGAGGCCCTCTGCTCGACCACCCCC^^ 
ACTCATTGCTTGAATTCTGCCATGATTCAAATCTGATTTAGAACT 

G AC CCCAG AAG CAAACAAG AGTGAGAGGTGGACGGGGTCTTGAGCAT AAGTT C C C T AAAGG CTAG AGAAGCTGTGCTTGG AGT CAG 

7 0 CCATTCAGAGAGCAGCAAGTTAATCCTTTAATGACC 

CTGGTCCCTGGACCTCTTTCAGCTGTCAAGTTCC^ 
GCTGCAGCAGCGGCAACAGCAGCAACAGGAACCAGTG^ 

GCCTCAGGCTCTGGGCCCTCATATTCCAACAACTTGGCCAAGGC^GGGCT 
75 GCGGGGCAAGGCTGAGGGCAGAGAGAGCTrTCTATAGCACAGGCAGCC^ 
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ACAGGCATCCTGQXTTTC^GGAAC^ 
GCCTCCCTGACAGAGATAGGTGAGOU3CTGGGG 
AATCAAACATGCGCXTTCAGGGAATTGAGGGGTCCAGACGAGGGGra 
CCCTTGCATGGGTAGGCGTAGGAGCTGGCTGAGATCAAGCCATGCCTTCCT^ 
5 CAAGTCCTACAGGGAGACATGCCAGCTIX5CGGCTGGAGGACCT 
GCTACCAGAGGAAGGTGAGGCCAGGAC^CCTGCAGGA^ 

GCACX1ACTGGGTCCTGGGGCAGGG CX5GGACATCA(^GACACAGGCTGGCCAACAAG CATGTGCACACCTTTT ATGTACAGTTGCG A 
ATGTGTGTAT CTGTCT ACAC C CCAG CAGATGGATGAG CGCTTTTCCTTAG^ CTTCATCCT 

1 0 GTTAAACACCCCTTCTCTGCCAGGCATTGTGCTAAACGCT 

GG CATTTATT CCTACTTTT AACAGACAGAAACTGAGCCTAGAG GGGT^ 

GTCAAATTCAGATT CATCAGGCTCCAAAG TTT ATG CTTGCCITTT CGATCACACT CTCATACCACCTGCTCTAAACACACT CG 
G CACTTCACATTTGCTT CTCC^GGTT ATTGAG ACCTTGGGCATAACC^ CAG CTGT ATGTCCTGTTCACCT C 

AACGAGAT AT ACC1TGTTGG AGGGCAAAGCTGT ATCT CATACTTTTTGG ATT C C CTCACAACAGCT AACTCGG TGTCAGCAG CC GG 
1 5 TAGGTGOTCAGTGTGTGGGACTCACTGGCAGGAATCTGTGC!ATTTGTC 

GGAGTTCAGAG CCTACC C CTTGCAGTTT ATTAGGTGGGGCTCCAGGGCTGAGGAGGATC AC AGGG CCAC^ CG CTACAGCGG 

GACCCrCCTCCCTCCCTGCAGTCCATGTGGGAGATGTGGGAACGGTGTGC 

GTTCGCCAAGAGGCTCTCAGGCTTTATGGAGCTCTGCC^A^ 

GCAGGCCTGGGGACAAGGGGACAGAGCCAAGTGGAGGGAGGTGGCTTAAGGAAATCA^ 
20 TGACACTGTCCCTGCATCTTCTCTCCCCACTGCCCAGGAGCAATGGAAGTGGTGCT 
CAACCGCACGGTCTTTTTTGAAGGCAAATACGGTGGGATC 
AGATTCTGATGCCAACCCCAGGCAAAGCTTTGTGACCCAGGGCACCCT 
TAAGGGCGGTGTCCTCGGGCACCATCGCTCCAGCCACTCTCTCACTTTTCT 
GCTCCATCTTTGACTTCTCCCACTCCCTAAGTGCCTTGCACT 

2 5 AATGCCCGTGAGTGTTGCTGGGCTTGGGTGAAGGACATTC^ 

GATGTAGTTAAATCTGGGAAATTGCTTTAAATAGCAGAATGAGCCCTACT 
ATTCAGAGGACTCK^GAGAGGGACAAGAGCAGCATAAGGTGGGGTTGTC 
CACTTTCCTGACAGAAATGTGTCTGATTGTTAGTCTATGGCAGT 

AAAACTATTGCTGCCTGGGTCCCACTTCCAGAGATTCCAGTGTACATGTGCTCCATGACTTATGA 

3 0 CTATCGTAAGTTGAAAATATGGTAAGTTGAAATTGCATTTAACACGCCTAACTTACTGA 

AAATGTGCTCAGAATG CTIT ACATT AGCCTACAGTCTX^ CAAAAGCAT AT AAC^ C 

TCATGTAATTTATTGAATATGGTTCTAAAAGTGAACAGCAGGATGGTTGGATGGGTATTCAAAG 

TGGCTTTCTCAC^!AA<^TAAAATCAAAAAATAGTAAOT 

GAATTTTAACAGCGACCCCCAGGCCACCAAGATTAAGAATCA 

3 5 AAGACTCTGGAT ATCC TATCCACTCCCCTGG CATGG AGT AG CTGGG CTG AG C CAG ATG AATACTAAATATTCAGAGAACCTAGGGA 

AGTGGGTCAAGCTGCTAACCTGAGTTTGGAGATCGAGACTATCAG 

TCAAACTAGAAGATAAAAACTTTAAAGATGACTTCTGGATATTTTATAGAATCAGGG 

CCCTGGTGCAGGCCTGGGCACCTTCAGGAAGGCCACCTCCrrATCAGGAGCCCTT^ 

GTTACCCCTATGACAGAATCCTAGTGCCTAGCATTGAATGAGCTATGTGGAATATCAGGTTACCAGGGAG 

4 0 AGGAGATCAGAAATTCT CAT ACCTTT CAT CT CTT CCCAGTTTTCCTTGTAGACAG C T AG CCAAGC CCAACGTC CAGAGCTTACCAT 

. TATCAAACCAAGGCACACACATGCACGCGCTCAGCTTAGAAGACCT 
GGGAGTTAATGTCCATGTTCTTTCTTGTTCTCATTACGGTCCCAC^ 
GTAGAACAGCTGCAGTACAATCTGGAGCTGGCCTTTCATCATCIATCTCTG^ 

AGCAGTCCCTGGGGTAGAAGAGGCCAGGCCCATCGCTAGCTCTGTAACATCAGAGTTTGCGAGGGCC 

4 5 GAGGGAGTGCGGGAGTACCACTCTCTGTTAGAGAGCTTGCATCAGCAGTGGGAACTAAGGGAATC 

AAAGACTGG AAAGTT AG AG GG CCTGGG ATTG GG AGGGAC CT C CAGGG AACAATTCAGTTT AAT ATAG CCAG CACTTACCCAGCAC C 
TGCTTGTACAAGGCACTGTGGAAAGACACAGAGATTTGGTCGCTGCCCCCACCAAGAGATTT^ 

GTGTATCACT AACTC T AACATAGAGTAGAATGTGGT ATG TGAT ATAAT AATAATG CAAT T AACAGAG TG C TTTTGCTG ACATGCTT 
TCTCATCCTCATGGCAACCCAGTGAATAGGACAGGTGTTCAAATCT 

5 0 CCCAACACTTTGGGAGGCCAAGGCGCtGTGGATCACCTGAGGTCAGGAGTTCGAG^ 

ctactaaaaataaaaaaaattagccaggcatggtggcaggcgcctg ctacttgggagg ctgaggcaggataatcgct 

tgaaaccgggaggcagatgttgcagtgagccaagattgtgccattgcactccagcct 

AAAAAAAAAAAAAAAATCTCCTTGTAGCTATCAGCtAGACTTCAGTC^CTTAAATGCAAGATTGAATCCCAGTGCTCTT 

ttctatccctgtgtcccctatgtataactataataagtgacaccaggaaaatgttatgagagtataaaacagg 
55 ttgggggtaaaagc^gtgggtcataaatacttcccagc-gaagatgacat 

cattcatgggggtggggcaggciattccaggcttagggaacaataggagcaaaacaaa 
tttaacc^aaaaaatggatgagatgagtatgagaggctggggataattgtc 
agaccaagaacaacaggagaaaaataac^aggcaaaatagtgtgtatgtggagaatc^ 
aatovggagctgggatagactggtggggcagaagagcaccagatgatcj^ 

6 0 ttcaatgccatataagagttaaagcittatttctaggcc^^ 

ttaggcaatttactttggaacc^gtataaaggaaccatttatgtaattattcaa^ 
gggtgttaaggatacagcaatgacttacacggtctgtgctctcaagaacttgaactttaatctc 
gagagacac^aatctgc^agagcaattggaaagcaaatccggttggatct^ 
c^ccctggcccctaccagctctcagctttcacttggactttaag^ 

6 5 tgtagcaacc^cacctactccgcacccacg^ 

TTTTTGTTGTTGTTGTTTGTTTGTTCT'GTTTGTTTGTTTTGAGGCAGA 

CTCGGCT CACTGCAACCT CCACCTCCCAGGTTCJiAG CGATTGTCC TGCCTCMCCTCCTGAGTAGCTTGGATTACAGGGATTACAC 

GCCOSGCTAATTTTTGTATTTTTAGTAGAGACAGGGT^ 

GCCCACCTTGGCCTCCCAAAGTGTTGGGATTACAGGC^ 

7 0 TGCCTTGTGATGCCXaAGAATCAAATCTAGAGTCTGCGTCATGGTCAA^ 

TCTTCCACTGCCCCTTCJ^^CCCATTGC^GCCACACTTGCr 
TTGTACCTGCrTATTGTAGTCACCTGGAGGGTTCTrCCCCC^ 

TC C TGAC CAGTAAC CC CAT ACAAAAAG CTTTAGTTTT CTTTTAAAGAACTT ATT ATCIGATACACT TTATTTTCTGGG CCC 

CTCAC CAGAATGT AAATTTAATCAAC^TACAGATT ATACITT ATTGACTG CACT AGAGG C CAT CACAGTG C 

7 5 CTAGCTCAGATCCAGATGTGTTCTCAACAAATATTTC 
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AAGCAGTGGTGAGGAAGAGAGAGAATCTAGGAGATATGA^ 

GG AAACTAAGCTGGGGGAGGGAGGCAATGG GAGGCAAGAGACATTTTT AAG C CTCTTGGTTGTAGAGGAGACTGCAAGAGATAAAT 
TG'XTC'rr I'CAG CCTTGCTCCAGTGAGGTCTCCCTGCCTC CGT CTGCTCACTGGTTTCTGTGC L*rrr ITCATCTCCCCTTTGGCTGA 
GTGCrcGCAGCATTGGTTTGCTACTTGCMTGTTAGGTC 
5 ATGTGCTAAGCATTATGATAGG TACAGAGG AGACAGGGAAAGAGAGACCTCAG CCAAGGAGCTGAAAT CTAGGGTG GGAAGGCAGA 

CAAATTGG ACGATTTTC CTGCAATGTAGTAAGTC GG C C CTTAATCCAGATGGGAGGCAG T 

TAGGGAAATAGTCTTAGGAGAGGTGACACTAGAGG 

TGTGACAGCATACAACAAGAGT CACAGACAATCACTCTTAT AAAAG CAGGAAGTG CACATTAGAGTCTTTGACAAGGTTCATTCTA 

GAAGTATTGGGAACATAAATTGAGGGCTTC^CCCAGAAAACATTC^ 
1 0 ACACACACGTAOTTCTACCAGTGGGATTTGGCrGGTG 

CAGGAAGAATAAGCGGACTTCOTGCTCTGAGGAGAGCTGAAGTATT 

CAAGGGGAAGCTTCGGAGCCTGTGTAGCCAGC^TGTGGAAAGGCTGC^ 

CTTTCCCTCCACTCTACAAGGAGCTCTTCAGCACTGAAACC^ 

TGCCTCTCCCTATGGCCTGCTCGCCCACCTCCCrGGACCCCGTTCCACCCrCA 
1 5 TCCCCACCAGCTCTTTGGAAGTGAGCAGATGCTGCGGCT 

GTGGGGCTGGCAGAACACCATCTCCAGCCTCAGCTTTGACCTGTC^ 

GGGGTGGCTGGGATTTAAGGACTTCTGGGGGACCAAGACATCCTCAAGAAAACA 

GCAATTCATTCAGAAGCTCAGAAGCTAAGAATAAGCCTTTGAAATACCTCA 

GATCAAGCT CAGAGACTGG CAGTGAGAGCCCAGAAGGACCTGTATAAAATGAATCTG GAGCT TTACAT TTTCTG CCTCTGC CTT C C 
2 0 TCCCAGCTCMCAAGGAAGTATTTGGGCACCCTAC CC TT T AC CTGGG GT CT AACCAAAAATGGATGGGATGAGGATGAGAGGCTGG 

AGAT7U\TTGTTTTATGG G ATTTGGGTGTGGG ACT AGGGTACAATGAAGGC CAAGAGCATCT CAGACAT AGAGTTAAAACTCAAAC C 
TCTTATGTGCACTTTAAAGATAGACTTTAGGGGCTGGCACAAATCT^ 

G ACT CAACAGCAATCATGCAGTTC CAGAGACAC^TGAACCTGACACAATCTCrrcrTATCClTGAGG CCACAGCTTGGAGGAGCCTA 

2 5 ATCCAAATACCACCCCAGCTCCAGCTGTCnTCTACCACTAGAAGACCCAAGAGA 

GCAAGATCAGATCCTGGAGGACTTTCCTGGCCTGCCCGCCAGCCCTGOT 

CCCGTCATTGGGCACOSCTGACTCCAGCATG^ 

AACCTCTCTTGGCTTCATAAA<ZAGCTGTGAACCCTC^ 

GG GGTGGGATTGTGTCCTCTAAGGGGACGGGTT CATCTGAGTAAACAT AAAC CC CAACTTGTGCCAT T Cl"i"l AT AAAATGATTTTA 

3 0 AAGGCAAGAAGTGTGTGTGTCAGAGGGTGGGGGAGATTCTTAAATTAGATTACCT 

CAAGACTCAGGTTTCCAGCTCAGCAATCCATCAGCAT^ 
TGGAGCAGTCCCTCAGTGAGGACTGTGAACGAATCTTCAGGAA 
CTAGACTGAGAACTTCCCATAAATGATACCCACGGGGAACGT^ 
AAAAGACAGTCCTCTGGAAGGTGGCTGGCCCAAGCTC 
35 erTGCTATTGCATGACCCCTCCCTGGCACCAGAGCCTCCCTCCTGGCT 

TG CAG CC CTC CCTG^TTGCTTTATTTATTT ATrrX'GCACC^U^C AGGGTTG CTG CAGACTCATT CTTGC CTGGTTTAAAAAGAGAGA 
GAGAGAGAGAGGAAAAAAAAAAAGGAGAAATGCTTTCTGGCTCT 

AGCAGCAGCAGCAGCG^CAGGCAGCAGCCGGGCAGCCAGGCAGCGGGGGTTGAGGCACA 
AGCTCGAATGGGACAGG^CCCCCAGCGCTGGACAGATGCAGTGCCAAACTTGAT^ 

4 0 ACTCITCTGTCCCAAAGATGCAGCCGCCCTTTTCC ATAACATT CTCCGAGACAGGCCAGACTAGGGGCCAGACAGGC CCCTCAAGGC 

AAGAGGGTTTGGGCCCCCACACTGCTAACAATTAATCCT 

TGGGTGGAAGGGCTCCAAGGAGACCTCCCTTCCATCTCCCCAGTGCTTCCCCTTCCTCT 
GGATCTTCCAGAAATCTTTCCCCCAGGAACCCTCTCCCCACAC^ 
ATACCCAGGTTCAAAAGTGTCTCTGGACTGCCACAT7^A^ACAAGAAAAAGGCT 
45 CATGTCTTGGTGATGTCCCCACCCCACTGAAGGGACAAAA CGCTTCCTG CAAAGTGGCCGGAAGCCGGT 

CCTGCAGCCATGGAGGTTCGCAAGGGAAATTTGTGATTAAATTAGCCACCTTAGAAATAAA 
CCTGGCCCAGCGGACTCAGGGGAAACTCACAGGAGCAGGAAGGAA 

AGAGCGAGGATGGGOVGGAACTGCAGCAGTAGTGATGAAGAGCTAGCTGTCAGGTGGACTT 
GGGAGGCTGGGCCATCTTTCCCGGGAGAGCTCrrGGGCATGCCACAGAGACCTGTTAGGTCAGT^ 

5 0 GCAGTCTTTGCCCCACGTTATTCTTTTTCAT^ 

AGTTTCCCAACTAGGACAATCCTCTCAAGTGGCrrcTAGGATAGCCACTAGAATGATCOT 
GGGGAAGAAATGACCATCTCTTAAACTGCCTCTCCTAAGTTCCACATGA^ 
GAGCCCAGAGAGCCTGAAGCCCACAAGCCCAAAGGTGATCAGGAT^ 
CATCCATGGGCITTTGCTCCTATCTCTGTAGCCATCCCTTGCCAATTCC^ 

5 5 ACAGGATGAGTCAGGGG CCCTCATCACCTAGAGGGCCCCTTCTCCTCTGTCACCTCAGCCATTGTAGT CACCATCTTCCTGAGGGT 

TCCCGGAACCTGGTACCCAGAAACTGACTATAAGTCTACAGGCTCTGCACACTGTCTGTGCCCAGATA 

TGGGAATTTCTCAGGCAGCTC1\GGCCAAAAGGGACAAGAGCTC 
GGAGGGATATGAGGGAAGCCCAAGACTGAAGACCAAGCAACMAACTCAAAACCTG^ 

CACAGCTAATGTACAGCITGGCAGTTGTCCACACAGGCATTTTGGGGAGATTGGA 
AGGCTGGAAGAGGGCTGAGCATGGGGGTGGAGCTGAGGCCTGCATGTTTCCCA.GCCT 

CCAGT CAGGACAGGG AACT CTG GG CCAAGGACATGAT CTC CCTGAGC CTTAAG CAAAACT AT GGGT ACATAATGTGAAAG AATAAG 
CGTGAGCAGAGAGGGACAGAGCAGAGCTTGGGAAGAGCAGGTATCAGGGAGAAG^ 

6 5 CAGGAAACCCC^TACJVGAAAGACTCAGTACCTCCCTGCTGTTCCCTGCCCCATTC 

TGGCT AGGGTT C CATGAGC C!AGTAAGCACTTGGCTGGTrATCCAGGG CTGGAAGAGGGAG G AAAGC AG AGATT CC CCAAAGAGAAG 
CTCCAGGAAGCCCC^GGGAGGTGGCACACAAGAATTCT 
ATTTCTCTCCTCTCTAAGAAGGGCCAGGAAGTCCAGGGTCACCGTCA 
TTCCAGGACTGAAGAG4GGAATGGATGCAGCCACAGCrrCCAAAGCAAGCCT 

7 0 TACCTG<3AGGGAGCGGTGGCAGCTGCCCTGCTGTGTGTGACTC 

GAGGCTGTACCTGGAGGACTCCCACTGGACACTGAGCTCCTGGACCTGAGTGGGAACCGCCTGTGGGGGCTCC^ 
CTCCCGCCTGAGCCTGCTCCAGGAATTGGACCTCAGCrACAACCAGCT 
GCCTACT CACCCTGAGGCTGCAGGGCAATCXSGCTCAGAATCATGG^ 
GACCTCCGCCTCAACCAGATTGTTCTCTTCCTAGATGGAGCTTTTGGGGAGCT 
7 5 CCACCTGGTATTTGTGGCTCCGGGGGCCTITTGCAGGGCTAGCCAA^ 
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TGCCTGGCCTAGCCCTTGCCCGTCTCCCGGCACTAGTGGCCCTAAGGCTTAGAGAA^ 

CTGCGGGGGCTGGGGCAGCTCAAGGAGCTGGAGATCCACCTCTGGC 

CAATCTCAGC^^CCTGGCCATCACTCGCTGCAATCTGAGCTCGGTG^ 

TGGATCTGTCCCAGAATCCCATCTCAGCCATCCCAGCCCGAAGGCTCAGCCCCCrrGGTGCGG 
5 GCATGC CTCACCTCCATTGCTGC CCATGCCTT CCATGG C C CACCTCCTGGATGTGGCAGATAACGCC CTTCAGAC 

ACTAGAGGAAACAGCTTTCCCTTCTCCAGACAfiACTG^ 
TCTGGCTGCTCXIGGCTCCGCCGCCACCTGGACTTTGGCATGTCCCCCC 
CTGAAGGAGTTTTCAGACATCCTGCCTCCAGGGCA^ 

1 0 CTTGGCTGGGCAGGGCTGGGAGAGTAAGGGTCCTAG 

TATGTCTGTGTGGTTAGCAATGTCGCTCGGAATGACTCC^ 

ACTTTCTGACCCCAACATCACCGTGCCAGGGATCCCAGGGC 

GCrTCCTCCCCTTCCTCACCTCAGTGACCCTCTGCTTTGGCCTGATTG^ 

ATGACCTTTGACTTTGTGGCACCTCGGCCCrrCTGGGGATAAAAACTCTGGGGGTA^ 
1 5 CCTTC C CCAGTGGGGAACCCACCAAGTCCGC1TCAGATACCAAAGGGGAAGACAGAAC CAAGG CTG CT TGAAC CAGAAC CTAGTCC 

CGAGCAGCACCGCTCTCCTGCACCTCCCGCCTGCGTTGTGCCTCCTGCCGGAGAGTCTGCTTCCTGAGCTTTTCCGCT 

AGCATTGTGATTTCTTCTCTGAGGGTCCCAGGGAGCTGCAGATGCAGA^ 

ACACAAAACAGG AAACATAAT CAAAGCGCT AGT CAG CT AGT CT AACCACT AGG CTTTCTTCACACATGCTT ATAT CCTTT AAT AAC 
CAATTGCCAACCACGGCTATAAGATTATTTCAGAGGTGGGGCrGGGAAGTC 
2 0 AGAGTCCCTTTCTTTTCTGCTCCCCACCCCAACCCTGCCCCTATGT^ 

GGATGGGGACAGAGTGTGGGATGGAGAGGACAGACCATATACTGCACT^ 

AGAT CATT AAACCTGCTG TCAAAGGGC CACAACAGTAGCAGCGAAAACT CT CTGGAGTT TTCTTTACTT CAGTCTAT 

TTCCTACCCTCATTTCTGTTATATCTCCCCAGCTCCTTCTC^ 

TCTAACCTGCCTGACAAACAGGGTAAGGAGTCCCCCTCCCACCTTCACTTT 

2 5 GGGATCAGCACTCCTAGCCCCGGTTTCAGCCTCAATCCTTTCCCT 

ACCTTCCAGGGGACCCC^ACATAGAAATTCCACCTGGGACACCCAGTTGCT 

TCATTTTT GCGT CACAGATCC CTAGTGCCCTTGGGGAAAACTCAG AACTCCAAG AT AATG ACTAACAA^ C CGCAGTT 

GT CAAAGG AG AGAGACCCA.GG ACACTGCAG AGACTAGGCTTGG AGGAA(^ G A 
GGGAAGGGGGAACAGCAGTAACTGCCrrGCCTAGTTAATTTCCACCATCCTTCTGCAGCTTCCT 

3 0 CCACTGGGAAGCTGTCTCCGGGTAGAGGTCACTGATTTACAGAGACCCCCAGATGGGGAGGTGGAGTA 

TACCACTGACTGTGCTTCAAATAGAGGGCAAAGCAGAGCCCAAAAAGAAGTA^ 
CTCAGCATCACTGGAAGGX3AGGAGTGACAGGGAGAGGAGAGGAGTCGGAGGAGG^ 
GAGCCTTTCTGCGCAGTAGCATCTACCCAGCGTCOTTAGTTXCCTCCCIACCCACCCCT 
CTTTCTAGTTGATTCCTGCTGCrGTTGTCCTTGCTCTGAGGAGACTCCACTCAT^ 

3 5 TGAGACGCTCAGAGCCACTGAGGAGGTGAAGCTGGACTTGACAGGCM 

AGTACCAGAGGTGAATGTGAGAACCAGGCAGACAGGAGAGTGG 
CTGCAACCTCCCCACCCACCATGCACACACAACTC^TCAGCAT^ 
ATTTCCTGGGG(^CAGCCCTCCCGCTCCTTTCCATAACCTTCCACCAGCAGCT 
GGTTCTCACTCTAGTTGCCCTCTCTGCAATGCGGAAGCAGCAATGCT^ 

4 0 GGGAACAGATGACrrGTCTGGTGCTGGGACATGCTGCTGCCCAAGGTCTCrGAAGTOT 

GACACAGAAT CT AAGT CACACATGT AATTTCAAATTTTCT AGTAGCTACT AAATT AAAAAAT CAAAAT GAAACGGAT ACATT T AAA 
TACTTACTTTATTTAACCCAATATACCCAAACTATTA^ 

TTTTTTCATATTAGGTCTTTAAAATCCAGTGTATATTTTACACTTACAGTACATCT 
CCAGAAAAATCTATATTTCATACAGGTCACTGTTGAGAAAGTGGATTCACAC^ 

4 5 TAACTGAATCGAATCTCTGTCTTAAATTTTAAATTAAACAAATTTAAAATTCCAT^ 

TTCAATAGCCACATGTGGCCAATC^CTACCATATTGGACAGCAAAGCTT 
TGX3CAGTGGTGGGTGGAGAAAGTATCCCTCAAGAAACACATTO 

TTAGTCTCCTTTCACAGCTTTACCAGAACACTTTTGAGGC^TGAAGCCCATTCAGGAAACAAGCT 

AGCTTGTC CAAGCGTTAG CATGCCAGTCCCTCATCTTACCTGT CGGGAAG ACTGCCCTCAGCTCAGCTGTGATGGCACAGGCTGCT 
50 GTGTGTGTGCTGAT6AGTGTCCAAATGCAGCCAAGGACATGAGTGGGTGGTGTG 

TGTATGGTACCACATCATGGGGACAATCTAATGGAGGCTCTGCCCAGGGTGGGAC^ 
TGAGGGCCTATTCTATGTCAGTGGCTTTATATATATTTTCTTATTTAATCCTTACATC!AAC 
CTGGTGAAACTAAAGGTCAAAGAGAGTAAGTGCTCAAGTGCAC^ 
TGTCTGATTGAAAACTCCCCCTTTCCACCACACTGCCATCCTCACTGCCATGCCCT 

5 5 GCAGGACACGtACCTGTAAGAAAGGAGAAATCTTCCAATGCATCCACT 

TACATTCTTCATTAACTCTAATGCACCTGATTGAGGA 
AATGTCCTTACAGCCTCTGCCTCAGGGAC 

HUMAN SEQUENCE - mRNA 
60 CCCCTGGGCCCTGCTCCCTGCCCTCCTGGGCAGCCA^ 

GAGACAGCACCGAGCCTCACGGGAGCTGCTGGCTGCAAAGAAGACCCAC^ 

GGGACAAGTCGTCTGGGATCCACTACGGGGTTATCACCTGTGAGGGGTGCAAGGGCTTC^ 

GCCTACTCCITGCIACCCGTCAGCAGAACTGCCCCATCGACCGCA 

GG CG CTGGGGATGTCCCGAGATG CTGTCAAGTTCGG CCGCATG TC CAAGAAGCAGAGGG ACAGCCTG CATG CAGAAGTG CAGAAAC 

6 5 AGCTGCMCAGCGGCAACAGCAGCAACAGGAACCAGTGGTCAAGACCCCTCC^ 

ACCTTGGGGCTCCCAGACGGGCAGCTCCCCCTGGGCTCCTCGCCTGACCTCCCTGAGGCTT 
AGCCTCAGGCTCTGGGCCCTCATATTCCAACAACTTGGCCAAGGC^ 
AGCGCGGCAAGGCTGAGGGCAGAGAGAGCTTCTATAGCACAGGCAGCCAGCTC 
CA.CAGGCATCCTGGGCTTGGGGAACTGGGACAGGGCCCAGACAGCTACGGCA 
70 TGCCTCCCTCACAGAGATAGAGCACCTGGTGCAGAGCGTCTGCAAGTCCTAC^ 
TGCGGCAGCGCTCCAACATCTTCTCCCGGGAGGAAGTGACTGGCTACCA^ 
CACCACCTGkCCGAGGCCATTC^GTACGTGGTGGAGTTCGCCAAG^ 
TGTGCTTCTCAAAGCAGGAGCAATGGAAGTGGTGCTGGTTAGGATGTGCCGGGCCT^ 
AAGGCAAATACGGTGGC!ATC<5AGCTGTTCCGAGCCTTGGGCTGCAGCGAGCTCA^ 

7 5 AGTGCCTTGCACTTTTCCGAGGATCAGATTGCCCTCTACACAGCCCTTG 
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AAGGAAAGTAGAACAGCTGCAGTACAATCTGGAGCTGGCCTTC 
AGCTGCCACCOUVGGGGAAGCITCGGAGCCTGTGTA^ 
GTCCAAGCCGCXTTCCCTCCACTCTACAAGGA^ 
GGGACTCCTTGCCTCTCCCTATGGCCTGCTGGCCACCT 
5 GAGGGTGGTCCCCACCAGCTCTTTGGAAGTGAGCA^ 
GCCAGAGGGTGGG 

HUMAN SEQUENCE - CODING 

ATGGACAGGGCCCCACAGAGACAGCACCGAGCCTCACGGGAGCTGOT 
1 0 CCCTITGCAAAATCTGTGGGGACAAGTCGTCrGGGATCCACT 

GCCAGrcCTGTAA(X3<^CCTACTCCTGCACCCGT(^^ 

CGCCTGCAGAAATGCCTGGCGCrGGGGATGTCCCGAGATGCTGTCAA 

TGCAGAAGTGCAGAAACAGCTGCAGCAGC^ 

CAGATACCCTCACCTACACCTTGGGGCTCCCAGACGGGCAGCTG 
1 5 CCCCCTGGCCTCCTGAAAGCCTCAGGCTCTXSGGCCCTCA^ 

CCTTGAATACAGCCCTGAGCGGGGCAAGGCTGAGGGCAG^ 

GACTTCGTTTTGAGGAACA(^GGCATCCTGGGCTTGG^ 

ACACCGGAGGCZACCCTATGCCTCCCTGACAGAGATAGAGCACCT 

GCGGCTGGAGGACCTGCTGCGGCAG^GCTCCAACA^ 
2 0 TGTGGGAACGGTGTGCCCACCACCTC^CCGAGGCCATTCAGTACGTGGTGGAGTTCGCCAAGAGGCTCT 

TGCCAGAATGACCAGATTGTGCTTCTCAAAGCAGGAGCAATGGAAGTGGTGCT 

CCGCACGGTCTTTTTTG AAGGCAAATACGGTGGCATGGAGCTGTTCCG AGCCTT^ CAG CTCCATCTTTG 

ACTTCTCCCACTCCCTAAGTGCCrTGCACTTTTCCGAGGATGAGATTGCCCTCTACACA 
CCAGGGCTCCAAGAGAAAAGGAAAGTAGAACAGCTGCAGTACAA 
2 5 CCAAAGCATCCTGGCAAAGCTGCCACCCAAGGGGAAGCTTC^ 
ACCTCCACCCCATCGTGGTCCAAGCCGCTTTCCCT 

CCAAGTGACOTGGAAGAGGGACrCCTTGCCTCrCCCTATGGCCTGCTGGCCACCT 
CTTTCCCATGAACCCTGGAGGGTGGTCCCCACCAGCrrCTTTGGAAGTGA 
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Table 104 

MOUSE NOMENCLATURE 
ICSGNM N/A 
Celera mCG15938 

5 

HUMAN NOMENCLATURE 
HGNC BAT1 
Celera hCG1641022 

10 MOUSE SEQUENCE - GENOMIC 

TGTGGGCAGAAGGCCGTCCX3TCTCTTAACAGACGGGCCTCTCCTCCAGTTCT 

CTGTCACTGCAAAGGCAACACCTACGCrGTTTCCAGAACTGT 

GTCAAGAGGCTGGGGGAGCAGGGGC^GGAGAGTGCAGGGTTCTC 

TCAC!ACCATAGCCGCCATGTAGGTTAGGTGAGATCTGCITGAT^ 
1 5 AGACTGGCCTTGAGGTGTTCCCGAGGGCTGGGATTACAGGAGTGTGTTGTGAACTTCC 

GGG CTGGCTATCTGCCAGTAAGTAACA(ZAAATGAACTAGAAAG AGG CTGTCTG CACTGTGGAGC CACTACCG ATG CTGACTGGAGA 

TGTTTTGTTGACGTCTCTCTACCGTGGCTCCGGCTAGCTTGGAACT 

GACTCCCTCXX3CCTCGAGACCGCACCCACCAGCT 

GCG CCTGGT AAGTGTCAAGGG ACATGAGTCCATC C CTAACACACAG CAG CAGATCT C CGGCATAACACCAAGTAAAAGTTT CAAGT 
2 0 CCT CTGGCCGG C CCAG CTG ATCATGTGACAAAG CATGG ATACT TTG T AAGGTGTTTGTGGCATT ATT ACAAGG AG AGAAGCAAAAC 

TAAGGGGGTGGGGGTGGGGTGAGTGGAGCTGGGATCTGACCTGAGCAGTCJyVCTGCAAGGTAAAAA^ 
GTCACTGAGCTGAAAGCTAGGAAATGTAAAGAAAGAGTTGCGAGCTATGCGGCT 

ACTTGGGATGCAGAGG CCAGCCTGJVGAGACTCGGTAA^ CTGGTGAGATGGCTCAGTG 
GGTAAGAGCACTGACTGCCCTTCTCAAGGTCCTGAGTTCAAATCCCA 

2 5 TGACGTCCACTTCTGGCACGCCTGAAGTCAGCTACAGTGTACTTATTTATAA 

T AAAATT CAATTCCG AACAACGAG ATGAAGACTCACAACTATC CGT ACCG CTAT AGTG TGT ACT CATATACATAAAAT AAATAAAT 

CTTCAAACAAACAATCACACCACGCTAGCACACACCCCTTA 

TATGGCCACATCCCAACCAGGGATCTTCAACAGTTCC^ 

AGGGT CTCT CTG TAGTCTTGGAG CTCACTGTGT AGGCTACAT ACTT AGAG AGATT CT C CTG CCT CTG CT TG CCCAGAGT CAAAAGA 

3 0 CCTGCGCCACCACACCAGGCCnTTATTTCCCACGCATTCTTTCTAG 

CCACTCACCATCAGGGCCCTGGCGGGCAGC!AGCGTGCAGCGCCGTGTCG CCATGG CGGTCCTGGTGGGCAGGGTCAGCGCCGAGAC 
GCAGCAGCAGGCACAGGGCAGGGGCATCGTGGCGGGCGCAGGCCCrrGTGGAGAGGCGGGGG 
GG ATGTCGCTGGAGGAGAGCCTGGGCT CGCACCAGCCGTCCTGCAGAO^ 
GGAAGCGGAGGCCATGGAACAACTCTGGGGCTGA 

3 5 TCTCCCTTTACCTCCCAITTCCTCCCCCrTGCATCTCCAGCA 

AGTTTGGGGCGCIAGGGTGGGGATGG^GTTGGAGTTGGGGGCTGGTGGGGAGGGGGGAATCACAGA 

TG AGAAAATGAT ACATAGTTGTTTTAAGGTT C CAG AG C C CAAG AAAAAAGTT AATT CTG AACTTTG G AGGGAAACAACAACAACAA 
CAAAAATCAAGT AAGGC CAGATG GGGG CAATTTTCATT AGC CACAAAT CTGG AGAAAAGTCACAAATTC AAAGAG CGATCTAG AAA 
TGGAATATAATGTACCCGACAGACAGACGTAGAGGCTTCTTCCT 

4 0 GCCCCCCCCAAGGACCCTCAGAGACGGAGGCCGGAGGCCTGGGCTTAAGA03CTCGGAGAAGGGAGGTGGG 

CCAGGATGCAGGAAATGGCGCAAGCACACGCGCAAGTT^ 

CTCGGACTGGGTGACTTGTGATTAGCTCTCCGGAGGGGAAACAAGACAATT^ 

AGGCGAGCTTTCGAAGGGAGGCAGAGGCAGAGGCAGGCCTC 

CCCTGT CTCGAAACACACACACACAAT CTCACACACACACACATAATC ACACACACAC CACACAAC CACAATGAGG CCAAATATAT 

4 5 GGGTAG ATCAAAAAAG AAAAACACATGG TT AG ACACT AACAAGGCAAACG T AGGACAG AG CAGGG AGATGTGTGCAGACG TTCGG A 

TITGGGGGGAAAGTCTGAAGAGAAGATGGAGGCTGAAAAGAGG^ 
AGTGAAACCCAGCGTACTGTGTCCAAGCGCATCCTTACCTTCCCACACCATTCT 
TCAAGATCTCACACCCAAACACTCTGCCCTCCGCTTCCCCCAGACCCATCTCTGTGGACCCTG^ 
TGCTGCAGTAGTCAGCACAGGGCTGGTCTCTGCTGCCCT 

5 0 CAGTCATCAGTGTGAGGATGACAGACACCCTCCCGGTGCAGGA 

GACCGCATCIAAGGGCCCGGAGGAGTAGGCGCTGGAGGGTGGGAGCCGTCT 
GTGATAAAAATGGCGAGTCAGACCCAGGGTATCCAGCAGCTCCT 

GAAGAGTGAGCCCCCCTTTCCTCTCTTAGGAATATGGAAGGAGAATTTAGGAATAGAGCATAG 
AAAAAGCCCAAGGACAATGAGGGCAGATGGGGGACTGAGGOTATTAG 

5 5 TTGGTTGACAGK3AAACTGGATAACTTTCTAGAGGGCACTC 

CCTTCTGCCTCTAGGGAAGGCCCGGCGACTGAAGCAGGCGAAGGAGGAGGCTCAAATGGAG 
AGCAGGAGTTTCAGAGCAAGCAGCAGGCGGTGAGTTGAGGGGGACAGGG^ 
GCCAGGTGTATACAGGTGACTAAGAAATTAGGGTGTCT 
CGGGATCTGTAAGGTCTGAGGGTTTTAAAGGGTACATGAGACTTGTGGG^ 
60 GGTC^CTCTCAGAAGTAAGGATTGTGCCCACCTGCTTGGTATCTACATGTTTTTGTGGGGTGGGAA 
AGGCCTTCCTCTGAACCCTTCCTCCTACATGCTGG^ACCCTCCTTTCT^ 
CCCCAGGCCATGGGCTCTCAGGGGAACCIXSTCTGCT 
GCAGAGGAATCGGGAGCGCGTCCTGGCTCAGCTTCTCGGCATGGTCT 
CTGTCTAGAACC1ATCGCTCAGGGACACATCCCTAGAGTGACTCCTTCTGTCA 

6 5 CACTTGTGTCGCATGCGCAGAGCCTTGGGTTCAATCCATAATATC^ 

G CTGGAGGGGAGGCTTOTATGTGACAGGATCCAATAT^rCCCCC CTGAGACTTACAAGTAGCCTGTTCAAACCCGACAAACCCGAAT 
CTCTCTGTTCCATGACACCCACTGGAAGTTTACTAATGAATGCCCACGTC^ 
GTGACAGTTATGAAAGCACCCTGACTTCAGAGCAGGGGAACCTCT 
GACAACTATTTAACCTGGAAGCCTCAGTGTCCTCATCTTTGTGACT 

7 0 GTGAATTATCTGTGAGJ^TTCAGCATAGTTCCAGATGACGGAGTGCCTGTATCATG 

ACCCCCTCTCCTGACTTTTTTATTCCTCGTGTTCCAGCTTTGTTCAGCCACT 
CTGGCTCTCCCTTGGCAC^TCATGCCCCCG^CAGTTAAATGATAATGTTATTTAGCCCCTCTC 
AATCTCrrAGCACCTCTCTGTTTCTCCTGAGCTGGTCTTTCTCATGGTTGGCC 
TTCCGGGAGCTGAAGATATCTATGACATTCACATGCATGCATTCCTT^ 
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AGTGTCAATAAAATACTGGTGGATGAGTCAATTGTCG^ 
TCCTCAGAGTTATGCATTTATATAGTITCCAA 

CTGTGTCCCCTCTTGAACAGAGAGTTGCATAGATAGTTGTAAGGGAAA 
CACTG CTCATCAG<IACCAGAGACAG AGACAG 
5 CACACGAGAGAGAGAGAGAGAGAGAGAGAGAGACAGAGAGACAGAGAGAC^^ 
AGTATCTCTGACTACCCTGGAATTTGCTAGGCCAG^ 

GAATT AAAGACGTG CACCACCACGC CTGG CTTGACATTATGTTTGGGTATTGGC TXTTCATT ATGTAT CTGATGATAAAGGTCGTTG 
TATCCCCAGAGCCTAGAIAGCACTTGGCACAC^^ 
GAAGTTGGGGTGGTGGTGGTAGGGGGTGAGGGGGTGGGACACTACT 
10 TCATTTCACCCTGAACTTG AATCAATGAGGTAATTCATAGT CTTAT ATGACT TT TCTGTAG AG AG ACATTAAAAAAAAATCT ATAA 
GCCAGGTGTGGTGGCGCATGCCTTTAATC^ 

CAGAGTGAATTCCAGGACAGCCAGGGCTACACAGAGAAGCC CTGT CTCAGGAAAAAAAAAAAATCTATG CCAAATAAG GCAGTCAT 

GAAACACCAAAGATATCTCACCACCCAAGTCTCACTTGGC^ 

GTTACTTATAGGAGTGAGACTGGAGTAGCTTCCCA^ 
1 5 ACCAGGAGCTGTTTATTGGTGACACAATCTGGGAGAA^ 

CTCTCATCAGGAGGAAATGTTTCGGGGGAAATAGCTATACAAAAGCAAGGAGATTT 

TTGGGGTGCTTACACCCTGGTGTTTCCCTGCrAGCTG 

GAAGAGTGGAGCGGGGCATGGGTCTGCATTCAAATCCTCC^ 

CTCAGTAACAGAGAAAGGAAGAAAAAAATGGCAAGACAGATGAGGGGCACT 
2 0 CATTGTTAAAATCACAGCCAGCTGTTAAGACAATCCC1TCCTGCCCTGGCT 

TCTTCTATAAAGTCTTTGAGGAAACCACACAGCTAA 

TGGAACTTCCCATAGTCCAGTAGCAGGAAGATGTGTTATCACTGATGCCAAAGGGACAATATT 

TCATTTGC^CGAGTTTGCTGTTTGCTGTGTTT 

GCTGGGACAAACCTTACTATGCTACCCTCCACAAT^ 

2 5 CTAGGTCACTAGGAGAGTTCCTGGAAACGCTTCAAAGCTAAAGGCCCAGGTACCGC^ 

ACCAGAGCCGAGGAATTCCC^CTCCTCrCTGATTTATACAAAACCCGCCGGCCT 

AAAGCTACCAAG CC CCAGC CCAGAGAG CTTATGCG ATGAGCAGG ACACAG CCAGCTGGTTGG CAACACTGAT ACTT AGTG TTACTT 
TGTAT TAATATTAAGTGTC CTTAATAGGGAAAGCCCGAGCTTGTGTTTATGTAGCAAGGC 

AACATCTCAAAGGGAATAC CACATGGGGGGAGGGGGATGTCCAGGCTGAATAATGGAACTG^ C 

3 0 AAG AATTATGG AAAGGTGCAT AACCTCAAACC C CACCATTT ATTT AG CACCTAGACAGAATGT AAG C CT CCATCTTTCG ATATAAT 

TTTGGGTAAATCCTGCCTATTCTCTGCACGACTGTAATGGGCGTGGTCACGTGTCCCCCT 
CGCTCTCGGGTCACCTTGACTACGAGGCTAAGGA 

AGGTGCCCAGGGAGAAGCCTGGCGTGGCAAACAAAACTAAAGTAGAGCC^ 
AATGGAGAGCTCTGCGGGGGAAGACGTGAGTTTC 
35 TTCAAGTGGGttSTTCTGCAAAGCCAGCC^ 

GGGAAGGCGGGAGGCCGGGGCGAGCCTGGAACCGGAAGTGAAGGCAGCTTC 
CGCAGTGCTTAGGTAAGCTTTGGCCCTGTGCACCATCCACCGCCATCTGCTTCTCCCGCGGCT 
TACCCGGTG€CGGGTCGGCGCCGTGGCCG€CGGCGCGCAGGCGGGCGTCCAGTTCTGTTGTCTTCACCC 
TTGGCGGGGAGCTGGGAGACAAGGGGGGAACGATCCAAAATGGTGGCCTCGGCCGCCATTGTGTCTC 

4 0 AGTGGGTCCTCTGCCCACCCGGGGCCGCGGCGGCC^ 

TTTTCTTGCTCGCCCAGGGATGGGAAGATCTCGCTCCGGGGCC 

CTCTGTGAATGGCTGCCTCTGTG CGTGGCCGATGGTGTCGCGCACTTCTGGGGAAG CTTTTCCGAGAGCACTG CAAGGCGTTCCTT 
TCTAAGGCCACCCAGCTCACACTCTTAATCCTGCATCGTTCTCAGCTGTGACCTTA 
TGGCAGCGCTGAGCCGAACCITGCGGCCTCTCGCGTTACTAAGCAAGCACTGCCATTG 
4 5 AAAAAAAAAATTTTTTTTTCTTGAAATGGAGCGA^^ 

CCCCGGTCCAGGTGTGCGCCGCTGGGACGGCCACTCGGGGG^ 
GGAAGTGAGCGCTCTGCGCAGTGCTGACCCTT^^ 

GGAGAAAGCGCAGTCTCTGTCTTCCCTTGTCCTTTGTGTCTCAACCCTCGTGT 
CCCAGTTGGCTCCTGCCTGTCGAAGTGTGTGTTACAAAGTCTGGTTAGCGGTGGAGCCCACTCT 
50 GTTTCTTATGTCATGAGACTCTTTGTTAACTCT 

TCGACGAGAAGTGACGAGTCCATTCGTGACTGATGAGTTTTTCCGGTTTTTTTGTTGTCCCCTCTCC 

TGGTGTCTTTCCCCTTGCTGTTCTTCAACCCCTCTCTTTGGCCCTTGCTTCCTCACCTGCTCTGGGACA 

CCTTCTCCCCCTGCCGGCCCAATTATGGC^^ 

CGCTGGGGCAGATGGGACCGAGGCTCCCGCCAAGAAAGACGTCAAGGGCTCCTACGTCTCC^ 
55 TTCTACTCAAGCCAGAGCTGCTCCGGGCCATCGTT^ 

AGACCTTATTTAG CACCTCTGGTGCAAGAAAAGGGAGGTTQ^GTCTAGGGTCATAGGATTTG AATTTAT TTGGGATAGGCCAAGC 
TCTGAGAATGTACCCAAGACGGCAGCTGTAGAAGAGG CnTTTGTTCTCCATTAAACCGAGGGCTGC CATTTGTTCTGTG CCTGGCT 
TTTTTGCTTGGTTTTTGTCATTCTGACTGTTCGAACT 

6 0 GGTTG CAGGGTT AGATGAC CCAAAA 

TAAGAAAAGTAAGTTTAGCCAGGAACGTGGAAGGTGGCCTGTTGGTAGCTTCTCGA 
GGGTAGGAGGGTTTGACTTGAGTGTAAGAGTGATGAGGGTCAGGGAAAGGATGGGGTCTGAAA 
GTCGGCTGGGGAGAAAGGGGTTTGGCATGGTGCAAAAATGTCTCTCCTTTCTCCTTCTAGTCCAGCATG 
ATTCTGGGGATGGATGTCCTGTGCCMGCCAAGTCAGGCATGGGA 

6 5 GCCCGTTACTGGGCAGGTATGTTGGGGGCAGTGCrrGGAGAGGGTGTGGAGATTGAATCACCAGGAG 

CTCTCAGCTGGTGTCCTCTGCAGTTCAAGGGGAATC 
ACTTTACTGTGTCTGTCTCCATTTGCTCCCT 

CAAGGAAT ATG AG CG CTTCT CT AAGTACATGC CGAATG TCAAGGT AAGGGG GG AAAG AAACCTG GG ACAGGAGG CTGTGGGG 
AGGCACTGGGAGGGAGGTTGGCTGTCCTCGGGCATCCCT^ 
70 CCCTTCTGTGGCTGCCCTTCCAGGTTCCATACT^ 

AACACTTTGTGGTTAGTTGGTTTTGGTATTGCAGATTCCATTCCAGAGCCTTGAGCTACTATGTGTC 
TCCTTTGTTTTTATGTTCACGACTGTCTCTTAACCACCACGTTGTCCTGG 
CCTTGAACCTGAGGCTCCTGCCACAGGTTCCTGAGTAGTTGGGGCAGGCCTTTGC 
CTCCTGAATTGCTTGGTAGATTGTTTGAGAGTTGGGTTC 

7 5 ACTTTAACTCTAGAGTGGTTAAAGCTTTCCGGGGGTCATAACAAGTGGCCTGTGTCCTCT 
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GTTGGCATCCTTTGGAACTTTTGAGATTATATATTTCCATGCT^ 

CCTGTGAGTAGAGAGATGTGTGCAGACTTCCTATTCTGGTTAGGGTGGGGA 

GTCAAATCCACCTTCCTCTGCCTCTACTGGGATTAARJGGCGAGCATCT 

TCTAGAAGAAGAAAAGGAAAGGCAGTATGTG CAAAATG CTAAAGGAGGCCAGAGATCTTGTGATCTCTGAGGTTT CCATTG C T CGG 
5 CATAAC CAAGCATTAGACATTGTTTTCCTAACCTGTCTGTGTG CAAGCTTACT CGTATTTAT AT ATC 

TTTTTCCTAGGATTTTGAAAATAGATTGTAAT 
CCTTGCCAAAAAACTAATAGACTTTCAGTTCCTTGGCTG^ 
C^TGGCTCATTCCTGTATTCCCAAGGCCTGGG 

AAACAAAGAGGGAGGGTGGCAAAGCTGCAGTCCTTCTCTGCAGGTCCCACA^ 
1 0 GGATCAAGTCTGAGGTTGGGTTTGGTGTAGGAGC 
TGTACTTCTCCrTGGCTGTGCAG 

TTTGTAACTATGTCTGAAAGCTTAACTGATACCAGTAAACCATCGTGAACAGG 

TGTGGTTCAGGATTGAGAAGAGGAAGAGGTTTACTGTAGAGTAA 

CGTGTCACAGCTCACACATCCAGCACTCACCTGGTGTCACAGCTCACACA 
1 5 TCAGGCACCGAGGAAGCAAAGGCTCCAGGCCAGGGAGACTGTCTGACAGTTCAGTTAOT 

GTGGCGCCTCACACTTGCnK^TGGCTTCTTGCTTT 

GTAGATCAGCTAGGGCTX3TGCAGAGCGATCCTGTCTTC 

CCTCTTGGCCCCCTGGTGCTGAGGAGGTCGCTTTGTTTCTTATGGGAGGCA 

TTTACAGTCAACCACAGAATGGACACACTGGTATTGGCAAGATGG 
2 0 CTACAAAGTGAGTTCTAGGCCAACCAGGGCTATACAGTGAGGCCTCTCAAACCA 

CATTGTCCTTTCTCGTGGTTGACAGGTGGCAGTGTTTTTTGGCGGTCTGTCTATCAAGA^ 

GCCCACACATCGTCGTGGGGACTCCTGGCCGAATTCTAGCCCTGGCTCGAAATAAGAGCCTGAACCT 

ATTTTGGACGAGTGTGACAAGATGCTTGAACAGCTCGGTGAGT^ 

2 5 CCTCCTTCTCTTCCCCTGGTGCTGTACAGTGATG CGGTGTGCTCAGCCCTGCG CTCCTTTCTGTAGCAGGGAACGGAATGTGCTGA 

CX3CTACAAGTCCTCACCACCACCCTCTCCTTAGGGATC 

TCCCAGTATAGCATGCGTGATAAGGATGAAATTTCATTAAGAGTTTCTTTGGGGAGTO 

ACTTGGCTGCTCTTGCAGAAG ACCTCG TTTCATTCCCAGCAT CCAT ATG AAAG C ATAT AAC CACT ATACAT AG CTC CGGT TT C CAG 
GGGTGCCTGAAACCCTCCTTTGGCTTTTTTGAGCACT 

3 0 CATAACGGAGCTCaSGGTGCTAGTTTTGGCACATGCCTTT 

AGGCCCATCTGGTCTAGAGTCAGTTCAAGGATATCCAGGACTACACAGAGAAATTCT^ 

CCAAGGGCTACATATCGAGCTCTTATCTTTAAAAGAGGTGGGGGAGA^ 

AGCTCTAGAGAAAAGTCTTTACAAATGGCCTTTGACTGTCT 

GG AACAG GG T ATGTG AGG CGAG AGG TG AAGG TGG AAGG CACAGTGGACTGGTG TAG AG ATT AAGTTCTGAGATGGG CGTAGGGAGA 

TTGGACTGGAAATTATGTTAGAAACCAGGCTGGCCTCAAACTCAGATTT 
TAAGTTTTTTTTTTTTTTTTTTTTTTTGAGTCAGGGTTTCTCTGTAGA^ 

GGCCTCGAACTCAGAAATCCGCCTGCCCCTGCCTCCCGAGTGCTGGGATCAAAGGCGTGCACC^ 
TTTTAAGTGTGTATGTGTGCATGCATGCTTGTGCCCATAGAGGACAGAGGAGTC 

4 0 GGGAACTGGGAAGCTAATGTGGATCTCTGTAAGAGCAGCAAGTGCCCGTGGCT 

TAAGGGCTTCTGTCCCCTGAGCCTCTCGTGAACTTTTTCCCCACTTTGTAGATTCGGTCATAT 

CTTGACCCGAAGTCTTTTTCTAAGCACATTCTTTATT^ 

TTACAGGCATGTGCGACGAAGCCTGGCTCGTTCCTGA^ 

GG CCTCACCAGACTTATGTCACAGACTCAC'l'T CTT ACTGTGTGTGTGTGGTT IT TTTCTC CCCTGCAGACATG CGT CGGG ATGTCC 

4 5 AGGAAATTTTTCGCATGACCCCCCATGAGAAGCAGGTCATGATG 

TCTTGTGTCCATGCAGCTCAGAGGCCAGGATGTTCAGAAATTAAGGCCTAGATGTACAT 
ACATTGTGAAAGGATATTTGGGTTGGAGTAAGGGACGGAGGAATTCCT 

C CTTT AAAC C CAG CATT CAGGAGG CAGAGGCTGG CAG CT CTT AAGTCTACAAAGTGCAATG CAGGACGCCAGG AC TACACAGAG AG 
GGGCTGCCTTGAAAAGCAAAAAGCCAAAGTTCTGCCAGGTGTGT/rAGCA 

5 0 GATCTCTGTGAGTACAAGTGGGGCCIAGCCrGGTCTCGTGTTTCA 

GCTCTCCAGAGAATGCACATTGAGTTCATAGCCCTGT^ 

G C CAGACACC CACAAAAC CACACACACAGAAAT CTTGAAAACTCCAAAAAT CAG ATTT CTGGGGCTGGGGAAGTG GCTCAGAGGTC 
AAGAGCACTTTTGTTCTTCCAGGAGTTCCCAAGTTCATTTCTCTATGTCAGCT 

GGTCCAATGTCCTCTGCTGGCCCTCCAGGCACCAGGCACACATGCACAAGTGTGTGTAGGCAAACCCCACACACT 
5 5 AAACAAGACTGATAAGCTGGTGCATACATCATGCCCX3TGCTCCCTCAGGTGGT 

C CTGG CC CAG CTT CTAG TG GAACAGAACTT CCCAG CCATTG CTAT CCAT CGTGGAATGCCC CAGGAGG AGAGG TG GGTTGG AG GCG 
CGTGCGTGCCrTGCCCGTCCATCTGGGTCCTGTCCTGTGGAGGAGGCAGTGTGGAGAGAAGAGAGTCT 
CTCACAAAGGCTCTCTCGGTATCAGCAGTTCAAC^ATTTTCAGCGGAGGAT^ 
ATATTGAGCGTGTGAACATTGCT^rrCAACTATGACATGCCAGAGGACTCGGACACCTACOT 



60 



AGAGCGGGCCX3GTTTGGCACCAAGGGCTTGGCCATCACATTTGTGTCAGATGAGAATC 



GTGTGCCTGTGTGCGCGTTTCCCATTTTGTATCTTACTGTGACTTTTTTT 
TCAGTCTGCTGTAGAAGAGGACACGGGTCAGGAGGAGACACT 

6 5 TTCTCTCTGCX3TCACCACTCCTAAACCTAGTCCTC 

AACAACCGTGTCTGTGTTGTCrrGTAAGTGCTCTGTTCATGGCTTGAACCAGGGTCATTCTGAGGGCCGTGA^ 

AGCGTCATTGTCTTCTTTCTAAGGTGGCTGTGGACAGGGAGGCTGGGAC^ 

GTCCTGGTACCTCAGCTCCTTCAGCTGAGTTTCTTGTAC^^ 

TGCACAGGCATGCGCGCGCGCACACACACACACACACACACACACACACACACAC^ 

7 0 GCTTAGCTGTGTAAAGAGCCTGGAGTCCCTAGCTGGACTA^ 

GGTCTGGCCTGCCTTC CCTTTCTGTTTTTGGTCTOAGGCTCAGAAGGTCATAGGTGAAGCCCAGG CAG CTTCTTACCTCAGCTGTG 

CATCTGAGGTAGAGCGGGCCTGTTGCAGGATTATCrrTGGGGTTCTTGG 

CCTGAGCTGCTGCTCCCTGGAACCGGAACCGCCTCTCTGTAGACCTTGATTGC^ 

CATGGGCCCCCTGCCCGCTCTGACCTGACAGGCTGGAACTTGTGTTGCTC^CTTGAGATCTCAGATCCA 

75 
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ACTTGGGGGAGACACCCATTTCTTCA^ 
TCTGTGCCCCTGCCraTACCTCTCTAT^ 

AGGGTCTCATCAGGCGGAACTATACATACAGCCAAATGCTTCAG 
AAGTTTTGACAGCATCTATCrAAATGCACTAAGTGTGCATTGCCTATG 
5 ACCAGGAG CTGTGAG C CACCTGACTTGAGLAGCAAGTGCTCCTG C CCCTTAGCCCTCTGCTCCTGAGATCCTCATTTCTGAAGAG AT 

CTGGCC^CCACATCCTGGTAGGAAGGGATGGATGTTCACAGAGAAAGC 
ATTAGGGTTCCCTTTGCCCAGTCATOTGTGCTGGATAGTTTTATA 
TTAATTAAGACAATTCCTCCATAGGACTGGGCTC 

GTGGTCCTGGGTTCTATAAGAAAGCGGGACTGGGCACATTCTTGGAAAGGGGAGGGAGG 
10 CTAGGAGGAGATGGGGTGCTGCAAAGTGAATACATAATGGAAAAAATGAAAGGGCATGA^ 

GGGAGGCAGAGGCAGGCAGATCT CTATGAGTT CAAAGCCAGCCTGGTCTTCCAGAGCTGTCAGAGAAACTCAATC^CAGCGG TGGG 

GGTTGGGGGAGCAGGCTGAACAAGCC^VGTAAGCAGCCTT 

GACTTCCTTCAATGATGGACAGTGATGTGGAAGTATAAGCCAAATAAATTCT 

CACAGCAATAGAAACCCTGTGACATGTCACTGTGAGGTTGGTCCTCACACAGT^ 
1 5 CCAATGTCTAACAGGAGAGTCCAAGATTCAAATCCTATAATCGGTGCAACCCCrrTAACT 

C CT CTTCCAACTATCTCTAGAAATGTT CTTG CAGGTGGCAGCTG CAGT AGGAGCTAATG AGC C CC CAGATC TATGTAAAAACGAAT 

GAGATGGTGAGCXZTGGAGAGGTCGCTCAGTGGTTAAGAGCACT CACACAC 

GTGGTAGCTCACAATCATCTGTAATTCCAGTTCCAGGGGAGGGATCTGTCACCTTGT^ 

GGTGCACAGACATATATACAGACAAAATGCTC^TACATAAGGTG 
2 0 G T AATGTTGCT CCGTGG AT ACAGCATG CATATGATG CACAG ATGTG CATG CAAG CAAAACACAT ACGTAG AAATAAAT CAG AAG TT 

AGAGATGGTTCAGTAGTTAAGAGC^CTGGCrGATCTTCrAGAGGAGC 

G TTTGCAACT C CAGTT CCAGAGACTCC CACAT CCT CACACAGAGATACATGT AGGTAAAACACCAATGCAC^TG AAATAAAAATAA 
TTAAAAAACAT ACTTATTGAGT ACCTCGTAGT AGATTGAGGCAT CT C C C TGGAAGCAAGAGCTAACAGTG 

CCGATGGG CTTTTATTCT TTTCATTTACACTCTTTCATTACACAC TGGAAGGTGGGACAG AC CTGCT CACTT CAG CAC C CTG CCAG 

2 5 TTAAACAGAAC CCCCTTC TGG CAGAAGC CTGGTCCAAGTCTGGTGAACGTTT ACAAGG AAAG CAGGCAGTCAGCAACTGAGCTTT A 

TGTGTGTGTGTGCGTGTGCGTGTGCGTGTGTGTATAAGAGAGA^ 

GCACTTGGTAGGCAGAGGCAGGCGGATTTCTTGAGTTCGAGGCCAGCCTGGTCTACAAAGTGAGTGCCAGGACA^ 

3 0 ATTGAGTGTGAGTGTGTGTGAGAGAGAAAAAGTGTGTGTGAGAGAGAAAATGTGTGTGTGTAAGAG 

AAAGTGTGTGTATGAGAGAGAGAGAGTGTGAGAGTCTGTGTGTGATTGAGTC 
TGTGTGTACGTGCACACCAGCTCTTGTCTCTGCTCTT^ 

TCAGAACCCTGGGCCGCTGAGAAGAACCTAAGCCATGTTATTTGACAGCAACTGAGATGCA^ 
CATGGCAGCTGAGTTTTCAATGCTCTGCCA.CATTAATTAGTTAATTAA 

TGGTCATGGTGTTTACCACAGCAATAGAAACCCTOTGACATGTCACT 
CCTAGCAAGCAAGAGGCCAATGTCTAACAGGAGAATC 

CCAAAACAGCTGCTGCCCTOTTCCAACTATCTCTAGAAATGTTCTr CTAATGAGCCC CCAGAT 

CTATGTAAAAACGAATCAGATGGTGAGGCTGGAGAGGTCGCTCAGTGGTTAAGAGCACTGGC^^ 

4 0 GGTTCATAGGAGACACGTGGTAGCTCACAATCATCT 

G G G CACTC CACACACATGGTG CACAG ACAT AT ATACAGACAAAATG CT CAT ACA 



MOUSE SEQUENCE - mRNA 

CGCTCGCAGTGCTTAGCTCTTCTGTCGGAAACTGGTGTCITTCCCCTTGC!TGTTCTTCAACC 

4 5 ACCTGCTCTGGGACACCTAACTCAGAGACCTCCCTTCTCCCCCTGCCGG 

TTGGACTACGAAGACGACGAGGTGGAGAC^CCGCTGGGGCAGATGG<3ACCGAGGCTCCCGCCAAG 

CGTCTCC^TCCATAGCITCCGGCTTCCGAGAITTTCTACTCAAG CCAGAGCTGCTCCGGG CCATCGTTGACTGTGGCTTTG AG CATC 

CATCAGAGGTCCAGCATGAGTGCATCCCGCAGGCCATTCIX3 

GCAGTGTTTGTCCTGGCCACACTGCAGCAGCTGGAGCCCGT^ 

5 0 GGCTTTTCAGATCAGCAAGGAATATGAGCGCTTCTCTAAG 

TCAAGAAGGACGAAGAGK3TGCTGAAGAAGAACTGTCCACACATCX5 

AAGAG CCTG AACCTCAAACACATTAAACACTTTATTTTGG ACG AG TGT GACAAGATG CTTGAACAG CTCG ACATG CGTCGG GATG T 
CCAGGAAATTTTTCX3CATGACCCCCCATGAGAAGCAGGTCATGATGTTCAOT 
GCAAGTTCATGCAAGATCCTATGGAGATCTTCGTGGATGACGAGACCAAGTTGAC 
5 5 CTGAAGGACAACGAGAAGAACCGGAAGCTCTTTGATCTTCT 

GCAGCGCTGCATCGCCCTGGCCCAGCTTCrrAGTGGAACAGAACTTCCCAGCCATTGCTATCCA 

CGTGTGAACATTGCTTTCAACTATGACATGCCAGAGGACTCGGAC 

CAAG GG CTTGGC CAT CACATTTGTGT CAGATGAG AATGATG CCAAGATC CTG AATGACGTT CAGGAC CGTTT CG AGGTCAACAT CA 
60 GCGAGCTGCCCGATGAGATTGACATTTCCTCCTACATTGAGCAGACACGGTAGA 

AC^CGX3GTCAGGAGGAGACACrACCGCCC<^CCCGAC^CCGACGCCTCTGCCCACCCTATCTAT 
CTAAACCTAGTCCTGATTTATCAGAGTTGTTTGTTTGTTTGTTTTTGTTTTTT 

MOUSE SEQUENCE - CODING 
65 ATG^3GAGAGAACGATGTG<xACAATGA^ 

TCCCGCCAAGAAAGACGTCAAGGCCTCCTACGTCTCC^TCCATAGCrCCGGOT 

GGGCCATCGTTGACTGTGGCTTTGAGCATCCATCAGAGGTCCAGCATGIA.GTGCATCCCG 

TGCCAGGCCAAGTCAGGCATGGGAAAAACAGCAGTGTTTGTCCTG^CCACACT 

TGTGCTGGTGATGTGTCACACTAGGXSAGCTXKSCTTTTCAGATCAGCAAG^ 
7 0 AGGTGG CAGTGTTTTT TGGCGGTCTGTCTATCAAGAAGGACGAAGAGGTGCTGAAGAAGAACTGTCCACACATCGTCGT 

CCTGGCCGAATTCTAGCCCTGGCTCGAAATAAGAGCCTGAACCTCAAACACATTAAACACTTTATTTTGGACGAGTG 

GCTTGAAGAGCTCGACATGCGTCGGGATGTCOU3GAAATTTTTCG<^ 

CCTTGAG CAAAGAGAT CCGCC CAGT CTGCCGCAAGTTCATGCAAG AT CCTATGG AGAT CTTCGTGGATG ACG AG ACCAAGTTGACG 
CTGCACGGGTTGCAGCAGTACTACGTGAAACTGAAGGACAk^ 
75 CAACCAGGTGGTGATCTTTGTGAAGTCCGTGCAGCGCTGCATCGCCCTGGCC^ 
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CTATCC^TCGTGGAATGCCCCAGGBGGAGAGGCTCTCTCGGTATCAGCAGTT 

AACCTGTTTGGC CGAGGCA.TGG ATATTGAGCGTGTGAA CTGCA 
CAGGGTGGCCAGAGCGGGCCGGTTTGGCACCAAGGGCTTGGCC^^ 
^ ACGTTCAGGACCGTTTCGAGGTCAACATCAGCGAGC^ 

HUMAN SEQUENCE - GENOMIC 
ATCTTTTATTATCACTTATCAATA 

GGAGAGGGGTGGGGAGGTGAACCGGCAACAACTATGGCCGGCGG^^ 
GAAAGGGAGTGGAGTGTGACAGAGGGTCTCACCCATGGGCTGAGAGAAA^ 
1 0 TTCAGTCCCAACCITGCTGCATCrGGCCCAAGGTTAGCTGAGTC 

TT TCTTTCCCCGGGGTAAGAGGAAATGGG CTAGCAGTCCTTAAATCTT TATTTGGTAGTGCT 
AAGCTCGAATAGGGGAAGGCAAGGCCAGGAAGGCAAGAAGAC^ 
AAATCACATCAGGGTTGGTGGGGGGGGCACTTCTCCCTAGTGC^ 
1 5 AAGTACCTGGTCCTGTCTCTCATTCGCTTGTTC^ 

TTTACATTCATAGGTTTTAAGGAGTAGGGCCTCCAACTATAAAAACA 

AAAAAGCTGGCTACAAAACTGTAGAGAGGATCAGATGTGGCCAGGCA^ 

GGGGGGGGGGACAATGGATCACAAGXSTCAGGAGTTCAATACCAGCC^^ 

AACAAAATATATAT AT AATTAT ATTTTAT ATTAXAT ATAAAATTAGC CAGG CGTGGTGGCTGACG CCTGTAAT CC CAGCT ACTTGG 
2 0 GAGGTTG AGGCAGAGAACTGCTTGAACCTGGGAGGCGG AGGTTGCAGTG AGC CG AGATCG CGC CTCTG CATTC CAG CCTGGG TGAC 

AGGGTGAGACTCTATCTCAAAAAAAAAAAAAAAAAAAAAAAAGAGAGAGAGAGA^ 
ACCAAATCOTATGGTTGGAAAAAAAGAGGCCGGGCGTGGTGGCTCATGGCCTGTAATCC 
GATCACCTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCT 
TGGTAGCATGTGCCTGTAATCCCAGCTACTTGGGAGGCTGAGTCAGGAGAATCA 

2 5 TGAGATCATGCCAGTGCACTCCAGCCTGGGCAACAGAGGGAG^ 

TGGCAAAAAATATGTAAAAATTCTAGCCTATGGGTGGTGGG^ 
AAAGGGATATAATTAAGTTGAAATTTTCTTTTTAAAGAAGGACGAACA^ 

AGT CC C!ACCTTGGTGT ACCTAAAAGCATACATATG ACACATTTATT TGG AG ATATGTAACCATAC TACAAATATTG CTTGGCAAC T 
TGCTTCTTTCATTTAACATCTCATTTATATTTTTCCA 

3 0 TGTTCCACTGTGTGTACAAAACAAATCTCCTTACCCCAATATCGATAGACTTTG 

ACAGTCTTTCTGTGGGGGTAGATTTTCATAAGTGGAATTTCTGAGT 
TTTGTGCCAATATACAAGTGTGCCTATGTCTTCCTTTACC^ 

T AAAAAATT ATTTTATTAACAATGT rr'I M l"l"r C1TTTT CATGTCTTTTGG CT ATTTGT ATTTTTTGTGAATGAATTG CTC ATATTCT 
TTATCCACTTTTTCTTTGGAAATTATATATATTGATATGAGGGTTCTATA^ 

3 5 CTATCTATOTTTAAAAAAGCTTTTTTTTTCTTTTTGAGACAAGGTCT 

GCTCACTGCAAACrCTGCCCCACCTGGG CTCAAGCGATCCTCCCACCT CCTCCTG AGTAACTGGGACTACAGGACGACAGCCA 

CGTACCACCACGCCTGGCTAATTTTGTATTTT^ 

AAGTGATCTCCCTACCTTGGCATCCCAAAGTGTGGGATT^ 

TC T AT ATCAATATATT ATTTT CTG ACTTTTT ATTT CAC TTCACT CTT AG GAAG G CT CTATTTT AAG AT T ATAAAACATT CT CTG CA 

4 0 TTTTCTTCTACTACITTAGATGTATACTCATTTTTAT^ 

CCCAGCACTCITGGGAGCCCAAGGTGGGTGGATCATGAGGTCAGGAGTTC^ 

ACTAAAAATACAAAAATTAGCCTGACGTGGTGGCGGATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCA 
CCCGGGAGGCAGAGGTTGC^GTGAGCCAAGATTGCGCGACTGGACTCCAGCCTGGGTGA 

CAAAAAAG AAATTAAAAAACT CT CAAAAAT ACAAAGT ATAAT ACAATACACAAC C CCATT C CTAC CACAG GG TTTGT TAATGT ACT 

4 5 GTCATGTTTCTAGTATAATTTATTTTAAGGAAATAAAAT 

ATTAGGTGTGAGGTGGGACTCrTGTTTTTCCCTAGATGAG 
CAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAAGCT 

TGG CCAACACGGTGAAACCCC CATCT CTACAAAAA.TACAAAAAT TAGCTGGGCATGATGG CAGGTG C C CAG CTACTCAGGAGGCTG 

5 0 AGG CAGG AGAATTGCTTGAAC CCGGGAGGCGGAGG TTGT AGCAAGCTG AG ATTG CG C CATTG CACTC CAG CCTGGGTGACACAG CG 

AGACTCTGTCTCGGAAAAAAAAAAAAAAAAAGTTCACCITTCT 

CG CCTCTAATCCCAGCATTTTGGGAGGCCAAGGCGGG CGG ATC ATGAGGTCAGGAGTTCGAGACCAGCCTGG CCAACACAGTGAAA 

CCCTGTCTCTACrrAAAAATACAAAAAATTAGCTGGGCGTGGTGGCGGGCGCCTGTAATCCCAGCTACT 

GAATGGCTTGAACCTGGGAAGCGGAGCTTGCAGTGAGCCGAGATCACACTACTG 

5 5 TCCCCCACTTCGCCAAAAAAAAAAAAAAAAAAAAAAAAAAGAAATC 

CTCTTTAATACGCIX3GAGTTTCATACTAAGAAAAATAAACATAAAAACATTTTGGCCAGGCGCTGTGGCT 

ACACTTTGGGAGGCTGAGGTGAGCAGATCACAAGGTCAAGAGATCGAGA 

AAAATACAAAAAATTAGCCAGGCX3TX^TCGCAT^ 

AGAATCTCI^GAACCTGGGAGGTGGAGGTTGCAGTGAGCCGAGATTGTGCCACTGCACTC 

6 0 TTTTCAAAC!AAAAC!AAAACAAAACAGAAAAC!AAA 

TCCCAACATTTTGGGAGGCTGAGGTGGGTGGATCACTTGAGGTCAGGAGTTTGAGACC^ 
TCTACTAAAAAC!ACAAAAATTAGGTCGGGCATGGTGGCTCATGCCTGTAATCCCAGCACT^ 
CTAAGGTCAGGAGTTCGAGAGCAGTCTGGCCAACATGGTGAAACCCCATCTCTATT 
CAGGCTCCTGTAGTCC03GCTACTCAGGAGGCTGAGGAAGGAG7^TCA.CTTGAACCCAGGA 

6 5 ACTG C CACTGCACT CCAG C C!TGGGGAACAGAGGG AGAC TCCGTCTCAAAAATAAAT AAACAAAT AAAAA 

AAAAATAAAAAAATT AGC TGGGCATGGTGGTGTGTGC CTGTAAT CTCAG CT AC TTGGGAGGCTG AAG CAGG AGAAT CGCTTG AA.C C 
CAGG AAG CAG AAGTTAT AGTGAG CCGAGAT CATG CCACTGCACT CCAG C CT GGGCGACAAAG CAAG ACTC CX3TCTCAAAAGGAAAA 
GAAAAAGGAAGCTGGAAGCTGAATGAGATGGGCCTTTCAACCAAGGAG 
GGAGTAAATAAGGCCAGAGAGGACACCAGGGTCTGGGAGTGAAGGCACT 

7 0 GCTGTGAGATTTCAGACAAGTTCCCGAACCTO 

TTAAAATTCAATACAATGGAGTTGGGTGTGGTGGCTCACGCCTCTAGTC 
AACTCAGGAGATGCAGAACAGCCTGGATAACATAGCAAAACAGTCT 

TGCTTGTAGTCCCAGCTACTAGGGAGGCTGAGGTGGGAGGACTGCTTGAGCCCACGAGGTAGAGGCTGCAGTGAG 
CCACTGCACTCCAGGCTGGGAGACAGAATCAGACCCTGTCTCAAAACAAACAAGCA^ 
75 TATCAGGGGCATTTTGGCTGAGCGCGGTGGCTCACGCCTGTAATCCCAGCACTT^ 
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GGG AGTTTGAGACCAG C CTGACCAACACG GAGAAACCX3TCTCT ACCAAAAATACAAAATTAG CCGGGCGTGATGGTGCATGCCTGT 
AATCC CAG CTACTTGGG AGGCTGAGGCAGGAGAATCTCTTGAACCCAGGA^ CG CG CCATTGC 

ACTCTAGCCTGGGCAACAAO^GAAACrCCATCrCAAAAAAACAAAACAA 
TTGGGCTCACAGAGGTAAGCrGCATGTCCCTGTTC 
5 TTGC CTTCCAGGGT CCTGC CTCTTACAATATGAGCTGT CAAGTTAGATGCCrc CACT CAGTAAAC CTACTCTGTTTTAAGTAAAAA 

CAAGAAGAAACAAATCTGAATATGCTAGCCTATCTCAGGT^ 
GAGTTTTGGGGGAGGGGCCTTCTGTCTATACTTGAGC^ 

TT CTT ACTC OVGAAGGAAATCTCT AG ATGGGGAAAGAAGGTTTCAGCTrri'ATT CTAGTAAGCAGGGCTCTACCTATAAAGAGCTG 
CTTCCACCACTCTTTITTTTTTTTTTGA 
1 0 CAACCTCTGTCTTCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGAG 
TAATTTTTGTATTTTTAGTACAGATGGGGGTTrCACTATGTTGGTC^ 

TGGC CrrCCGAAAGTGCTGGGATT ACAGG CATGACCCACCGCAC CTGGCCTC CACCACTATTAT AATAT CACCAGGTTCCC CATTTG 
AATCCTTCAGTGCCAAAGGTTTTGCAGAATTCAAATGTTT^ 

GGTAG CCCAGG AGGGCCTAGGCAG CCCAAGAACCAAACACATG AGTC CAGTAGGA 
1 5 TGAAAATAAATAATAGTCTTACITTAGTTCAGATTAGGTTTCrrGT 

TT CT CAG AGTG1TTTTGAGTTTAGAATTGTGGTT AAG GGAGTATGG ACCTGTTC 

GGCTCACACCTGTAATCCCAGCACTTTGAGAGGCTGAGGTTGGTGGATCACCTGAGG^ 

TGGCGAAAACCCATCTCTTCTAAAAATATAAAAATTAGCT^ 

CG CATGAG AATC^CI^GAACCTGGG AGGTAGAGGTTG C^ TG 
2 0 AAACTTCATAAAAAACJUUVAACAAAAACAAAAAA^ 

GCCAGGTACTCTAATTATGTACTATGTCAGTTGATTCTCAAAA^^ 
ATGTACAATGCTATAATGTATAACACAGGAC^IATGTAGCTGTTAAAAG^ 

ACTCACCTCCTAAGATTGTTGTGAGGATTAAATGTGTTACTTTATAAGAAATGTCTGGCACATAG 

2 5 TTTTTACACATGGGGAAACTAAGCCT 

TTTG AATCCAGG CAATCT AACT CCAG AGCCTAC CTT CT ATG CTACTTTTTGGCTACGAG CAAACAATCTGTCAAGAAACAAAGT AG 
CTAC TAATCT AAACAGATGTGAAATTTGAAGACCAG TTG ATCTTTGGGGAATGTTGGGTTCTT CAG ACAATGGTAG CT CAGTAATG 
T AAAGGG ACAAATG ACAG C CACATG CC^CTTGGTT AACT CCTTCC CTAGGT C CTGATGACCAAAT AACCAG C TTATTT CTCAACT 
ATTGGTTGGCTTTCATTTCAGGTCTGGTCAGCTGCTTAT^ 

3 0 AGCAGCATGCAGTGCCGTGTCCCC^TGGCGGTCCTGGTGGGC^^ 

CGTGGCGGGCR.CAGGCCCGGTGCAGTGGTGGGGGCTGCCCAGCATCrACATCGAGGCC 

CGGACCAGCCGTCCTCCAGACAAGTAACGACGAAAGCGACGTTCTCGGCGTTGGCGGCGGGAAGTGGAG^ 

CTGGGGAAGGAAAAAAGGCAGCAGTCAGGACITCAGCCTTGGCTGGTCCTTCT 

CTTTGGTT CCGTC T TTTTTTAATAT CTTC AGCAAGAGATGAGG CCTAACCT AACC CTG AT CCTTT ATCAGATGATAGATTTGAAAA 

3 5 AAAA TT T TTTTT GAGGGGGGTGATAGGATCAGAGGTTTAA 

GCTCAGGAGCCC!AGGTAAATTTTTAATTTTTAACAAAGAACIT 

AG AAATCTGAGTTTTAAAAAGT CACAGATAATCTCCAAT AATG AT CTAGAAATTGAAT AT CATGT ACC CGG CAGACAG ATGTGGAG 
GCTTCTTC CTCTGGAACCTGGGGGG AGGGGTTACITCATCAGACCTGCCCCCGCCCCCCCAAGTACC CCCAG AG CCGTAGGCCCAAG 
GCCTGTGTTTAAGAAGCTCGGAGACGGGAGGCGGGAAGGGCGGAGACIACTCCAGGCTGGAGGAAATGGCGCAA 
40 TGGAGGACGGAAGTGAACTGTGAGGGGCGTTACCGGATGT(^TTCCGCCCCGACCGGGTAGTTCTTGGCCAG 

ACTAGGGAACTTAAATT AAAGGG GC CGTCTGAAAC CAGAAGACTGGACTGG AGG CGAGG AAAAG G AGGCGAGG GG AGGGGAGG GAG 

AAAAGAGAGTTATTTGGAGGTTTTTCCCCGCCTCCTCTAACTTG<3CAGAGA 

AGACAGAGAAAATAGAGGAGATAAAGACAGGATAAAAATCACATTAAAC^ 

CAGG GACAGAT CAAAAAAAAG AAAAAAT ACAG ACAAAAGACGG AAGAAGACTATCGTAGG ATGGGG CAAGTC 

4 5 GGACTTGAGAAATATGTAGAAAAAGATGGAGATGTTAAC^CGGGAGGCAGGGGAGGGGGCGGGATGGTGGAGAGAGAGAGAAAGG 

TAGAGAGTTAGTTTAGAATTAAGCCCAGAATGCTCTTTTCCCAACAC^^ 

CTAGACCTTTCCGACTGGGATGGCTAACCTGTTGTAAGCCCGCAGCTTTGGGCCTGGTCTCT 
GTACTGC CTG AGCAAG AAGTG CTGGAGAGGAGGACCAGTCATCAATAGGAGGATGAGATTG 

5 0 GAGTGAGCAGGGGAGCACTAAGACCCAGGGGTAGTGGAGGACTGCAGCAACGAGCTGGAGGAGGA^ 

GAGCCATCTGGTACTTTGACAGCATTCAAAACAGCATCGGCC^^ 

TCTGCAAGCTGAGAAGCGGGCAGCTGAGAAGGTGGCAGATGCC^GAAAGAGTGAGTCTCCTCTTTCCTC 
GAAAATTGGGGGTGGGGGACAGCAAACATTTTGGGAAAACCCAAGGCTC 

GGAAGAAATGGATGGATATTAGAATCTGGCACCTGGTTGGCTGAGAGAAGG CTGTATAACTTTCTGGAAGGGACTGACTCCTG CTA 

5 5 TTACATTGTGTGTGTGTGGGTCCATCCCCACTCACTGTCCTTTCTTCTX3 

AG GAGGCACAG ATGGAGGTGG AGCAAT ACCGCAGAG AG CG AG AG C ACG AATTCCAGAG CAAG CAGCAGG CGGTGAGTTG AGGCAG A 
GTCGGGATGAGACCCCACTGCAAGTTGGTGGGTGCATCTAGTGAGGTGTC 

GCTGAGGCTGAGGGGACCCTGGCAGGGACCACAACATTGGTGAAACTTTGTGATGATATGTAGGA^ CTGGGAGTTTTGAAGG C 
CACATAGAGCTTGTGGGCGGAATGCCACAGTCTGTGTAAAGTATAACIATCT^ 
60 (3AGTTTTATGGTCATGGATGGTGAGGTGGTGGGGATATTACGGTCTGTTTTAGGATGAAGTTGCATGTTAGGT 

GGACTGTGTTGATCTCTTTGGTGTTGGGATATTTCTGTGGGATGGGGGTGGTTTCTGAGAGGGCCTTTCTTCTAGGCTTTGT^ 
GGATCTTTCCCCTCATATGCCTGGACCCTTGTCTGTTTCTGCT 

TGGGCTCCCAGGGGAACCTGTCTGCTGAGGTGGAGCAGGCTACAAGGCGCCAGGTGCAGGG 
CGAGAGCGTGTCCTGGCCC^GCTTCTTGGCATGGTCTGCGACGTCAGGCCCCAGGTCC^C 

6 5 CCACCGTAGGGCCTGACTCCTTCTGCCAGTTCCCTCCCn'C^ 

TCTTTCCATC C C CT AGAAATC CTGGG AGG CAGGATC CAAT AAT TTT CCTGTG ACACTT AT AAAT ATCCTGCTCACATCTGAAT CTC 
CTTGTTGTTCTTTAACCCTCACTGGGACTTTGTAAACTTCCAAGTCATTCTCA 
AGTAGGAATGTGGAAAACATCCTGACTTCAGTGTCTGGCCGATGTGGGTCCCTCTCTTGACCCTGT 
AGGACAAGCTACTTAACTTGGTAGCCTCGATGTCCTCCTCTGTGA 

7 0 AATGATTAGGAATCATTCTGTAAAGTCTAGCACAGTTCCTTGCATGTTGTA^ 

TTACCACCACCTGCTCACTGGTCAAAACCTACACAGCTGTTC 
TGACCCTAGTTATTTTCTGAAAAATTGX3TTCTTCTCTTTTC^ 

CCCCTCTCTTATAATTCCAGGTAGATAACTGCATTTTGTAGCCTCTCTTTGTTTTTCTTTTGCTGATCTTTGTCTTTA 
TCCTCCTTTCCTATTTCCCCAAAGACTTATCAGATGCTCATTG 
7 5 CCCTTCCTTTCTATATCCTTGACACCTTACTTTCCC^TTGTAACAATAAAAAAAGTATCAAT 



2137 



WO 03/008583 



PCT7US01/51291 



GGTGAGTTGAAGCAGCCTCCTTTTGCCTCAT 

TTGTTGCCCAGGCTGGAATACAATGGCGTGATCTCAGCTCATTGCAACCTC 
CTCCCAAGTTGCTGGAATTATGGGTGTGTGCCACCACGCCTGG 
GTCAGGCTGGTCTCAAACTCCTGACCTCAAGTGATCCACC^ 
5 GCCTGGCCTTCAGTCACX'r i ' L "r i ^ rrn ' riX 3TTTACAT^ 

CAAGAACAAGTAGTACTTT ATTATTTCTCT^ CATTTGGGAATCTG AGGACTAT AGATCG CAACTGT 

AGAGAAAAGCTGGAGTGTAGGAGCAAGTGCTCTTTGCCCOT 

AGACAAGGTCCTGCTGTGTTGCCCAGGCTGGAGTTCC^ 

GCCTCCTGAGTAGCTGGGATTACGGGCGAGTGCCACTATGCCTTGCTA^ CACTTGCC 
1 0 CAGGCTGGTCTGAAACTCCTGGGCTCAAGCAATCCTTCGGGCTC 

CTGACCACTTATCGATACITGACATTATATTTGTGTTTAT 

ACCGTTGTGTCCCCAGATCCTAGGACAACATGTGGCAC^ 

GAAAATACCn^CTATGACACCATTCTTGAATTAGTTACTTCAT^ 

TAAGGTAACCTGCAGTGTATTTACTAACCAGATTCTTTGAGCAGG^ 
1 5 GCTGTAACCTCACCAGCCGTACTGCTTGAGATATGTTGCTTTGCTTCGCTTCT 

TAGTATT AAATG ATT AAGTATGTTAACAGGG AGAGGGCCAAACGT TTGTTGTTTT ATT ACACAG CAGGACAT CAGGT CTT ACT TTT 

GTGGCTCCCCATCTCAAAGACGGGGATAGCAAATGTTTCATTCAGGA 

CCAAGAACATTCTGCTCGAATTAACAGTAT^AATGGGCCX^GCGCGGTG 

GTGGGTGGATCACCTGAGGTGATACATGGGTGAAGCCCCGTCT 
2 0 TGT AAT C CTAGCT ACTCGGGAGG CTGAGGCAGGAGAAT CG CTTGAAC CCGGGAAG CAG CGGTTG CAGTGAGCCGAG ATCAGGACAT 

TGCACTCCCGCCTGGGCGACAGGGCGAGACTCTGTCrCAAAACAAAA^ 

ATCAGCTACCTCGGACAGCATCCTTTTGGGAAAATACCGCCCACCAGCCCCACGACT 
ATCCAGTTCCCTGAGACTTCCCTCCTCCCrCCCCTCAGCT 

2 5 TTCACTACCGGGCCAAGGACCCCGTGGGAACTCGCAGC 

AAGACCTGCGTGG CAAAAAC CTAAACG AAG AGATGAGGGG CATGGAG AGGAGTAG G ATAAGAGAAT^ 
GACGTTAGTTTCCTTTATATCTTTTGTTACTGGAGGTAGCAGTC 
CCAAAGGCAAGCCTGGAGCGCA.CGGATCTGTATAACCGCGGAAGGCCCTGTT^ 
AAGGCGGGAGGCTGGAGTAGAGGGAAGCCTGCAACCGGAAGTGAAGGCA^ 

3 0 CGCTCTCCCTCTTTAGGTAAGCTTTGGCCTTCGCTACAATCCGTTTC 

CTGATACCCTTTTCACAGGCGATGGTCTGGT(X3CTG^ 

AGAGCTCCTGGGGGAAGGAAGGGAAGCTAAGGGGGGACCCAATCCAAGATGGTGTCCTCGGCGCCAT^ 

TCTT CCAATGGGTTCTTTCTCATATTGGAGGCCTCAGCATCAATGAGAGG CGGTGCTCGGCGTCCCTTGGTCTTGGTATTTGCGGAG 
GGCGGGGCTOTTCTCACCTTCCT^GTTCTTTCITGAGCT 

3 5 CCAGTTGGATTTTTCTCACCTTGACTTGCCCAACTTAATTTGGAGTGCCTT 

TTTCTCCAAAAGGAGTCTCACCTTCGTAGCGTAACAGTGATGTGAGACCACT 

GCCTTTCTCTGTCACCTATTAGCTTTCrrATTGTAGGGTGGAGA 

CGCCCCTCCCCCCCATGCTAATTACACAAGGCTTGCTTAAACAGC 

GAGGTGGACGGCGGCCATATGATGTTTTCTTTTCGAAAGGTGAGCGCTTTGCGCAGTGATGACCCT 

4 0 TGGCTGCTGAGTTAGGCATCCATAACGGTGGGATTATA^ 

TCATCTCCATTTTCCACTTTACTAAGTTATTAGTCATATTTTACCTTTTATTATCTA 

AAGAATTTATAAGTTGGTCTTCCCCTTCCAACrTTTCTGGTTTCCGCrrACTGTGATTGCTAATCT^ 

CCACTTTCCCTGGTACTGCTTTTTCTGTTCTGTTATATTTGCTTTTCGTTTTTATGT^ 

4 5 CCTCCAGCTCTTCTGTTAGAAATAGTATCTTTGTTTTCCTTTGCTGT^ 

TTTGCGAGAACCCATCCAGATCCCCCTTCCCTTCrTCCCCTGCCGGCCCAGTTAT^ 

GACTATGAAGATGATGAGGTGG AGACAG CAGCTG GGGG AG ATG G G G CTG AG GC CCCT G C CAAG AAGG ATGTCAAGGGCT CCTATGT 

CAGAAGGTAAATTTTCTCTTGGGCATGTAGTGCTCATTGGGCTCTTTAAGGGTACA^ 
50 GGTGGTAAGGGTTTATACTTAAGGCTAGATCAGGGCCAGGTGCAGTGGCTCACGCCTGTAATCCC^ 
AGGAGGGTGGCCACTTGAGCTCAAAAGTGCAAGAGAAGCCTGGGCAACATAGCGAGACTC 
AAGCAGGTGAAGGTTGAGGCTTCAGTGAGCCGTGATTCCACCACTGT 

TAAGTATCAAAAACAATTTTGGATAGAGGAGGCTTATACAGGCTTATTCTTTCTTTCGTGATAG 
AAGTGACTTCCAGGTCTGCCATTCATTCTTGTGACTGGC^ 

5 5 AAATTTCCAGAAAGGTCCTGCTTGGACCTGTAGTCTCCCTCTGTTGGGCCAGGCCAACT 

TTTAGAGATGACTGGCTTCTGGGTCAGGTACCAAGTCCTTCATTTTGTCCAGGGTTGTA 

CCTAAG CAG G AG AAG TGGT CTGTGGCAGGTATTCAAATGTCATG AATTGTTACAG AT T AAGAAAAAT AAGG ACAGAGCT AGG AT C A 
TTGAAGGTGAGCGGTTGGTAGATGCAAGGGGTTTGTTACTA 
TTAGGAGAGTCTGATTTTGAGGTGAATGTAATTGAGCAGAGAGAGGTAA^ 
60 AGACTTTGAGAATTGAGTGGTAAGAAATGGGCTTGGCATGGTGAAAAAGGTAGAGTTATCTGGAGACTGAAGTCTAA 
CTCCCCCCCCAACTTTTAGTCCAGCATGAGTGCATCCCTCA^ 
GGGAAAGACAGCAGTGTTTGTCTTGGCCACACTGCAACAGCTGG^ 
GGGGATTTTGGTTAGGACTATAAGGGAAGGGTGTTTTTGTCCTAACTACA^ 

65 ATCGGCCCAGGTGTGTTTTTGTGAC^TCACTTCCCTAG 
TGCCTAAGGTGTCTGTGCTGGTGATGTGTOkCACT 

ATGC CCAATGTCAAGGTAAGCCAAGG TAAAGAGAC CTGAG AGTG AGGGTGTGG CAAGTTGGAGG GAT AAG AAACTTGT AGG CCAAT 
AGTCTCTTTAATTTTG<^GAAGCTTTAGTTTGCTGTGGTGTAACAGAGTGTTGAGTT 
CTATGGATCATGCTTAACACAAGACCACCCTTTTCT^ 
7 0 TGTGG CAGAG AAAGC CGGAAACTTT AAAACAGCTC CAG TGG TGTGTG AAT AT TGAG GATTCTG GCCAAGTG CACAATGGCTTACAC 

CTGTAATCTCAGCAGTTTGGGAGGCCAAGGCGTGTXjGATTATTT 
CCTCTCTCTACTAAAAATCCAAAAATTAGCCGTGCATC 

AT CG CTTGAACC CAGG AAG CAG AGGTTGT AGTGAG CTG AG ATTG TG CCATT ACACTC CAG CCTGGGTGGCAGAGTG AGACTCCCAT 
CT CAAAAAAAAAGAAAAAATCTGATTGAAG TTAAGC!ATTTTTG G CAAG AAT C CTT CATAGGTG AT G CTGTATCTCCTGTT ATG CCA 
7 5 aVAATCTGGTCGACTTATGTTAGTTATTTTATTTTATTTTTATTT ATTTGTT TTGAGATGGAGTCTCGCTGTGTCCTTCAGGCTGT 
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GAGTGTAGTGGCGCGATCTCAGCTCACTGCAAC^ 

GACTACAGTGTGCCATCATGCCTGGCTAATTTTTGTTTT^ 

ACTCXnX^CCTCAAGTGATCCACCCACCTCGGCCT^ 

G ATTTT AGATTTACCATAGGATTAG CGTCGTGACAGTCTGATTCCACAGTTGTTCTTTTCC CC TTCT 
5 GGTGTTATTTGATACTGTACCAAGG (XCAGAT CCCCAAACAACTATTCAC CT AATGGTTTTAACATGAAATGAT AATATTT AGCCC 

GAAGCAGTAATTTC^TGGGGTTTGTGTGAAAft 
TGGTACTTTAGACT AATCTGGTGATTTCTTG C^ 
C^GGCTGGAGTACAGTGGCTCAATCTCX3CCTCACTGCAAT 
TAAATTTTGTATTTTTTGTTTTTTAGTAGTAAAT^ 
1 0 GGTCTTGAACTCCTGGCTTCAAGTGATCCACTTTCCTTG 

TTATTTTATTTTTATTTATTTTATTTCTTTTTGAGA CTTGTTGCCCAGG CTGGAGTACAACGGTGGGATCTTGG 

CTCACCACAACCTCTGCCTCCCAGGTTCTraTGCCTCAGCCTCCTGAG^ 

C TTTATTTTTTAAATGAAG CCTGGCCTCATAAATGAAGGAAGTTGG TTAGATTAAGT CAGTAGAACTGAATTATTGTCCTG ACTGC 
TCCGACTAGCCATGTAACITTAGGCZAATCACCCrrCAGTC 
1 5 TAGTTTTAGTTTGGAAACATGCTAAAAATTGGTTTAGCT 

T TTTGAG ACTCTTTGCTTCTGG CT CGGCAGGTTG CTGTTTTTTTTGGTG GTCTGT CT AT CAAGAAGGATGAAGAGGTG CTG AAGAA 
GAACTGCCCGCATATCGTCGTGGGGACTCCAGGCCGTAT CCTAGC CCTGG CTCGAAATAAGAGCCTCAACC T CAAACACATTAAAC 
ACTTTATTTTGGATGAATGTGATAAGATGCTTGAACAGCTCGOT 

TTTGGAG C CAAATGATGTTT ATTTGAAACAGGAG CAC CT CAGTGCAAGGACGACT CTTATCT ATCAC C CATG ACTG ATGGCTCTGG 
2 0 GTTCCCTGGTTGGTCTTTATTATGCTTTTAAGCACAGTAA 

AG AATAGGGGACTTT GATAATTTTAGGCATAAGTCATCAC CACCACCAC CGTTTTCATTAT AGATTCATAT ACTGGG AGTCAT AGG 
GGAGATTCTAAACTGAAAGAGAAGACAGTACCCTTCTGGCATCTCCAGC^ 
TCTAGACTGAGGTCTGGGATTAATGTAGAGAGTGTTTGTAGC^GTTTGTGTGATGT^ 
GAAGATTTTTCTGTAAGAATTGAATCTTGGTGAAT^ 

2 5 AATTCT ATATGGTGAG CTAT AAAGGTGGG CTCCAGGTAGGGATGT C!AT ATTTG CCTGACTTGATAG AAAAGTAATC CAGAG AGTCA 

TAGATGGACTCTGATATCTGGAATATAATATGTGCTTGATATTTGT 
TGGGAGAAGGTCCCATAAAGCIATGTTTTGAAGGCCTTGAGAGCCTCTGCACTGGG 

G GGTTTTACCTT A'lTT CT1G CTTGGTTAAAACAAACAG CTGGAAT CTGATCCCACTTCTTGATTCCAAGTCCATTGCTCrTTCCAT 
TGTGTTGTT ACT ATTTCCAGCAATCTTCAC CTCACTGGG AAGT CT AC CTCTAAT L^lTGTTTATCATAC CTGCTT ATrrTCTCCTA 

3 0 CAATTTTTTTCCTTGTTCTTGT AGACATGCG CAGGTCATGATG 

TTCAGTGCTACCTTGAGCAAAGAGATCCGTCCAGTCTGCCGCAAGTTCATGCAAGATGTAAATACCC 
ACTCCCCGCC03CTGCCTCCTCCCCITC 

AACTGGGAGTAATGACTCCTTGAAGAGACATACAGAAGCAGAGACAGCTAGTGT 
AAGACTTGGTCGGGTTAATGTGAGAGCGGGTAGTGTTCGACTr^ 

3 5 GAGGGCAGGATTTTTCTGCCCTACCACCCA^ 

AG CATACAGCTGGAG C CT TCTGCTCACCAAAACTC CTAC TT C CCGGTGG CAG GAG AGCAAG AG AGGG ACAGACAGATG GCAGGG CA 
TGTCCAAAAGAAGAGCATCAGCACAAATGAATCCTCCCCTTCCCCACCTCCAGGGGT^ 

CCCTACTCCTTCCCACCCACATCTCCTTGCACCCATCTGGAACCT CGTGG CGCGG 

CGAGGGAATGCAGACGGCACCCAGCGGTGGATGGCGGCAGCGGAGGCCGCGGGGAAACCTGACCAGGA 

4 0 CCTCrTTTTCCGTTCCCGGTTTTTTTC^ 

GAATGGGGTGGTAGATTTTTTTTOTTAAAAAAATTTTTTTGTTT 

GATGTAATCGGGTGGTTGTTAGGGTTTGGGGCTAGGTGGGGCCAATTGCATAAGCAGTG 

TTCCTTC CCGTG GGATGATCACT CTTTAG CTG TATTTGGGG CT AG AATG AG ATTTG AAGGAG GCCATGGAAC TT CTCTTTAG AAAG 
CCTGCCTTGGCTGGGCCTGGTGGCTCACCTCTAATCCCAGCACrTTGGGA 

4 5 GAGACTAGCTGGGGCAGTGTAGTGAGACTTTGTCTCTACCAGAAAAACCGGGCGIGGTGGCGCATGCCTG 

GGAAGCTGAGGCAGGAGGGTTTGCTTGAGCCCGGGACGTGGAGGTGGCAGTAAGCTGTAATTGTGCCACTGTACTCCAGCCTGGGT 

GATAGAGTGAGACCCTGTATCAAAACAAAACAAAAAACAAAACCTO 

CACATCCTTTCCTATTTTGTCCTCTGGGTCTTCATATTA^ 

TTGGAGTTGTGGT AATTTCTGT C TTTGTTACTTTC CT CC CCTTOVGGGGG i"i r rri AATTTTAAAGATGAATG 

5 0 TGGTGTGTGCCTGTAGTCTCAGCTATTCAGGAGACTGAAGCAGGAGGATCACT^ 

GATTGTGCCAGTGAATAGCCACTGCACTCCAGCCTGGGCAACATGGTGAGATCCTGTCCCTTAAAAGCGTATCTGCTGCT 

TTGGTATTTTAACACCACTTACTGATACCTTTCCTGTAAACCrrGTAGATG 

ACTATCCTGTTGTGGTCTGACGGCAAGTAACTCATCTTGAGTA^^ 

CATT CTG AT C TT AAAAGACATC CACTTTGCTAATGC!ACACGAGATTCTCT TAGTTGAAGTAGG AGAATCAAATGGAG CAGTTG TCC 
55 TCCCCCCACCCCATGTTCTTAGAAGCACCTCTGATGGAGTTAT^ 

GTCCTTGCTCTCCCTTTGAGAATC^TCTCCCATTTT 

GTTTC!ACTGCTCCTTGACTTCIATTTGCAGTTTCTTTCCC^ 

AAATTAGATGTAATGTGAGAGAGACCACAGTAAAATGTGGGTATTT^ 

CATCCCCCGATGGGGTGTATTGGAGATCIAACTTCCTCC^ 
60 CCCCTTCCTCCCCCCC^GCCAATGGAGATCTTCGTGGATGA^ 

ACTGAAGGACAACGAGAAGAACCGGAAGCTCTTTC 

GG ATGAG CATTG^ AGCACT CCAGCTGTAGCAGAAACCTGGAT ATT AAG TACACTTTT AT TG AGG AAAT CACATGTGTG ATGTGGGA 

GAGAATAATGAGGGTATAAATATCTTAGGGGCTGAGCATGAGTAAGGTGGGAGCTGCTTTTCTAT^ 

TCCTCAATAACCTCAAGGAAAATAAACTTCAAAAATTAAGATCCT 

6 5 GGGAGGCTGAGGGAGGTGGATCACTTGAGTCCAGGAGTTTGAGACCA^ 

GAAAAAAAT/TAGCCAGGTGTGGTGGTGTATGCCTC 

AATTGAGGCTGCAGTGAACTATGATTGCACTACTGTGCTCCAGCTTGGGCAACAGAGTGA^ 

GATTTTAGGAAGTTGTTAAAAGTCTTGAAACGATGTTTGGGGGCATGTTAGGGTTCTTGAATGT 

ATTCAAGAGAAGCATTTCTGACTGGGTGCAGGGCAGTGGCTCATC 

7 0 GCTTGAGCCCAGGACTTCAAGACCAGCCTGGGTAACATAGGGAGACGCCC^ 

TAGCTGGGCGTGGTGGTGTGCACCTGTGGTCCCAGCTGCTTGGGATGCT 
CTGCAGTGACTTGCGATTGCACCACTGCACTCCAGTCTGGGGGACAGAGTGAGACCCCAT 
TTGTGAGTGGTCTATTTGC^TTTAAAACTGCTTTCTAGAATTAGGATAGCTCCCT 
AGTTACCCCTCCAGATCTTAATTCTAGTTTTTTATCACTC^ 
7 5 TTGATTTCTGAACTCTGTGCTACCTCAGAGGCC^GCTTCCTTAGGGAAAAATCAGTGCTGAAATAA^ 
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TCTAAATATATAGTGGGGGAATAAGAGAAATGAAGAGGAATTCCT^^ 

CTCTCACACACCATGGACCCCTATTCCCCC^TTTGCGACCCCCCACCCCACCCCACAAC^ 

CAGCGGTGCATTGCCTTGGCCCAGCTACTAGTGGAGCAJG^ 

GTGAGCTGAAGATGGGAAAGATATTTTGTGTCCTTGGGAGAAAAAGACAGTTGAGW 
5 TTTCTCACAAAGG CrrTCTCGGTATCAGCAGTTTAAAGATTTT CAACGACGAATTCTTG TGG CT ACCAACCTATTTGGCCGAGGCA 

TGGACATCGAGCGGGTGAACATTGCrTTTAATTATGACATGCCTGAGGATTC^ 
TGAAAAAATCCCACTCTCCCATTCCCTTGTTTTCTGTTTGTACATCr^ 

ATGATGCCAAGATCCTCAATGATGTGCAGGATCGCTTTGAGGTCAATAT^ 
1 0 AGTGAGTACTG ATCT CATGAAAC CCTTTAGGTCCT CCCTGTTC CTT AGTGTGTTTGTCCTAAATC CCAT CACATAGGT CATGGG CA 

TCTGATGCATAATGGAGACTTGACTGGTTCATGCCCCCTGGTCT^ 

TTCTGTCTTCTCTCATCATATTACATCCCTTCCCTCACCCCCACGTCCGTCCTCTGAACC 

GTGCCGTGTGTTCCTGCCTCACTTTCCCCTTTTCATGCCTTATTCTGACCATGCT 

GAAGACTCGCCCATTTTGGAATGTGACCGTCIX3TCCTTCAGGAGAGGACACCA 
1 5 CCCTGACAGCCCCCACCCCATCGCTTCCATCTTTTGCATCACCACCACTCCTGAACC^ 

TAACAAAACTAAAAATGAAACACATGTGTCTGTGGTATCT 

GGCATGGTCCAGGGTGAATTCXrTATAAGGCCTGGGTGCCCTG^ 

CTGGCTCCCCXTACTGCCTGCCAATTGCCTGCGGTTTGTGGTCTCT^ 

TCCCAAGGCTCCTGGTGGGTGGTGCGTGTATTTTCAGTGCCGGAGGGTGCTGTGGGCACTGGG 
2 0 GGCTCTCTTGGTGCCTCCTCATCTGTTCCTTCAGCTTCTGGATCTTGAGCACCAGGG 
TCAAGGAGGGCCTGGTACAGCTCCAGCTGCTGCTCCAACAACT 
CTGGTCAGGGAATTAAGGGAGGGAGCATCAGCCAGGGCAGGGGGCCGAGGCCCT 

CATAGGCCTCTGGGTGGTAGTAGGGGGAGCAAATAGAGTTCCCTGGCCCAGGGGCT 

2 5 GT CACTGGAAAGAGG AT C^GCCGCTC CTCCC CX5CT AAGAAATAATTAACTGTTAGGTGAGGGGGAAT AGATC CTGTT CAAGGACTT 

TGTGGACTGTGCTGTTGTGGGTGGGGGTGGGGTGAGGGAAGGGAA 

AG CTGAACTAAGAAAAGAGCTGCAGGGGTAGGCATGGTGTGGGGTGGTG CAGGGTGGGATTGAGGGTTTTTTTTTCCCACACCCCA 

GTGTAAATTCTCACACCCTCTGTTCCTACCTGTGGTGCCACTTACCCTGGGAGGGGACGTCA 

TCTGCTCTTCCATGCTTGCTTTGGGGTTTTGGGAGCAGCACCCATGGGAGCCCT 

3 0 CGAAGGAAATGGTACCGAGAGAGCCAGGGCAGAGGGAGGACCATC 

CACCAGGAACTGGTTCGCTCCAGGACTTGGCCTCACITGAGTGCCTGGCCCTGC 

CTGCCCCACTCTGCCCCACGTCTCTCCCAGCCTGGCCCCAGACAGAGTCCAGGAACAACTCCTGTTCCTGATOT 
TG CCAGTTTAGGCAGAACTTG C TT TAG AGCACTGGTG CC C AGC CT AC CACAGGT CTGTGA rrrriri r 1 TTTG ATCT AGTGTTT AT 
TAGGTATGAATTTTACAAACATTAGCGGTAGCTGTGGAGCTGGAGAGTATTGCAC 
35 TAGTGTCGTAGAACTTACAGCCCTTTCCAAGGCCGTGGCT^ 

ACCACTCGGTGTCAGGTTTCCTTCTGATAGTGTTATGGAATGGATCAACTAGGATA 
ACCAGCCCAGTGAGAATTCAGGACTCAGAGCCCCACAGTGGCATCCAGCTTGCTCTTCTGTA^ 

TAG CTGGTT AACAC CATG AT AGACAG GCTTG CTGTT AAG TTGC CTT CTT AGGAACT AG ATT TTCAGC CAC CGTGGCGCTTAT ATGT 
AAGATAACCTTGCTTGGCTGTAGCCCAGTAGCGTGCT 

4 0 GGTACTGCCAGCAGCGGACCCTCAGAAGAAAGCTCATGACATCA 

TCTTGGTTTACCCAGTGGCTGCTGCCAGA(mGAAAGGAAAGGCTACCACAGGTCTTGTGTCT^ 
GAGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCGCAATCTGGGCTCACT 

TCACCGTATTAGCCAGGGTGGTCTCGATCTCCTGAC^^ 
45 AGCCACCGCACCCGGCATGCCTTTTTCTTAACACTGTTTTCTCACT 

GAGGAAACTGGCTCAGATGGTTTCATTCAGCATTCACTGGGGAAGTGTCTGTTGGGGGCAGCTC 
CCACAGCTGGTTGTTGGTAGGGCCrGGAGGGTTCATGTCTAGGTCCACCTGACTTGAAAGCTCA 
ATTCCCCTTTGCAGATGTACCITTTATTGTGCTTCCCTTTATTGCT 
TGGCAACCCTGTGTCAAGCACATCAAAGAGGTCT 

5 0 GTGGAATTCTCAAAATGTTT CT^GCGTTTTCAT TAT T AT ACTTGTT ACAGTGAC CT GTAATCAGTTACTGAAGTTAACTATTGTGA 

T TGT TTTGGG ACACCATG AG CGATG CT CATATAAG ACAG CAAACTT AATGG AAAAATGTGTGTGTTGTGACTGCTTCACCAACT 

CCATTCTCCCGACTCTGCTTTCCAGGCCTCCCrrATTCCCTGAGG 

ACATCAITGTCTTATTTTAAGAAGTTGTCAAAGCAGCCTTGAG(^ 

AAAAG ATCAGGATTAG CTGAAG C CTCAT ATG ATTGTT AG CAT TTGTTT AGC AATTAAGT AT TTT AAAATTAAAGT AT ATGG CCAGA 

5 5 TACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTG 

CCAACATGGTGAAACCTCATCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGATG^ 
GGCTGAGCAGGAGAATGGCTTGAACTCAGGAGGCGGAGGTTGCAGTGAGCC^ 
AGCGAGACTCCATTAAGTATATACAO^TTTTTTGTACACAATGCTACTGTAC^ 
TTTTATGCACAATAGGAAACTAAAAAGTTTGTGTGACTCACTTTGTTC 

6 0 TGCCTGTCATTTCCCTTTCCCTCTTCTTGCTGGCCCAGAATGACCTTGTTTCTTGCCCCTC 

TGCCTTCTCTGGTAGGTCTGGGCACTTTGTATCCCTTX3TAACCTTGGCT 

TGTTGGACTAGTGAGCCTCCCCCAACACCTCCTGAAGTAGAACCAAAGGC CTGTGCACACACCGTGCATGTGTGAGTC TGCAT AGA 
GATGT CAG CTTC CTG CAG GGTGTTCTG AAGGGATGT CCTGTTGTG ACTGG ACTGTGACATAG CCACAGGC C CAGAGGCAGGAGTGG 
CTCAGAAGGGAGTGGCTGGTCCCAATTTTGATGATCTAGGAACAGG 
65 GGAGAGTGGACAGCTGOTAGCCAGCTCGCTATCTGGATATCACTCTGCATTGGGAGGGAA 
CCAGGAACCAGGCAGTGAGGACTTCCCAGCG<3TCAGTGCTTCTCACACTTGCGGGCCA 
TACGTTCCTCATTAGCTGACTTTTTCCCACTTAAGTGGAAAAAGAACCCAGAAC 
CCTCTTGTATCTTGGTGATAAGGTTATGC^TGACTCATACTTTAATTGCAATGTGTA 

AAGAGCCCCAAACTACTGTTATTATAGATAAGCGAAACTATGCAGTATATGGTTAAACAATCCACAACTAATTAACA 

7 0 GGCCGGGCGCAGTGGCTCATGCrrTGTAATCCCGGCACITTG^ 

AGCCTGGCGAACATGATGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCrAAGCGTGGTGGTA 
ACTTGTGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCT 
GGGCGACAGAGCAAGACTCCGTCTCTCAAAAAAAAAAAAAAAAAAAAAAAAGAAAAGAAAA 
GGACAGAGCAATGCTCTGTGA£CCAGGCTGGAGTC 
7 5 TTCTTATGC CTCTG CCTCC CAAG AAG CTGGGATTACAGACGTGTGCCACT ATG CCGAGCTAATT TTTCT 
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GGTTTCACCATGTTGGCCAGGCTGCTCT^ 

GGTGTGAAC CACCGTGCCTGGCCCTCTATCTGTTAA.TTT AAAAGATTAGCAG CCATTT 

AGACAATCTAAATGATACACTAATAACAATCCITTGGGAA 

AAAATGGACTAAATGGACTAACCCCCAATAATTTATAGTAGCTA^ 
5 AAATAGAATCTGAATTTTGGCTTTGTTCACCTGTGGGA 

TGAGGCCAGGCGCAGTGGCTCAC^CCTGTAftTCTCAGCACTTTGG 

C^TCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAA 

ACTCGGGAGGCTCAGGCGGGAGAATCACTTGAACCCGGGA^ 

GGCGACAGAGCGAGACTCTTGTCTCAAAAAACAAAAAACA 
1 0 CATAATTGTAGAAATGAAGGAATTAAGCTAAAAAAATACATAAACCAGA^ 

CGTGTTGACATTTAATTGCTATCCTAATGGTATTAAGATGCAAGACm 

GCCCAGGCTGGAGTGCAATGGCACAATCTCAGCTCACTG 

G AGTAG CTGGAATT ACAGGCACATGCCACCACG CCCAGCTAATTTTTC 

GCTGGTCTTGAACTCCTGACCTCAGGCGATCCACCrrcCCTTG 
15 GCCGATGTGGGACCTTTCMGGTTGATTAGATTGAATAGATTAATGCCATTC 

CCTTCCACCTCTCACTATGGGATGATACTGCAGCCAGGCC^ 

TCTGTGGGAAAAACATTTATTTTCTTTATAAA 

GAGGCCAAGGTGGGTGGAAGGCTTGAGCCCAGGAGTTTGAGACCAGCCTGGGCAACA 
AAAAATTAGCTGGTGTGGTCGTGCGGGTCTGTGGTCCCAGTTATTTAGGAGGCTGAGGTGGGAG 
2 0 GAGGTTGCAGTGAGTCGAGATCATGCCACTGC^CTCCA^ 

TACC CAGTCTGT ATT AT TCTGTT AT AG CGGCAGGAAACGGACTAAG AGACAT AGATTATGTT ACTGTG TTT ATTTATTT ATTGTTG 
TTTTTGTTATTC CTGACTCTTAAT AT AG AGT CTT AATCAG ATG AG CATTC TGGC CTGG CCT CCGCAGAAGGGG CCTGT CTTTAG C C 
ACGGACAAGAGGAGATTAAGGCCAGCATCATCCACAAGG 

CAATATTGTATCCTACCCAGTGGATTGGCAGGACTGGGTGACTGACAGGAATCATTGTTGCCTCTATGGGAAAGTCT 

2 5 GGGGGCTGAGGGATGTTGAAGTTTAGCCATTACATTACAGTGAGAGAGATTACATTACrAAGTC 

TCTGTTACTGTCACAGGTGGCTTTCACAGTAACCT^ 

GCAGCATTTGCTGAAAAGACTATCTTTATTGTATTGTCTTTGCTCCTTGTATTT 

TCAACCCCATCTCTATTAAAAATACAAAAGTTAGTCGGGCATGGTGGCAGGCTCCTG 

GGAGAATCACTTGAACCCAGGAGGCGGAGGTGGCAGTGAGTCAAGATACTGCCACTGCACT 

3 0 CCGTCTCAAAAATAAATAAAGAAATAAAAATTTAAAAATTAATAAATAAAA 

GTAATCTCAGCTACTTGGGAGGCTGAAGCAGGAGAATCGCTTGAAC^ 
GCACTCC^GCCTGGGCGACAAAGCAAGACTCCGTCTCAAAAGGA 
ACCAAGGAGTTAGAAGGCCATCTX5GTGGCAGGGCTGGCAGAGGACCAG 
TGAAGGCACTGAGCTTGGGTCCCCCTTTGGAAGACAATGACCTGAGAGCT 
35 CTGCTTTCCTCATCTGTAAATGGGATAATATCAGTCrrCACCAGCTrCT^ 

GCCTCTAGTCCCGGCACTTTGGGAAGCCGAGGTGGGCAGACTGTTTGAACTCAGGAGATGCAGAACAGCC 
ACAGTCTCTACCAAAAATACAAACAATTAGCTGGGCATGGTGGTCTCT 
GACTGCTTGAGCCC^CGAGGTAGAGGCTGCAGTGAGCCATGATTC 
CTCAAAACAAACAAGCAAACAAACAATAAAGGAAATCCCTACC^ 

4 0 GTAATCCGAGCACTTTGGGAGGCTAGGCTGGCAGGT 

CTCTACCAAAAATACAAAATTAGC CGGGCGTG ATGGTGCATGCCTGTAATCC CAG CTACTTGGGAGGCTGAGGCAGGAGAATCTCT 
TGAACCCAGGAGGCAGAGGTTGAGGTGGGCTGAAATCGCGCCATTGCACTCTAGCCTGGGCAACAACAGGGAAACTCCAT 
AAAACAAAACAAAACAAAAAACAAAACTCCCATTTTTGCGAGGCA^ 
CAGAGCTGGGGTCTGGATGCAGGTCTGCTTCGGGGTAATCCGCTCTTTre 
4 5 AAGTTAGATGCCTGCACTCAGTAAACCTACTCTGTTTTAAGTAAAAACAACAAGAAA 

TACGTTAAAGGAAATTTTTAAATAGGGGGTTTTTTGACCATTTGGGGGAGTTTTGGGGGAGGGGCCTTCTGTCT 

GGGGATGTTAGGGTTGTTCATCTGGATCTAGAGGTTTTCCTGTAATGTTCTTACTCCAGAAGGAAATCTCTAGATGGGGA 

GTTTCAGCTTTTATTCTAGTAAGCAGGGCTCTACCTATAAAGAGCTGCTTCCACCACTCTTITT 

50 AGCCTCCCAAGTAGCTGAGACTACAGGTGTGTGCGACCATGCCTG 

GTTGGTCAGGCTGGTCTCGAACTCCTGACCTCGTGATCTGACTGCCTTGGCCTCCC^ 
GCACCTGGCCTCCACCACrATTATAATATCACCAGGTTCCCCATTTGAATCCTTCAGTO 

TTTGGG ACTTGATAGGGCTGACCT AAAAGT ACACT CACT CT AT ATTAGGTAGC C CAGGAGGGCC TAGG CAG C CCAAGAACCAAACA 
CATGAGTGTTTCTGCAGGGAAATGTATGAATATTGA 
55 ITCTGTCACCAAATGAATTTTGGTGGCAGCCTGATGAAAAATGTTGGTTCTCAGAGTGTTTTTGAGT^ 
AGTATGGACCTGTTGATAACAAAAACAGGAACAAGGCCAGGTGTGGTGGCTCACA 
TTGGTGGATGACCTGAGGTCAGGCATTCGAGATCAGCTTGTCCAACATGGCG 
TGGGCGTGGTGGCATGCGCCTGTAATCCTAGCTACTTGGGAGGCTGAC 

GTGAGC CAGG AT CG CACCATTG CATT CCAGC CTGGGCARG AAGAGTGAAACTTCATAAAAAACAAAAACAAAAAC AAAAAACAG AG 
60 AAACAGGAACAACAATCGCCAGCATATACCAAGTC 

AAAACAT AT ATGG CACAACATGGGTACT CTG AACATGGG CACAAT CAATG TACAATG CTAT AATGTAT AACACAGG ACAATG TAGC 

TGTTAAAAGCATGGACACTCTATCTAGTCCATCTGGGTTATAATCTCrrGCTCTACCAGTGAATAACTGTAACT 

TCTCTATGCCCTCTTTCCTCAGCTGGGAAATGGGGGATAATATCAGTACTCACCTCCTAAGATTGTTGTGAGGAT 

CTTTATAAG AAATGTCTGGCACATAGCAAAGGTTGTTATTATTTTAATTTTTACAC^ CAAGT AACCTGTC 

6 5 CAAGAATACGTAGCTATGAAGTGTGGAGCTGGGATTTGGAGCTGGGGTTTGAATCCAGGCAATCT 

TG CT ACTTTTTGGCTACGAG CAAACAAT CTGTCAAG AAACAAAGT AGCTACTAAT CT AAACAGATGTGAAATTTGAAG AC CAGTTG 
ATCTTTGGGGAATGTTGGGTTCTTCAGACAATC 

TCCTTCCCTAGGTCCTGATGACCAAATAACCAGCTTATTTCrCAACTATTGGTTC 
GACCTTGTTCCCCACTGAGCAGACTCACCATCTGGGCCCTGGCGGGCAGCAGCATGC^ 

7 0 GG CAGGGTCAG CCCCGAGCCGAAGCAG CAGG CACAGGGCAGGGGCATCGTGGCGGGCACAGGCCCGGTGC^GTGGTGGGGGCTGCC 

CAGCATCTACATCGAGGCCTGGGTGTCGCTGGAGGAGGGCCTGGGCCCGGACCAGCCGTCCrGCAGACAA 

CGTTCTCGGCGTTGGCGGCGGGAAGTGGAGGCCATGGAACTCrrTGGGCTGGGGAAGGAAA^ 

TGGCTGGTCCI^CTCCCTCCATCTCTGACATCCCCTGTT^ 

G AGG C CT AACCTAACCCTGAT C CTTTAT CAGATGAT AG AT TTGAAAAAAAATTTTTTTTG AGGGGG GTGAT AGG AT CAG AGG T TTA 
7 5 AT TTTTTTAATGT AAAAT T CGAG AAAAGG GTAAAT AATTGGTTTAAGG CT CAGGAG C C CAGGT AAATTT TT AATTTTT AACAAAG A 
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ACTTT AAAAAAACC CAACAGGG CCGGATGGGG AAAATTTTTAT CAGCAGAAAT CTTG AGTTTT AAAAAGT CACAGAT AAT CTC CAAT 
AATGATCTAGAAATTGAAT AT CATGT AC C CGGCAGACAGATGTGGAGG CTT CTTC CT CTGGAACCTGGGGGGAG GGGTTACTCATC 
AGACCTGCCCCCGCCCCCCCAAGTACCCCCAGAGCCGTAGGCCCAAGGCCrGTGTTTAAGAAG^ 

GCGGAGACACTCCAGG CTGGAGGAAATGG CAAGCAGAG ACGCAGGTG^GAGG ACGGAAGTGAACTGTGAGGG G CGTTACCGGATG 
5 TCGTTCCGCCCCGACCGGGTAGTTCTTGGCCAGATCTCCCAGGGGAAACTAGGGAACT^ 
AGACTGGACTGGAGGCGAGGAAAAGGAGGCGAGGGGAGGGGAGGGAGAAA^ 
ACTTGGCAGAGAGAGGAGATGGTTCAGTGATGGACG 
CACATTAAACATGGAAAACAAAAACAAAAACCACAGTGGGACAAC^ 
ACGGAAGAAGACTATCGTAGGATGGGGCAAGTGAGATGCAAAAATTTGGACT^ 
1 0 AACGGGAGGCAGGGGAGGGGGCGGGATGGTGGAGAGAGAGAGAAAGGTAGAGAGTTAGTTT^^ 
CCCAACACAGGTTGCATGATGACATCCTTACCTITr 

TCCTTCTCTCACCTCAGTACTCCCCCGTCTCCGCCCCTGCCTCATCCCTAGACCTTTCCC^CTCGGAT 
CCX3CAGCrrTGGGCCTG^TCTCTGCTGCTCCCAGGCGGCCCCTT 
CATCAATAGGAGGATGAGATTGGGAGAGACACTCGGTGGA.GGAGGCTGA^ 
1 5 GACTGCAGCAACGAGCTGGAGGAGGAGAAGTAAGCGGTGGGGGGTGGGAG 
GCCATAACAACAGAAATGGCCAGTCAGTCCCAAGGTATCCAGCA 
TGCCAGAAAGAGTGAGTOTCCTCTTTCCTCCCTTAGGAGTITGGAAAGAAA^ 

CC CAAGG CTGGCGGGAAGACAG CTAGGGTCTGGAGGCTGGTTAGGAGGGAAG AAATG GATGGATATTAG AATCTGGCAC CTGGTTG 
GCTGAGAGAAGGCTCTATAACITTCTGGAAGGGACTGACTCCTGCTATTACATTG 
2 0 CTTTCTTCTGCCTCCAGGGAAGGCCCGGCGACTGAAGCAGGCAAAGGAGGAGGCACAGATGGAGGTGGAGCA^ 
GAGAGCACGAATTCCAGAGCZAAGCAGCAGGCGGTGAGTTGAGGCAGAGTCGGGAT 
AGTGAGGTGTGTAAGGGTGACTGAACAAGAAAATATGGTGGC!AGAGGGCTG 

GTGAAACTTTGTGATGATATGTAGGAGAGTCTGGGAGTTTTGAAGGCCACATAGAGCTTGTGGGCGGA^ 
AGTATAACATCTATGTGGAGTATGATTAACATTTGTGGTGGAGGGTAGAGTTT^ 

2 5 ACGGTCTGTTTTAGGATGAAGTTGCATGTTAGGTCTAAGGGGAAAGGGGACTGTGTTGATCT 

GGATGGGGGTGGTTTCTGAGAGGGCCTTTCTTCTAGGCrrTTGTT^ 

CTACAAGGCGCCAGGTGCAGGGCATCCA^ 

GACGTCAGGCCCCMGTCCACCCCAACTACCGGATTTCTGCCTAGGG 

3 0 CAAAGAAATCCTCCAATCAAAATCACCTCCCACCATAATCCCTGTCTTCTTTCCATCCCCTAGAAATCCTGGGAGGC 

TAATTTTCCTGTGACACTTATAAATATCCTGCTCACATCTGA^ 

CCAAGTCATTCTCACCTAAACCCTCTGTGAAATTTGTAATATGGGGAAGTAGG 

CGATGTGGGTCCCrrCTCTTGACCCTGTCACTTGCrrGGCrrGTGAAACCAGGAC^ 

CTGTGAAAC TGGG ATGAT AAT AATG C CTAC CTTGTGAGGGTTG CTT CAAT G ATT AGGAATCATT CTGTAAAGTCTAGCACAGTTCC 
35 TTGCATGTTGTAGCAGTGATTCAGTAAGTAGCA^^ 
TTTCCTCACGTCCATCACTGGCTCTCTAATTCCAC 

TCCCAGAGACCTTCTGATCTCCAAAAAGAGGAGATGACTACATTTAGCCCCTCTCTTATAATTCCAGGTAGATAACT 
AGCCTCTCTTTGTTTTTCTTTTGCrX^ 

ATTGCTTTCTAAGATCTAAAATGATACTGTGTTCCCrrCATATGCATGCCCTTCCTTTCrATATCCT 

4 0 GTAACAATAAAAAAAGTATC^TAAAATAATTATTGGCAAATAAATTGGTGAGTTGAAGCAGCCTCCTTT 

ATTTTCAGTCACTTTGTTTTTTTTTTTTTT 

CATTGGAACCTCTGCCrrCCCAGGTTC^GCGATTCTCCrGCCTCAGCCTCCCAA 
CTGGCTATTTTTTGTATTTTTAGTAGAGATGGGGTTTCGCCATGTTGGTCAGGCrrGG 

CTGCCT TG<3CCTCCCAAAGTGCTGAGATTAGAGGTGTGAGCCACTGTG CCTGGCCTTCAGTC^CTTTCTTGTTTTTTGTTTACATA 

4 5 TTCCCTAAACAGCCCAAATGGCTATCCTTTGAAACTTCTrGGAGAAAC^ 

AACATGGTTCCTCATTTGGGAATCTGAGGACTATAGATCGCAACTGTAGAGAAAAGCTGGAGTGTAG 
CTTTACCTTGCATTTTCTTCATAGCACT 

CTCACGGCAGCCTTGACCCCCTGGACTCAAATGATCCTCCCACTT 
GCCTTGCrAATTTTAAAATTTTTTGTAGAGATGGGGTCT 
50 GGCTCGGCTTCCTGftAGG^TTGGGTTACA 

TGTGTTTTCTTTCCTGTAATGTAAACACTGTGAGAACAGGGCTGTTCACCGTTGTGTCCCC^GATCCTA 
AGGGAGGCAGTTCATAAATACTTTTGAATAAATTAAATGATACTTG^ 
TCATTTGTCACTGAAGACAAGCTTACTTCACCAAGAATTTGAACCAR.TAAGGTAACOT 
AGCAGGGAGGCAGAATACAATAGAGAATGAGAGATGTTTGCAT^ 

5 5 TTTGCTTCGCTTCTGTCAATAAGATGAGAATAAC 

AAACGTTTGTTGTTTTATTACACAGGA.GGACATCAGGTCTTACTTTTG 
ATTCAGGAAAAAAATCCAGGTTGAACAATOGGGCTGTTGGGGC^ 

GGGCGCGGTGGCTCACGCCTGTAATCCCAG CACTCTGGGAGG CCGAAGTGGGTGGATCACCTGAGGTCAT ACATGGGTGAAGCCCC 
GTCTCTACTAAAAAAACAAAAATTTGCTGGGCGTCGTGGCGG^^ 

6 0 GCTTGAACCCGGGAAGCAGCGGTTGC!AGTGAGCCGAGATCAGGACATTGCACTCCCGCCT 

AAACAAAAACAAAAACAGTATTAATGGAATGTAGTATAACCCTCAAGCCCT 
TTCCCGCTCGGATTCAGAACACCTTCCTGACTC^^ 

TAGGGCCTGCCGGTTCCTAGTGCGTGCCCAGXIAGTCCTCAGGTCACCTTCACTACCGGGCCAAGGACCCC 
6 5 TTCGCCACACTCGTTCCTCGCGCATCCACX3GAGGGGTGCCTACAGAGAAGACCTGCGTGGCAAAAACCTAAACGAAGAGATGAGGG 
G CATGG AGAGG AGTAGG ATAAG AG AAT AAAG ATAACAGTGGGG GG G AGACGTT AGTT TCCTTT ATAT CTTTTG 
CAGTGAAGTTAGAAACGGTTTTAAAACAAATTTCAGACAGGCATTTO 

GGAAGG CCCTGTTTCCXK3TCCCTTGCGCCTGCGCT CCTGCAACCGGA 
AGTG AAGGCAGATTTCCCTCCTTCGTCG CTGTTGCTGCCGCCATACGCGCTCTCC CTGTTTAGGTAAGCTTTGG CCTTCGCTACAA 
70 TCCGTTTCCATCTGCGCTTCTCCGCACC 

CCTAGTTGGTTCGCTATTTCCTTAGCTTGCATCCCTTTCGAGAGCAAAGAGCT 
CAATCCAAGATGGTGTCCTCGGCGCCATTGTGTTCG 

CAATGAGAGGCGGTGCTCGGCGTCCCTTGGTCTTGGTATTTGCGGAGGGCGGGGCTCTTCTCACCT^ 
CTTTTTCGGCCCTCGGTGGGACTGGGAGGAGGAGCTGGTTTCTGGGCCCAGTTGGATTTTTCTCACCT 
75 TTGGAGTGCCTTCCAAGTGTTTACGATACGATTGGTGTCATTGTATGTTTCTCCAAAAGGAGTCTCACCT 
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ATGTGAG ACCACTTGGTAAAGAT CCTGTT AAAG C CTGGG CGGGGATTGCCTTT CT CTGTC^CCTATTAGCTTTCTTATTGTAGGGT 
GGAGACATGAATTTTGTTTTTTTGTGG CCGAGCCATTTGTCTTG CACCG CCCC TC C CCCCCATGCTAATTACACAAGGCTTGCTTA 

GTGAGCGCTTTG CGCAGTGATGACC CT CATCTATCACCCTTGACTGATGGCTG CTGAGTTAGGCATCCAT AACGGTGGGATTATAA 
5 TAGGGAAAGCGGAGTCTTCCTTTGAGGACTTTTCAGGACT 
TTTACCTTTT ATTATCTATTCT ATTTCCT CAC^ 
GTTTCCGCTACTGTGATTGCTAATCTTGTTGGGAACCT 

AGAGGTCGTCATTAAGTCTTGC ri^rrriTl CCCCCCTCCA^ 
1 0 CAATCCCCTACTCTTCACCCCTTGTTTTCACCTATTTTGCG^ 

TTATGGCAGAGAACGATGTGGACAATGAGCTCTTGGACTATGAAGATGATG^ 

GCCCCTGCCAAGAAGGATGTC^^GGGCTCCTATGTCTCCATCCACAGCT 

CCGGGCCATTGTCGACTGTGGCTTTGAGCATCCGTCAGAAGGTAAATTTTCT 



1 5 ACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGG 

GCGAGACrrCCTGTCTCTACAAAACGTTCAGAAATTAAGCAGGTGAAGGTTGAGGC^ 

CAGCCGGGGCGAC^GTGAGGAAGAAAAAATCAGGGGATAAGTATCAAAAACAATTC 

TTTCTTTCGTGATAGCACCAAAGTGCTAATGATCCAAAAGTGACTTC 

TGCTTATTTTTAATTTTGT CACTTGAC TT CTAATTTTAAATTTCCAGAAAG G TCCTG CTTGGAC CTGTAGTCTCC CT CTG TTGGG C 
2 0 CAGGCCAACTGTGGTCTCTGGAAACCTCTATGACTGGT^ 

CAGGGTTGTAGTAGTTACGTGACCCGAAGTATAGCAACCTAAGC^GGAGAAGTGGTCT 
TTACAGATTAAGAAAAATAAGGAC^GAGCTAGGAT<^^ 

AAGGCCTAGATCTGGAGGAGGCTAAAGCTAGGAGGAATTAGGAGAGTCTGATTTT 
AATGGGTCTGGAAGTTGGCAAGAACCAGGTAAATACrAGACTTTCAGAATT 

2 5 GTAGAGTTATCTGGAGACTGAAGTCTAATTTATCTTC 

TGGGAATGGATGTCCTGTGCCAGGCCAAGTCGGGCATGGGAAAGACAG 

GTTACTGGGCAGGTATATTTGGGGAGAGTGGTGGGGAGGGGATTTTGGTTAGGACTATAAGG 

ATGATGCTTGCAGAGCCATGAGCAGATGACCTCTGTTACCCTTGACAACCTG 

30 GAAGAGCTACATTTACCATATGTCTCCGTCTACTTCCTGCCTAAGGTGTCTGTGCTGGTGATGTGT 
TCAGATCAGCAAGGAATATGAGCGCTTCTCTAAATACATGCCCT^ATGTCAAGGTA 
GTGGCAAGTTGGAGGGATAAGAAACTTGTAGGCCAATAGTCTCTTTAATTTTGGAGAAGCTTTA 

GTTGAGTT CCTATGT AACAG G AG GATTCGT AATTGGG CTATGGAT GATGCTT AACACAAGAC CAC CCTTTTCTTACT ACTTT ATAC 
TGACTTTGAATCATATCAGTTTAATAATTT^ 

3 5 TATTGAGGATTCTGGCCAAGTGCACAATGGCTTACACCTGTAATCTC^ 

CAGGAGTTTAAGACCAATGTGGCCAACAGGATGAAACCCTCTCTCTACTAAAAATGC 
CTATAGTCCCACCTGCrTGGGAGGCTGAGGCAGGGGAATCGCTTGAACCC^ 
TTACACTCCAGCCTGGGTGGCAGAGTGAGACTCCCATCTCAAAAAAAAAGAAAAAATCTGATTC 
ATCCTTCATAGGTGATGCTGTATCTCCTGTTATGCCACAAATCTGGTCGACTTATGTTAGTTATTTTATTTTATTTT^ 
40 GTTTTGAGATGGAGTCTCG CTGTGTCCTTCAGG CTGTGAGTGTAGTGGCGCGATCTCAGCTCACTG CAACCTGCGCCTCCCACGTT 

CAAGCGAT^CTCCCGCCTCAACCCCCCGAGTAGCTGGGACTACAGTGTGCCATCATGCCT 
TGACAGGGTTTCGCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCCACCCACCTC^ 
ATTACAGGAGTGAGCCACTGCACCTGGCCTCATTAATGATTTTAGATTTACCATAGGATTAGCGTCGTGACAGT 
TTCTTTTCCCCCITGAAACCAGAAAGTAGTTTCT 

4 5 AATG GTTTTAACATG AAATGATAATATTT AG CCCG AAG CAG T AATTT CATGGGGTT TG TGTGAAAAGAGTTTGAGATT CTGGG TTT 

ATTTAGGAAACCTTAATGTTCCATGTGTTTTTTGTGGTACn^ 

TTTATTTTTTGAGATGGAGTCrCGCTCTGTCATCCAGGCTGGAGTACAGTGGCTCAATCT 

GGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGGCTAAATTTTGTATTTTTTGTTTTT^AGTAGTA 

AGTAGAGATGGGGTTTCACCGTGTTGGCCAGGCTGGTCTTGAACTC^^ 

5 0 CTGG GATT ACAGG CGTGAG C CAC TGGGCCTG G C'i u i"r ATTTTATTT TTATTTAl'0"i ATTTCTTTTTG AGATGG AGTATCACTCTTG 

TTGCCCAGGCTGGAGTACAACGGTGGGATCTTGGCTCACCACA^ 
TGGAATTATAGGCGTGTGCC^CCACACCTGGCTCCTTTATTTTT^ 
TAAGTCAGTAGAACTGAATTATTGTCCTGACTGCTCCGACTA^ 
ACATAAAATTTTTAAATTAGGTGACCTCTAAAGTTAGT^ 

5 5 AACCCTGGGGGGATTGGACTCTTTCCTTCCTCTGTTTTGAGACTOT^ 

TGTCTATCAAGAAGGATGAAGAGGTGCTGAAGAAGAACTGC^ 
CGAAATAAGAGCCTC^CCTCAAACACATTAAACACTTTAT^ 
GTGCTGGGGCTTGGCTAATGCTGGGGAGTTGTTCTTTGGAGCC^ 
ACTCTTATCTATCACCCATGACTGATGGCTCTG 

6 0 ATCATCTTTCTATGATTTTTGTTTTTAACCTTTGAGA^ 

TTTCATTAT AGATT C AT ATACTGGGAGTCATAGGGG AGATT CTAAACTGAAAGAG AAGACAGTAC CCTT CTGG CATCT CCAGCACA 
GCATTTACAGTCAGAATTTATAGCTGAATAAGTCT 

ATGTGGTATTCTAGTGTGCCAGGTGGGGTTAATGGAAGATTTTTCTGTAAGA^ 
TAGGCCCCATAAGTCATTACAAATGATCTTTGGCAATTCT 

6 5 GCCTGACTTCATAGAAAAGTAATCCAGAGAGTCATAGATGGACTCTGATATCTGG 

CTGAAGGCTGGCTGGGGCTTGGGCAGGAAAGGGTTGGGAGAAGGTCCCATAAAGC^ 
GGGCTTTATCCCCATTTCATAGTTCGGAACTTTGGGGTTTTACCTTATTTOT 
CACTTC1TGATTCCAAGTCCATTGCTCTTTCCATTGTGTTGTTACTATTTCCAG 
ATCTTTGTTTATCATACCTGCITATTTTCTCCT^ 

7 0 TCGCATGACCCCCCACGAGAAGCAGGTCATGATGTTGAGTGCTACCT^ 

AAGATGTAAATACCCFTCTACCTTCTCTCCCTC^ 
TGTCATTCAAGTGCCAAGAAGGCGGCTTCTGCCCAACTGGGAGTAATGA 
GTTAGGGCCTGCGCGGGTCCCAGGGAAACTCCGGAAGACTTGGT^ 
TCACAACATTTTTGAACCTCTTCTCCCTTCGGGGGAGGGCAGGATTT^ 
7 5 CACCCTACAG CCACG CACCCT C AAGGTGG CATCGAG CATACAGCTGGAGCCTTCTGCTCACCAAAACTC CTACTTCCCGGTGGCAG 



2143 



WO 03/008583 



PCT/US01/51291 



GAGAG<IAAGAGAGGGACAGACAGATGGCAGGGCATGTCCAAAAG^ 
GGGGTGGGGGCCTTTGGCACCTCAATCCCCGATACCCTACTCCT^ 
TGTGAGCCGGCAACAGAGAAGCACCX3TGGCGCGG03AGGGAAT^ 
GGAAACCTGACCAGGAAGCTGAGGACCAAACCAGCCTCTTTTTCCGTTCC 
5 GTTTCCCCCAATATGTGTTGGGGAGGGGTGTCCrGAATGGGGT^ 

ACTCAGAGGAGAGGGACATAGGAAAGGTAAAGTGGATGTAATCGGGTGGTC 
AGCAGTGGAGTGTGTTCTTCCCCTCCCTGCAGTGTTCCriTCCCGTGGGATGATC^ 

CCAAGGTGGGAGGATTGCTTGAGCCCAGGAATTTGAGA 
10 CGTGGTGGCGCATGCCTGTAGTCCCAGCTACTTGGGAAGCTGAG^ 

AGCrGTAATTGTGCCACTGTACTCCAGCCTGGGTGATAGAGTGAGACCOT 

GGG ATTGGGCTTCTGGr r r rrri'CCCATGACACACACATCU"l"'l''rC C TAT1' r i\iT CCTCTGGG TCITCATATTAACTAT CTTCCCCC 
AGGATAGTATAAAAAGTGTTAGGAAAGTTGGGCTITGGAGTTGTGGTAATTTCTGTC^ 
TTT TAATTTTAAAGATGAATGCAGTG AGGTAT AATGGTGTGTG CCTGTAGTCTCAGCTATTCAGG AG ACTC 
1 5 TTGAGCCCAGGAATTTGAGGCTATAGTGTGCTATGATTGTGCCAGTGAATAGC 

C CTGTC CCTT AAAAGCGTATCTGCTGCTCTGAAT TTGGTATTTT AACAC CACTTACTGATAC CTTT C CTGTAAACCTGTAG ATGGT 
TTAATTCTTAGTCAAGAGACCAGTCTCATCTAAAACTATCCTOTTG 
TCTCCTTAAGTGGCATTTTGACTGTCCATTGCAGCATTCTGATCTTAAAAGACATCC^ 
GTTGAAGTAGGAGAATCAAATGGAGCAGTTGTCCTCCCCCCACCCC^ 

2 0 GAGTCACTGCCTCCCATCATTTCCCAGATGTTTGGTCCTTGCTCrCCCTTTG 

CTATTTGAGGTAATGGCATCTGTGCCATTGGGTGGTTTCACTGCTCCTTGAC^ 

GTTGG GCAGTCTT AAAACT CAT CTGAT AGGAAG GAAATT AGATGTAATGTGAG AG AGACCACAGT AAAATGTGGG T ATTTTTGGGA 

GTCGGGTGGGGTTTTCAATCTTCTCTTTCCTCC 

AACCCCCCCACTCTGCCCTCCTCCCGTTCCCCACCCCCTTCCTC 

25 

HUMAN SEQUENCE - mRNA 

CTAAAGGCTGCCX3CCATACGCGCTCTCCCTGTTTAGCTCTTCTGTTAGA 

CCTACTCTTCACCCCTTGTTTTCACCTATTTTG 

C^AGAACGATGTGGACAATGAGCTCTTGGACTATGAAGATGATG 

3 0 GCCAAGAAGGATGTCAAGGGCTCCTATGTCTCCATCCACAGCTCTGGCrTTC 

CATTGTCGACTGTGGCTTTGAGCATCCGTCAGAAGTCCAGCATGAGTGCATCC 

CTGGTGATGTGTCACACTCGGGAGTTGGCTTTTCAGATCAG 
TGCTGTTTTTTTTGGTGGTCTGTCTATCAAG 

3 5 GCCGTATCCTAGCCCTGGCTCGAAATAAGAGCCTCAACCTCAAACACA 

GAACAGCTCGACATGCGTCGGGATGTCCAGGAAATTTT^ 
GAGCAAAGAGATCCGTCCAGTCTGCCGCAAGTTCATGC^ 
ATGGGTTGCAGCAGTACTACGTGAAACTGAAGGACAACGAGAAGAACCGGAAG 
CAGGTGGTGATCTTTGTGAAGTCTGTGCAGCGGTGCATTGCCTTGGCCCAGCT^ 

4 0 CCACCGTGGGATGCCCCAGGAGGAGAGGCTTTCTCGGTATCAGCAGTTTA 

TATTTGGCCGAGGCATGGACATCGAGCGGGTGAACATTGCTTTTAATTATGACATG 
GTGGCCAGAGCAGGCCGGTTTGGCACCAAGGGCTTGGCTATCACATTTCTC 

GCAGGATCGCTTTGAGGTCAATATTAGTGAGCTGCCTGATGAGATAGACATCTCCTCCTACATTGAACAGACACG^ 
GCCCATTTTGGAATGTGACCGTCTGTCCTTCAGGAGAGGACACCAGGGTGGGGTGAAGGAGACACTACTGCCCCCACCCCTGACAG 
45 CCCCCACCCCATGGCTTCCATCTTTTGCATCACCACCACTCCTGAACCCCCATTTCTGATTTGTC^ 
CTAAAAATGAAAAAAAAAAAAAAA 

HUMAN SEQUENCE - CODING 

ATGGCAGAGAACGATGTGGACAATGAGCTCTTGGACTATGAAGATGATGAGGTGGAGACAGCAGCTGGGGGAGATGGGGCTGAGGC 
50 CCC TGCCAAGAAGGATGTCAAGGGCTCCTATGTCTCCATCCACAG CTCTGGCTTTCGTGACTTCCTGCTCAAGCCAGAGTTG CTCC 

GGG CCATTGTCGACTGTGGCTTTGAG CATCCGTCAGAAGTCC!AGCATGAGTGCATC CCTCAG^CCATTCTGGGAATGGATC 
TGCCAGGCCAAGTCGGGCATGGGAAAGACAGCAGTGT1TGTCTTGGCC 
TGTACTGGTGATGTGTCTVCACTCGGGAGTTGGCTTT^ 
AGGTTGCTGTTTTTTTTGGTGGTCTGTCTATCAAGA 

5 5 CCAGGCCGTATCCTAGCCCTGGCTCGAAATAAGAGCCTCAACCTC 

GCTTGAACAGCTCGACATGCGTCGGGATGTCCAGGAAATTTTTCGCAT^ 
CCTTGAGCAAAGAGATCCGTCCAGTCTGCCGCAAGTTCATGCAAGATCCAATGGAGATCTT 
CTGCATGGGTTGCAGCAGTACTACGTGAAACTGAAGGACAACG^ 
CAACCAGGTGGTGATCTTTGTGAAGTCTGTGCAGCGGTGCAT^ 
60 CCATCCACCGTGGGATGCCCCAGGAGGAGAGGCTTTCTCGGTATCAGCAGTTTAAAGATTTTCAACGAC^ 
AACCTATTTGGCCGAGGCATGGACATCGAGCGGGTGAACATTGCrTTTAATTATGACATGCCTGAGGATTC 
TCGGGTGGCCAGAGCAGGCCGGTTTGGCACCAAGGGCTTGGCTATCACATTTGTGTCCGATGAGAAT 

ATGTGCAGGATCG CTTTG AGG T CAATATTAG TGAG CIX3CCTG ATGAG ATAGACATCTCCT CCTACATTGAACAGACACGGTAG 
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Table 105 

MOUSE NOMENCLATURE 
ICSGNM Iqgapl 
Celera mCGl5312 

5 

HUMAN NOMENCLATURE 
HGNC IQGAP1 
Celera hCG27443 

10 MOUSE SEQUENCE - GENOMIC 

TCATGGATGGCTGTGAGCCACCATGTGGGTGCTAGGAACCAAACCT 

CCGTCTCCGGGCCCTGAGCCGAGAGAGCCTGGTAGAAAGCCT 

TGCACCACCACTGCCGGCTTGTCTGCITTCAAAGA^ 

TTGGCTTT CCCTTT CACCTTT ATGTCGATTTC^GG ATTAAACT CAG ATTGTCTGG CT CGAGGAGGGGGT CTTACC CACTACC CT AC 
1 5 TCCAGTAGACCTGTTGTTTTGTTTTGTTTTGTIT 

CA.CAAACTGGAGACACCI^G AAGAGGGAACCT C^ CAGG T CCGGG GG ATAT 

TTTCTTGATTAATGATTAATGTGGGAGGGCTCAACTACTGTGGGTAGTGACACCCCTGGGCA 

TAACAAATTAAAAAACAAAC CAAAAACAG CAATAACAAGCAAG CTGAGCAAGCT ATGG AG AGCGAT CACTTTAAAG CAACATTCC C 
TTT CTG CTTTACAAAC CAGTTCTGCCCACAGG CTCCTG CTG TG CGGTCCAG CTTGG CrrcrCTTGATGATGAGATG ATG AATTCCA 
2 0 ATCATAAGATGAAATGAACCCTTTCCTTTTCAAGGTGGTTTTGGTC^ 

TTTCCAAGATTTAAAAAAAATTTTTTTAAATAGTGTGTGTATGTGTGGTGTGTC . 
TCAGTGCAGTTGCCCATGGAGGTTAGAGGAGCAGGACCT 
CTGGGAATGGAACTTGGGTCCTCTGCAGGAGCAGCCTGCGGAGTTGTTTCTC 
AAAGACTAGGAGAGGATCTTATGTAACACAGGCTGGTTTTGAACTTGCTATGTAGTTC 

2 5 ACCTCCACCTCTCAGTCACTGGGATTCTGTGTGTGTGCT 

AAG AAGTATCAAAGGAGAAAAG CACTCTAG CG C CT CT CAT CCAGAG CATCC CCCATCATCC3VGGAGCCCAAGGTACATAACAAAGA 
GAGGAGATGACACTCTCCCAGGGGCACGGCATGCTTCCAG^ 
TTCTGCTCCTGGAGAAGACTGGAGTGAGGACAGCCAGGCAGTC^^ 
GATTTCCTGGCTTTGTTAACTTCCCITCCATAACrcrCTG 

3 0 CCAG CAGTTCT CAAC CTGTGGG TCT CAAC CTCTTTCAGGG TTG AATATCAAATATCTGGCATT TCAGATATT^ 

TAACAGTAGCIAAAATTACAGTTATGAAGTAGCAATAAAAATAATTTTATGGT^ 
C^GCTTTAGGAAGGTTGAGAACCACTGCCTGAAAACCTTAATCAAGC^ 

GTTTTTACTGAGGTGTGGTAGTATATACACCTGTAATCCCAGTACTCAGGAAGTAGAGGCAGGAGG^ 

ATT TGG TCGACAAAG AG ACT CCTAGGC CAG CCAAGGCTACATAGTTAAAC C CTG TT AGAAACAAATAAACCTG^ 

3 5 AGCAGAGATCTGAAAGGACTGAGCGAGGAAAGGTGCTTGTTACCM^ 

AAAGTTGTCCTCTGACCTCCTCATGTATGTATAAGTAC!ACAC^ 
CCACCACCCAACCCCACACTATACAGCAAATAGTAACAGCAAAGTTCCriXSGCATG^ 
AGGTAGAAGCAGGCAGAGCTCTCTGAGTTTGAGGGAAGCCTGGTCTACAAAGCGA 
<nX3TCTCAAAAACTCTGTCTCAAAAAACCAAACTAACCAACCAACCAACCA^ 
40 AAAAAG AC CAAAAC CAAAAC CAGT AAAG CAAAGCAAAG CAAAAC AAAACAAAAC AAAACAG CAAAAGAGT ACCATCACT AAAACTG 

CAACACAACACAACACAGCACAACAACAGCTTTAGTCC 

TCTGTGGCCACTACATCTTCTGTCTGTTTGAGACAGGGGCTCACTATCGGACCCTGG 

CCTCAAATTTGTGACAATCCTGGGAATAAGGCCTGCACCACCAC^ 

ATCTGTCAACAAGACTTGTAAGGTGGCTGCTGCCTGGAAAGACACAGTGTATCCGT 

4 5 GGCCAAGTGCAGTTTAAATTCTGGCAGTGTTCA^ 

CAG AATT CTACACACACT CACACACACATGTG AGAGCACAT AG ACATACAG ACACACAC CACAT ACACAG AG AT ATACACTAAATA 
AAAATTTCTTTTTAAAAAATGAGATGGTACATATCAAATACT C AT AAGAGGACCTAAATGG CTGG G CATAT AGTG CATGTTTG ATG 
TGTATGAGACCCTGGATTTGGTGGCCAGAACTACCAGAACAATGTCGTGTT^ 
ACGAGTTAGCTATTGATACTTATAATTCATTGACATCCCTGGAGACCTGCTGGGGCAATCTTGTOT 
50 ACTGTGCTATCTCTGGCTCTAATTACTTCCCTTCTCCCACACTTTAGA 

TTGACTCGTTTTATGTGAGTCTACCTGCACGGTTCGAGGGTGGGGAGCCTGAAA^ 
GCCTCAGTAGTCCTAAGTGGTGCTGACTTCCTGGGTGACTCCTGGACAGCCCT 
TGCATGCTAATGGCAGGGACAGTGGCTTTGACCAGAC!AAAGGAGATCAAAC^ 
ACCATCCTGGGAGGTCCTGCCCACCCTGGGTGGGCTTTCTTCCTCAGTTGTCCTTCATC 

5 5 ATTTTTTAGTTGAGTCCGGAGCCAATaVAATTC^ 

GAGG CAGGTGG ACC T CTGAGTTTG AGG CCAGCCTAGT CTAGAG AG AGAGAG AGAG AG TTCT AGGGCTTCAG AAAG AAAGG CTGTTT 



TAT CCAG GGTG TT CAGTGAACAG GACAAAGAAATG AACTCTACTTGTGT AACCAGATT ATT ATG GG CT ACCTCTAAT CTG TCAGAA 
6 0 AGAAGCAAGGGGCTCAGCTAG CTCATAGrriT TATTTATACAAGCTTTTCTCATTAGTTTATTT GACAGTGCT ATTT AGG AG CCAC 

CTCAGACTCCTCTTCAAATATCCAAAATGTCCCTGGAAACTTATATTAGTTTTCT 



TTCCCATTCCCACTCCTATTCCTATTCTTCTTTGGAGACAGAGTCTTGCT^ 
TCCTGCCTTTATCTCCTGAGCACTAGGATTATAGGCATGAACCACCATGCCC^ 

6 5 TCTAGCATGCTCTCTATCAACTGAGCCACATTCCATCTCTGATGTCACCCATCTTTAAGCAA 

TTTTTCATGTGGTGCTTAGCTATTTGGCTTCCCT 
TGTGAGCAGCTTTCCATTTAATTTTGGGGATGTATGATTT 

AGCAACATCCCATTTGGGGCTGGGACGTTAAGTGGGAATACCCCAGGC^GATGAGA^ 

GGGTTTCTATTCCTGCACAAACATCATGACCAAGAAACAAGTTGGGGAGGAAAGGGTTTATTCGGCTTACACT 

7 0 TTCATCACCAAGG AAGTCAGGATTGGAACTCMGCAGGTCAGAAAG CAGGAGCTGATGCAGAGG CCATGGAGGAATGTTCATTACT 

GGCTTGCTTCCCCTGGCTTGCTCAGCCTGCTCTCTTATAGAACCAAGACTACC^ 

CCCCTTGATCACTAATTGAGAAAATGCCTTACAGTTAGATCTC^^ 

TCCAGCTGTGTCAAGTTGACACAAAACTAGCCAGTACACAGGCCAAG 

TTAATATTGCCCATTCCCATCAGCTTATAGATCTCTTGAGGTCATGAACCC^ CAGAT AGCT CATTAT 
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TTTGCCATTTGATGCCTACTTATG6ATTGATTTGGTATX3 

GTATTT CACTGTAAAGAACATAGAGACGTAAACAAACAAGCTCCACAAGTCC^ 

ATATTGTC CTGGGTGCTT CTCTTTTCTTTTTTGAGACAGGAT CACATCATGTAGCTCTGGTTTG TCTAGAACTCT CTATGT ACACC 
AGGCTAC CCTGGAC CTCATAG CTATCTGC CTCTCTCTG C CTCTAGGCATGAAAGACAAG CACACCATCATCCCCAAATCT CTC CCT 
5 XCTGACCAAGAGAGGTTCTTGAACCTATTATTCTTCTGTCTTAGCCTCCTGSVGATCA^ 
GACTGTTTTCTTTCrTTCTTTCTTTCTT^ 
XTTCTTTCTTTCTTTCTTTCTTTCTTTC^ 
TTTCTCTCTCTTTCTCTCCCCCCCTCTCrCTTTCTTCTT 

1 0 AAAGAGTCCGA^ 

CCCAAGGACTGGGGTTATAGGTGTGTATTACTG^CGTTGGATGTCTTTATATTTTA 

CAGAATTTATCTT CATTT AAAC^TCACTT CAGAAAT C CTGATGT CTCTTGCTCACCCAGCTT CCCACTAAAGAACTG CAG ATTCTC 

TTCTG CGTATGGTGGCATGTCT CAG AC CACCCACACAAGTTGAAAA 

TCTCTGCCCAGAGTAGGGATGCTATCCTTCAGCACGACAT 
1 5 TGAAGTCTGGTTCCTATTGATCATGTATCTTTTACAGCATCAAAAAAATTGAAAAGAATCT 

CAAGTTCTT ACT ATGTGACT CAAG CTGGC TT CAAAT TTGCT ATTCTC CTGT CTTAGCCCTCCCACGTGCTATG ATT AATGTATGTG 

TACCTCCATGCTGAGTGGTGATTAGAGCCATTCTAAGAC^ 

TTGCCATTTATAGTTCAAGGGCAGCCCACCTATGGACTCATATACTAG 

CCTGATCCTCCTGCCCTCTCTCTCCCAAGGAACCATTACCACAGACTTTGTATCT 
2 0 AATGGAAGGCAGGCCAGGCTG^AGTGGCC CTAAAGAG CTT CAAC C CTGTGCTCAGTGAC CTTAG ACCAGAATTGCTCTAATTTTCA 

TAG CAG AG ATGGCAGG ACTAATT CTCACATTCAAGATGG CT CCTATT AACCT AGAATT GT AT^ CATAGG AC CTGG ACTG C CCTCC 

TTTGAACCTCACGAATTATACACCTAGAGCTGTACACAC^CACAAAAAACCCT 

CCCATGTGACCCCTTTACTATTAACCTACAGGTA^ 

AGTTTTGTAGACATTCTTTTAAATGCTTGAAGATACAGAAGTAA 

2 5 GCT ACTGGTATTGTC TGG AT AGATGCCTCAG AGACGGAGGG AAAG CAGTGGCTG CTCTT CCAC^ 

GCCCAGATGGGTTACAATTGACTGTAACTCCAGTCCCAGGGG^ 
TGGAACAGAGACACATGAATAGACAAAGCACCATACACTGGGGCT 

TAC C CAC ATGGTAGCTGACAACCAACTGT AACTCCAGTTTG AG G GGATCT CCTAC CTT C CACTG ACCTCAG ATGC CAGGT AAC CAC 
AAG GC ACAGATACGT ACAT AGAT AGAACT AGGGCCTCATGCATG CAAGG CAAGT AAGTGTTTTG CCT CTGAG TTTTATCCC CTG CC 

3 0 CT CTTGT ATGC TTTCTCTTTCCTGTTTCTTTATAGACAGAAT CTT 

ACTTCTGCCTCTCCCCCAAGAGTACTAAGATGACTGGTGTAGTC 

TCAGGGCTATC^GTGTAGCTTAGTGGCAGAGGGCTTGACTAGCGCATGTCAGGGCATGGGCTGT^ 
CTTAACTTTTC^TTTATTTATCTTGACACACAGACACACTCTCT 

TGGCCTTGAACTCACCX5AGATCTGCCTGCTATTGTCTCCTGAGTGCTGGGATTAAAGTCGTGCCT 

3 5 AAGTTTGTAAGGAAAGAGCTAGTAAATTGAAAATGGCTGGAA 

GAAAGTACACAAACTGCACTGTGAAGGGAGTAACAGGAAAAATTAC^ 
GCTTTTAGTTCTAGCAATCAGGAGGAAGAGACAGGTGGATCTTCAAGGGTAGCCTTC 
ACTACATAGTGAGACTTGTCCCCCITTCCCAAAGAACrCCGTACTTTGGAAAGAGC^ 
AAGTGAGAATAGAGATTATGATGATCAAAAAAGTGTTATCACAGGTGGGGACAGA^ 

4 0 AGTCGAGGAACCTGGGGTGTGTGTGTGACTCK3TGAT^ 

TTAGTGTCTGTGAGAGGTCCAAAAATGCACGTTAGATAGTTTGGAACTACTGAGGGGCTTCT 

GCCATCAACraTTTATGGTGAGGGTAAGAGTCCTGAGAATGGGACTTT^ 

TATTATTATTATTAGAGCCAGTGATTCACTCCAATCCAGGCTGGTTCAGAATAGA^ 

ACAAGGGT AAAGAGACGAG G CAT CCTT ATTAATAGGGTCAAG AT AACC CC C C CCT AAG AT CTTG CAG AT TGT AAAGAGCTGAT AAA 

4 5 CAGCTCTTCCGCGTGAGGATCGTCATCCCCCAGTTTAAGAGATTGAG 

AGATTGACAGAATGTAGCAGAGAGTAGGAGTGGGAGTCTTTCCrTGACAGTG^GTCCT 
ACTTGCAACGGCTCCTTGAAGGAGCCGCGGTGCAGAGGGGGTTAACTCAGTTGTCAGGAGAAGG 
TTTCAAGCTGTAGACATTGGCC^CCTTTTAGTCAGAGCCGCAGATAG^ 
GCCTAGGGACTCGCTACAGCACCCroCCTCCTCGCTTCCGCAAGTGGAGAGCGAAG^ 
50 ACGCCCCTTCCGCTCGCTCCGGGGAGCGCAGCTGGGCGTGGACrCAAGGGCGGGGTCTGCGGGGCGGGACCTGC^ 

ATGGCG CGGGGCGGGGCTCTGAGG ACAG CGACAGCCCGCGCACTGGGCAGG AGTT ACTG CTGCTACGGTCGCCCGCGTCTTCAAGG 
TCTCTGCG^TTCCTCACOJGAGACCTGGACTCGGCCGCCATC^ 
CTATGGCTGTGAGTGCCGAGGGCTTGGGGGTGCGCGGGCGAGGGACCAA 
CCGGGTGGCTTGGGACCGCACGGACCCKX3AAGAGCAGCTGGCTGTATGGCTC 

5 5 CGGAGGCGGCGGGTAGAAACCTGGAGGCCTCTGCCTTCGCTAGGCGCTGCGGCAGGGGAGC 

TTCTACTGAGATTGGGGAAGACGGCCTAGGGTGAGGTGGCAGGAGAACTGACTGTGCAGGGCGTGTCCTGGGCACTGGACCCAGGG 

GTTCTGCTCGGGCGTGGTGGGAAGGTGCTGCCCGGATTGCTGCCTGCGAGGGTGACC 

AGCTGCGAGGTGCAGTTTTAGGCAAGCAGCTTATGCTTTCTC 

TAAACACACCCAAACCGGCTCCCCCCACTTCCTCATTTT CTG TGACTATCCCTGCAA 

6 0 ATAGTAGTTACAAGAGGATCCCTTTGAGTCCCAGGTTTAGATTTTATGTTTTGGGAAAAG 

ATCC^GTTATGCATTGGATAGTGTTATAATGTGTGCTACGTTTG 

TGTC CGAGAAAGAGACIAGGTGTAC^GAG ATTG AT AATT AG ATGAGG GGTTGGG AATTT AG CC CAG TGG TAGAGTGCTTG CT TAG CA 
AGTGCIAAGTCCTCAGCTCCAAGAAAAAAACCAATAAGGTGCAATA 

AGTAATG GGGGTG AAGAGAAGGGACTTAACAAGG C CATG GGAAGGTAGGTG CTGACTTT TAAACAGAGG CTTGGAGG ATGCAGTGT 

6 5 GGGG CAGGGTG AGCAAG ACATGACTTTG AGAGGG CAG ACACAAACTAAATATGGTATT GGAG AACAT AGGGTTACTGTACGTGG AA 

AACCG AG GGCAT CGAAGTGCAG^GGTGGAATTG AAAGGG AGAGGAG AAAAGTAAAAAGGATTTCCCG AAG T AGCC CAGG CT AG AGT 
GAGAATAGCTCTGATTGCCTTCCCAAGGAGTTATCCCTTGGGAAGACAGTGGAGAAGG 
CCCTGGAGTCAGGTGGGCTGau^GCCCTrGATTAGCCCTCCGACTGAGTT 
GGTGATCACAGGATGGGTCTGGGCTAGAGTAGGAGCATGTOTGTAAACTGTG™ 

7 0 AGTGTG AC CAACAAGGATAGACT ATAGTTG ATGC CTCTTGAAG CT ACAAGAAAATGT AG TACAT ATAT AT TTAAAACACTTAAAAT 

TTTGGTTCCTTGATAATTTCTTTTTTAAA^ 

GCCTTCATGTGGTTGTTGGAAATTGATTTTTTTAGGACCTCTGCTCGCTCT^ 
TTATTATAAATAAGTACACTGTAGCTGTCTTCAAACGGAAA^ 
CTGCCTTTCrAGGTGACTAGCCAGC^TTTGATGGCGGTGGCCTTCATTTATTGCTC 
7 5 TCTTTCCTTTCTGTTTTAGCCGTCCTGGATAATGAGAGGCTCACTGCAGAGGAGATGGATGAGCGGAGACGGCA^ 



2146 



WO 03/008583 



PCT/US01/51291 



TGAATACCTTTCTCATCTCXsAAGAA^ 
CCTTTAAAAGAGTTGTTTTCATTTCTGTGTATGGTT^ 

CC CTGGGGCTGGAGTTACAGT CACCTG ATGTGGGTGGTGGCAGC CCAG CT CTGGTCCT CTGAAAGAGCAGCAAGTGTT CCTAACTG 
CCGAGCTGfTCTACCCAGCCTCCACAGGTG&CTCTCT 
5 TAAGTAGGT AGGATAAGTGTAG AGTGTTTATTC^TGGGAACAAGTGGG CCGTGGTGGGTT ATGATGAAACACTCTGT CGAACACTT 
GTGCCTCCTGAAAGATCTCCCACAGTTGTGTTTTC 
AACGTTTTAAGTTAGGGTTTGATGTTCTTGATCTTCTGAGTC 
TCCTGCTGACTCATGGAGTGGGTATTAGGTTTAGGGGGCTGGTA^ 
AGTACTTGTTGAGTGOTCAAGGTCCTGGGAGACACCCAGTGCCAGAAGAAAACTO 
1 0 TATAGATCTGTCCACCAGCAGCATGGTGAGG CCTTTCCGCAAACTCAATG CT CT 

CAGCCCTAGGCTGGCCTCAAACTCATCCTGCTGCTGAGGCrTCTCTGGTCCT 
CATGCGCCACCACACCCATCTCCTCTGCATTCTTTATGAGGAGG 
TGTGATCTTAATTTTAGGCTGTCAGTTTCTTATCAGTC 

ATAAACACAATTATAGGTCGTGTACACTGAAGAATAATGGGAGATAATTGTGATCCAGGCTT 
1 5 TGATTAAAACAAGAGTGGAAATTTGATTTGATTGTCAAGAGCT 

AGCCCTTCAGAGCTCCCTTTTCTTTAAAGATGACAAAAACTGCCCC^ 

AGACTGAAGT AGCGG CTTCCCTGG AACTGGAGTTG AAGGAG ATTGTAGGC CAACCT AGGAGTTC CTAGGTG CT AGAATGGGACC GG 
CTCTTAACT CCAGAGAATGGAAGCTT AGTACGATGGT^ CAT AAAGAG CAGTTGCTG CCCC 
TCCGAGCTGCTTTCCCCATGTGATGGCAAATGGAACTGCAGGGA^ 
2 0 AGGAAGTTCGTAAGGGGATACAAACACTTAAGTCCTCTG 

AGCTGCAGCTGGTCTCAGAAGCC!ATCTTGTGTGCTCCCTTGAGCC^ 
TCCTACCTGTAAGAGTTTTACTTTTCCAGAAGTGCTTGAGGTTTCGAGCCTTC 

TAAGTCAACTTCTGT CCCGACT AGCCAC CT CATTT CAGTG AG AAAAC CAAGG C CTGAG CTITCTT AG CCT AC CTGTAG CTTCTCTT 
TTAGGGGGAGGGGTCATGTAG CCTAGGCTGG C CTG GAGTTCACAATGTAG C TGAGAATGACAG AG CC CTGAG C CCAGTGCCAAAC A 

2 5 /UVGTTGTCTGCCACTACACCTGGCATAGTTTCCCATAGTTTTCT 

GTATGTGGTATGTGCATTAGAATCTCTTGCGAAAGTTTTTACAAATGCGGATAATCT 
TCCTACTGCCGAGCCATACCCCAGCTTCATACCTAAATATTTCAAAT^ 

G AAGTAGGATATT CGT ACTTTT AACACGTTTAAATTTTCTTTGAAT GGTTTT AG ACTCTG AC CT ATT AG AAGTT AATG ATGC TC T T 
CACACTTTCTTTCTCGGTGCAGCGCTGGCAACATTTTATTT^ 

3 0 ATAGAAAAGTAGCCTGATTATGCAACCCATCGAGAGACCACTAGTCC^ 

AAGATGTTTATGATATTCITATTTACITCCTTCTTTACTTAAAGC 
AATGTGGAAATGTTXKICTGGTTTATTTTAGGGGAGTAA 
CCCCCCCAAAAAAAAAACATTCAACAGCCACCTCTGG 
TGGGAGTAGTATTTCTACCGTCGCCTAATGTCATGCCTGCTGGCCM 

3 5 CCTGGGCAGCTGGGTGGTGTCTGTCTGCTCCCGGGGCTCATTTTCCCTCAAG^ 

TCCTATTTCCTCCCAATTTGGAGGAGCCCCAAGGTCTGGAGTTTACATCA^ 

GGGAGAAAGGCTCCCTCATACTTCTATTTATAACACCTCTGCCCCTCCT 

GTATCCTGAGGCTCTCTTTCTCCTGTACCTTTCACAGAGTTTGTTCTTGG^ 

4 0 TAAGTCAGGACATTAAGTGTGAGTTTCAGAAAACTGTATACTGGGAA.TGGTTG 

TGAAAAGTGG CTTCAGTATG GAACTT AAACTG ATG CT CT AATAAAAT ATTTTAG GATG CC AAG AT ATAG CAG AG CAAAGG CTGT CA 

AGGGATGCTCTTTAGTGCTTACACCTATTTCCTTACTTATTTATGTGTC 

TAGCTAAGGAGTAGCTGTATGAGAAAGGGATGCC^TTTCATCTGT^^ 

GGTCTTG AGACG AGAATGTAACAG AG ACGAAT CAGAAGGTGGCTT AGGGAG AAAAAGAG AAGG AGG CAAG AG AG AATGT ACTGGGT 

4 5 AAGCCAGAGGAGTCGGAGAGCATGAGGGGCAATGCAGGCAAAGGAAACGTCAGTTAG<5GTAACGGTCA 

TACTCGTCAGTCATCGATTTAGTCGTGTCTTTCAGGGCCrTTCTGGTGGATCCCT^ 
GCACCC^CAGCCCAGCTTAC^GAGTGAGGACACTCCCCCAACTCCCTCTGTAG 

TGTCTGAGACTACGTGATATTAACAAGAGCTTAGCATTAACTGGAAGCTCTTTACTCCACTTGAGGGGAACTGGAGA 
GAAAGAAAACAGCTTTTGAAAACAGATTCCCTTTTGTGTGTACATCTTTC 

5 0 AG ATGCCT CTC CAGAATGGT ATT AGAATTCCAG AAAAACAT CT GGAT ATTTCCC AGTT CAGACTCTTG ATTTTGAAAACAG AAGT T 

GGGATATTTAATTTAGGCAAAATCCCTAAGTGTATAGGC^ 
GATGGAGAAGTATAGCTTAAGGCTGAAGGAAGAAGCAGTTGTGCTACCGAC^ 
CACAGGGTAACAGCGCTACTCACAGGGTAACAGTGTTACTCACAGGG 
AACAGCGCTACTCIACAGTAACAGTGTTACTCACAGGGTAA 

5 5 CTCACAGTAACAGTX5TTACTCACAGGGTAACAGCGCT 

AGTTACTCACAGGGT AACAGCG CTACTCACAGT AACAGTGT^ CACAGGGT AACAGCG CTACT CACAGTAACAGTTACT CACAG 
GGT AACAG CGCTACTCACAGTAAC AGTGTTACT CACAGGGTAACAGCACT AC T C ACAGTAACATGTTACTCACAGGGTAACAG CGC 
TACTCACAGTAAC!AGTGTTACTGACAGGCTAACAGCACTACT 
ACAGTGTTACTCIACAGGGTAACAGCGCTACTCACAGTAACATGTTACTCACAGG 

6 0 A(IAGGGTAACAGTGCTACTCACAGTAACAGTGTTACTCACAGG 

ACAGCACTACTCACAGTAACAGTGTTACTCACAGGGTAACA 

AGGGTAACAGCACTACTCACAGTAACAGTG TTACTCACAGGGTAAT AGTG GT ACT CACAGTAACAG CTACT CACAGGGTAACAG 
TTAC TCATACAGCAACAGTG CT ACTCACAGG^T AACAGCGC TACT CACAG GGT AACAGCG CTACT CACAGGGT AACAGTGCT ACT C 
ACAGTAACAGTGTTACTCATAC!AGCAACAGC!ACT 
65 ACAGTGCTACTCACAG CTACT CACACAGCACAG AG CTACT AATGTGAGT TTT CTTT CCTAGT AT AACTGT CAGTCACTTT TACCAG 

AGTCCATTGTAAACTTTACAGATGTGAAGCTGAGGCAGATGATGTGTC^ 

GAACAGACAGGCCAG C CT AAT CT ACACAGTGAAAT C CAGG CCGGC CAGGGCAAT AAAGAG AC TTTGTCTTCTAAAAAAAAAAAAAG 

GTGAAGCTGGGGCTGGAGAACGGCTTAATGGTTACAGGGATTAAAGAACACTTAAGTTCAGATCCTCAGAAAAATO 

ATGGCCTGTCTGTAATTCCAGCCCCAGAAAACAGTCGAACACTAGCCATCT^ 

7 0 CAATGAATAAGGT AGCAGAGTG ATGT CGGATGATG ATTCCTGTGAT AAAC CTTAGG CTTC CATGTT CACAAAT ACACATGTGCGTG 

TGGCCCCATATGTGTATACATACATAAAGAGAAATATGAATACACACACACACACACAC^C^ 
CACTGGAAAAAAAATACAAACTGTGAAGCTGTCTGGGTAGCTCGAGTTGTAACTCCAGCCTT 
AGGAATGAGAGTCACTGTCAGGACGATCATCTCCAGCATG^ 

TGAGCCCTCXSGGGAGACTGAGGCAGGAAGATCCTGAGTTTGAGGGCAGCCTAGACTACAT^ 
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TGGGCTTCCAAGTGAGACCrGCCTCAAAACAA^ 
AGTTACTGTCATTGCTAAGTCACACAGCTGCCCCCATGTrACT 

AGTCTTGGCTGCTGAGGT AACTGGGCCATGGAGAWl^TGTTG AGATG CTTGG GTGAACTGG AAAAG ATTGGGACTTTGAATTTGA 
GCAGGTATAGGCTTTGATTCCGGCTTTACAAGCTGTTTAACCCCA^ 
5 CAGAGTTGCTATGGGAGTCAA.GTAGTAGCCCCGAGAGATACAGAAAACACT 
GTTTCTTTTACATGAGGGTTTTGCCTG 

GCCATCTCGCC^VGGCCCAATGGATACATTCTTTAACCTGGTCTGTGGCCAAAGGG^ 
AGCACGTTTTACCAAGTATGTAGCCTAGG^ 

AGACACTCCTGTAAGGTTGGGAG CC CTCGTTTCCACTGGAGACTTCAGTGATTTTTC C CAGTCACAGAGAAG AGTG 

1 0 G AGGTGG AGGGAGT AAGGAGGCTGC CCTTTTATTGACT ACTATACT CTAG CAT CCCTTTACACTAAGATAGCAATAAGG 

GACGGCAGGGATCGAGGGATGCTCTGTGTCAAGTGCATAGTTTTAGGTAGT^ 
TT ATTTTTAT AGACATG TGCACATGTGTGATGGGG AGGTCAGATG AGGGAGTC 

TTCTCACTGTTTTTAACATCTATATAAAATTTATTTAGATCr 
1 5 ACATTTTGTTACGTTTTTGTCTATAGTTTTATTTTTATATCCAGATTTTCT 
TCTTTATGTTAACTTTTGTAAGCTCTCCAGTAGTGTGTCT 

GTGGGTGAG AGTGCACATATG CTCATATATGTGG AAGCCAG GAGACAACTCCAGGTGTCAATTCCTCAGGCAAATATCCZACT CTTT 
TGGAGACAAGGTCTCATTGTCCTGAGGTTATGTCAAGTAAGGCT 
TAGGTTCTGGGTTTGACTCAGCTCTTCAGTTTGTAGTGAATGCCTrCCTGCT 
2 0 GCCTGCTAACTTCTCTGTGATTTAATAACTTTACTTAAGATTTTCTTTA 
GGACraGCOTGTCATGGTGAGTTAAGGACTTCTCTTGACTGCCCAGCA 
CAGTTTTATTTTGGCTCCTTCTGAAGAACTGTGACTTC 
GTGTTAGCAGTTTTGTCTGTAATGTGGGCTTTTCAGTGGGCCrCnTrr 
GTCAGTCCTCAGTCAGACTCTTTTTGATTTATAATGTATTTTAAAAGCTTTCCCC 

2 5 TTTTATTTGTTTACATTCCAAATGTTGCCCTCT 

TGTCTGTTGTCATCTTTCTGCAGATGTGTCATTATATTTTGTTTTTCCATGTGCCCTCTGCCCTC 

CAATAT CTITTTTATTT ATTTTAG CT CTAGG AAATTATC CATTTTAC CTTTAAT AATTGCTTCTG ATTGATTAGTTCTGTCTTGAGG 
GTGAGCTGCACAATCTCTTTATGGCTTCTCTTCATTTCATTTOT 

GCAGTGGTCATCCTAGGTGTGTGTTCTGCTGTCTGTTTGCTGCTGGCTTGGTTTAGTTOT 

3 0 GAGCGTTCATTTTGACCTGTAAAGTCTTTCTATT 

ATAGCAGTCCCCCCAGAGGTGGAGAAAAGGAGGCTCATGGGAGGCGTGGGTTTGAC1ATTTTACAGTTC 
AGGTGGACGTGACCACACACCAGTGTTAACCCGGCGGCTGAGTGTACAGACrrc 

CTAAG CAG CCTGTT CATTTGACCTAATTTGGT ATAT ATTT CTCCTGG ATTTAGAG AT CAAAG CAGTG ATTGTG CTCATGTATTT TA 
AACCCCATGAGACAAAATGGTGAAAATGTATTTGAAGTCTTGAGTTAGCTTAT^ 

3 5 ATCTGTTTCTGATTCTTTATAATTTATTCTGCTTCACTATGAACT 

AGTACTAGTAAATGATATGATATTAGATGGACCTCTGGTT 

GTAAATAAAGTGCTGGTAGGCAATGTGAGACCTTCAGGAAGTGATTCATATCACTCTG 

TTAGAGAC CAT AT CT TTG AGGAGG CTGAGTGTGAAATACTAG CTG AATGC CAAGGG CCAGGGGCT AC CTG ATCAAATTAATCACAG 
CCCTGCTTCrTTCACTTTAGTTTTAGAAGGCTCCTTATAGTGGCTTGGC^ 

4 0 AATG AAGGAT CTGGCT AATGATTT AATTTG AAATGT AAATATTTACTC TG AATTT T CAGG AAACAT TTTTTTT TGAT AGAA 

TTGGTTATTTCTACCAGGCATTATATATATATATATATAT^ 

CCCTCCCATACCCATTTCrrTGAGAATAGACATTCTTTAGATCTGGAAATCCCCTACTCTCCCC^ 

TATAAAAACTGGGACCT CTTTCCCCTCGAGGTCGACTC CTCTACCT CTGCATGGGATATGAGTCGTCCCCAGAGCTCTGGCTTTCC 
CCGAATAAAGCCTCATGTGGTTTGCATCAAGCTTGGTCTATCGTGAGTTCTTGGGTGTCCGCT 

4 5 GG CTCCTCTTGGAGTCrrTCAGTTCCAGACAACTGGTGTCAGCT CATATGG CCGAGGACTGGAGCTGTGTCTGCTCATTAAAGT 

CTCATCCACCCTaVAATGTCC^GTTTGTCTTTTGGTTTGTTTT^ 
ATTACCTGCTGCTAAAACACACCCCAAATCCCAGTGATTAGAC^^ 
TCTGGTTTCAGTCCGCTGTGGTTCAGGAGTAGGTTGGGTTTACAGGCAT 
TAACTGGATTGTTGGGCTATTCCTCACCGTTTTTTTTTT 

5 0 GTCCCACCAGACTGTCCTTTGCTAAATAGTGACCTCATGCAGACAGGAGTGGCA 

• GCAAGAGCCGTGTTTGGGGGGTGTGGGTGGAGGAAGGACTTGTCTGAAGGCCTCTATTTCTTCTTTACT 
GACTCACTGGAAATCATAACX3AAAAGCAGAGG 

GAGG GC CTGG GG AGACAC CAGGCGCCAAGATTGGTGAGAT TTTTG AGG AG AGAAAAT AG AAGG GG ATT CTGGT ATT ATTGTGAAAG 
GTAATTTTTCAAAAAGTATAGTTATGGCCCAGATTGAACTTTTCTTT^^ 
55 ATTCAGAGGGGATGGGAGATCCAGAGAAGAAAAAGCCCACCTGTCTCCT^ 
TTAGATGGCTGTTAGCATTGTCCCCTGTGTATGCC^TATGTAT^ 
TAAGGTGTTTATAGTAATAGCATCTTAAGACATGTTTGAATTTATTAATCTT^ 

TTCATACAGTCCTAGT AT AT ATATGAAATT ATAGAAAAT CATTT GAGTTT GGAG AGATGG CTAAACAGTT AAGAG CAC TAAT TGTT 
CTTCCATAGGACCTGAATTTGATT'CCTGGCACCCATACT^ 

6 0 T TGG CCT CTTAG ATG C CTGTAATG ATCTTGCCCATGTGT ACTTC ACAATT AATG AAAGC7VAAAT AAT AGCAAAAAGCA 

AAAAAAAAAAAAAGGAGGGGGGTGTTGTTCAATGTAGCCATGTTC 
CCTAAAGTCCGTCCTCAAAGGGTCTGCCCTAAAGGTAGGAAGCAAGA 
CCATCTCAGCTCCAAGCCTCCTGCAGGGTCATCTGAGGGGCGCGAGATAT 
TTGTTTCCTCTTTTAGCAGGAAGGAGTCCTGGGAACrCTCCA^ 

6 5 GAGCACACAC CT CT CTTT AAGAG CAT AT C CTG CT AAACAGGTTCATTACATAC T CAT ACAT ACACACAGCT AAAAATCAC C CAAAA 

CTAAGTCTTAAAAATACTCCCTTATGGTTTCCTCCCCTTCTCCCTTCAGTAATGGAGCTTGAAC^ 
CAAGTATTCTGCTGGGCTTTACTTCCAGGCGAGCCCCTGC^^ 

TTAATATTTACATGTGTATTATAATATATTAAATATGAATGTATGTAATATATATATACATAC^ 
AAACTCGACATGTGTATATATGTGTACACACATATATGCAGCGT^ 

7 0 CATGAGGACTTGGGTAGATATCTTATTTATTTTGTTTT 

TCACTGTGTGGACCAGGCTGGCCTTGCACTCAGAGATTCA^ 

ACTTTCACTGTTTGGGAAATATAATCGCTGGGCTGTO3 
GCCAAGGCTACTCCTGGTTCTCCTACTTGTACCACCTAAGCACT 
7 5 TG TAAAGGACAG AGTTGTG AGTT CTAAGAG TTAGGATTG CAAATGTACT AATC CT AATTATTT CAAACAAAAAGAAAACAT ATCA 
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AAAATCTGATCTAGTGAAC T AATTTCTGAO^IAAT ACAC^AGTGGCAAAATTCCAG GGT ACATT ATCAGATAGCTGTCTTCTGAGG 
ACTTAGTAAGACC^VGAATGAC^TGATGCTAGGAGCCACCCTGGCTCCT 
TCTACTCTTCAACAGGGTCACCACATTGXn^ 
GTGATACTTTGCAGAGAGAITGGTTAAATATGAATTAGATTTC^ 
5 TAAAAGGGCCCTTAGAGGGGCGTGGCTGCTAGAGGGATATAGGT^ 

GTGGTTGTACAACATAGCGAGTATAACAAACCAAAAACACCACATTGCT 
GAAAAAGAGCGATTGGTTTTGTTTGGTTTTGCTTTTG 
GGTGACCTTGGCCTTCTGGTTTTTCTGTCACTGTCTCT 
CTACCAACTGAGCTGCGTACCCAGCCCCGGCTTGTTTGTTCCTTTGGT^ 
1 0 C^GACTCCTTGTACACITAGAGGATGACCTTGAACT 

CACCTGTCATCCCTCCCAGCTCTAQTTGCTGGATTTTAACGCAACGGAAGCTC^^ 

CAAATTTGGACTGCTAGAGAAGTGGTTAACACAACAAACTGTGTTTA 

TACGTTGTGGTAGAAAAGGTCTAACTGCAGAOT 

ACCATTAAGTCTGGTGACCTGCATTTGACCCITGGAAGCACTGTAAAGGT 
1 5 TGTGGCCTGGGATACACATACGTACAATAATTC 

TGAGGTTGACCTCTGGCCTTCACACACATGTGCAAGCGTGTGCAGATGGACCCT 

TGTGGGAACCCACGTTGTAGGAGACTTTCAAACAC^GATCCACAGTTC 

GTTA 

20 MOUSE SEQUENCE - mRNA 

AGCGACAGCCCGCGCACTGGGCAGGAGTTACTGCTGCTACGGTCGCCCGCGTCTTCMGGTCTCTGC 
GGACTCGGCCGCCATGTCCGCCGCGGAGGAGGTTGATGGCCTGGGTGTGGTCCGGCCGCACTATGGCTCCGTCCT 
GGCTCACTGCAGAGGAGATGGATGAGCGGAGACGGCAGAACGTGGCTTATGAATACCTTTGT 
ATGGAAGCATGCCTAGGTGAGGACCTGCCGCCCACCACAGAGCTAGAGGAGGGCCOTAGA 

2 5 GAACTTCITCTCTCCCAAAGTGGTGTCCCTGAAGAAAATCTATGATCGAGAACAGACCAGATACAAGGCTA 

GACACACGGAT AATGTGATTCAG TGGCTG AATG CCATGG ATG AGATTGGGTTGC CTAAG ATTTTTTACCCAG AAAC CACAGATATC 
TATGACCGGAAGAACATGCCAAGATGCATCTACTGTATCCACGCCCTCAGTTTGTACCTGTTCAAAC 

TCAAGAC CTGT ATGGAAAGGTTGATTT CACAG AAGAAG AAAT CAACAACATGAAGAT CGAGCTGGAGAAGTACGGGATCCAGATGC 
CTGCCTTCAGCAAGATCGGGGGCATCCTGGCTAATGAGCTCTC^ 

3 0 GAAGCGATTGATCGCAGAGTTGCAGCTGACACTTTTACGGCT 

GGGTCCCACGTACCAAGACGTGCTTTACCAGGCCAAGCAGGACAAGATGACAAACGCTAAAAACAGGACGGAAAAC^ 
AAAGGGACGTTTATGAGGAGCTGCTCACIACAAGCTGAAATCCAAGGGAATGTAAAG 

ATCAGCCrGGCTTTAGAGCAGGGCTGTGCAGTGACCCTGCTCAAGGCTCTGCAGTCACTGGCTCTGGGCCTCCGAGGGCTG 
C CAGAACAGCGA.CTGGTACATG AAG^IAGCTACAG AGTGATCTGCAGG AAAAG AG ACAG A 

3 5 AGGAGGTACAGGCCGGAGTGGATGCTGCCAACAGTGCTGCCGAGCAGT^ 

GCCATCCAGAAGGGCATCGCTGAGAAGACCGTGTTGGAGCTAATGAATCCTGAAGCC^ 

TGATCTCTATCAGAAGGAGTTGGCCACCCTG GAGCAGCAGAGCCCTGAGCATAG CCTCACCCATCCTGAGCTCACTC CTGTGG 

ACGGGCCTTACCAACATCGAGGAAGAAAACrrGTCAAAGGTATCTCGATGAGCTGATGA^ 

4 0 TAATGCATTTATTACATGGAATGACATCCAGGCGTGTGTGGACCATGTGAACCTGGTGGTCCATGAGGAGCATGAGCGGATTTTGG 

CC^TCGGCTTGATTAATGAAGCCCTGGATGAAGGGGACGCTCAGAAGACTCTGCAGGCCCTC 

GG CGT C CTTG CAGAAGTGGCACAG CAC T ATCAAG ACACG CTG AT CAG AG CAAAGAG AG AAAAGGC C CAGG AAACAC AGGATGAG T C 

AGCTGTGTTATGGTTGGATGAAATTCAAGGTGGAATCTGGCAGTCCAACAA^ 

TCTCTGCCATCAATGAAGCAGTAGACAGCGCTGATGT 

4 5 GTCATCCCCGAATGTGGGGAAACGTACCAGAGTGACCTTGCT^ 

GTGGGTGAAG CACTGGGTGAAAGGCGGGTACCATTACT ACCACAACCTGGAGACG CAAG CAGGAGGATGGGCTGAGCCCCCAGACT 
TTGTGCAGAATTCTGTGCAGCTTTCTCGAGAGGAGATCG&GAGCTCC^ 
TGGCTGGCCAACGAAGGCTTGATCACCAAGCTGCAAGCCT 
TTTTCTGAAGAAACAGATCCCTGCCATCACCTC 
50 TGGCTTACCTGCACTCCCATAAAGACGAAGTTGTGAAGATTCAGTCCCTTGCCAGGATGCATC^ 
CGCCTACAGTATTTCCGAGACCATATAAATGACATTATCAAAATCCAGGCTTTCATTCGGGCCAA 

GACTCTCATCAATGCTGAGGACCCGCCTATGATTGTGGTCCGAAAGTTTGTCCACCTCCTGGACCAAAGTGATCAGGACT 

AGGAACTTGATCTCATGAAGATGCGCGAGGAGGTCATCACCCTCATCCGTTCCAACCAGCAGCTGGAGAACGACCTCAACCTCATG 

GATATCAAAATCGGACTGCTGGT<3AAGAACAAGATCACGCTGCAGGATGTGGTTTCCCATAGTAAAAA^ 

5 5 GGAACAGCTGTCCGAGATGATGATGATAAAC^AAGCAGAAGGG 

CCTATCAGCATCTCTTTTATCTCCTGCAGACCAACCCTACCTATCTGGCCAAGCTGATCTTT 
AAATTCATGGACTCTGTGATCTTCACGCTGTAC^CTATGCATCTAACCAGCGGGAGGAGTACCTGCTC 
AGCTCTGCAGGAGGAGATCAAGTCAAAGGTGGATCAGATTCAAGAAATCGTGACAGGAAACCCTACGGTTATTAAGA 
GTTTCAACCGTCGTGCCCGGGGCCAGAATGCCCT'CCGGCAGAT^ 

6 0 AACATCAAAACCGACCCTGTGGATATTTACAAGTCTTGGGTTAATCAGATGGAGTCGCAGACAGGAGAGGCGAGCAAACTGCCCTA 

TGAT GTGACC C CTGAACAAG CCTTGTCT CATG AAGAAGTG AAG ACG AGGTT AG ACAACT C CAT CAGG AACATGAGGG CTGTGACAG 
ACAAGTTC CT CT CAGCCATCGT CAG CT CTGTGG ACAAAAT CCCTTATGGGATG C!GATT CATTGC CAAAGTC CTG AAG GATTCAC TT 
CACGAGAAGTTCCCTGACGCTGGTGAGGACGAGCTGCTGAAGATTATCGGTAA 
CGTCGCTCCCGATGCCTTCGACATCATTGACCTGTCAGCAGGX3GGCCAGGTGACC^ 

6 5 CCAAGATGCTCCAGCACGCGGCGTCCAACAAGATGTTTCT 

TCCTACCAGAAATTCAGACGGTTTTTCCAATTGGCTTGTGACGTCCCA 
CCTAGTC!ACCCTCACTAAGCCAGTTATCTACATCTCCATTGGCGAAA^ 
CCATTGCTCCAGAGCATAACGACCCCATCCACGAACTTCTGGACGAC 
AGCTGTGGCAATTCAAACX^CCCCAACAAGGAGGCTCTGGCT 

7 0 TGGTGACGAGAACGCAGAGATGGACGCTCGGACCATCTTACTCAATACAAAACGTTTAATTGT 

G AG AG ACCTTG ACTG AAATT CTAGAAACCCCAGC CACCAATGJ^CAGGAAGCTG AACAT CAG AGGG CCATG CAGAGACGGGCTATC 
CX3CGATG CCAAAACC C CTGACAAGATGAAAAAAT CAAAGCCC ATGAAGGAGGAT AACAAC C T CAG C CTC CAG GAG AAG AAAG AG AA 
G ATCCAGACTGGCCTAAAGAAGCTAACGGAGCTTGGGACGGTG G ACCCAAAGLAACAGATAC CAGGAACT CIATCAACG ACATTG CA 
AGGATATCCGGAATCAGCGGAGATACAGGCAGAGGAGGAAAGCTGAATTGGTAAAACTGCAGCAGACG^ 
7 5 AAGGCCACCTTTTACGGCGAGCAGGTGGACTACTACAAGAGCTAC 
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CTCCAAAAAGCCTAGGGAAATGJU^AGGCAAGAAAAGCAAAAAGATTTCTCTGAAGT 

T CCTTCIGGAGATTGAAGACCTTCAGG CAAACCAATTTAAAAATGTTAT CTTCG AAATTGGTCCAAGAGAAG AAGTTGG AGACTTT 
GAAGTAAAAG CCAAGTT CATGGGAGTTCAGATGG AGACTTTCATGTTGCATT^ CAG CTACAGTATGAAGGAGT 

TG CAGTTATGAAATTATTTGAT AGAGKTTAAAGTGAATGTCAAC CTCCTG AT CTTCCTTCTCAACAAAAAGTTCTATGGG AAGTAAT 
5 GGGTCGCTCGCCGCCCAGAAGAGTTAACAAAAAACGGCAGCACCTCA 
AAGGACCTGGTCCAGTGGCGGTGCCTCAGTTCACACTCC^ 
AGCCATAAGCCTGACTTCCCCTCAGCCCTGCTCCCTCATTTTCTCT 
CTGTTAAAGCCTCCTAAGTGGTCATTACTACnTTGTAGACAT^ 
CTCGTTTAGAAGTGAAAGCTGTCAGGTTGATAGCAAGCATGC^ 
1 0 ATGGAGTCTGATGTCTTCCGGCTGCTGCCTTTGTGGGCAGAGCT 
AGAAAAAGAGCAATTACTTTTTGAATTCTCCAATAAATCTGGAC^ 
GGTCAGAGCAGGCCATTTAGAGACTGAGTGGGCGGGGCACTTACCAT 

ACTGTTTC T AAACTGTATT CCAAGAGTGC CTTACCAAAGTATAAATTTATTTCTTAAAATGTG AGTAATAGGAATTTTAAAGATT 
ATATAATGCTTTTGAACACTCTGAGAAAGGGTTGTTTTTTTTCA 
1 5 GCCT ACAGT AGGGT CGATCTGCCGAGCTGCTGAAT AAC CCTGCTTGATTTTAG CAACAAGAGACAATCCATGCCATGCTTT TAT CC 
TG AGAAACCTGGACCCTTAGG CTCTG ACTAACAAAG ACAGTTCAG CT CCAGACACTGAACTTGGAATCCTTTGTCTTTGGATTAAA 
AGTCATAAACTTAAAAAA 

MOUSE SEQUENCE - CODING 
2 0 ATGTCCGCCX3CGGAGGAGGTTGATGGCCTGGGTGTGGTCCGGCCGCACTATGGCTCCGTCCTGGATAATC 
GGAGATGGATGAGCGGAGACGGCAGAACGTGGCTTATGAATACCTTTGTCATCTGGA^ 

TAGGTGAGGACCTGCCGCCCACCACAGAGCTAGAGGAGGGCCTTAGAAACGGAGTCTACCTTGCCAAGCTAGGGAACT 
CCCAAAGTGGTGTCCCTGAAGAAAATCTATGATCGAGAACAGACCAGATACAAGGCTACCGGCCTCCA 

TGTGATTCAGTGGCTGAATGCGATGGATGAGATTG GGTTG CCTTUVG ATTTTTT ACC CAG AAACCACAGATATCT ATGACCG G AAG A 

2 5 ACATGCCIAAGATGCATCTACTGTATCCACGCCCTCAGTTTGT^ 

GG AAAGGTTGAT TTCACAG AAG AAGAAATCAACAACATGAAGAT CG AGCTGGAGAAGTACG GG ATC CAG ATG CCTG C CTTCAGCAA 
GAT CGGGGGCAT CCTGG CTAATGAG CTCT CAGTGGATGAAGCTGCG CTACATGCTGCTGTT ATTG CTATTAATGAAGCGAT TGATC 
GCAGAGTTGCIAGCTGACACTTTTACGGCTCTAAAAAACCCCAATGCCATGCTCGTCAATCTTGAAGA 
CAAGACXSTGCITTACCAGGCCAAGCZAGGACAAGATGACAA^ 

3 0 TGAGGAGCTGCT(^GACAAGCTGAAATCCAAGGGAATGTAAACAAA 

TAGAGCAGGGCTGTGCAGTGACCCTGCTCAAGGCTCrrcCAGTCACT 

TGG T ACATG AAGCAG CTACAGAG TGAT CTGCAGCAAAAG AGACAGAGTGG C CAGACTGACC CCCTGCAG AAGGAGGAGGTACAGGC 
CGGAGTGGATGCTGCCAACAGTGCTGCCCAGCAGTACCAACGACGGTTGGCAG 
GCATraCTGAGAAGACCGTGTTGGAGCTAATGAATCCTGAAGCCCAGCTGCC^ 
35 AAGGAGTTGGCCACCCTGCAGCAGCAGAGCCCTGAGCATAGCCTC^ 

CGTGGCCCTC!ATOUVCAGGGCGCTGGAGTCIAGGAGACATGACCACTGTGTGGA 
ACATCGAGGAAGAAAACTGTCAAAGGTATCTCGATGAGCTGATGAAGCT 

TAATGAAGCCCTGGATGAAGGGGACGCTCAGAAGACTCT 

4 0 AAGTGGCACAG CACTATCAAG ACACG CTGATCAGAG CAAAGAGAG AAAAGGC CCAGGAAACACAGGATGAG TCAG CTGTGTTATG G 

TTGGATGAAATTCAAGGTGGAATCTGGCAGTCCAACAAAGACACCCAAGA^ 
TGAAGCAGTAGACZAGCGGTGATGTTGGCAGAACCCTGAGTGCCCTACGTTCT 

GTGGGGAAACGTACCAGAGTGACCTTGCTGAAGCCAAGAAGAAGAGACTGGCAGCAGGAGATAATAACAGC^ 
TGGGTGAAAGGCGGGTACCATTACTACCACAACCTGGAGACGCAAGCAGGAGGA^ 
45 TGTGOVGCTTTCTCGAGAGGAGATCCAGAGCrrCCATCTCTGGAGTAACCGCTGCATATAA 
AAGGCTTGATCACCAAGCTGCAAGCCTGCTGCCGTGGGTACCTCGTTCGACAGGAATT 
CAGATCCCTGCC!ATC!ACCTGCATTC!AGTCACAGTGGAGAGG 

CT CC CATAAAGACGAAG TTGTGAAG ATT CAGTCCCTTGCCAGG ATGCATCAAG CTCGAAAGCG CTATAGAG ATCGC CTACAGTATT 
TCCGAGACCATATAAATGACATTATCAAAATC(IAGGCTTTCATTC^ 
50 GCTGAGGACCCGCCTATGATTGTGGTCCGAAAGTTTGTCCACCTCCTGGACC^ 
CATGAAGATGCGCGAGGAGGTCATCACCCTCIATCCGTTCCAACCAGCAGCT 
GACTGCTGGTGAAGAACAAGATCACGCTGCAGGATGTGGTTTCCCATAGTAAAAAACT 

GACATGATGATGATAAACAAGCAGAAG GG CG GGCT CAAGG C TTTGAGCAAAG AG AAGAGGGAGAAG CTGG AGGC CTATCAG CATCT 
CTTTTATCTCCTGCAGACCAACCCTACCTATCTGGCCAAGCTGATCTT^ 
55 CTGTGATCTTCACGCTGTACAACrATGCATCTAACCAGCXaGGAGGAGTACCT 

GAGATCAAGTCAAAGGTGGATCAGATTGAAGAAATCGTGACAGGAAACCCTACGG^ 

TGCCCGGGGCCAGAATGCCCTCCGGCAGATCTTGGCCCCTGTCGTGAAGGAAATTATGGATGACAAGTCTCTC^ 
ACCCTGTGGATATTTACAAGTCTTGGGTTAATCAGATGGAGTCGCAGACAGG 

G AACAAG CCT TGTCT CATGAAG AAGTG AAGACG AG GTT AG ACAACT C CATCAGGAACATG AGGG CTGTG ACAG ACAAG T TC CTCT C 

6 0 AGCCATCGTCAGCTCTGTGGACAAAATCCCTTATGGGATGCGATTCATTGCCA^ 

CTGACGCTGGTGAGGACGAGCTGCTGAAGATTATCGGTAACCrGCTTTACTACCGATACATGAACCC^ 
GCCTTCGACATCATTGACCTGTCAGCAGGGGGCCAGCTCACCACAGACCAGCGCAGAAACCTGGGCTCCATT 
GCACGCGGCGTCCAACAAGATGTTTCTGGGCGATAATGCCCACTTAAGCATCA^ 
TCAGACGGTTTTTCCAATTGGCTTGTX3ACGTCCCAGAGCTGCAGGATAAATTTAACGTGGATGA 
65 ACTAAGCCAGTTATCTACATCTCC^TTGGCGAAATCAT 

GCATAACGACCCCATCCACGAACTTCTGGACGACC!TTGGGGAGGTGCCCACCATTGAGTCCCTTATAGGAGAAAGCT 
CAAACGACCCCAACAAGGAGGCTCTGGCTAAGACGGAAGTGTCTCTCACGTTGACCAACAAGTTTGA 
GCAGAGATGGACGCTCGGACCATCTTACTGAATACAAAACGTTTAATTGTGGATGTCM 
TGAAATTCTAGAAACCCCAGCCACCAATGAACAGGAAGCTGAACATC^ 

7 0 CCC CTG ACAAGATG AAAAAAT CAAAGCCCATG AAGG AG G ATAACAACCTCAGCCT C CAG GAGAAG AAAG AGAAGATC CAGACTGG C 

CTAAAGAAG CTAACG GAGCTT GGGACGGTGGAC CCAAAG AACAG ATACCAGG AAC TCATCAACG ACATT CAAGGAT AT CCGG AA 
TCAGCGGAGATACAGGCAGAGGAGGAAAGCTGAATTGGTAAAACTGCAGCAGACGTACTCGGCGCTGAACTCTAA^ 
ACGGCGAG CAGGTGGACTACTACAAGAGCTACAT CAAAACCTG CTTGGATAACTTGGC CAGCAAGGG CAAGG T CTC CAAAAAGCCT 
AGGGAAATGAAAGGCAAGAAAAGCAAAAAGATTTCTCTGAAGTACACAGCAGCGAGGCTGCAT 
7 5 TG AAGAC CTTCAGGj CAAAC CAATTTAAAAATGTTATCTT CGAAAT TGG TC CAACAG AAG AAGTTGG AG ACTTTG AAGTAAAAG CCA 
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AGTTCATGGGAGTTCAGATGGAGACTTTCATGTTGC^^ 
TTATTTGATAQAGCTAAAGTGAATGTCAACCTCCTGATCTTCCT 

HUMAN SEQUENCE - GENOMIC 
5 CCAAGGGAAGGATGCGGAATCTCTGTTTTCTACTAAGGG 

GGCCTCCGCAAAC^CCCTGCAGGCTCCGGCCXSCCGGGGATGAGGTG 

GAGCTGCTCTGCGCTGGGGAGGGGCGAAGTGCCAGAGTOCGGGAGACCT 

AGGCCAGGAGATTCACATCrrGCAGATGCCACA^ 

GGTAGCTGGGGAATTGGGTTTAGCACTTTCCAACGCTTGGAACTGGAC^ 
1 0 TACGTTTCCCAGAAGGCTAGGCGATAATTGACAGCTCrATCAGCCT^ 
AATAGCAAGAGGATGGGGGCGGTACITTCCGGCCXSCrGGCrGTCGA 

CGGCTGGGCCCTGGGGCGGTGAGTGATTCAAGGAGGGAGACCGCGGC^^ 
CCGCCACCCAGACX3TTTTCTCAGCGGGGTTCCCGCGCTGGCCTTTG 
1 5 TGGTCTCTATACCGTCGGCGGGATAAGTCATTCTCTTAATC^ 

GTGACCCTCTGACCGCCG CACCCCGGTTAGGGCGCIGAGCACCGAGAAGAAGA 
TATTAATAAACCCGGTGAACGCACTGGAGCTTCTCACAGTGGCACTTAG 

CTGTGCATGGGGAAAGGGACGGGCTGGGAAACAGGGTTTCTATAACCGTGGCTAAGACGTCTCCCCT 
AGGAAGTGGGCCCTGCAGCTGTGTTTTGTAATGJUVTGTA 

2 0 TCGCTCTTGTTGCCCAGGCTGGAATG<?AATC 

CTCAG CCTCCCAAGT AGTTG GGATTACAGG CGTGTG CCACCACACC CGG CTAATTTTGTATTCTTAGT AG AGACAGGGTTT CAC CA 

ACAGCACCTGGCCAATACrGAARACTCTTATTACAATTTTTA 
GAGATGGTTTTTAGATCCTTAAAAACTCAATACTTTATAAATGACCA^ 

AGAGTAA^CTTCTTCACCACAGTCTGGTCCTTCCT 
TTTTTTTTTTTAAGGGAGAGAC!AGGGGCTCACTCrGTTGCTAG 
CTCCTGGGCTGAAGTGATCCTCCCACTTCAGCCTCCTGAGTAGCrrGGGACC^CAA 
AAGTTCTTTGTAGAGAAAGAGGTCTCAATATGTTGCTGAGGCTGGTTTATTGGAAAOT 

3 0 TCTATGTGACTCCTATGCCTGGCAACATATTAATTGTTGAATAAATGACTGATCCATCCAGTAA C CTCCAA 

AAAGTAATGTTTTTCATGATATTG^ 

ATTAAGATCTTGAAATAGAATGGTATCCAACACAGTTCCCTTTGGGGTTCTGTATAATCTGTCCCTTGAGG 
CAAG CCAAG AAGTTAGC TG C T CCCATAACTAATATGAGTAT AGTTAAGG C CACAATTTCCTTGCITG CTG AG^ 

3 5 GGAGAATCAAAAGCTTTCCAGAAATCAAATTACACCT^ 

CAAAAAGTCAATGTGTCTAAAAGACTGGGTTCTGTCGATGTGTTTGGGAGGCCTAGGCCTCTOT 

GAGAAGACTGTATGTAAAAACAGACrrAGAAAAGCACCCCTCTGTGGTTGACAGAA 

GGGGTTGTTTGTGTACACTGGTGGCCATTTTTTGGC 

CTCTTGCTTTAGGAACTGATGACGCTTGATAATGTGGCTGGGGACTTCACK^ 

4 0 CT CTC CAGG AATG CCAC CCAGCACAGTTAGAAGAATGGGCTCTCAAAGGGTAAG AATGCTACT CT CCTTTATTT CATTT ATTTAT C 

TTGTTTTTTTTGAGACGGGGTCTCGCTCTGTCATCCAGGCTGGAGTGCIAGTGGCGCGATCTCGATTCA 
GGGCTCAAGCGATTCTCC(2ACCTCAACCTCCTGAGTAGCTGGGATC^ 
TTAGTAGAGACAGGATTTTGCTTATGTTGGCAAGGCTGGTCTCACTA^ 
CCAGTGGCCTCCGCCTTCCAAAGTGCTAGGATTATGGGCGCCkCC^^ 
45 TTGGAGAGGGCCCAGGTCTTGAGGAGTTTCTG^TATGTTGGCATCTTAGAGCTCC!AGACCTCTGTAAGAT 
GGAGAAAAACAATTACTTCCTTATGCIATTTTATGGTTTGCAAATGTCTTTGCT 
TTTGACTTATGTATTACTATCCCCCrTTTCCAGCCT 
TGTGGAAAAGGAGGTAATTTTTTCTGTCACTTAAAAACAG 

TGAGGTGGGCGGATCATAAGGTCAGGAGTTCGAGACCAGCCTGGC C^ CTACT AAAAATAAAAAAATT 

5 0 AGCTGGGTGTGATAGCGGGCGCCTGTAATCCCAGCTACTCTGGAGGTTGAGGCAGGAGAATTGCTT 

GCAGTGAGCTGAGACCACGCCATTGCZACTCCAGCCTGGGCAATAGAGCIAAGACTCTGTCT 
AGGCTGGGCGCGGTGGCTCATGCCTTGTAATCCCIAGCACTTTGGGAGGCCAAGGCAG 

TCCTGGCTAACATGGTGAAACCCCGTCTTACTAAAAATACAAAAAATTAGCCAGGCATGGTGGTGGGCACCTC 
TCGGGAGGCTGAGGCAGGAGAACAGCGTGAACTGGGAGGCGGAGCTTGCAGTGAGCC^ 

5 5 CGACAG AGCAAGACT CCATCTCAAAAAAAAAAAAAT AAATAAAAAAAAC CAAAAACAC CTC GAATGCT CTTTCTCAAATATCATGG 

CTACCCTTC CTC CTG CTTTAT ATTTTTAGTAG AGATG GGTTAT CT C CATGTGGT CAGG CTG G TCTCAAACTACTG ACC TCAAGTGA 

TCCGCTCACTTCX3GCCTCCCAAGTGCTGGGATTA 

CCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAAGTGAT^ 

GCCACITGCGCCTGGCCTACTTCATCTATTATAG^G^ 

6 0 TCCCTAATCCCACCAATGAAGACTTACTATCCTACCAGCATTTT^ 

TATTGGGATTATGCTGTATATTCAGCTGTACATTCTTTCT^ 

ACCAC CTAGTATTTTGTT TTATGG AATACAAT ATTATTTTGTTG ATGAAGT CAAACATTAG ATTG AACC^ TTTT 
TTAAC CT ATAAAAAT AG CAATTGC CTGAGCTACTTGGG AGG CTG AGGCAGGAGGATC CCT T CAG CACAG G AGTTCGAGGCTGCAGT 
GAGCTATATAATTATTCCITTGTTGTGGGACATTTAG<3^ 

6 5 TTTT AAAAAGTATTTTTTATTGAAG TAT AATTTTAT CATTTTCTTAG AATT CT CAGAATTAGAATTGGTGGGT CAAAGAACAGATA 

CATTT CTAAGGTATTTAAT ACTTTG CAAAATG CCTT C CAAGAAG AT TGAAT CAATTG AT TT AT T C CCACAAACATTATATGAAAGT 
GCGCATTTCTTTATATTCTACCTTTCACAGTGGGTATTATAA^ 

C CAGCATTT TGGG AGG CTG AGG CAGGAGGACCACTTGAG C T CTG G AGTTCCAGAC CAG C CTGG G CAACATAATG AG ACCCTGTCTC 
TACAAAAAGTTACATTAACCAGGTGTAGGGGCTTGTGCCTCT 

7 0 G&GGAGCTCGAGGATGCAGTTAACTGTGATTGTACCAGTGCT 

AAACACACA<ZACAAAACACAAAAAAAGAGTCTGGGAAAAATAGCAAT 
TCTCAATGTGTTGCCCAGGCTGGAATGCAGTGGTGCAATCTCAGCTCACTGCAA 
CCTCAGCCTCCCAAGTAGCTGGGATTACAGGTGTGGACCACCACACX3TGGCTAATT 
CATGTTGGACAGGCTGCTCTTAAACTCCTGGCCTCACAAGTGATCCA^ 
7 5 AGCTACTGGACCTGGCCAATGGATCTITATTCTTATCCCAATTTTTTTTTTTTTTTTT 
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TGGAGTGCAGTGGTGTGACCTCGATTC 

GGGACTACAGGCCTGTGCCACCATGCCCGGCTAATTTTTAGTAGAGA^ 
TGACCTCGTAATCCGCCTGCCTCGGCCTCCCAAAGTTCrGGGATTATM 
ATTGATGAAGACATTGAGAOICAGAGTTTAAGTAACTTTCTTAAGGC 
5 GGCAGTGTGACT CTTGAGTTTGCACTCATTACCTCTACACTATATTGC CTCAGTGT AT CATTGTT AT AATGGAATTTCTTTGATTA 

TAGTGAGGTTAAACATTrTTCGTATATTTACAGGCT 
CTGGGAAAATTTTAAAGCCCAACATTATAGTTAACTTCCTTC^^ 
ATCCTCTCCATATTTAGCTTTAGCCTTGTACXrTTCAGAA^ 
TTTTTCTTTATGGGaGQAAAAGGGCTTTTTTTTCTTAAC^ 
1 0 TTTTAATGACACCAAATACACGTTAGAGCTCCCCAGGT^ 

TTGTGTGATTG<5CTCAGACTCACATCCTATGTTTACATTCCT 
AAGAAGAGCCATGGGTCCTACCACTCCAAAACTTTGAGGCGAGGAAGAT^ 

C CCAGTGG AAGGAAATCTAGCATTTC^GCCITGTTT AGATG A^ C CTTAG AATTTTGTTCTTACT AG CAGAC 

ATTTTT TT TT TTTTTTTTTTTTT GAGAOGGAGTCTCGCTCTGTCGCCCATO 
1 5 ACTGCAAGCTCCGCTTCCCGGGTTCACGCCATTCTCCTGCCT 

CGGCTAATTTTTTGTATTTTTAGTAGAGACG^SGGTT^ 

GGCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCC 

GTAGTTGACrrAAATAAAGTCTTCATTTTGATTAAGCATTTCACT 

GTCTCCCAGCCTOraAGTGCAGTGAC^ 
2 0 CCCAAGTAG CTGGG ACT ATAGGCAC CCG CCAC CACG C C CAGCTAATTTTCATATTTTT AGT AG GG ATGGAGTTTCAC CACGTTGG C 

CAGGTTGGTCTCGAACTCCCAACCTCAAGTGATCTGCCTG 

CTGGCCTAAACCTATAATATCTTCTAAAGAAACCGCACAATATTGACAATATGTGG 

GG CCATCTT AAAATTT AAAAAGi r I'AGGAATT CACGTG it u i<3 ACAGCTCAATG AACCT CAATAAACTITTT CACAAAGTG AAAACA 
GAGCGGTTGAATTAAGTGCAGTCTGCTGCAGAAACAGG<3ATTTTAAT 

2 5 T AACTTT ATT CTTCTGTT ACTT ACATT AATTTTT CAGACTGTGAGCAT CAGGTG GCAAAG CT CAAT CAGG ACAATT CTGAAACAGCA 

GAACAATGTGGAACATC CTCAGAAAGGAC CAAT AAAGATCTTT CTCAT ACT CTT AGTTGGGG AGGAAACTGG G AG CAAGG CCTAGA 
ATTAGAAGGGCAACATGGAACCCTTCCAGGAGAGGGCCAGCTGGAGT 

GAT ATGT AGG AG AGAAG CCTATGTG TGCAGAATG CG GGAAAAG CT TTAACCAG AGTT CCT ATCTCAT AAGACACCT AAG AACCCAC 
ACTGGCGAGAGC^CCTATACGTGCATTGAGTGTGGGAAAGGCT^ 

3 0 AGGAGAGAAG C CCTACCAATG CAAGGGGTGTGAG AAG AAATTCAGCGACAG CTCAACACT CAT CAAACATCAG AG AACCCACAC AG 

GGGAGAGACCCTATGAGTGCCCAGAGTGTGGAAAGACTTTTGGGCG^ 

G AGAAGCC CT ACG CGTGCCTGGAATG TCACAAAAG CTTCAGTCGAAGCT CAAATTTCAT CACT CAC CAG AGGACCCACACAGGGGT 

GAAGCCTTACAGGTGTAATGACTGTGGGGAGAGTTTTAGCC^GAGCTCGGATTTGATTAAGCACCAA 

GGCCCTTCAAATGCCCGGAGTGCGGGAAGGGCTTCAGAGATAGTTCTCATTTTGTAGCT 

3 5 CCTTTCAGTTGTCCTGACTGCCACAAAAGCTTCAGTCAGAG 

TTTTAAGTGCGAAAACH'GTGGGAAAGGATTCGCCGACAGCT 

ACAAATGTGGAGAGTGTGGGAAGAGCTTCAATCAGAGCTCCCACTTTATTACCCATCAGCGAATCCAOT 
CGATGTCCTGAGTGTGGCAAGACCTTCAATCAGCGTTCCCATTTCCTCACACACC^ 

CTGTAGTAAATGT AACAAGAG CTTCCGT CAG AAAG CG CATCTTTT ATG C CATCAAAACAC C CATTTG ATTTAGG AAGT AG TCTTTG 

4 0 GTGTTCAGCTGCTCCCTTGCACATTTTCAT^ 

TGC<3CCCTGATCTATTCTCCCTCTTTCTTGTCTATGTTATAACAGAGAGGATAAACTTAAAG^ 
AAAGGCATTCCTTCAGTGTGTGACTGACTCTTAGGGAAATGTGAGTTTAATAGTTGATG 
TAATCCC^GCACTTTTGGGAGGCC^GGTGGGTGGATC^CTTGAGG^ 
ATCTCTACTAAAAATGCAAAAATTAGCTGGGCATGGTGGCACGTGCCT^ 

4 5 ATTTGAACTCAGAAGGTGCAGGTTGCAGTGAGTTGAGATCATGCCACT 

AAAAAAATT AAAAAGT TG ATGCCT AGTT ACT AAATG AGAAGTGAGAAAAATGTGGC CTAG AG ATCACATGT T CACCACCT AGTACA 
GTGCCTGGCACAACATAGATGCTCAATAACTAAATGGTCCCATCATTATTAAT^ 

ATG CAG TACCTGGC C CTTAAAAG ACACT CAG T ACAAGATTGG TGG CTTTTATCAGT CTITATTACTCAATT AGAAATTT ATT AGTGT 
AGTCCCCCCGCCCCAAACTCAGAACGTGAAAGAAATTAGATTGGAACTAGAAGGTT^ 

5 0 GGAAATGCACTGCCAGTTCAGTGCTGTGCAGGCATTAGTCACCAGAGGTCT 

GTTTGCGGTCTTGTGACTTGGAGGCTGAAAGAATTTCAGAAGCTCTTTTAAATGGCAG 

G CTGTCATCIX1ACACAGAG AAAATATCC TACAATGAACAAGC CAG AGGG A CTG GTAGAGG ACTAT AAAATTGTGG AAGCAAAATT 

GCTCAGAATGTCAAATGATATTACAGGGATCCTCCCTGGCATTTAGCTGAAG 

TGGCCATTTAGTTTT AGGTT AAT ATAATT CTCTG AT CCTTTT AGGG CC AT C(^ 

5 5 AAATAAATTGAGGGACGGCAAGTGTGTTGGAAAGAACACCGACTTCATTGAGAAGGT^^ 

AATTTGGCCAGACTTTCATCTTCTCCCAGCCTCAAGTTTTACCTACCT 
ACACACACACACACACACACACACACACrrTTGT^ 

X T T T C CAAGAAAATTTTC3G CT AATATTXCZTTT AGX3XATTCCTTT TT CTC T CAT AGTG AG CM AXXAAAAAJU^AAZVAAAACTGTTGAA 
AAATTAGGGCGTAAAAATGCTAAATGACATGACTCATCATGGGCCACG^ 

6 0 GATTTCCAGCCTGCAGTCCTCCTCTGCAACAACAGACCAGCTCT 

AC CCAT ATT ATATC CATTAAAC CAGTCTG AATTTGG AAATGATGG AGGGTG T AGTCT AAGTTGT AGG GAG CTTTGCAAG AACCTGT 
GCTGGGGTCCTTGATCCTGGTGGAATC^GGGTGCAGGAGTGAGCGCAA^ 
TTTTAGAGTGCAAAGTGGTGGATTAAAGGTTTCTTTTCTTTCATTCTCT 
TGAAGGATAGAACTCAGCTGATAACCTTACCTCAG 

6 5 CAGTGCAGTGGTTCTCAAACTGTGAGCCCAGTCCAGTAGCATCAG 

TGCAGTtKSCTCAC^CCTGTAATCCCAGCACTCTAGGAGGCCGAGGCGGGTGGACCACCTGAGCT 
CCAACATGGCAAAACCCCACCTATACTAAAAATACAAAAATTATCCCAGGTGTGGTtSGCATGTGCCTC 
AGGCTGAGGCAGGAGAATC^CTTGAACCCAGGAAAGGAGGCTGCAGTGAGCTGAGATCACACA 
TTGGCTCACTGCAACCTCTGCCTCTCAGGTTCAAGCGATTCTCCT 
70 CCATGCCCAGCTAATTTTATATGTTAAGTAGAGACGGGGTTTTCACCATGTTGG^ 
GATCCACCTGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGCGAGAGCCACCT 

TCCTGCCCACCCCCCCCCACTCCCCCGCCAGCCATGAGCAGGGAATATTTCAAT^ 
TAGTTTTCTTTTGTTTCTACCTCATGACTAGATGATTCACTG 

7 5 AGGTCATCA.CCAAGCCCCATTAATTTTACCTCCCAGATTCCT 
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CAGTACCAAATGACCAGTCTCTTACCT 

TGGTTTCACTCACAGAAACCAGAGTGAACTTTTAAAATCCTAA^ 
ATGAAGACCCAAATCCTTACCCAGGCCTAAGAGGCCCTTGTGGTTT^ 
TCTCCCTTGCCTCACCTCAGCACrrCTTGAGTTCTCT 
5 TGTTCCCTCTGCCCAGAATGTTCCTCGCCCA^ 

TAGGAAGACTTTCCCT AACTATCCCATCTAAATTAGTCACCCT CCATC^ CAAAGTCCTTACACCTGT 

CTGGCAATTTCTTATTAATTGATTTGTTTTTGGTAAACT 

ACTTAGAACAGTGCCTGG<IACATGGTAGGTGCT 

CATTGTG CTGGAAAAGCAGCAGAGTGAG ACAAAACIX^ ACAGGTGAG TGAAGGT CACACCAAGAAGGTTCTTAGCTGCCAAACTAA 
1 0 AGGGTTTGGATCTTACTCTCTGGAGAAAAGGGTGTAGCTAGTGGTTTTA 
AATTTGGCCGCTGTGTGTAGGTCATATTACGGAGAAAAAAACTAA^ 
GCCCAGCACGCAGTAGGTCTCCAGCGAGTCGTTATC^ 

TCTTTTACCTCGAAATAATTC CT CCTAGAC CCTATCATCGCATT CCTTCTCTT C CCCT^ 

ATTCTCAGGGTT ATCATGCCTAGGCCITTTTAAT AGGGCCACT T AC C CTAG CATGCTT CT ACAACTGGAC CCTCAAAGG TCTTTTA 
15 GGGCTGAGTGTGGTGGCTCACAC CTGTAATGCCAGCAT TTTGGGAGG CCGAAGC CAG AGGACTG CTTGAG CCCAGAGGTTCAAGAC 

CAGCCTTGGGCACCATAGCCAGACCCTGTC^ 

CAAGTACITGGGAGGCTGAGGTGGGAGGATTGCGGATTGCCTGAGCCCAGG^ 
GTACTCCAGATTGCGTCACAGAGCTAGAACTTGTCT 
TCTTCTTCTGATTCAACAACTCTTTACTGTTTGATGCC^ 
2 0 TGAGGCTGGAGAGCAGTGGTGOSATCATGGCTCACTGC^C 

GTAACAG CT AGGACTACAGGCACATG CCAC CAAAC C CGG C CTTTT AAAATTTTTGGTAGAGG CCIAGATGTGGTGG CTCATG 
AATC C CAG CACTTTGGGAGGTCCAGG CGAGTGGATCAC CTGAGGTGAGGAGT TTG AG ACCAG CATGG CT AATATGG TG AAACCCCG 
TCTCTAGTAAAAGTACAAAAATTAGCCAGGCGTGGTCGTGGGCGCCT^ 

ACTTG AACCTGGGAGGCGGAGCGTTG CAGTGAGCTG AG ATCGTGTCATTGCACT CCAGCCCAGCTGACAAGAG CG AAACTCCATCG 

2 5 CGGGGGATGATAATAATAATAATAATAATAACTTTGCTAGAGACAGGGTTC^ 

CTCAAGCT^CTCTCCCACCrTGGCCrCCCAAAGCXjCT 
AAATTATCTTTTCCATTGGTGACTATGAGGTTGAG 
TCCTCrrcGATTTTTTTGTTTTGTTTTGTTTTGT^ 
TGGCAGGATCTTGGCTCACCAGCTCACn*GCAACCTCC^ 

3 0 GACTACAGGCCCAGCTAATTTTTGTACTTTTGGTAGA 

CCCACCrrCGGCCrrCCCAAAGTGCrGGGATTCTAGGCATGAGCCACCGCGCCTGG 
CCTGACCCTXn*CAACACTTTGGTTGAGGTCl u rCTTTOTT 
TCTCAGAGCATGGGTCTGGCTCAACCTCTCTCCTGAATTTCAGACCTACAAG 
CTGCTGCTTCCC!AAAAGCAGACTCTCC!AAATTA£!AGTCAGTATCTCCC 

3 5 AATTCCCCATTCT CCTACATTGC CTTGC CAG AAGCCTG CTGGT CAG CTTGGATTTCTTTTTGT CCTT l"X rr TT CT ATATT TTG CTG 

GTGCCTAGTCATCTAGTTGCTGCCTCTACACTTTCTCTTCTTTA 
CAGAAAATACAGAAGTGCTCAAAGTGAAAAAATGGAAATGCCTTGTCCCTTCCT 

AG CTATGTAATAGTCTGTCATGCCAAGTCTTATTTCCAGCTCCTCTTTTCCATCC CATCTG AACTAAACG GATTGT 
TTTCCATCTGGTCTCCITGGCTTTTCCTTTCAGTGCAGCT 

4 0 AGATCCTAGAGGTTCAGAGACAAGTAAGATAGTTAAAGAGATCCACATTCCAGAGCTGT^ 

TCTGACCCnTACTTCCTTATCTGTAAAATGATGCTAATCC 
TGTAAAACTCrTTGTACTAACTACCrGGTACACTGTATCTGCTCCATA 

GAGG AGT TT AT AG CTT AATGGAGAGACTTAAAG CAT AAG AATT AT CT AGGCG AAGAAT GATGAGAAAATATTTTTGGAAAAGGAAA 
ACAAACAGTTCTACTAAAATTAAAAGGCTGATGTAGAGACTTGGGAAACTG 

4 5 TAATTCCCGAAGTGTGAGCAAAAGTCCACCTGCATCAGTCTTACTTGGGGTGATTGCT 

CG AGCAAGTTGGTAATTTACATATCGGAATGCT AT CCATCAAGGAAAATGGG CAGACTACAGTTACATG CATCAACACAG ACAAG C 
TTO^CAATATTGAGTGTAAAAAGCAAGACATAGAAATATATATTTAGTAAGAGTA 

AACT AAACAATAT ATTGCTT AAG CAAT AAGG AT ACACAAACT AATG AAAAT CAAAGG AT TTAC T AAT ACAAACTTCAGTAT AGT AA 
TTAATTGGAATGGGAGAGAAAGATGCAAAGTTTCTATTTCTTTTTT^ 
50 AGGAGTGCIAGTGGCAGGATCTCIAGATCACTGCAAGCTCAGCCTCCTGGGTTCAAGTGGTTCT^ 

GGGATTACAAGCATGCACCACCACACCCAGCTAATTTTTGTAATTTTAGTAGAGATGGAGTTTC^ 

CGAACTCCrTGGTCTTAAGTAATCCGCCC^CCTCTGCCATCAAAGTTT CTGCCGAGTACACAGGTTTT CTT 

TGTAAAATAATTATTTAAATTGTTAATATGCATTACTTATATGCTTTTCATTTACA^ 

AAATAAGAAAACAGAACAC^CTCCAGAGATTGATCTTTTTCACTCAAATCT 

5 5 TGAGTGCCTGGGCCCTGAACAGACAGACTAGATCAGTTCCT^ 

TTCTTATATTTCCrT^GC^AACCCCCCGCCCCACCAACCMC 

ATAAATGTCCAGAAGATTATAGGCCATCTTTCTATAAACAGCCAATCTTGGACCTGGT 

AAAGATGCTCATGTGAACATTTTGTTGTTTC CCAGAAAAAAAG TT TC CACATTTT AG ATT ATTTT CACAGGGTGAGAACAATTTTA 
CACCTCCCTTACTTGCTGCAGAATCTTTTTTTTTTTTT^ 

6 0 CAGTGGCACGATCTCGGCTCACTGCAACCTCCGCCTCCCA^ CCTCAG CCTCTCGAGTAGCTGGGATTA 

CAGGTGCGCACCACCATGCCC^GCTAAT TT TTTGTATTT^ 
CCTGACCACAAGTGATCCGCCCACTTCGGCCTCCCAAAGGGCT^ 
ATTTTTAAAGTGATTGTATCAATTrAC^GTGTTCCGAACAGTAAATGGCTA 
TGCTAATTTAATAGACATGATTCTTACATATTGTCTTAATTTGCATTTCT 

6 5 T AGAAT<rTTrCTTGTATTTCTTCK3TT GTTTTGATTAG AGTAACT C AGAGCAGGATTGGGAATCTG AAAACATGAACATTTGATTT 

TGAGGAAGTCATTGTTTTCCTGTTGTCTTCCITCCCCCTCTCT 
AAAATACAATTAAGCAAAC^xAGTAAATTCCACTTCCC^ 
GCCAAAAATATTCACAAATATATCGCTTATTCTTTCrrc 
AATTAACTGGTGGGAGCACATGAAAGTATTTTATAAAGTGC^ 

7 0 ATTCTGCCAAGCAGAATGATGACACTTTGTTTCCTGAACACAGCCTGCTCTCTCC^ 

TGCTTCAATTCCTCCTGCTXSAACTACTATCTACTCT^ 
ACCACGTACCCACTATACAGCTTAGGAAAAGAGTCTGGCATT^ 

CTCT ATAT ATACATACAT AT ATATTT AGTAAG AG CAAAAAT ACAGTAAAGAT CIAGGG AAGGTG CGGTGGCTCACG CCTGT AAT CCC 
AGCACTTTGGGAGGCCGAGGCGGGCAGATCACGAGGTGAGGAGTT^ 
7 5 AAAAATACAAAAATTAGCCAGGCGTGGTGGCTCATGCGTGTAGTCCCAGCT 
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TGGGAGGTGGAGGTTGCCGTGAGCCAAGATTGTGCCACTCACT^ 

AAAATACAGTAAAGGTAAAAAACAGG CAAAACTAAATATATTG CTTAAGCAATAATGATACTCATACAAACTAATGAAAATCAAAG 
GATTTACTAATACAAATTTATAGTAGTTAATTGGAAGCATATOTATT 

ATATATATAGAAAAAAGOJVATATATGTATATATAC^TATATATCTATATATGTAAAATTTT^ 
5 GAAATGATATATCATTCTGGTGTTAATATGCATCTTA 

GGCAGGAGGATTTCTTGAGCTCAGGAGTTCAAGACCAGCCTC 

ATCTGGCGTGGTGGCTGATGTCnn' AGTTCCAG CTACT CAGGAGG CTGAGGTGGGAGG ATCAC CTGAG CACTGGAGGTCAAGGCTG 
CAGTGAGCTATGATCATGCCACTGC^CTCCAGCCT 
AAGTATCTCCCTGATTAGTGGGGAAGTTACCCAACTTTTTTTTT^ 
1 0 GCAGTGGCACGATCTCGTCTCACCGCAACCT 

ACAGGCATGTGCCACCATACCCGGCTAATTTTTTC 

CCGACCTCAGGTGATCCTCCCACCTGGGCCTCCCAARGTACTGGGATTACAGGCGTGTGCC^ 

TT ACAAG TTTATTGTTGTTCATGTTG C CTCTT CAGGGAATTTTGTTT ATAAAGTAC CTATTCT T CAAGG CC CCATTAAAACACC T C 
CCTCTCCATAAAGTATTTCCAGATCTCCTCT^ 
1 5 GGTTTCACTCACTTCTTATTTCATCTTGCATTACAGTCACT 

TGCC CTTGTT CAT ACTAGTACAAAAGT AAACATTT CATAAATCTTTGTTG AATCAAATCCTACTCCTCC CCTGCTTATAAAAT CTT 
AAAT CATTTCCTATTGTCTACAGACGAAAGTAGAAT CT CGTTTAATGTAG AATTT AAGACC CTTCCACGGGCTCTAG CATCAACCT 
TCCTTTCTTGCTCTTCTCCTGGGAGGACCTTC 
TGTTTCGCTTTTCTCTCTGTAGCTGGGATGGCAGGTTGGTCCC^ 

TTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTATAGGCAACrGCCACCATGCCCG 
GGTTTCACC^TGTTGGCCAGGCTGGTOTCGAACTCCTGACCTCAGATGATC 

GGGJTCAGCCACCGaSCCCGGTCTGAGCCACCGCGCCCGGTCTGAGCCACCACACCCGGTCTAATT^ 
GGGGTTTCACCATTTTGGCCAGGCTG^TTTTGA^ 

2 5 GGCTTGAGCCACTGCGTCCAGCCCAAACTCTTTAAGA 

TTGCCCTGAATCTGTCACTTATGTACTAAAAATACATAAATTTCCT^ 

ATCTAAATGTTAG CAC CT AAGCCTG AGT CTAGGTGC CAG CAT AGC CTGTCTTCCCT CTCCAGAGTCAGAGG CATGTTACAAACAGA 
AT ATATGG ACACACGT CGAAC CCCACATTG T ATCT CTCT AATTT ATTCAATTT AATT CAGT AAACAC TCATT AC CT CTTATAT CCA 
GGCCTAGTTGAGGGTG CTGGG G ATACAGGT ATAAACATG ACAGTT CTCTTGAGGAG CACAG AGTGG TTG TGGGAAACAGACATACA 

3 0 GAC^CAC^GTTTCTTTCTTTCTTTTTTCTTTTTTTTTTTTTT^ 

AGGTGATCTCAGCTCACCACAACCTCCACCTCCTGGGT^ 
ATGCGCCACCACACCCACCTAATTTTTGTATTTTTAGTAGAGACAGGCTTT^ 
CTCAGGTGATCCGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACACGTGTGAGCC^ 
TACAAGAAGTATATGTCAGCCAGAGAAGGGCTATCTT^ 

3 5 ACATG AATG AGTAAATG CAAACAAG CCATGATACTGAAAT AAAGCTTTGTTTAAT^ CAG C CACAGAAG 

ACATTCTGATACATGGGGGCATTGCCZAGTGATTCCACGAAAiCTGGA 

T ATTGT AAGGAAGG TGGTACAAC CTG GC CTTT CTT'TTT^AAAAAAAAAAAAAATTCAAACTGGACATG TGT CAATGAACTGACTGG A 
GTCITAGGGAGGAACATTGCrCCATAGCTATATGTCTCTCCCGTTTTTTCCTG^ 
TTTTGAGACAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGTGTGATCT 
40 AGQUVTTCTCCXGCCTCAGCrTCCCAAATAGCT 

GACAGAATTTCACCATGTTGGCTACACrGGTTrGGACCTCCT 

TTACAGGAGTGAGCCACCTTGTCTAGCTCTTTTTTTTTTTTTCTTTTGAGACAAGTTCTTG 

TGGTGCAATCACAGCTCCCTGCAGCCTCGACCTCCC 

GCATGCACCACTACGCCCAC4CTAATTTTTTTTATTT^ 

4 5 GGGCTCAAGCAATCTTCCTGCCATGGCCTCCCAAAGTGCTAGGATTA 

AACTGAATAGTGACTGAAGTCTTGCAGTTTCTACCTTGCCCTGCACTGGCATTATAAGT^ 
TAATOTCATGAGAGGCCTAGTTAGATTTTTCT^ 

GCTTTCTACCACCCAAATATCAGTCGTAAAGAAAATGAGGAGTGATAAAG 
TAGGTTGGGTGGGGACTTGCAAAACTTTTCTGTCTAGCTAGA 

5 0 TTGTAAATGCACCAATCAGCACACTGTAAAAACGCACCAATCAGC^ 

AC TCTGTAAAATGG ACTAATCAG CAGG ATGTGG GTGGGG CCAAATAAGG CAAT AAAAG CTGG CCAGCG AAG CCAG CAG CAGGCAAC 
CCACTCGGGTGCCCTTCCATGCTGTGGAAGCrTTGTTCTTTCGCrCTTC^ 
TACCTTCATGAGCTGTAACACTCAATGCGAGGGTCTGCX3GCTGCATTCCT 
AACAACTCCAGATGAGCCACCTTTAAGAGCTGTAAC^CTCACTGCGAAGGTGTGCGG 

5 5 ACCCACTGGAAGGAAGAAACTCCGGACACATCTGAACATCTGAA.GGAA.CAAACT 

CACCGTGAAGGTCCGTGGTTTCATTCTTG&AGTCAGCGAGACCAAG 

GCCCAGTGTCCACCCATTCCCCATGTAGCAACCCCTGGTACCTGAGGAAAAAGGTCACAGCCTOT 

GGGCATGATCACAC^CGCACACACCrTTGCAATTTTTCTGGCTTTGT^ 
60 TCATAGGTCTTCCCTGOlAAGTCTTTTTTrTTTTTTTTTTTTGAGACGGAGTCT 

TGATCrCGGCTCACTGCAACCTCCGCCTCCTGGGTTCACGCCATTCTCCCGCCTCAGCCTCCCGAGTAGCT 
TGCTACCTGGCGCAGCTAATTTGTGTATTTTTAATAGAGACG^ 
TGATCTGCCCGCCTCAGCCTCCCAAAGTGTTAGGATTACAGGCCTGAGCC^^ 
CTCCTAGATCTCTATCTTGATGTCrrGAAAGCrTCTTCAGAGTGCTTTG 

6 5 CTCACAAATCTTTCAACTCA.GCCCTCTAGGTCCTCATGTGCA 

CAGTTTGAATCCTGAGTATGTCACTAACTAGTTGTGTAATTTTTGGAACATTGTTAACCTT^ 

AATTGAAAAT TCT AGT AGTTCCT ACCTCA.TAGG ATTGTTGTG AAGGTCAAATG AGAT CAAATACTTACAACTGG ACCTAAGG G G CA 
CTCATTATGAGTTAGCTGTTGTTACTAGTAACCCATTGTCATCCCT 

CAGGCTTGTCTCAGGTTCTGACCCTGTGATGGGGTCTGGTCTTCCCTGCCTCGTTTACCACT 

7 0 CTCC^TGAGCTCATGATTCCTATGAGCGATTT^ 

AGTAGAACCTTGTGGCCTATTCTAACCCCAGCCCGGAAACCAATGCCCCAACT 
AACTTCCGGGTCTCTGTTTTTATATATTGTTCCTAGACTA^ 
TTAGGCATATTT AAGAAGTGAATGAACAC CTG CTCATTG 
GAGATTTTAGAATCAGAGAGATGGAAGCTTGATCCTAACTCTGAATCT 
7 5 TTTTCTCATTTGTAAAATGAGGATAATTATATTTCCTCATAGGG 
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GCATTTTCTGGAACATGGTAA^ 

AGAGTTGAAGAAATGGCTGAAGTTGGAATATCTGGCAGGAGCTGTGCT 
AAGATGAAATAGGTTTCTCTCAAATGAGTTGGTTAAGGAGTTAA£ 
GGTACTTGACAGACAGCX^CTTGATGGTATTCTGAAGTTC^ 
5 CCCAAGGTGTTCAAAGTATAGAGAAACAAAGAAGATCAGTAATTACATAGACCAAATGGA 

TG CTCAAAAGACTCATGAACTATATGGAGTTAGTCCTATCAGAGAGAAG CAGGGTATC CAGCTAGCTCAT AGGCCTTTTTATACAA 
AAGTTTAC CATTAGCATGTTTGGTAACATTACTTATAA^ 
ATTAGTTTTCTATTGAAGTACAGGCAAGCTTGACTGGGTTCTTTGCTCA 

T AAGGACrTATCTTGAGACTGTGAGGAG AAT CTGCTTCCAAG CTCATTTAGGCTAT AGGTAGAATCAAGTTCCTTGGAGTTATAGG 
10 TCTGAGGTCCCTGTTTCCTTGCTGACTATCAGTCAGGGGCCACTCTTAG^ 

CCCTCCATATTCAATACCAGCAATGGCCCATTGAACCATCCCATGCTTTGAGTCTC^^ 

CT ACTTAT AAATGG CTCATGTGATG ACCAGGAT AATTT C CCCAT CTTAAG AT CAACTGTTTAGTGCCTTAATTGCAT CTGCAAAAT 
CC CTTCAAAATAGTGC CTAGATTAGTGTTTGATTGAAT AGCCAAG G AT TGGGAATCTTGGGCAGT CAT CT CAGAATTCTGCCTAC A 
CAGCATG GTTACAATTTC CTG AAAAG CTATC CTGTG AG G C CTG ACAGTT AT AGT ACAAGCATTGGAT AAACATTC CCATTTCAGAA 
1 5 TGGAGAAATCAGCCAACAAGAAAGGTGCTACAGGCCCCATGCAAGTTTGAAAGCT^ 
ACATAATCTTTGACTCCATGTCCCATATGCTGGGCAT 

GG CTTTG CAGGATTCATCACCCATGACTG CTCT(^TGGGCTCGAGTTGAGTGCCTCTGGCTTTGCCAGGCACAGGGTGCAAGCTG C 
TGTTGGATCTACCATTCTTGGATCTGGAGGATGGTGGGCCTCTTCTCACAGCTCCACr 

GTGGGGAATGTGTATTAATCTGTTCTCACACTGCTAATAAAGACATACCCGAGACTGGGTAATTTATAAAGGAAAGGGG 
2 0 GACTCACAGTTCCACCGTGGGGTGGGGCTCACAACCATGGCAGAAGGCAA 

AAGAG AGCTTGTGCAGTGCAACTCCCATTTATAAAACCAT CAG ATCTCAG GG AACTTATT CACTACCACTAG AATAGTATGAGGGA 
AACCACTGCCACGATTTAATTATCTCCACCTGG^ 

GAC!ACAGCCAAACCATATCAGTGTGCCAACCTTACAATTCCCCTTGGCATTGACCTAGTAGAGGTTCTCTG 
TGCAGCAGTCTTCTGCCTGGACACCCAGACTTGTCCATACATCCTCTGAAATCAAGG 

2 5 TCTGTAC!ACCGCAGGC!TTAACACTACATGGAAGCC^ 

GGCCCCITTGAGCTGTGGGTAGAGGTGGAGCIAGCCTGGATG 

CTTGGTCCAGGAAACCATTCTTCCCTCCTAGGCCACTGGGCATATAATGGGAGAGGCTGCCATGAA 
GGTCTTTTTCCCATTGTCTTTGGAATTAGTTCTTGGGTCCT^ 

CTGCTTG AATT CCT C TCCTG AAAAAG Cl"rri"r uttt Lri"rrG CCACAT AG CC^GGCTG GAAATTTTT CAAACTTTTATG CTGTG CTT 

3 0 CCTGTTTAAATATAAATTCCAACTTTAAGTCAT^ 

CAATGCTTTGCTGCTGAGAAATTTCTTCTC 

GGCGGATCACCTGAGGTCAGGAGTTCGAGACCIAGCCTGGTCAAGATGGTGAAATCT 

GGCATGGTGGTGGGTGCCTGTAATCCCTGCTACTCAGGAGGCTGAGGCATGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAGT 
GAGCCAAGATGCACCACTCCACTCCAGCCTGK3GTGACAGA 

3 5 CACTTAGATCATCACCCTTTAGTTAAAAGTTCCAC^^ 

CAAATGTGACCTTTGTTTTAGTTCTCAAGAAGTTTCTCACTTCCATCTGAGACCTTGT 

AT CAG CATTT TGGTTGAAACT ATTTAAC CAGTCT CTAAGAAGTT CCAAACTTTCCCTCA.CCTTCCTATCTTCTTCTGAGCCCTCCA 
AACTCTTCCAACCTATACTTGTTACCCAGTTG 

40 HUMAN SEQUENCE - mRNA 

GGTATTAAAACTGATCITTTGACATTTTTGACAATGTTCTTATAAATTACTTTCTTTT^ 

GAGTAAGATAGAGTGCACAGCAAAGGGGATCTGCCCCTCCTATCTGTCCAATAC^ 

TTCCAGTCAAACCTGCCTCCCACTTCTCACTAAAGTTAGTGAACATGTGACCCACATTCCCCAA^ 

ATTCT TGGCITTTTCATT CAT AGAGATAGCTATTTT ATG AG ACAT AGAT AAAG CATTTTTTAGTG ATGTGCACGATG C CTTTTTTC 

4 5 TTAATTATTAACTTCTCAAAACATAAAC!ACATTGGAGGCACTTAATAAAGGGAGCTGTACGTACCGC 

TCACGGCTTCCTCJ^CAGAGACTCGGGCTCGTCCGCCIATGTCCGCCG 

CATTTGGAAGAAGCGAAGAGGTGGATGGAAGCATGCCTAGGGGAAGATCTGCCT^ 
TGGGGTCTACCTTGCOUVACTGGGGAACTTCTTCTCTCCCAAAGT^^ 

5 0 ACAAGGCGACTGGCCTCCACTTTAGAC2ACACTGATAATC 

TTTTACCC^GAAACTACAGATATCTATGATCGAAAGAACIATG 
CAAGCTAGGCCTGCCCCCTCAGATTCAAGACCTATATGGJ^ 

TGGAGAAGTATGGCATCCAGATGCCTGCCTTTAGCAAGATTGGGGGCATCTTGGCTAATGAACTGT 
CATGCTGCTGTTATTGCTATTAATGAAGCTATTGACCGTAGAATTCCIAGCCGA 

5 5 GCTTGTAAATCTTGAAGAGCCCrrGGCATCCACTTACCAGGATATACT 

ACAGG ACAGAAAACTCAG AGAGAGAAAGAG ATGTT T ATG AGG AGCTG CTCACG CAAGCTG AAATTC^^ C 
AATACATTTTCTGCATTAGCAAATATCGACCTGGCTTTAGAACAAGGAGATGCAOT 

C CTGGGGCTTCG AGGACTG CAGCAACAGAATAG CGACTGGT AC T TG AAGC AGCTC CTG AGTG ATAAACAG CAGAAGAG ACAGAGTG 
GTCAGACTGACCCCCTGCAGAAGGAGGAGCTGCAGTCTGGAGTGGATGCTGCAAAC^ 
60 GCAGCAGTAGCACTGATTAATGCTGCAATCCAGAAGGGTGTTGCTGAGAAG^ 

GCCCCAGGTGTATCCATTTGCCGCCGATCTCTATCAGAAGGAGCTGGCTACCCTGCAGCGAC^ 

TGGAAGCAATTGAGCAGTTCAGTTACTGGTCTTACCAATATTGAGGA^ 

GAAGGCTCAGGCACATGCAGAGAATAATGAATTCATTACATGGAATGATATCCAAGCTTGCGTGGACCATGTGAACCT 

6 5 AAGAGGAACJVTGAGAGGATTTTAGCCATTGGTTTAAT^ 

CAG ATT CCTG CAGC T AAACTTGAGGG AGTCCTTGCAGAAGTGG C CCAG CATTACCAAGACACG CTGATTAGAG CGAAGAGAGAGAA 
AGCCCAGGAAATCCAGGATGAGTC^GCTGTGTTATGGTTGGAT 
AAGCACAGAAGTTTGCCTTAGGAATCTTTGCCATTAATGAGGCAGTAG 
TCCCCrTGATGTTGGCTTCTATGGAGTCATCCCTGAGTGTGGTGA 
70 GG CAGTAGGAGATAATAACAG CAAGTGGGTGAAGCACTGGGTAAAAGGTGGATATT ATTATTACCACAATCTGGAGACCCAGGAAG 

GAGGATGGGATGAACCTCCAAATTTTGTGCAAAATTCTATGCAGCTTTCTCGGGAGGAGATCCAGAGTTCTA 
GCCGCATATAACCGAGAACAGCTGTGGCTGGCCAATGAAGG^ 

ACAG GAATT C CGAT C CAGG ATGAATTTCCTGAAGAAACAAATC C CTG CCATCACCTGCATTCAGTC^ C 
AGAAGAAGGCATATCAAGATCGGTTAGCTTACCTGCGCTCCCACAAAGATGAAGTTGTAAAGA 
75 CAAGCTCGAAAGCGCTATCGAGATCGCCTGCAGTACTTCCGGGACCATATAAATGACATTA 
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AAACAAAGCTCGGGATGACTACAAGACTCTCATCAATGCTGAGGATCCTCCTATGG 

ACCAAAGTGACCAGGATTTT CAGGAGGAG CTTGACCTTATGAAGATG CGGG AAGAGG TT ATCACCCTCATTCGTT CTAAC CAGCAG 
CTGGAGAATGACCTCAATCTCATGGATATCAAAATTGGACTG CTAGTGAAAAATAAGATTACGTTG CAGGATGTGGTTTCCCACAG 
TAAAAAACTTACCAAAAAAAATAAGGAAGAGTTGTCTGATATGATGATGATAAATA 
5 AGGAGAAGAGAGAGAAGTTGGAAGCTTACCAGCAC 

CAGATGCCCCAGAACAAGTCCACCMGTTCATGGACTCTGTAAT 

CCTGCTC CTG CGG CTCTTTAAGACAGCACTC CAAGAGGAAAT CAAGT CGAAGGT AG ATCAGATTCAAGAG ATTGTGACAGGAAATC 
CTACGGTTATTAAAATGGTTGTAAGTTTCAACCGTGGTGCCCGTGGCCAGAATGCCC 

GAAATT ATGGATGACAAATCTCT CAACAT CAAAACTGACCCTGTGGATATTTACAAATCTTG GGTTAATCAGATGG^ 
1 0 AGGAGAGGCAAGCAAACTGCCCTATGATGTGACCCCTGAGCAGGCGCTAGCTCATGAAGAAGTGA^ 
TCAGGAACATGCGGGCTGTGACAGACAAGTTTCTCT 
GCCAAAGTGCTGAAGGACTCGTTGCATGAGAAGTTCCCT 

TTATCGATACATGAATCCAGCCATTGTTGCTCCTGATGCCTTTGACATCATTGAC 
AACGCCGAAATCTGGGCTCCATTGCAAAAATGCTTCAGC^^ 
1 5 ATCATTAATGAAT ATCTTTCCCAGT CCTACCAGAAATTCAG ACGGTT TTTCCAAACTG CTTGTG ATGTCC CAGAG CTTCAGGATAA 
ATTTAATGTGGATGAGTACTCTGATTTAGTAACCCTCACCAAACCAGTAATCTAC^ 

CTCTCCTIGTTGGATCACCAGGATGCCATTGCTCCGGAGCACAATGATCCAATCCACGAACTGCTGGACGACCT 
ACCATCGAGTCCCTGATAGGGGAAAGCTCTGGCAATTTAAATGACCCAAATAAGGAG 
CCTGACCAACAAGTTCGACGTGCCTGGAGATGAGAATGCAGAAATGGATGCTCGAACCATCTTACT 
2 0 TGG ATGTCATCCGGTTC CAGCCAGG AG AG AC CT TGACTGAAATCCTAGAAACACCAG CCACCAGTGAACAGGAAG CAGAACAT CAG 

AGAGCCATGCAGAGACGTGCTATCCGTGATGCCAAAACACCTGACAAGATGAAAAAGT 
CACTCTTCAAGAGAAGAAAGAGAAGATCCAGACAGGTTTAAAGAAGCTAACAGAGCITGGA^ 
AGGAACTGATCAACGACATTGCCAGGGATATTCGGAATCAGCGGAGGTACCGACAGAGGAGA 

CAGACATACG CTG CTCTGAACT CTAAGGC CACCTTTTATGGGGAG CAGGTG G ATTACTAT AAAAGCT AT ATCAAAAC CTG CTTGG A 

2 5 T AACTTAGCCAGCAAGGG CAAAGTCT C CAAAAAGCCT AGGGAAATGAAAGGAAAGAAAAG CAAAAAGATTTCTCTGAAATAT ACAG 

CAG CAAGACT ACATGAAAAAGG AGTTCTT CTGGAAATTG^5GACCTGCAAGTGAATCAGTTT AAAAATGTTATATTTG AAAT CAGT 
CCAACAGAAGAAGTTGGAGACTTCGAAGTGAAAGCCAAATTCATGGGAGTTC^ 

GCTGCAGCTACAGT ATGAAGGAGTTGCAG TCATG AAATTATTTC CTAAAGTAAATGTCAACCTCCTGATCT TC C TTCTCA 

ACAAAAAGTTCTACGGGAAGTAATTGATCGTTTGCTGCCAGCCCAGAAGGATGAAG 

3 0 TCCTTCTTTCCrCATTGGAAGCAAAGACCTAGCCAAGAA 

TCGCTCTCATGGGACATTTGTTACCCTTTTTTCATAGTGAAATTGTGT^ 

TT C CATTACTT AGG AAAGAGTGGAAACT C CACTAAAATTTCT CTGT GT TGTT ACAGT CT TAG AGGTTGCAGT ACT ATATTGT AAGC 
TTTGGTGTTTGTTTAATTAGCAATAGGGATG GTAG GATT CAAATGT G TGT CATTTAG AAGTGGAAG CT ATT AGCAC CAATG ACAT A 
AATACATACAAGACACAGAAOTAAAATGTCATGTTATTAACAGTTATTAGG 

3 5 TCCCATOIGGAAAACTGAAGGATATGGGGAATCATTGGTTATC^ 

CAGTCATGTTCAAAGGAAGCATTTCTAGAAAAAAGGAGATAATGTC 

AACTCCCCGACTAGTGGATGGGAGAGTCCCATTGCTAAAATTCAGCTACTCAGATAAATTCA^ 

TTTTGTTGGTGCACAGAGATTGACTTGATTCAGAGAGACAATTCACTCCATCCCTATGGC 

AATGTCATTGTTTTTAAAACTGTTTTATATCTTAAGAGTGCCTTATTAAAGTAT 

4 0 TTTT AAAGATTT ATATAATGCATCAAAAG CCTT AGAATAAG AAAAG CTTTTTTTAAATTG CTT TATCTGT ATAT CTGAACT CTTG A 

AACTTATAGCTAAAACACTAGGATTTATCTGCAGTGTTGCAGGGAGATAATTCTG 
GCCAACCTATGTTACACGTGAGATTAAAACCAATTTTTTCCCCATTTTTTCTCCT 

TTTT ATGAGC C CCAGTTTT CTGGGCTTAGTTT AAAAAAAAAAT CAAG TCT AAACATTG CATTT AGAAAG C TT 'ITGTTCTTGG AT AA 
AAAGTCATACACTTTAAAAAAAAAAAAAAAACTTTTTCCAGGAAAATATATTGAAATCATGCTGCTGAGCCT 
4 5 TGATGTTTTGATTCAGTATTCTTTTATCATAAA 

TGAATAACAAACTATGTAGTGTGTCCCTATTATAAATGCATTGGAGAAGTATTTTTATGAGACTCT 

AGCCACAGGGAGGCATGGAGTGCCATGGAAGGATTCGCCACTACCCAGACCTTGTTTTTTG 

AAAGAAACATTTTCCTCAGATTAAAAGATGATGCTATTACAACTAGCATTGCCTCA 

CIGTTT CCTTAAAAGAGGC TATGAAT CCCAAAGGC CAC^ TTT ACAG CTC CTTT AATAAAA 
50 TGTGT CAGT AATTTT AAGGTTT ATAGTT C CCTCAACACAATTG CTAATG CAG AAT AGTGT AAAATG CG CTTCAAGAATGTTG AT G A 

TGATGATAT AGAATTGTGGCTTT AGT AGCACAGAG GATG C CC CAACAAACT CATGGCGTTGAAACC ACACAG TT CTCATTACTGTT 
ATTTATTAGCTGTAGCATTCTCTGTCTCCTCTCTCTCCTCCT 
TACTTCCTCCCAAGAGTTTGGACTGCCCXjTCAGATTGTTTCT 
TGTTC 

55 

HUMAN SEQUENCE - CODING 

ATGTCCGCCXSCAGACGAGGTTGACGGGCTGGGCGTGGCCCGGCCGCACTATGGCrCT 

GGAGATGG ATGAAAGGAGACGT CAG AACGTGGCTT ATGAG TACCTTTGT CATTTGGAAGAAGCGAAGAGGT GG ATGGAAG CATG C C 
TAGGGGAAGATCTGCCTCCCACCACAGAACTGGAGGAGGGGCTTAGGAATGGGGTCTACCTTGCCAAACTGGGGAACTTCT^ 

6 0 CCXAAAGTAGTGTCCCTGAAAAAAATCTATGATCGAGAACAGA 

TGTGATTCAGTGGTTGAATGCCATGGATOAGATTGGATTGCCTAAGATTTTTTACCCA 

ACATGCCAAG&TGTATCTACTGTATCCATGCACTCAGTTTG 

GGAAAGGTTGACTTCACAGAAGAAGAAATQVACA^ 

GATTGGGGGCATCTTGGCTAATGAACTGTCAGTGGATGAAGCCGCATTAC^ 
65 GTAGAATTCCAGCCGACACATTTGCAGCTTTGAAAAATCCGAATGCC^^ 

CAG GAT ATACTTT AC CAG G CT AAG CAGGACAAAATG ACAAATGCTAAAAACAG G AC AG AAAACTCAGAG AG AGAAAGAG ATGTTTA 
TGAGGAG CTG CTCACG CAAG CTG AAATT CAAGG CAAT AT AAACAAAGTCAAT ACATTTTCTG CATTAGCAAAT ATCGACCTGG CT T 
TAGAACAAGGAGATG CACTGGCCTTGTTCAGGGCTCTGCAGTCACCAGCCCTGGGG CTTCGAGGACTGCAGCAACAGAATAGCGAC 
TGGTACTTGAAGCAGCTCCTGAGTGATAAACAGCAGAAGAGACAGAGTGGTCAGACT 

7 0 TGGAGTGGATG C TG CAAACAGTG CTGCC CAG CAATAT CAGAGAAGATTGG CAG CAGTAG CACTG ATT AATG CTG CAATCCAGAAGG 

GTGTTGCTGAGAAGACTGTTTTGGAACTGATGAATCCCGAAGCCCAGCT^ 
AAGGAGCTGGCTACCCTGCAGCGACAAAGTCCTGAACATAATCTCACCCACCCAGAGCT 
GGTGG CCCTGATCAACAGGG CATTGGAATCAGGAGATGTGAATACAGTGTGGAAGCAATTGAGCAGTTCA 
ATATTGAGGAAGAAAACTGTCAGAGGTATCTCGATGAGTTGATGAAACT 
7 5 ACATGGAATGAT AT C CAAGCTTG CGTGGACCATGTGAACCTG GTGGTGCAAG AGGAACATGAGAG 
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TAATGAAGCCCIX3G31TGAAGGTG 

AAGTGGCCCAGCATTACCAAGACACGCTGATT AGAG CGAAGAGAGAGAAAGC C CAGG AAAT CCAGGATG AGTCAG CTGTGTTATGG 
TTGGATGAAATTCAAGGTGGAATCTGGCAGTCCAACAAAGACACCCAAGAAG CACAGAAGTTTGC CTTAGG AATCTTTG C CATTAA 
TGAGGCAGTAGAAAGTGGTGATGTTGG CAAAACACTG AGTG CCCTTCGCTCCCCTGATGTTGG CTTGTATGGAGTCATCCCTGAGT 
5 GTGGTGAAAC!TTACCACAGTGATCrTGCTGAAGCCAAGAAGAAAAAACTGGC 

TGGGTAAAAGGTGGAT ATTATTATTACCACAATCTGGAGACCCAGGAAGGAGGATGGGATGAACCT CCAAATTTTGTGCAAAATT C 
TATGCAGCTTTCTCGGGAGG AGATCCAGAGTTCTATCTCTGGGGTGACrrc CTGGC CAATG 

AAGGCCTGATCACCAGGCTGCAGGCTCGCTG C CX3TGGAT ACTT AGTTCG AC AGGAATTC CGATCCAGGATGAATTTCCTGAAG AAA 
CAAATCCCTG CCATCACCTGCATTCAGTCACAGTGGAGAGGATACAAG CAGAAGAAGGCATATCAAGATCGGTTAG CTTAC CTGCG 

1 0 CT CCCACIAAAG ATGAAGTTGTAAAGATTCAGTCCCTGGCAAGGATG CAC CAAGOT CG AAAGCGCT ATCGAGATCGCCTG CAGTACT 

T C CGGGACCATATAAATGACATT ATCAAAATC CAGGCTTTT ATTCGGG CAAAG&AAG CTCGGGA^ CT CAT CAAT 

GCTGAGGATCCTCCTATGGTTGTGGTCCGAAAATTTGTCCACCTGCIX3 
TATGAAGATGO^GAAGAGGTTATCACCCTCATTCGTTCTAACCAGCA^ 
GACTGCTAGTGAAAAATAAGATTACGTTGCAGGATGTGG 

1 5 GATATGATGATGATAAATAAACAGAAGGGAGGTCTCAAGGCTTTGAGCA^ 
GTTTTATTTATTGCAAACCAATCCCACOTATCTGGCCAAGCTC^ 
CTGTAATCTTCACACTCTACAACTACGCGTCCAACCAGCGAGAGGAGTACCTC 

GAAATCAAGTCGAAGGTAGAT CAG ATTCAAGAGATTGTGACAG G AAATCCTACGGTT ATT AAAATGGTTGTAAG TTTCAACCGTGG 
TGCCCGTGGCCAGAATGCCCTGAGACAGATCTTGGCCCCAGTCGTGAAGGAAATTATG^ 
2 0 AC C CTGTGGATATTTACAAAT CTTGGGT TAATCAG ATGGAGTCTCAGACAGGAGAGG CAAG CAAACTGC C CT ATGATGTGACC C CT 

G AGCAGGCGCTAGCT CATGAAG AAGTG AAGACACXJG CTAG ACAG CT CCATCAGGAACATGCGGGCTGTG ACAGACAAGTTTCTCTC 
AGCCATTGTCAGCTCTGTGGACAAAATCCCrTATG 

CTG ATGCTGGTGAGG ATGAGCTGCTGAAG ATTATTGGTAACT TG CTTTAT TAT CGATACATG AATC CAGCCATTGTTGCT CCTGAT 
GCCTTTGACATCATTGACCTGTCAGCAGGAGGCCAGCTTACCACAGACCAA 

2 5 GCATGCTGCTTCCAATAAGATGTTTCTGGGAGATAATGCCCACTTAAGC^ 

TCAGACGGTTTTTCCAAACTGCTTGTGATGTCCCAGAGCTTCAGGATAAATT^ 

ACCAAACCAGTAATCTACATTTCCATTGGTGAAATCATCAACACCCACACTCTCCTGTTG 

GCACAATGATCCAATCCACGAACT 

TAAATGACCCAAATAAG GAGG CACTGG CT AAGACGGAAGTGT CTCTCACC CTGAC CAACAAGTTCGACGTG C CTGG AGATGAGAAT 

3 0 GCAGAAATGGATGCTCGAACCATCT TACTGAATACAAAACGTTT AATTGTG GATGTCAT C CGGTTCCAG CCAGGAG AGAC CTTG AC 

TG AAATCCTAG AAACAC CAGC CACCAGTG AACAGG AAGCAGAACATCAG AG AG CCATGCAGAGACGTGCT AT C CG TGATG C CAAAA 
CACCTGACAAGATGAAAAAGTCAAAATCTGTAAAGGAAGACAGC^ 

TTAAAGAAGCTAACAGAGCTTGGAACCGTGGACCCAAAGAACAAATACCAGGAACTGATC^ 
TCAGCG^AGGTACCGACAGAGGAGAAAGGCCGAACTAGTGAAACTGCAACAGACATACGC 
3 5 ATGGGG AGCAGGTGGATT ACT AT AAAAG CTAT AT CAAAACCTG CT TGGATAAC TTAG C CAGCAAGGGCAAAGTCTCCAAAAAG C CT 
AGGGAAATGAAAGGAAAGAAAAGCAAAAAGATTTCT CTGAAAT AT ACAG CAGCAAGACTACATG AAAAAGGAGTTCTTCTGG AAAT 
TGAGGACCTGCAAGTGAATCAGTTTAAAAATGTTATATTTGAAATCAGTCCAACAGAAGAAGT^ 
AATTCATGGGAGTTCAAATGGAGACTTTTATGTTACATC^ 

TTATTTGATAGAGCTAAAGTAAATGTCAACCTCCTGATCTTCCTTCTCAACAAAAAGTTCTACG 
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Table 106 

MOUSE NOMENCLATURE 
ICSGNM Zfp29 
Celera mCG15309 

5 

HUMAN NOMENCLATURE 
HGNC N/A 
Celera hCG27579 

10 MOUSE SEQUENCE - GENOMIC 

GATCCITCAATCGCCACCTGCCTCCCTCCCCTGTA 

GGGATTTGAACCCAGGTCCTCATGCTTGCACAGAAAATGCTCT 

TTTAAAAATATTTTTATTTTATTTTTGTTGG^ 

AGCCTCTGGAACTGACATTATGGACAATT^^ 
1 5 GAAGAGCTCTTAACTCCCGAGCCATCTCTGGAGTC 

ATACTAGGCGAGCACTCCAGTCCITAGCTGTATCTCT 

TTTTATTTTTCTTATTTGTTTGGCTTAGTAGACGTGTTA 

GCATGTGTGCATACATGTGTATGCATGTGTGTATG3m3TC 
20 TGTG(^TGCATGTGTGTATACATGTGTGTATGCATATGTGCATACATGTTTGTATAC^ 
TGTATACACGTGTGTATGCATGTGTGTATGCATGTGTGTATGC^^ 
TATACATGTGTGTATGCATGTGTGTATGCA.TGTGTGTATACATGACTTTTCCTGT 
CATGTGTGGACCAGAGCACGTGGAGGGCCGAGGCTGAT 
ATCAAACCCAGAGCTCACTGATATGACTAATCTTACTA 

2 5 ACATAGGAGGCAGGCCJATCATGCATACCTGGCATTTACTCG 

CTTAACCTGAGCCATCATGCGATCTGCTCTAATTTTTTAAGAC^ 

CCCACACTGGCCTTGAACACTTGCCCTTTTTTAAAATTTATTTTTA 

GAGCACTGTGTGTGTGCCTGGTACCCGCTGAGTTCAAAATC 

TTTGGAGTCAC CT CT CCAGCCCTTCTTTTGTGTTTTGATAAATGAGG TTAATTAGTGAAG AAATGGTGCATT CAAACTGTTACAGT 

3 0 TTCTTCTAGGCTCCACACXZATGGTTACCTGGCAACAGCTAGG 

TACTCTTTTTTTTTTTTTTTTTTTTTTTTTTTGAG 
AGGTTCX3CCTCGAACTCAAAAATCCGTTTGCCTCTGCCTTCTGAGTGCT 
CTCTTACTCTTTTACTCTCTAACTCTCCTCCCTTTCTGCCCCTTCTCT 
TCAGCCrrCrOTCTCTCTCTOTCTCTCTCrCTCTCTCTCTCT 

3 5 TCCCCTCCCCATGCCCTAAATGAACTCTAGTTTATACTATACCTCGTCCTGTGCCTGAGCATGGG^ 

CTCATCATACCACACCACCTCCAAACATATCCTTGGCCT^ 

TGGAAAT CCAATT CAGAATCT AT AAAAAGACAGG ATGAACGAAATTGACTC TGGGG C TG G AGAGG TGGCT 

CGGTGATTAAGAGCACACACTTACTCTTGCAGAGGAGCT^GTTTGACTCTTAGCACTC^CGTTGGGCAGTTAATAACCTTCTCTA 
ATTCCAGTTCCAGAGGAT CCAATACCTCTGGCCTCTGTGGGCATC CAAATTCAAATGCAAATACCCACACAGAAACACATAATTAA 

4 0 AAATAAAATAAACCGTGATGGAGAGAGAGCTCAGCAGTTAAGAGCACTGACTGCTCTTC 

ACCACATGGTGGCTTACAACCATCTTTAATGGGATCrGATGCCCTCTTCT 
AAAATATATAAGTCTTTTCnTAAAAAATTAAAAAATAAAATGAACCTTAAAAA 

TGG CTTGGTGGTT AAGAG CATTGGCTG CT CTTT CAGAGGACC CTGGTACAAC CT CCCG CACCCACATGGT AGTTCACAACTTTTAT 
AACTC CAG TCAGGAGATCCT ACACCCT CAAACCAGTG CACATAAAAT AAAATAAAAT AAAAT AAAAT AAAATAAAAAATTTAAAGC 
45 CAAATATCATTTTAAGTCAGAAGAAATGAATTTA1TTT 

TGCGAGCC!AGAGTTACAGGGAAGTTGTAAGGCACC(IATACGGGTGCT 

GCTATTTACCAGTAAACTATCTCTCCAACATTCAAATTCTCT 

ACAGTTCCTGGGTCAGCGAGATGGCTCAGTGGGTAAAGGTGCATTT^ 

TAGATTTC CT AC CTG AGTGT C CAGGTAGTTACTCTCTTGTTTTTATTT AGT ATGGAT CCCATGG AAC CAGG C CCACAAACTGTTGT 

5 0 GAGCTGTAATGTAGGTGTTGGGGTCTTCTGCAAGAGCAGCAAGTG 

CGCTGCAACAGGATCCTTCTGCAC!ACGTTTATTGGGAGAGCTTGATTGTAGA^ 

TTACATAGGCCTAGGAGAGGTGTGTCTCACATCTGATTGGTTAACTC 

TGGCCAGCAGTAGCCAACTGCCACTCTGCAACTGCCACTCTGC^ 

ATCTCTCC^GCCCCCAAGTCAAGGGTTTTAACTAGAGCGAGTAAGGGAGTAAAAGTTTGTATGCTCT^ 

5 5 TACAGAGTAGGAATGGCTGTGTCGGGGACAGAAGTGGGTTTAGATTTATCCTTTATCT 

GCGGGCACTAAAACAAAGTG GCACAClT"l'l'Ci'l'GG AAGCTCATTT CACC CAGT CTTG GGG CTTAAGTGCCCT CAAAGCTG AAAAGT 

TCACTTGGTGAAGGGGTAGCAGGCACTCCATGCATACTCTT^ 

ACCACTACCATCTATTGCTGTTCTTTACATCCTGTTTTGGGGACCTT 

TCCTGTTTTAGGGTGCCTGGGTGGTCATTC CCTCCCAGAGCTAACCTCTTGCCTAACTGTAGGG CAGGTTC CAAGGAGCCGACTAT 

6 0 AGACTAGGACTTACAAAGCAGAAAGAGCGGGGCGGGGCTTTCGTGAAGACGCA 

ACCCCGGAGTGCAAATGCCTGTGCGTAG CTGCAAATTTCCAGGAGAGGTG CGAACTTGGGCCAAGAGGAACTTTGATTGCGCAGAT 
TTCTTCTCTGCTTAGAGTCAGCTTTTGGCTCTCGAGGGCTCATTT 

AGCTTTCATTTTTT AT AGATAAAT ACACG AGC TTTAGAGAATTOT CACTATCTGT CTCAGGTG AAAGCTTTGGG CAGT ACT AG A 
' AGACAGACAGCTAGTCCTGGGT CGTCCTCTGCCTCTGATCCGGAGGCATTAC^ CTCAGGTAC 
6 5 AAACCACCCCCCCCCCCX3TCTCTGGGTAAACTACATTTCCCGCAATGCATCGGGTGGAACT 

AGTTTTCACCAATAGTGGAGCAGAATTTCAGGAACTGTTGTGGACAGCCAAT 

GTTCTCTGTGCCCCTTGGGCTTGAGCTGAGGTGJ^TCCAGAGGGGCCGGGCCGGGCCG^ 

AGAGAGATAGCGGGAACAGGACCTGGTCCCTGGAGAGGCGAGCGGGAGAGGCAGGTCAGGAGCCGCGC 
^ ^ AGAGAGACAGCGAGTAACGGGCCTGGAGCCAGCTCAGGGCAGTTT^ 

TTACACTGGGAAGGGATG CTCGACGACATCCCACCGAGCCG CCTCCTTGACC&GX3TGGGGAAACTGAGGTTCTGAGGGGGCGTGTC 
GGAGCCAGGCTTAGCTAATAAGCTGTCTAGGTTGAGTGTTCCCGGG 
TCATTTTGGACAAATTGAC^TTGGGAGCTTCAGTTTTCTTTCTGTTC 
GGGCATGTTATCTGATTCTATGGTGTAATGATTTTTTAAAAATAAATATTCATTGOTT 
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ACTGCTTTGAATATTTTGTAGAGCTACT 
GTTCATTTAAAAAGGAATTGGGCAGGC^TAATCT 

CT ACATAATGAGATCTTG CTTCAGAAGCAAAAAAATAA^ GAG CGTTCT CGTTATTATTCC 

CAAACAGTACAGCATACCAACTGTTTACATATTATTTACACTTAAGTATA 
5 ATTTrATATAGGGGCCTTGAGGAGTCCTGGATTCTGGGATCTGGGAGATTATCT 
GACTTGTAGTGAAGCTGGGGGCACCTTGACATGCGATCCATT 

ATTCATTAAGTGGT CACTGAGGTTGAATGACTAACCTCTTTGTTAGGT CACAGTCT CTGAAGGGAGT CTGAACTGACTTAGGGT AG 
GATATGTCAGTTCTGTTTCTCTCAGCCCAGAT(^CTG C^TCCCACTACT CTTCC TTTGGT CTT CTG AAAAGAGCAATGCTGTGT AT 
GTGTGTGGTAGTTGTTATTATTATTATTATT^ 
1 0 TCTTTAAATATCTACTAGGTGTTTCrrACCCCACnTCAG 

AAGCITAAAAGCATTAAAGTTGGGCTATTTTGTCTACCTCC 
CAGGCC^GCCCCTATAGCCATGGGCTCATGAACTAC^ 

CTCAGATCCCAAGCCTGGGAACCTTTGCTCCACCCCCTGCCTTCTGCCCAGTCATCTCTAT^ 

1 5 ATAGTGATATCAGACCAATG ri'G'l'GTGTGTGTGATATTTCTT CTTTTTTGGG GGGG GGGGGGTGGGTGTTTCAAG ACAGGG TTT CT 

GCTGGG ATTAAAGGTGTG CGCCACCACGATGGG CTGTCAT ATT CCTTTTG CC^ CACTGCTTTT AG AAGTAAGG CAACA 
AGCCTCTGTTGGTTTTAAATATGCGGAGGCTGCTATGAGATGG CTGTAG CAGTTTCCTTAGTGATTCTGGTTCTCTGCCTTGTTTC 
TCAG CAGTGTTTCTTGTTGAGATTGGAGG AAAGACGG CCTTCT CAG AG AGC CTGACTGGAGACAGGTGTTAGGCTTGAAG CCTTCG 
2 0 TGACCATCCAGGAAGTTGGACAATGGCAGCCG 

AAGATGAACAGGCAGAGGTCACCACTATGATCCTGGAGGATGACGCGTGGGTGCAGGAAGCAGTC 
TCTGAGCCCTTTCCCCAGAGTGCTGGAAWVGGCAGCCCCCAG 

TCGGGAGCTCTGTCGGCGCTGGCTGAGGCCAGAGGTGC^CACTAAGGAGCAGATGCT 
GGCTGCAAGAACATCGGCCTGAGAG^GTGAGGAGGCAGTGGCCC^ 

2 5 ACAGAACCACAGAGGGAGAGGGTGGGAGCCTI^CGGAGGTTGGAGTAGTGTCAGGGTTT 

CTCCCTGCAGCACTC^CAAGATCCTAGCTCTTTTAGATCCCTCAC 
CX3GAGGGCTATTCTTGGAGCTCCGTTTGCATACGAAACACAGCTTT 
GAGATAAAGGTCAATGGTAAATAAAATGCCTGCCTAGCACGCACAGG^ 

3 0 TTTCCTTGCTATAGAGAAAAAGTGTTTCTTCCTCCTCCCCCTCCTCTTCTTCCTC 

TCTTCTCCTTCTTCTTCTTCTTCTCrrrCTGTCT^ 

ACTCACTGTGTAGACCAGGCTGACGTCAAACCCACAGAGATCTGCCTGCCTGCC 
GGGA7ACAGAAAATTTCTGTTGTTTTGCAGGTATGAGAACAAAAAAGACCTC 
AATCATT CAAATAGTT CCAGCAATTGGAATAAAAGCATGCCACr TTTTT CAAAGT^ 

3 5 GTAAATTGTCTGGGCTTTCTGTGATATGATGT CATCTT AT ATC CTTTT AAAGAGT AATTGGC CAGG CT AGG AAGGG CACACCT ATT A 

TCCTAGCACTTAGAACCCAGGCTAGTATCTACTTCAGTTCTTCCTTTCCCT 

GTGTTG ATGAAGT ACTT CAGGAAT ACAGAAGGGCATGT AG AGAG GACAG GCCAG AC CAG AG AAGG CTT CGT AG AAGACATGG ACCA 
TGAGTAG G AG CTTGCAG AC TGAAGT AG AAATTG G AAAGT C ACAG GG CG AGGAG ACAGCTGTG ACTGCTG AG AGG AGCTGT AG GTAG 
CCAAGGCTCTGAAGTGGGGGAGCAGCAGAGGAAGGAGGTTTCTT 

4 0 GGTGGGAGTTTGGGGGGAGGGAGGATGGAGGTTTGAATGAAACGCCATTGGCCTCAGCAGGCTCTTGATCTAGGGCC 

GG AG AGGAAGCTGAG CCTGGCAACAGGATGT AG GTTTG ATT AGG-GGAG AG AG GCT AG AGGTTGGGG AG CAGTCAGGAAG CAG C CTG 
GCCATGGTGTGTGCAGGGACAAGAGTCTGGCCXJTGGGACGATGGGACGTGAGAGAAAGAACAGCG^ 
GCCCTCAGTCTGCCGGAAGATGAGTGGACTATTGTCCTTAGACGAAGAGTGACCCACTCTAAGATCACGCAGC 
AGCCAAGACTCTTGTCCTTAAGCATGCCATTGCAGGAGGGGTCCTAATGAT^ 
45 GGGATTGGGGCAGAGCCTCTGTACTGTCCACTCTGGAGGACTTAGTCGATTAGAGAGGGAGATCATGAGGAG 
CAGAAGGTGTGACTGCAGCCIAAGGACTGAGAGATATGGAGAAGTCTG 

ACTAGCCCCTCGCTGGGCACCTTATATGGCACCCACACCAAGAATCTTGTGAGATGAGTGGGACCGTCAC 
AACAGCTTCACACAGCTGTCATTCGGTAGAGCCAGGATTCAAACTCGGGTCTGTTTG^ 

TTATTT GAAGTG AAGT ATTTT ACTTT CTTTGACTTTG ACAGAT AACTT C CTT AG GGGTG TGCTTGAG AT CAAGCCT AGGAAGTGGG 

5 0 TCATGTGAGGAAGGG(n^3TGTCACTGAGCCT 

C C CAGCTGG CCTAAAG C TGGTATAGT (2AAGGATAAACTTGAACTTAAAAAT ATTTC TTTTACTTAG'rr'rr ATGTGTG TGG GTATTT 
TGCCTGCATGCATGTATGTGCACTGTGCATGTGCCTGGTGCCCT 
ACACTCTGAGCTGACAAGGAGGTGCrGGGAATCGAACCTTGGTCCT 
CAGCCCCrACCTTGAACTTTTGATGGGCCCACTTCACCGGATAGGATTC 

5 5 GGCACCAAACCCAGGGCTGCTTGCCTCITTGACAAGCAGGGTACC^ 

GTATCCTAAATTGGCCTGGGTCTTOTGGTAATCCTCCTGTTTCAGCCCCCTGAAGTCrrGGGACT 
ACTTGATTTTATACTTTGTACAAAGAAGAAGAGGGGAAAGAAACTCTGTCATCCCGTTTCAGATCTCCCTTGCCT 
AGGCGAGTTTATGATGACCTTCAGGGTTTCCAAGTCACCCCAGAGGCrrTGGGC 
GTGCCAGGTCAGTGCTGTGCCTCAGCAGGGGT 

6 0 TCATCTGAGTTGGTGGTGGGTTCTCCCTAGAGGAATCAGGAGGCT^ 

TGTTTGATAACACGTTGTGCCCTTCTTCCTCATAGCTGCTCACAGGAATCTATATTTTCCTTCATTCCTTCTTAA^ 
CTGAGAGCAGCCGGCCTCTATTCTGATTGTTAAGATAAAGGCCTT^ 



v ATAGTTACGTGTAAAGTTGAGCTCTATTTGTCCATGAG^ 

6 5 • ATTTCCTGACT<3AATTTTTt3CAGTAGTCTCrGAACTAA 

CAGTCCACCTGCAGCCTCCACAGGCTTCTCCCTCCTTCT 

CTCTCTCrCTGTTTTTGTTATTTTCTGGCTTTTTGTTTTGAGATGAGGCCTCATTGTTGTG 

TTGTTTTGGGTTTTTGTTTGTTTGGTTGGTTGGTTTGGK^TTGTTTT^ 

TTTGTATAGCCCTGGCTGTCCTGGAACTCACTCTGTAGATCAGGOTGGCCTCGAACTC^ 

7 0 CAATCTTGTTTTTCTATTTATATACTCTTATATCTTTTAGG AACTT A^ 

TAATCCTTTCTCAGTTCCTGCTAACAGTGAGCACTCAGTGAGTATTGTTGGGA 
GAGACCTCTACTTGGGATTTCCTGACTGAATTGTTGCAGTGGT 

TTCCCTTTGGTGC^GAGGACAGGGAAAO^CTCAAGCTGATGATGAGCTGGGTCTGAGCGATGGAGAATGAGACT 
TAGTCAAAGCTCAGAATAAAAGATGGAGGAGTTGGCTCAGCAGCTAAG 
7 5 CTAGCACCCAGACAACAACTTAAAACCATCTGTGGCTCCAGTTCCAGGGGGTCTGATGCCCT 
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ATGCATCTGGTGCACGTGCGGGCAJ^TGCCCATAAAATAACATCTCAGGATGACA 

CTT AGGGTTTGTATGAGATTCACCAGTGT CCTTGTTTGATGG CTG TTTTCAGTCCATGACAGACCTG TTAAAAG ACCACTTATACA 
AAGCAATAGCTAACTTCTGTAATGGTTACAGATTATTACTGAGGTAAAAAACA 

ITriTCCCCCCATAAATGCTTTGGGGAATGAGATTCAGGTTTTCTAGAAAAGT CCTGTATTT ACAGT TATGAG C CCTGGCTGTT CT 
5 CCCAGAGGACCTAGGTTTAGTTTCCAGCACCTACATGGCAGTTCACAGCT^ 
GGACATACATGCACCTCACATGTACTCACACACACATA^ 

GTCTTTTGGATGGAGAGAG TGGTT AAAAGT ACTTAATTGCTTTTATTGAAGAC CTAGACTTGGTT CCCAG CACCCACAT CAAGGT A 
G CTCACAACTGCTTATAACTCCAATTCTAGTGATGGGGTGTAGTGG CACACGT CTG AAAT CTTAG CATT CAGG AAGTGGAGCAGG A 
CAGTTGTAAGTTAGAGACCAGCCTGAACTCCATAAGAAGTTACGGTCCCTAGTGGGAGGA 
1 0 CACACTTGAGCAAATCCTAACACCTGGCCCTCTCCCAAATTGTCCAAATATTTTCCATTCC^ 
TGTGCTTCATAACTGAAGTTAGATGTCATTTCTCATGGCCCTAATATTTAAGTTCCAAT 
TGCCTCTGTACTGACCTTTAGGGAAGTGAACCCAGGCCCCAGGCAACC^ 
CCCTC 



15 . G AGCT AAC CCT CCTTGATAG AT ACACTG ACCACATGGAATTC CATCAT CCCA^ 

ATTATTAGTCTCAAAATTGATAAATTCCTGATAATAGTCACACCTGGTA 
CAGGAGGATCATGAATTCAAGGTCAGAGTGGGCTACATAGTGAGTTC 
CTGGACCAGCAGCATCAGAATTGTTAAAAATGTGAATCCCA^ 

TT AGGGTCTT ACAAG CCTT ATAGGTGATCTT AG CTTAAGCTCT AGTCT C CTAACCACT ACACCAGG TCTC TGGTTTTT AAAGTTGG 
2 0 CAGGTOTTGAAATTACCIX3GGGACCTAGAGTCCATG 

TATGCTCTATTGAATGACACCGTAGTCCCCATGCAAAGGTTT^ 

GTAGCTTTTTTACTCAAAGGATGGTCCCTrGGAACACAA 

GCCCACTGAAGCAAAATACACATTTAATTAGATCTCTGGGTGGCrCTTCT 

GAGGAACAGGGGTGTT ACATTCATACCTGTTCTTGT CCCACAGATGAGC C!AAAGAATCACAGCTTCAC CCTCACTCACCACACTT C 

2 5 ACATATTCCATAGTGCGCATGTGGAAAGCTTTCGGGAGCTGTTTCTCTTTCrrA 

TCATCAGGTTTGGT AGAAGGCCTCT ACCTG CTGAGT CATCT TAG TGTTC1T Cri'CC TT CTl'T AAGGAGAGGGTTGTGTGTAG CTGA 

GGTCTTGAAATTCTATGTCTATAGCCAAG^ATAACCTTGGCT 

GAGCCACCGTGCCAGGTTGATAGGGTTCTAGAGATTAAACCCAGAGCTTTGTG 

C CTGG CCAGAT CATATTCATTATTTATTTATTTATTTATTTC^ C CAT 

3 0 TGCAGATGGTTGCGAGCCACCATGTAGTTGCTGGGAAT^ 

CATCTCTCCCGCCCTCATATTCATTTTGTTAACATGAGAGGTAGCACTGTTGGACAGGT 

GTTTAAAGAGGTGGGTCAGCTGTTAAGAGGGCCTGAATTTGGTTCAGCACCACTATCAGGTGGCTCCCAAATGTCT 

TTC CAAGAG AAT CCTG AGC CT CCAGGCTT CTAGGGT ACCTGCACT C ACATTCCCACACAATAT ACAT AATT AAAAATTAT AAAATT 

AT AACAAGATAAAAAACTT CAGGTCT T AGCTTGAAATCAAAAG CACATT CAT TCAGTCAAATCTTTG AAT AT AT AC CACATAGAGC 

3 5 TCTTAAGCAGAACTCACCCTAC^CTGTAAAATGGACAGTA^ 

GACTGAGGCAGAGCATCAATCGCTCCTGGTCGCTGCAGCCCATC 

GGGCTCCTGTGAGAATGGAGAGGACCTGGTGTGGGCATCAGACCAGTATCCTGATCACTGTTTTAGC 

TGTTGAGACACAGTTGTGTGTCATAGACAATTGGCAGGACTGAGCCAACTCAAGCT 

CCTGACTACAGGTGATTACATCTAAGACACAOTCCACTCT 

4 0 AATGTTTAATTT ATGAATT AT ACATTAT AATAACT AT AAG AG ATTG ACAGTAAAGG CTACAG TAG AAAAATTAT AACAATATGTT A 

TACTGCAAACTTATTTAACCTGGCAAATACCTTTATT^ 

CCCTGGTCTAC^TAATGAGTTCCAGGATAGCCAGGGCTATGTAGAAAAAAA 

TATTTTGTTTTTTTTTTTTTTGTTTTTTTGTTTTTTCGAGA 

AGACCAGGCTGGCCTCGAACT CAGAAATCTGCCTGCCTCTGCCTC CTG AGTGCTGGGATTAAAGGCGTGTGCCACCACGCCCGGCT 

4 5 GAGTTCTGGAACTTTTATGTACTGTTTTTTGCACTTTAGTTGACTC^ 

GGGACTATTCC!AAAGCACATTGGTTTATCTTrCATCTTTAACATCAG 

GAAAGGGCCCACAGCTAGGCTGGAAATGCAGAGGTCCTCAGGAG^ 

TAAGGGGGTGGCIX^CGAGATGGCTCAGTGGGTAAGAGCACTGACT 

CAT CTG ACACC CTCTT CTGGTG CGTCTG AAGACAG CT ACAGTGT AC CT ATTT AT AATAAT AAAT AAATC TTrGGGCTCG AGCAAAC 

5 0 AGGGACTCAGCAAGCAGGGCCAACCGGAGTGAACGGGACT^ 

ATGAAGGCTCACACCG CATG AAG CT CACAACCATCTGTACAT AAGTAAAT T AATT AAT AACTG TTAGCT CTT AAGTGTAACAGT T C 
AACTG CTGAGTGG ACAGTTTTTGGATAGTTT TGAGAAAAACTAAAATTG AGGC CAGAT ACTGTGGCATACTCC CTTAATCCCAACA 
CTCCAGAGACAGAGGCAC!ATAGATCTCTGAGTTCAGGTCCAGACT 
GTTCCAGGACAGCCAGGGCTA<^CAGAGAAATCCTGCTCCCTTAATTATT^ 
55 CATATACTTATATGCACATATATTTATATGTAGGAAAATAATGTCAGCACAGTT^ 

AAGAGACT ATGATC TACTGACTT'rr IT TT lVittTTT CCT TTC TCT TTCT CT CTTT C CTlTlTTTTTTTGTTTG TTTTTGTTTTTT 

AGAAATCCCCCTGCCTCTGCCTCCCAAATGCTGGGATTAAAGGCGTGCGC CACCACTGCCCGGCCTATGCACAG CTTAAAAGCACA 

60 GCAGAGCCCTTGGT 

TCTCCCTCCACCCAGCACTGCTGGCCAGCCTTTATTGACAAGGCAGAGA 

GAGTTTCCCAACACAAGCATTACAATGCTGCCCTGTCCCCATTGAAATGAA 

GCGTAAACTTTACACAGTGTACAAAATCATGCTGTCAGGATTGCT 

GCATGTGTATGTAAATACAATTCCAGCCCTGAGGAAGGAGCCCTGGGGCTTGCIXSGCC^ 

6 5 AGGCCAATGAGAAATGGGATCTCAAATGGTGCCTAAGGAAGAGCTGCACCA^ 

TTCTTGGTCTGAACTCTTGGCTCTGTGATTCTAATCAAGCAAGGTAGAAAGGGCTGCRGA 
CTGTTCTTCCCAGAGGACTCAAGTTGAGTACCCATGACCACCTGTAAGTC 

GAGGGCATCTACACAAGTGT G ATAT ACACACAG CAG AGT AAAAAT AGACACATTTTT ATATAC CC C TG AATATCAGG AACAG AG AC 
TAAGAGGCTCACAAGGTTGGGTTTGCTTX3GTGTTGAGGTCAGGAAGAAAA 

7 0 TAAACAGCATTX3TAGACTGCTTCCTGGCAACATCCATGTTCCTGACAGTCATTAGTTCCTAAGGTA 

TAGTGTCTGTAGGCCATTACAGCAGCACATGGTTCAGTAAACAGTCTCACAG 

ATCAAGCTTCATACTCATTTTATGTGTATGGCTGTTTTCTCTGTATGAGTATCTGTACCAAGT 

AGCATAGGTATCATCTGAGTTTCAGGAAGTTAGGAGCCACTGTGTGGGTGCTGGG 

TC TGTG CCC CACACACATGTCTT AC C CACAGAAGCGT AAGAGGG TAT CATGTT CCCAG CACTGGAGAT ACAGTTGCT AGTCACCAT 
7 5 GGGATTGCTGGGAATTGAACCACAGACCATTGAAAGAG 
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TTTTTTTTTTTTTTGAGAC^ 

CCCGTCTCTCCTTCCTCAGCACTGACCTTACAAGCACACA.CCTCCAC^ 

GTGTGTGTGTGTCTGTGTGTGTGAGAGAGAGAGAGAGAGAGAG^ 
5 GGTTCrCTCAGGCTATCAGGCTTGGTGGCAAGCACCCCCCATC 

AG AG CG AG AATGGGGAGAACTCAAATGAAGACATGTTTG^ GATGTTCTTGAACATCTCTGGAGGGG AA 
GGTGGTC^C^TCTGATGGGGACAGTGACTTTGAGAGAGACTGTGGCTCTGGAGG CG CCCAGGGACATGCCCCGGGTGAGGACCC 
CAGGGTCGTGCCATCGGAAGGAAGGGAAGTTGGCCAGCT 

CCCAGTGTGGGAAAACTTTTAGC CGGAAATC CCACCTTATCAC CCATGAGCGGACC CACACAGGAGAAAAATACTACAAATGTGAT 
10 GAATGTGGGAAGAGCTTTAGTGACXKKn'CGAACTTTAGTAGACAC 

CTG CGGGAAGAG CTITTAGCCGGAGTGCGAACCTTATCACGCAC CAG AGGATC CACAC CGGCGAGAAG CCTTTCCAGTGTGCCGAGT 
GTGGO^AGAGTTTCAGCAGGAGCCCCAACCT 

GGCAAGAGCTTTGGCAACCGGTCCAGCCTTAATACTCACCAGGGC^ CAAGGAATG CGG 

CGAAAGCTTCAGTTACAACTCCAACCrrGATCCGACACCAGAGAATCC^ 
1 5 AGAAGTTCAGCCAGAGCTCCGCGCTCATTACGCACCGGAGAACGCAC^ 
AACTTCAGCCGCAGCTCCAACCTGGCCACTCACCGGCGCACC^ 

CTTCAGCCAGAGCTCCAGCCTGATCGCGCACCAGGGCACGCACACCGGCGAGAAGCCCTAC 
TCAGCTGGAGCTCCAACCTCATCAAGCACCAGCGGACGCACACGGGCGAGAAGCCCT 

AG CCAG CGCTCG CAGCT OGTGGTGCACCAG CGG ACGCACACCG GCGAGAAGC CCTACAAGTGCCTC CTGTGTGGCAAGAGCTTCAG 
GGAACTCCGTTCTCATCATCCACCAGCGCATCCACACGGGAGAGAAGCCCTACAGATGCCCGGAGTG 

AG CT CCAACTTCAT CACACACCAG AGGACG CAC CTGAAAGAG AAG CTT T ACTG AAGTGG CAG AAAAGAGAAGG AAGTG CTG AG CTG 
ACT CTGCAGGG AG ATT GT ATCAGGTCAGGATGATAGAT CT C CC AGTCGGAGT CATCTGT AG G AAGT CGGAG CCCTTCAG AACACAG 
TCTGAGGAAGTATGGCCTGAGACTGATGTCCCGCTGTCT 

2 5 AGATGTGTGTGCCTGTGTGTGGAGCACACITGGAC^ 

TCAGAGTGAAACCGTCGTGOSTGCTGTGCACTGTGTACACAATCACTC 
CCCAACACACACACACAGAGTGTATGTATCACATAGAAC^ 

TCTCATCCTCCTGCCTCCACCTTCTGAGTGCrrGAGATTACCAGGAAGTGCCAACGTG 
CTTTGTGCATGCCAGGGCATCACACTACTGAGCCATATCCT 
30 GTGTGTGGTGTGTTGGTGTGTGTGTGTGTTGGTGTGTGTGTTGGGGTGTtSTC 
GTCATTCCTAAGGCA.CTGTTCACCTGGGAGTTTTTTTGAGACAGAGTCTCTCA 
CTAGACAGCCTCAGGAACTGCCTCTCCTGTGTGAGGGTTCGAAGCATGTACCTCCACACATGCC^ 

AT CTGAGAG CCT CAAACATTTACAGTGTGT ACTTGAC CGATTG^ CTATCTCATAT ATT TTT AACTTTATGTGTGTG GATCTAGCT 
TACrGCCCTGCAGTTTTGTATACCAAGGGTGCTCTTTT^ 

3 5 ATGAAATGGCTCAGCGGTTAAGAGCACTTACTGCTCTTCCAGAGCATCACGTTT 

ATGTGAACCTTCAATTCTAGGAATCTGATGACCTTCTGGCTCTCATTC 
CAAAACACACACACACACACACACACACAAATCCTTCAAAATAGGAA 
GTGAGGTTTAGTCTTGAATCAGGAGTTTGCTCTGCAGAACCAGATGTAATCTTAGT 
CCCJAAGGCGTGAAAGGTAAAGAGAGGAAAGCTGACTGGAGAAGTGGTGTTACT 

4 0 AGAAGTGTGAAGCTATCACCTGATATTTAGGAGGTTTATGCTTC 

TGTTTTTTTGTTTTTCGAGACAGGGTTTCTCTGTGTGGCCCTGACrGTCCT 
CAGAAATCTGCCTGCCTCTCCCTCCCAAGTGCTGGGATTAAA 
GCGTTGACTATTCCCAAGTGGAGCCCCATTCTTTTCCTGACTGTGCCTTTA 
CCTATTTCTTTITTTTTTTTTTAAAGGATTTA 

4 5 ACACCAGAAGAGGGCATCCGATCTCATTACAGATGGTTGTGAGCCAGCATGT^ 

AGCAGTCAGTGCTCTTAACCACTGAGCCATCTCTCTAGCTCCCAGACTAGCCTAT^ 
CTGGGATTACATACATGCTTGGGTTTCCATGGCCAGCTCCrGAGTCTTATCCTC 

TATAGTGGTGTGTGC CTTTAGTGT CAGCACTCAGGAG CCTGGAG CAGGCAGAGCTCCATGAGTT CAAT ACCAT AG AG AACTCACAT 
AGTGAGTTCTAGGATAGCCAGGGTTACATAGTGAGACCCAGGGGAAAAATATAACCTAACC^ 
50 ACTTC CAT ATC C T ATCCAC CT CC CACTGCAGG C CGG AAAT AAC CT AAG ACTTTCAGG AAGG AAAAATT AGTTGTT TCAAAGAC CTG 

GGCTGTGGTGGCACACACCTAATCCTAGCACTTGGGAGGCAGAGGCAGGTGGATTTC 

TGAGCTC CAGGACAG CCAGGGC TAAATAGG AAAAC CTTAT CT AG AAAAAACAAAG CAGAAAG AAATG GAGG ACAGAT AGACGGACT 
GACATATTATTGGG TTAGATTCTT ATCACTG CTGTT TGTGAAC CATC CAT CATG CTTGTCACC CAAG CCAGGTGTG GTGC CT CATG 
CITGTCAACCCAACACTTGGGAAGCAGAGGTAGGAAGATCA^ 

5 5 CACCTGTCTTGGCATGGCACATCTGGCTAAGAAATGAGAGAACACTGCGAATCATCTCTAGTCAGACTC^ 

GGGAAGTTCTTGGCAGTGATGGTGGACTGTGGAGCCAGCTAGGTGGTTAGGCTGGTCAT^ 
AAGGATGCAGTGTCATTGCTGTGCTCAATGTTAGGAAACTATAAAGTCCT^^ 

G G AAG GT GCAGGAG CTCCTGT CAGAAAGGCAG CAT AG CT AGAAT AGTTT C TCAG CTG ACATT AAGTCTC C CGTGT AGC CCACACTG 
GTCTTAAGCAAAGGGACAGAACGTACCGTGCCAAGCCTTCTGCCTCTGCTTCC 

6 0 TGCCATCTCCTTTCACACGTTAATCCACTCTATAACTTTAGCCTGAGA 

GATAAATACATAAAAGTGTG CAG CTT GC CTT AGTTT CACAAGC 
AGCATCGATAACCAGATGTGCCACCTTGTACCTGCCTTA^ 

AACAGACTTGTGTTGCACAG^TCTGCTTTTACTATACAGTTGAGTCAAGTCATAGTGAGG 
TTCTACATTTATGGTTCCCAATGCCATAGTAGCTTCTAGTGTGAGCGAAAGGTACAAAGCTATGCTCT 

6 5 TGACAACTCCGAGGTAAGTAAGCAGCCTCACTCrrAGGGACAGACTGG^ 

TCATCCTGTGGGCACTTAGCTACATTGTACTAGTTTCTTTTAAATTTT^ 

TATAGCCCAGGCTGGCCTCAGATTTGTAGATCTCCTGTCTCAGCCTCTTACACTGGGCATTCCAATT^ 

AAGGACATGGTGGCCAGGCTGTGGTAGTGCATGCCTTTAATGCCAGCATGTGGAAGGTAGAGGTCAGTGGATCTCT 

GCAGAQVACCTCAGCTCAGTCCCrTGGTCCCATATA^ 

7 0 GCACCACTGTAC^CATTCTCTACCCACCATGTAAAATGTTTAAGTTTCAAAAGCCAAA 

GG ATGAAAGG AATGGATAGTAAAAAGTGGT ACATTAG CTAGATTT AAAT CTTTTTCTAG CT AGCAG G ATTGGCAATAACTGATTAG 
TCCTTGCAAGTTAACAACCATCATTTTTCCTGGCCCTTTACCCCTGACTGGGGCT 

GAGGGGTAATCGCT ACCCTCTAACCAGCTCCATAGCGCAGCTGCCT TGTTCAACCCACTCTCTCACGTGGTTGGGGAGAT CTTTAC 
ATGG ATGT CCCTTAAGTAGACCTTTTCAG AT CTTTGGGTAAAAT AT ACAT CT^ TCTTT CTTT CTTTCTTTC 1TTCTTTC TTT 
7 5 CTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTTCTTCCT 
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TTTATTTATTGT ATATGAATACACTGTAGCTGT CTTCATGCGTGC CAG AAGAGGGCATTGGACCCCAT T ACAGATGGTTGTGAG CC 
ACCATGTGGCTGCTGGGAATTGAACTCAGGACCTCTAGAAGAGCAGCCTGTGCGCTO 
TATGCATCTTTCACTTTTGCACTTCTGATAAAATTACCCC^ 
GAGTTGAGACAGAGATGAATTCCCTTTGGTTTTCCACATTCCCTACCCG 
5 CTGAACAAAXSGCTGTCACCCTGGGGCCTGCTTGTCCT^ 

CATCTGCTTGAAGCTGCTGTGGGAGAAAGGCTACTGAGGGA 
CTTGCTGTGAACCCTGTCAAAGGAGAAGOSCTCTGTC^CTGCCCAAATGCCCT 

MOUSE SEQUENCE - mRNA 

10 GTTGTTTCTTGTTGAG ATTGGAGGAAAGACGGC CTTCTCAGAGAG C CTGACTGG AGACAGGTGTTAGG CTTGAAG CCTTCGTGAC C 
ATCCAGGAAGTTGGACAATGGCAGCCGAAGTGC 

GAACAGGC^GAGGTCACCACTATGATCCrGGAGGATGACGCGTGGGTGCAGGAAGCAGTG 
GCCCTTTCCCCAGAGTGCTGGAAAAGGCAGCCCCCAGGAGGAGGACGCA 

AGCTCTGT CGGCG CTGG CTG AGG CCAGAGG TG CACACTAAGG AG CAGATG CTAACTGTGCTG CCAAGAGAAATTCAGGCCTGGCTG 
1 5 CAAGAACATCGGCCTGAGAGCAGTGAGGAGGCAGTGGCCCTGGTG 

ACAG AG CGAGAATGGGG AGAACT CAAATGAAG ACATGTTTGAG GG TGTGGAGTCACATGGGATGTTCTTGAACATCTCTGGAGGGG 

AAGGTGGTCAGCAGTCTGATGGGGACAGTGACTTTGAGAGAGACTGTGGCTCTGGAGG 

CCCAGGGTCGTGCCATCGGAAGGAAGGGAAGTTGGCCAGCTAATAG^ 

TCCCCAGTGTGGGAAAACTTTTAGCCGGAAATCCC^ 

2 0 ATGAATGTGGG AAGAG CTXT AGTG ACGG CT CGAACTTTAGT AGACAC CAAACGACT CACACTGGAGAGAAGC CCT ACAAATG CAGG 

GAJCTGCGGGAAGAGCTTTAGCCGGAGTGCGAACCTTATCACGCACC^GAGGA 
GTGTGGCAAGAGTTTCAGCAGGAGCCCCAACCTCATCGCCCATCAGCGCACGG^ 
GTGGCAT^GAGCTTTGGCAACCGGTCCAGCCTTAATACTCACCAGGG 
GGCGAAAGCTTCAGTTACAACTCCAACCTGATCCGACACCAG^ 
25 CCAGAAGTTCAGCCAGAGCTCOSCGCTCATTACGCACCGGAGAACGCACACCGGGGAGAAGCCCTATC^ 
AGAACTTCAGCCGCAGCTCCAACCTGGCCACT 
AGCTTCAGCCAGAGCTCCAGCCTGATCGCGCACC^ 

CTTCAGCTGGAGCTCCAACCrrCIATCAAGCACCAGCGGACGCACACCGGCGAGAAGCCCT 

TCAGCCAGCGCTCGCAGCTCGTGGTGCACCAG CGGACGCACACCGGCGAGAAGCCCTACAAGTG CCTCATGTGTGGCAAGAG CTTC 

3 0 AGCCGGGGCTCCATTCTGGTGATGGACCAGCGAGCGCACTTGGG^^ 

CTGGAACTCCGTTCTCATCATCCACCAGCGCATCCACACGGGAGAGAAGCCCTAC^ 
AC^GCTCCAACTTCATCAGACACCAG 

AAAAGAACCACTTTTCCTAAATAAAAAAAA 

35 

MOUSE SEQUENCE - CODING 

ATGGCAGCCGAAGTGCCAGCAGTGAGCACTCCCCTCAGCCCTTTGGTTCAGGTACCTCAAGAAGAAG^ 

CTGGAAAAGGCAGCCCCCAGGAGGAGGACGCAGCCGAGGGACCCCAGGGTGCTCTTGTCC^ 

4 0 CTGAGGCCAGAGGTGCACACTAAGGAGCAGATGCTAACTC 

GAGCAGTGAGGAGGCAGTGGCCCTGGTGGAAGACCTGACCCAGACTTTC 

AGAACTGAAATG AAGACATGTTTGAGGGTGTGG AG TCACATGGGATGTT CTT GAACAT CTCTGG AGGGG AAGGT GGT CAG CAGTCT 
GATGGGGACAGTGACTTTGAGAGAGACTGTGGCTCTGGAGGCGCCCAGGGACATGCCCCGGGTGAGGACCCCAGGGTCGTGCCATC 
GGAAGGAAGGGAAGTTGGCCAGCTAATAGGCCTTCAGGGCACCTACCTGGGTGAGAAGCCGTATGAATGTCCCC^GTGTGGGAAAA 

4 5 CTTTTAGCCGGAAATCCCACCTTATCACCCATGAGCGGACCG^ 

TTTAGTGACGGCTCG^CTTTAGTAGACACCAAACGACTCACACT^ 
TAGCCGGAGTGCGAACCTTATCACGCACCAGAGGATCCAC^ 

GCAGGAGCCCCAACCTCATCGCCCATCAGCGCACGCACACIAGGGGAAAAGCCGTACTCGTGCCCCG 
AACCGGTCCAGCCTTAATACTCACCAGGGCATTCA(^ 

5 0 CAACTCCAACCTGATCCGAC!ACCAGAGAATCCAGAC CAGCCAGA 

GCTCCGCGCrCATTACGCACCGGAGAACGCACACCGGGGAGAAGCCCTATCAGTGCGGCGAGTGCGGCAAGAACTTCAGG.CGC^ 
TCCAACCTGGCCACTCACCGGCGC^CCC^CCTGGTGGAGAAGCCGTACAAGTGCGGGCTGTGCGGCAAGAGCTTCMCCAGAGCTC 
GAGCCTGATCGCGCACCAGGGCACGC^CACCGGCGAGAAGCCCTACGAGT^ 
ACCTCATCAAGCACCAGCGGACGCACIACCGGCGAGAAGCCCTACAGATGCX3G 
5 5 CTCGTGGTGCACGAGCGGACGCACACCGGCGAGAAGCCCTACAAGTGCCTCATGTGTGGCAAGAGCTTC^ 

GGTGATGGACCAGCGAGCGCACTTGGGAGACAAGCCTTACAGGTGCCCGGAGTGCGGGAAGGGCTTCAGCTGGAACTCCGTTCTCA 

TCATCCACGAGCGCATCCACIACGGGAGAGAAGCCCTACAGATGCCC 

ACACACCAG AGGACG CAC CTG AAAGAGAAG CTTT ACTGA 

60 HUMAN SEQUENCE - GENOMIC 

TTTGAAGCCAAAGCCAGGCTACACAGAGATGAGGCTCCTGGGAGGGAA 
ACAGCTGGACAAGGGCTCCCTCTGGGTGACAGCCTCT^ 

GATGAGCTCAGCTGGGAAACTGTGTGGTAATGGGAAGAGAATGGTTTTGCTTT^ 
ATACTGGAACTCAGTGTGGAAACAGTGCCTTTGAACAGC 
65 TAGTTCACTCCTTACCCCGCCCATCACTGATACCAGATCCATCCT 
GGCACCTTCAACACIAACAGGAAATGTCTCCAAC^ 

CCACATCTCCCTGTGGGCCCTGCCAAACTGGATCATTTTTTTTCTATCCATGAGCTGTGAACTT 
ATACTGTGTCCn^CCTCTGAATGTTCTCTTCCTCCCTCTCCCCCAGTCCAAATGCTATAGACT 
TACATTGGAGCTCTGATCTGCTATGTGATTGTATTTGGAAATAGAGCTCT 
70 AGATCAAGATGGC!TGAATAGCAGCCTCCACCAATGATTCTCCTTGCAGGAAAATCAAATTTAACAAC 
TCTTCATAAGAACCAAAAATCAGGTGAGGACTCAAAGTACCTGGCTTTAACTTCATATCGCTGAA 
AAAGACAGTCTTGLAAACACTGATGTCACCCCTCCCCCATCCCTC^ 
AGAGGGAGAATGCAGCAACTGTGGGACTTTGCACTGAACTCAGTGCT 

ATGACACCTG C CCIATG GAGGGAG CATTT AGACCAGCC CT ACCCAGAGG ACTT CACCCAT C C CAGTAGTT AGG AG GCTTGAGG C TTG 
7 5 GGTTTTGACAAGCCTTGGCATGGCAGGCTATCATGCTCT 
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ACTCCTAGGCAATT CCTAATGCCATG CTGGGCTCAGAGCCAGTGG^ 

CAAGGGAGGGTTTGCACCACCCCTCCCCCAACCCCAGGCAGCAC^ 

GAGGAGGGGGAAGAGTAAAGAGGACTTTATCTTACATCTTGG^ 

GAGGCCTCCATTCCAGGACGTAGCTCCTGAATTACAT^ 

AACCCAGTCCTGX3CAGGATTCATCATCTGCT 

ATGGGCCTGGAGTAAGACTCTGAGGCATGCrrGGCTTCGGGTGTAGACC 

T CTTCTG C TTGAGAAAAGCAGAGAG AGATGCACAAGGG ACTCTGTCTTGCAG CTTAGGT AC CTG C CTG GCCACAATGTGGTAGAGC 
ATCAAATGGGCTCTTGTGGTC^CTGATTCTAGGCCTTGGCT 



1 0 GCTGTGGGCCTGTGGGGGTGGCCACAGGGTGAGGCTCCTCTG 

GTTTCAGTGC CAGTTCAGCCACCAT AGACTAAAG CACCAGGTAAATTTCTG AG GTTTTTGACTC CAGT CCCTGGCTC CCAGACAG C 

ATCTCTGGACCCGCCAGGGCCTGAGGGAACTCATTGGCCTGAAGGGAAGGACACAAA 

AGAGCCCTAGGCCTTGAGTGAACATAGGGGATAGCCAGGTAGTGTT^ 
^ TAGGTCTGATCCATTGCAGTCCCAGTGGTGGTGGCCATAGGGATG^ 

AC CCAAG ACCACCAAGGCAGTACCT CTATGAGTCTGCAAGAAC CATAGCATTACTGGG TTTGGCGTGT CCC CTAATG CAGAT ACAG 

TTTAGATCACAAGACCCACATCCrTCAAATACCTGGAGAGCCTTCCCAA^ 

ATAAATACATAAATCITGAATGCTAAGGCACTGACAAAC 

AACTAGAGGGGCGAATC CT AGAGAAGCAGAG ATAAGTGAC CTC CAGACA.GATAATTTAAAATAG CAGTTTTGGGG AAACTCAGAG A 
2 0 AATTAAAGAT AACACAGAAAGGATT CAG AATT CT ATAAG ATAAATTT AACAATG AGATTGAAAT AAAAAGAATCAAG CAGAAATTC 
TGGAGTTGAAAAATGCIAATIX3ACATAATGAAGAA 
GCTTGAAGACAAGCTATTTGAAAATATAIIAGTTAGAGAAGACAAAA 
AACAGCCTCAGAAGGGCAAAGCCAAGAGTTATTGGCCTTAAAGAGG 
GAGAACTTTCCAAACCTAGAGAAAGCTATAAATATTCAAGTACAA 

2 5 TACCIGAAGACATTTAATACTCAAACTCCXIAAA^ 

ATACAATGGGACTCCAATACGCCTGGCAGCAGACTTTCTGTGA 

G CTGAAGG AAAAAACTT TTACCCTAG AAT AACATATC CAGTG AAAATGTCCT T TG AACATGAAGGAGAAATAGACTTTCC CAG ACA 
AACAAAAGCTG AGGGATTTC AT CAACACCAG ACCTAT C CT ACAGGAAATG CT AAAGGG ACATCTTCAATCAGAAACGAAAG G ACAT 
TAATGAGCAATAAGACATCATGTGAAGTTACAAAACTCACrGGTAATAGTAAGTACAC^ 

3 0 AATTGTGCTU^CAACTCTTAAGTAGAACGACTAAAAGATGAACCAAACAAAAATAGTAATAAG 

ACAGTGCAATAAGATGTAAGTAGAAATAG CAAAAAGTTAAAAAG CAGG ACAAATTAAAATAGAGTTTT TATT AGTTTT C'TTT TT AT 

TTGTTTGA.TTGTTTATTTGTTTGTTCCTTTGTGCAAGCAGTGCAAAGTTGTCATCAG 

CCAAGCCTCATGGTAATCTCAAATTTAAAAATATGCAATAAATACACAAAACACAAAAA 



3 5 CAATACTAACATTGAATGTTAATGGACTAAACTCTCCAATCAAAACTGTTATGGAAAGAACTGTT 

C CAC CCACAT CATrriTCATCAACTCTAG ATG AGCT AAGGGCC C CCAAATG C CTTAGGCTGAGCATTC CTTG GGGTGAGGTC CCTT 



TCTGCCTTTGTGTGGGGTATTAGTAAGAAAATCTAGGAGCCACATCGAATCCT 
TGACTTAGACTCAACCAACCAAATGCTCTTTCCTTGCATG^ 
4 0 CCACCAGAGGTCATGGGATAGGCAAGCTGAACTTTTTCTGCTCAGAGACTGTTATTGTGGTT 
TGGCTCCTAACTGTTCTCAAGCCrcATTCGTCAGCT 
TTAAGTTAGCCAGATTAGTTCTTAAGCTTTTAACC!AAGAATGC 
ATGK3AGAAAGGACTCTCTTGGG<3GAGCAACCTTGAACTTGGCT^ 
ACTGGCTTGGCTACCAGGGCCCTTCTCAGTAGGATCATTATGCCATGAATTGG 

4 5 GATAGTTGACACTGGCAAGGACCAGCTTCCCAGAATCAGACAC^^ 

CCAGTTCTGTGGTCAAGGGTTGACATGGGTTAGGAACAGAGGTAAATAAACGCT^ 

CAGCCTCCCCTGCATCTTTTCCTTTTTGTGTAAATTTCCATTCCTG 

AATGTCA.TCCCAAAAAAAATCACTTGGAGGGGTGGTGGCCCAGAATTCTGCATC 

TGGAGGTGGTGCTATTTGCATGTGGTCAGTCTGGGATAATAAAGATGCTACCAGGGTATTGGATAAGAATCACTAG 

5 0 CTTTGCATCTCTGTAATTGGATTCAAGAGGTTAATTAGTGAAAAACAGGT^ 

TTCTAACACAAAACTCACAGGCTATAAAAAGATAAGATTAAOy\ATTTG 
CAGCAGAAAACAAAC^AGOVAACTGAGTVAAGAGTAGCIAACTC 

AATAAATCCAATTTAAAAATGGGAAAAG<3 ACCGGGCACGTGGCTCATG CCTGTAATCTCAGCACTTTGGGAGGCCAAGACGG CTGG 
ATCTlCCTGAGGTGAGGAATTCAAGACCAGCCTGATCAAC^ 

5 5 GTGGTGGCGCACGCCGGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGATCCGAGATC^ 

CCTCGGCGACAGAGGAGGAAAAAAAAAAGCAGTCATTTC^ 

TCTAT<3ACAGACCATTAAGGGTGTCTCTCTCGATTTT^ 

GGGTTGGGGCAGCCGTTTCTTG^CAGCX^CCAGCCGTAAAAGk^ 

CCCCATACTGAGATTGACAAGTAAGCAACCCGGTGATCAGTACTAAAATGGAGACGTC^ 

6 0 TTCCAGCACrrrGGGAGGCTGAGGAGGAAGGAT CGCTTGAGCC CAGGGGTTCGAGGT CAGCCTGGGCAACATATGG AAACCCTGTC 



AAAAATAAAATGGGGACGTCCAACACGTGTACTAGGGGCGGTGTCTCTGC^ 
GGAGAAGCTGGAAGAATGGGAGTTCCGCGGCAGAGGAGGCGGGGCAGGGCGTTCCT^ 

AAGGGGTGTGCGAAGACCGGGTGCATGTGTACAGOTGCAAACTGGGGGAAAGGCGCC^GGCTTGGGCGATGAGGATT 

6 5 ATGGATTCCCTTCCTGCCCAGGGTCCGCTCTCGCCGGTCGAGGGTGCCTAACAATACTCCT^ 

GTCCTTCAAGTCCGGCCAT CCAACTTT CAC TTTTT ATAGACAAATAT ACAGAGG CCTTAGATAAGTGAAACGGCTTGTCTCGGGTT 

AC^GCTCTAGCGAACACAGGGTTTCTGACGGTCAGTCT^ 

CCAAACCCTACTCCGGGGACCGCGGTOVGGTTOSTCTCCGGGCGG^ 

CCGCCCX3GCTGGGTGCCGTCTCCACCAACAGAAAGGCAGAATTTCCAGGGCCGTTCTCGGC^ 

7 0 GCCTCTCCCGTCCATTGTTCTCGGTGCCCCACGGGCTTGAGCCGGGGTGAATCTGGAGGGG 

GCACGCGAAGCAACa3CTAGAGCAGGACCTGGTCTCCa3AGAGGTGAGCCGGAGA^ 

GTAGCCCGAGGTGAGGGAATAGTGGGCCTGGAGCCAGCTGCCGGCAGCTCTGCTGGGGGAGGGGCGTCGGGGTCGCGCT 

TCTCCTACCGAGAAGCGCAGGCTTTGGACGGGAAGGGGC^TTCGAGGACATCCCGCGCAG 
7 5 TGAGGTTCAGAGGGGGCGTGTCGTGCCCCAAGTCAGACC^ 
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GAGTCCTCTTACACCATA^ 

TTTTTTGTC CTTTTTTTTTTAAGCTTTTGTCAG CTTTTCT CCCCTT'TTCATTGAGGGAGGGTTG TTAGTTGAGTCTTT ATGATGTA 
GTAATGATTTTTTAAAAAATTTTTTTATTTTTGTAG 
AGCCATC1TCCCTCCTCX3GCCTCCCAAAGTGCTGGAATTACAGGCGTGA 
5 TTTAT AATATTTTAAGTCGTTTG CAATTCCACTGCT 
TTTTTTTTTTTTTTTTTTGAGAaSGAGTTT^ 
TGCCTCCaSGGTTCAAGGGATTCTCCTGCCTCAGCCTCCGGAGTA^ 
GTTTTTAGTAGAGACGAGGTTTCTCCGTGTTGGTCAGQCrGGTCT 
AAAGTGCTAGGATTACAGGTCrGAGCCACCGCGCCCGGCTGTGAGTTCATTTT^ 
1 0 TTTTCTG CCCTGCTGATTTTGTGTTTATCCATT ATTTTCCTAT CAGAAGG ATT ATCTTAAATCATT AATGATTAATCTTTGTCAAT 

ATAATTTTTAAAATATTAAAAACATACAG CX5TAATGTGCAGATCTTAAATTTCAT ATX3TTT13 CT CTTGTGC CGCCATTGCCAAATG 
AAGATATTGAAAATTTCCATCATCCCAGAAGGCTCCrrTTGA 
ACAGTTTCTTTTCTTTTTTTTCCTTTTCTTT^ 

TGCAGTGGCACGATCTCGGCTCACTGCAACCTCCTCCTCCCAGGTTCGGGCAAT^ 
1 5 TACAGGCX3CCCGTCACCATGCCTGGCTAATTTTTGTATTTTTC 

TCCTGACCTC^GTGATCCGCCTGCaSAGGCCTCCOVAAGTGCTGGGAT 
ATACATGCTCCTATATGGGTGTCTCATAGTTTA 

TTAAGATAAAGTACTGAGGGGCCGAGGCTGGATTGGAGAGTATATGCATC 

GTTGAGATATTG CCTTT CTTCCAAAG AAG CT CAGAGTG CTTTGACATCTGAT CCATTCCT GATCCTT AAGATGC CCCCAT AAAAAC 
2 0 AGAAACATIIAGTGTTATTATCTTTATTAGGTGGTCACCGAGTTTG^ 

TCGAAACTGACTTAGGAGAGTCCTCAGTATTTCAGTCATGGGTTTTTCCCGGG 
TTTGGAAGGAACAATCCTGTATTTTTIXjTC 

CTTGGAGTGCAGTGGCGAGATCTCTGCTCACTGCAACCTCCGCCTCCTGGGTTCAAG 
AGGAACTACAGGCCCCCGCC^CC^CGCCCGGCTAATTTTTGTATTTTTAATAGAAATGGG^ 

2 5 CCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGACACTGCACCCAGCCT^ 

TGAGTTAGATTGCTTTAGGACAGTCTACCTATGGATTATCGTTCTCTTTT^ 

AGGGAAGTGTCTTACCTGAGAGCCTOGCTGGAGAAGACTOAGGTCCAAGG 

ATGGCTGCAGACATCCCGAGAGTGACCACTCCGCTGAGCrrCCTTGGTCC^GGTGCCT 

3 0 CACCACCATGATCCTGGAGGATGACTCCTGGGTGCAAGAAG 

GTGCTGGCAAGGGCGGCCCCCAGGAGGAGGTGACCAGGGGACCACAGGGTGCACTCGGCCGCCTCCGAGA^ 
CTGAGACCAGAGGTACACACCAAGGAGCAGATGTTAACCATGCTGCCAAAGGAA^ 
AAGCAGTGAGGAGGCAGCGGCCCTGGTGGAAGACTTt^CCCAGACCCTTCAGGACAGTGG 
GGCGGGAGCACCCTTCCAAGGTAGAGGAGTGTGGTGTTTCGGAGGAGGAGAA^ 
35 ATCCCTCTGCTCTGTCTGCAGGCAGTCAGCGTGTTCATCAGCCTTTTAGTGTCCTCACTGTGTGCAAAGTCAGCT 
AGGAGGGCCTTAGAGCTACATTTGAATTGTAAGCACAGTTTTTACTTCTGT^ 



TGATATTTTCAACATGACTTTTTTTTCTTTT^ 
GCAGTGGCACAATCTTGGCTCACTX3CAACCTC 
4 0 ACAGGCGTGCGCCACCACGCCCGGCTAATTTTITTATTTTTCG 
CCTAACCTCGTGATCCGCCCACCTCAGCCCCCGAAAGTGCT 
TTTTATGAAACCAAGAGAAATGACAAGTAAACCAGCTAATAACTAGGGAC 
CGTAATTTCAAACTTGAGCTCTTTGCAATAGTTACTCTATTTTTTTCCGGTGT^ 

TAACCGCTGTCTCCCTCCTGGTTTTCCATTGTAAAGTAGACTCACTTGATGACCAAATGTTTACTTCAGATGTC 

4 5 T C TCAG GG C TCG AAGGATATGGCTG C CTTTGTGCT GATGAAAACT ACGGAAG TT C AG AG AAGGGAGCGTGGAGAGGAAAGGTCAGG 

. T CAGAGAAGGCTTCCTGG AGGAGG TG ACACCTAAG CAGGAGG TTGTAGAAG GGAGTAGGGTGTGG CAG CTG AGAGGAG CTGTGCTG 
AGCAGAGGCTTGGAAGGGCAGAGCCACGGTGGTTTATACCTTTGGGTGACA^ 

TAGGAAGTGGTAGGAAGTAGAGTTTGGGGGAAGGGTGGGTGAGGGCTTGGACTGAATGTTTTGCCTCAGTGGGTTCT 
CCCCCAAGGGGCATGGAAGGAAGCGGAGTTTGGCAACAGGATGTAGGTTTGATTAGGGTAGTGAGAGGCT 

5 0 CCAGGAAGCTCCCTTGTCTTCAGGTGTGCAGTGATGGGGGTCTGCCCCAGATGGGGCACGTC 

GGCTAGCACTTTCCAAGCTCTTTCTGCAAGC 

GAAGATAAGCAGGATTATCATCACCAGAGATCCCTTAAGATCATGTGCCTTTGTACC^ 
GTGAGGACATAGCCCATGTTTTTAAGCGTTGTC&TATGTATGATT 

AG ATGGAATT AAGACAGTG AC TACAGAGCAT CTGG CCTGGAG GATTCAG AGGACTGGG AAGAG AAGAGAAGGAAACTGAAGGGGAT 
5 5 TTGGGGCAGAGACTATCACAAACTCCAGGGACTGGGTGAATGAGTAGAGCT 

CAGTGACCCAGCCCTGCCCTGGGCACCTTCATGCTTGTCTCACTAAGAATCCTG 

AAATATAAGTGCAGGAAACTCACATGGCTGTTGAGTGGTGGAGCCAGGATTCAAACCCAGACCTGTC 

CAGTTTTGCTTTGAAGCAAAGTGCTTGATTTTCTTTGGCCCTGAA 



6 0 AGATACTATGGAGGAGAGAGTGATGGAGGGCTGAGGAGAGGCCTCAGGTTC 

GTCTCAGCAGGTGTCAGAGGTGATTACAGCCCTTCCCATCCACCTTGCAGCAGGGGCTTTGTGC 
TGAGATGGGAACCAATGGGGACCATCCCAGACACATGGGTTTCTCTGTC 
TGTTTAATAAACGCCCCGACCTTCCTCATGATTGCTCACAG 
ATAAGGCCGCGTGACCTCTATTCTGTTTXTCCCAGCT^^ 

6 5 AGTCAGTGGCGGTTTAATGTTTTTCTGACAGGAACCTCrGGGCACTCAAAATTCAA 

TATATTAATAGATTAAGTCGCAGAGCTTATGTCTCTCTGTGACTGTTTCCAGAGGACATTT 

AGATGCGTGGAGTTTCTCACCTGAATTCTTGGAGCAGCTGCGAA^ 

CCACTTCTACCAGAGTCATGGGGCACTCCTGCTTAAA^ 

TTCTGCTTGGAATCCGTGGCTCTGTGGGACCTGGCCCAAACCTCCCCTGCAGTCTTTTCCC^ 

7 0 CCACTCCATTCAGACCAGATCTTGCTCTTTCTTGAACACAGCCT^ 

ACXSTTCCCCAGGCTGGAGTGCAGTGGTGCCATCTGGGCTCACTGCAACC^ 

CTCCCAAGTAGCTGGGATTACAGGTGCGTGCTACCGTGCCCAGCTAATTTTTGTATTTTTAGTAGAGACA^ 
GTCAGACTGGTCTCGAACTCCTGACCTCAGGTGATCCACCCACCTTGGCTTCCCAA^ 
GCCTGGCCCTTGCTCCTGGATTTTTGGCTTGGAATAGC 
75 CCGTGGAGACCCTCTTCATAAAG<!TGCTCn , TAAACCCTGTCCCTTTTCCCCATCCCCAGTTC 
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CACCTCTGCTAGAAGTCGTCTCCCCCAGAAC^ 

TGAATACnTGTCTTATTTCTTCTAGGAACCTGTAAAATCCTAGAAGACAAGGAACCT 
CITCCCrCTCTTCCACCCTACTCCCTCTTCCTTTCCCTCT 
TCAGACACTTTGAGGGTGGCTGGGCGCAATGGC^ 
5 GCC^GGAGTTCGAGACCAGCCrGGCCAACATAGTGAAACCCTGTOT 

TGTGC CTGTAATCC CAGCTACTTGG GAGGCIX1AGGCAGGAG AATCG CTTGAACCTGGGAGGCGGAGGT^ CTGAGATTG 
CGCCATTGCACTCCAGCCTGGGTGACAGCAAGACTCTGTCTCAA 
TCTAAGTGATCCTGATTCAAACTCCAATTTGAGAACCTCTGCACTAGACCTCTC 
TAGAATTACCTGGGGGCCTGGAGCCTACTCCAGGGAT^ 
1 0 TTTGAAAGTGCCCAGGTAGTTCITATTATGCTGCAAAGTGTAAAGAGCACTGCC^ 

GCCAG CAGCATCAGCTCACCTGGG AGTTTGTTAGACTAGG ACTGT CAGT CT CACCAAGC C CCGCTGAAGCAAATTTG CATTT GACA 

AGATCTCCAGGTGATTCCTATGCTCTTTGAAGTTTGA 

TTGTCCAACAGTTGAGTGAAAGAGTCACAGCTGACCrrCT 

GTGATTTGTTTTGAGCTTG CTG C CCCAGCAAAATGCT CCCTTAATGAC CACAAATGAACACATACATTCATTTGTCTCAGCCAGAA 
1 5 AGGGAGTGGAGAGTGTTACAGCTTATGTCTTT^ 
ACATATATATCTATGATATATATCATATATC 
TCTCACTCTGTCAACCCAGGCTGGAGTGCACTGGTC 

GTTTCAGCCACCCAAAGCACTGCGATAACAGGAATGAGCCACTGTGCCCAGCCCAA^ 

AAAGT AT AC CGGACAGAATTCTTAGGGAG CGGTAACG TTATACCT ATGAAATGTTTAGCATT CATGT ATTTCACCTG AT AACAGTC 
2 0 CATTGCTGTTTTTATCAGCAGTTAGTCTTTCAAGCAGCCAG^ 
AGCGCTCTAATGGGGAGAGCCCTAGTGAGAATGGAGCK 
AGACACAGGGGTAGGACACAATCAGGCAGGAAGGGGCTGGAGAAG 
CCAGTTTATTTCATTGTTACAGCAAAGTCAGGTCTATCCCCT 
TGCGGGAGAGAAAGAGGAGGCCTGTGGCCAGGCGAGGACACGCAGGG 

2 5 GGCCAGAGCTCACGTGGCAAGTGCGAGAAATACAAGGACCGTTGGCTGAGATTGAGCTACGGTGG 

TAAGGGGAGAGGTCCTGTGGCCCTTCAGGGACTGTTCCAGTGAATCAT^ 
TTGGATACCTACTTTGCAGCTTACAAAGAAATA^ 

GTAATAGACAGGATCTGAAATGACATGTGAGAGAATATTTTTCTAATOT 

AACCTCTAGG CACTCACAATTCAAGT ATGAAGTGTG AATAAG CAAT ACAGAAGT CATCAAAG CT AGATTGGACTCAAAACAGAAGT 

3 0 CTTTCrTTCTTTCTTTCTTTOTTTTTTTTTTTT^ 

TCTC^GCACrGCAACCTCCACCTCCAGGGTTCAAGTGATTCTCCT 
C(ZACTCCCGGCTAATTTTTGTATTTTTAGAAGAGATGGGGTTTCA 
ATTCGCCCGCCrCTGCCTCCCAAGGTGCAGGGAT^ 

ACGATACTGTGAAT CAAAAGACAG CATACACATT AGGAGGAAAT AT TTG CAT CCAAGT CT AATGT AGCAGATCATGAGT C AAAATC 
CTT GATCT ACAAGAG CCT CTG CAAATTCATAAG AAAAGT CAACCAAAT AG AATAAAAG GTAAAAG AT AT AAACAATTT AGAAATG A 
AGAACTAAAAATTATGAAAATATtnTCTGGCCAGGTC 
GATCACGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACAT^ 
ATGTGGTGGTGTGCACCTGTAATCCCAGCTACTGGGGAGGCTAA^ 
CTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCGTCTCCAAAAAA 
CACCTATAGTCCCAACTACTTGGGGGCTGAGGTGGGAGGATCACTTGAGCCTGGGAGGT 
CACTGC&CrCCAGCCTGGGAGACAGTGATATACTGTCT 
CACTTTGGGAGGCTKllAGGTGGGTGGATCACCTGAGGTCAGGAGTTTGAG^ 
AAAAATAAAAAAATTAGCTGGGAGTGATGGCAGGCGCCTGTAATC^ 
TGGGAGGCAGAGGTTGCCAAGATCGCGCCACTGCACT^ 

4 5 AGAAAAGCAAGAACAACAAAAAAGTGGATGTCTGGGCCATCCAGCCTCTTACT 

CATCTACTCATATCCACAGAATGTGCTCCTCTTGGCATCCTGGAGTAG^ 
TTCCCAGCTGCCACC^TCTCCTCTCATCTCC^TTCACMCGTGGXAAGAATATTATTT 
TTTTCATTTTCTTAATTGGCCACCAATAAATAAAGGAGGGACTCACCTGCCCTTGAACGTGCTCT 
AGGGGCGAGCCTTTCTCAGGGAGCCTGGAGGGGGCCAGGGATCCCT^ 
50 CCTTGTCAGTCAGATCACCTGAAGGGCCACTCTTTTCTGTCCT 

TGAAGAACTGCCAACCTTCCTTTGGTCTTCGCTGGATAGCAGCGGTGTTTGTAAAGA 
GGGGTTATTGGGCCTGGATCCTACTC^GAGGGACCTCCTT^^ 

TTCT AAAGG CCAGGAG CGGT AG CTCATGC CTGT AAT C CGAGCACTTTG GG AG GC CGAGGCAGGTGGATCACGAGGTCAGG AG ATCG 
AGACTATCCTGGCTAACACGGTGAAACCCTGTCTOTACrAAAAATACAAAAAAATTAGCCGGAC 
CC^GCTACTCGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGTA^ 

CGGCCTGGAAGACAGAGTGAGACTGCGTCTCAAAAGAAAAAAAAAAGGGGGAAATGACATTCTi^ 
ATAAGGTGGTTTTATAGGATATGTTCTTTGATTGTCTTCTATGTAATACCTGGGAGTTCAA^ 

CC CCATAAATTAAGTATCG CACTTAAAGG CTG C CAG ATTTTAAAGAAAT TC TT TCAT ATGTTCAT CT AATAGTTTGATG CTGCTAT 
TTGGGTTTTTTGTTTTAACATTTAAATCTTAGATCCA.TTTGGGATTTGTC 
60 TGAATTTTTCCAGTTTTCCGTAACTAGTTCTTC 

CGTATTGTATGCTAAATTTTCATAGTCATTATAC^ 

AATT CTG AACT TACATAT CTATTGAATCTATTT CTGGAGTTTTTATTTTGTTTATTTGGTCTGATTGTCTATTAATGTT TTAATTA 

TTGAGATTTCATAATGTTTTTGTTTTTGTTTTTGAGACGGAGTCTC 

TACACTGGAACCTCTGCCTCCTGGGTTCAAGCAATTCT 
65 ACCCAGCTAATTTTTGTATGGGGTTTCACCATGTTGGCCAGGGTAGTCTAGAACTCCTGACCTCAG 

TCCGAAAGTTTTGGGATTACAGGCGTGAGCCACTGCACC^ 

TCCCCTCCAATTTTAGTCAGTAATCTTTCTTTTTT^ 

ACGATCTCTGCTCACTGCAACCTCCACCrCCT 

CAGGCATGTACCACCATGCCCAGCTCATTITO 
7 0 CTGAGCTTGCG ATCCG CCCACCTCTGCCTCCCAAAGTGCTGAGATTACAGGTGTG AG CCACTG CTCTTGGCCTCrrTTTTTri'TT'V 

TTTTTTTTTTTTTCAGAGACAGAGTCTCACTCTGTCA^ 

TCCTGGCCTCAAGCAATCCTCCCACOTCAGGTTCCTGAGTAGCTGGGACTACAGGCAAGTGCC^ 
TTTTTTGTAGAGATGAGGTCTTGGCTGTGTTGCTCAG 

AAAGTGCTGGGATTGCAAG CAAT AAT CTTATCTTTTTT AGTGTTTATTTT TG CAAACCTCTACTT AGCTG CATG CTTTACCAGTTT 
7 5 TAAATGTGAATTCTTTGAACTCCCAGGTTATTACAGATGAG 
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CCAGTTTTGGTTAGAATGTCATTTCTACACTGCCGCATAA 

CTTCATTCrrAAAGTTAJUkTTTATTAGGTGTT CACCC CTTCTAG CAGT CCATTAATTTGTGTATGTTCAAAG CTGTTTG CCTATAGGT 
TTATACTTGAAGGACAGTTTTGTCXLAATATAG^^ CG CCCAGGCTGGAGTGCAGTGG CG 

CGATCTCAGCTCACCACGACCTCCACCTCCTGGGTTCAAGCAGTTCTCCTGCCTCAGCCTCCGGAG^ 
TGCCACCAC^CCCAGCCTGAATATAGAGAAATCTGAAACCAGTTGATTTTCTT^ 
GTCCACTGGTCTTACTGTTAGCCACCCTGGGTTAGTTTTTGTTGGCC 

TTT ATCTTTAAGTATTAGTT CTTTTTTTTTTTGAGACGGAGTCT CACTGTGTCG CCCAGG CTGGAGTG CAG TGGTGTGATCTCAGC 



CCCAGCCAATTTTTGTATTTTTTTAGTAGAGACGGGGTTTCAC 
1 0 CCTGCCTCAGCCCCCCAAAGTCCTGGGATTACAGG 

TTGAGAACTCCAGTT ATGTTTACTGATTCTC CTTTTGC CTAACTT CTGTTT 

ACCCGGGCTGGAGTCCAOTGGTGCAAATATGGCTCACTGCAG 

AAGTAGCTGGGTCTACAAGTGTGTACAGCCACACCTGGCTAATTTTTGTAT^ 

GTTGTCCCAAACTGCTGGGCTCIAAGCAATCCACCTGTCTTGGCCTCCCA^ 
1 5 CCCTCTGTCATTCTTTTTAATCCTTTTAACCTATTCCnTTGTTTCCAT 

CATTCTTGGAATGGTGCTTTCCAAGATGCCTATTCCCATTGCGCTCCTTTTCCTGT^ 

CrTTCCTGAGTTCTTCTAGTGTACATTTAATCTCTTC 

ATAAAGGCAGTTGCCTCATTTAATTATTTTTATGGATGGAAATG^ 

GCCTATAATCCCAACACTTTGGGAGGCCGAGGCAGTTGGATCATTTGAGGTCAGGAGTTCGA 
2 0 ACCCCATCTCTATT AAAAATACAAAAACAATGAG CCGGGGGTG GTGGTGTG CACCTGTAATT CCAG CTTG TCAGG AGGC TGAGG CA 
GG AG AATCG CTTG AAC CTGGG AGG CAG AGGTTGCAGTGAGC CAAGATAGTG C CACTGCACTC CAGC CTGGGTG ACAG AGTGAG ACT 
TCATCTCAAAAAAATAAATAAATAAATACCTTGGATTAATGTGCGTGGGTCAAA 
TTCCTGCGTACATGGCTGCCCTGTGTGATTCCAGTTGCAGCCTCACCT 
CCCCAGTTCCCAAACACAGGAAGTGATTTTCTGCCT 

2 5 CATTGCATTTCCTTCCTCATTGCTTTTACCCAGTCT 

TCTGATCTCACTGAAAATAAAATTXSGATTTTACTTGTTTTCCTTGTTGCT^ 

AAAACAGCAAAAACAGCCTTATTAGCTAATGTTACCTATTCIAATTA 

GCTAAAAATTAATCAGGGTC!TTGGCCT 

CTGTCTTCCGGCCAGACATGGCAGCTCACGCCrATAATCCCAGCACT 

3 0 TTCGAGACCTGCCTGACCAACATGGAGAAACCCTGTCTCTACT 

CCCAGTTAATTGGGAGGCTGAAGCAGGAGAATTGCnTGAACCCGGGGJUVCAG 

CCAGCCTGGGCAACAAGAGCTGAAACTCTGTCTCAAAAAAGAAAAAAAAA 

CTCTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCGG^TCACTTGAGGTCAGG 

CCTATCTCTACTAAAAAT ACAAAAATT AG CCAGG CGTG TTGGCGTG CG CCTGTAGT ACCAGCTACTC AGGAG G CTGAGG CAG GAGA 

3 5 ATTGCTTGAACCTGGGAGGCGGAGCTTGCCCTGAGCCAGGATCGCGCC^ 

AAAAAAAAGACTGTCTTGCAGGAAGTTGAGGAAACACAGAATTCCTGGTTCTGCC 
TGTATCTGTCCCCTAGTGATATGAGXSGGTGGGGCTGGiAGGGCAATTCT 

GCATGGAGTTTAATTATATGCCTGTTTTCAACTTTAAAGGTGATCTTATGGCCAGGCATGGTGGCTTATG^ 
TTTGGGAGGCCAAGGCAGGAGGATCACTCAAGCCCAGGACTTTGAGACTAGCCTGGGC^ 
40 ATACAAAAAAATTAGCTGGGTGTGGTGGCATAC^CCTGTGTAATCCCAGCTACCTGCAAGGCTAAGGT^ 
CAGGAAGTTGAGGCTACAGTGAGCTGCGATCTCGCTACT 

AAAAT AAAAAT AAATGT GAT CTTAAATTGACTGAAGG CAAG AG TGGAG CCCAGGTCC TG C CTGAGAGGGG CATGTC CAATTTTATC 

ATGTTCATATGTAAGAGTTTATATTAACTTACAGAAAATT 

AAATGTCACHTIGGAGTTCGAGCGCGATAGCTCACACCTGTAATCCC^ 

4 5 AGGAGTTTGAGACCAGCCTGGCGAACATAGTGAAACCCCAT 

TGTAGTCCCAGCTACTCTGGAGGCTGAGGCXGGAGAATCTCTTGAACTCG^ 
TGCAGTCCAGGCTGGGTGACCAAGTGAGACTCTGTCTTAAAA 

AAATTTCCTCCTCAGGCrCATCAAGAGGACATTGTGTGGTATAAACAATGTCATCIAACA 
CAGCCATTAATTTTTTTTTTTTTTTTCGAGACAGAGTCTTGGCT 
50 CAAAGCGTCC!ACCTCCTGGGTTCTAGTGATTCTCCTGCCTC!AGCCT 
GCTAATTTTTTTGTATTTTTATTAGAGACGGGGTTTC^^ 
CACCTTGGCCTCCCAAAGTGCTAGGATTACAGGCGTGAGCCACCATGCCCG^ 
GTGTCAGCCAGGGCTATGAAGCCAACACACTATTCAGTAA^ 
TCTGCTGCTTTGGCCTCATCCTCGGGTAGATTTTACAGTCCCTCATCT 

5 5 GAT TT CTTT TACTCAAATATTTGG G CACTGG AAG GTTT AACAAGT T CCTGG AAG ACATATTACCAATT AAAG ATCTATTCATGG CT 

GGGCGCGGTGGCTCACGCCIGTAATCCCAGCACTTTGGGAGGCCGAGGC^ 



AGGCTGAGGC^AGAGAATGGCGTGAACTGGGGAGGCGGAGCTTGCAGTGAGCCGAGATCG^ 
ACAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAATCTATTCATAAGCT 

6 0 AGTTAACTTGCTTCTAGTCATTGGTATACACACCTA 

TAATGCGGTC^TAAACACCAGATACATGTTAATACAAGAAATTATTATATAAGCTCCTTTTGAAAG 
TGTCTATGGTACCAATAAAAAAACTCATTCATGCACTCJVrT^ 
GGGTAGGTACCCTGTTTATGTCCTTTAAAATTCCCACCTGACCTCATT^ 
^ AAT CTTTAAGGCTTTGAAATTCTGGGTGATTGC CACAAAGATTATGAGG CCTG ACAAC CAGTCAG AATG AAATAAGC CTGG GTTGG 

CCATGGTGGTTTACTGCACAGGTCATCCCACCACCTAG^ 
CCCTCCCCCAGAGGCCCGAGGGTGTGTTATTTCCCTCCTyrc 
ATGCAGTGTTTGGTTTTCTGTJCCTATGTTAGTTTGCTGAGGATAATGG 
CTCATTCJTTTTTTTTGGCT^ 

7 0 GTTG ATTCTATGTCTTTG CTATTGTGAAT AGTGCAG CATTGAACATGTGTGTG CATGTATCTTT AT AAT AGAATGATTT AT ATTCC 

TTTGGGTATATTAAACCTGATTATTAAAGGCTTTGAGGATTTTCTAGCCTCT 
CCCCCTTCGTTTTCTTTTGTCATCATGACAGGGAGTTGAGTAGACA^ 

AGTGAAAATGGGG AGAACTGTAATCAAGACATGTTTGAG AATG AAT CACGTAAGATATT CTCG G AAATG CCTGAAGGTGAAAGTGC 
TCAGCACTCCGATGGGGAAAGTGACTTTGAGAGAGATGCTGGCATCCA^ 
7 5 AGGTGGTTTCTC!AGGAC!AGGGAAGTTGGCCAGCrCATAGGCCTGCAGGG 
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TXn'GGGAAGACCTTCAGCCGGAAATCCCACCr 

TGGAAAAAG CTTT AGTGATGGTTCAAAI'T'IT AGTAGACACCAAACCACTCACAC CGGGG AGAAGCC CTACAAATG CAGAG ACTGTG 
GGAAGAGCTTTAGCCXSGAGTGCCAACCTCIATAACCCA 
AAGAGCTTCAGCAGGAGTCCCAACCTCATTGC^ 
5 GAGCTTTGGCAACCGATC CAG CCTT AACACG CATCAGGGGAT CCACACTGX3AG AAAAGCC CTACGAATGT AAAG 

GCTTTTAGTTACAACTCCAATCHAATCAG^ 

TTCAGCCAGAGTTCAGCCCrCATCACCCACCGGAGAACCCACACA 

CAGC CG CAGCT CTAAC CTGG CCACACACCGGAGAAC CCACATGGTGGAG AA^ 

GCC^GAGCTCCACTCTGATTGCACACOIG^ 
1 0 TGGAGCTCCAACCTCCTCAAGCACCAGAGGATCCA^ 

GCGCTCCCAGCTCGTAGTGCACCAG03GACCCACACGGGCGAGAAGCCCTACAAATGCCTCATO 

GCTCCATTCTGGTCATGCACCAGAGAGCCCATTTGGGAGACAAGCCCTACAGGTGC 

TCAGTCCTCATTATACATCAGOSAATCCACACTGGGGAGAAGCCCTACAAATGCCCaS 

TAACTTTATCACACATCAGAGAACTCACATGAAAGAGAAACriTTATT 
1 5 GGGAGTTGCCACACTGCCCCAACAGTGATTCCCTTTCA^ 

CCTTGCT CAT C CTCATTT CCAGGACACTGTCATTTT AG TGGT CTGAGT CAAGTC C CG T AT ACATT CAAG AACAGGGCAT AGGCG TG 

GAAGGTCTGGAAAGTTGGGTCTTTTTCCCTTACATTG^ 

CTTTGGTAAAATGGGGGGAAATGTTTCTCCATGTGGAATGGAAGAC^ 

TGGAAATCATTGGTGTGGTTCrrGKSTTGTTrTC 
2 0 TTCCAGGGCACCCCAAGCTGTCAGTTAGAATCTGCTCTTC 

TTCTCCAGTTCTGAGTCACCCACGTGAAGGTAAAGACCCTTTCTATTTCCAGAAAGTGTCAGGA^ 

AGCCIGGAGCCAGTGTCCCAGTGTCCTTTCCATTGGTAAGAGTTGG^ 

TGCTTGCTTTTGTCTCTGCCTACTGTCCTC 

GTGTAGTGAAGTCOTAGAGCCCTAGCTCCCAACCCATt3GTGGATGG?TAACTTCTGTCT 

2 5 GCAG GGTGGGT TTGTG CCTTTGG C C CAACAGGT ACATAGC CCCATAATTT CTGAATTATT CT ATGCACTTGTTTC CCTCTT CTTTT 

ATTTTTTATTTGATATATGCCGAGCTAG 
CGGAGTCTCACTCTGTCACCCAGGCTG^AGTGCAGTGG 

CGGK3GTTTCACCGCGTTAGCCAGGATGGTTTCGATCTCCTGTCCTCG 

3 0 AGGCGTG AGCCAG CGCACCCGGCCAAGAACMT ATTTTT AAAGAAGTGTT AACTT TGAGGAC AT AT CTGTTCCCTGG AGATATTTG 

GGCTTGAATCAGGAGTTTGTCCTACAGGTGTCGCCCTTGATCT 

AG AGTGAAG ACGG G CATGACGGCAGGTGAAGGGGTTTGCTGTGAAGGAAGAG G AG AT AAGG CATTTC CAGG AAATGGGAAACTG CC 

TCCTCCTACACATGGGGCCTGTGCTCAGAATGGGCTTAGTTCTTATAGGATGGATGCrCAGTATTCCT 

TTCTTTTCCTGAGTCTGTCTTITACTGTGTTAAAAACCTGAACTAGGCTGGGCGTGGTGGCTCACACCTGTA 

35 

HUMAN SEQUENCE - mRNA 

CGGGACTACITGTTGATATTTGAGGAGGGAAGTGTCTTACCTGAGAGCCT 
AGTGATTGCCCCAGGACTGTGGATGATGGCTGCAGAC^TCCCGAGAGTGACCACTCCGC 

AGGAAGAT AG ACAGGAGGAGG AGGTCAC CAC CATGAT C CTGG AGG ATGAC TC CT GGGTGCAAG AAGCTG TGCTGCAGGAGG ATGGC 
40 CCTGAGTCTGAGCCCTTTCCCCAGAGTGCrGGCAAGGGCGGCCCCCAGG 
CCTCCGAGAGCTCTGCCGGCGCTGGCTGAGACCAGAGGTACACACCAAG^ 
CTTGGCTGCAAGAGCATCG^CCTGAAAGC^TGAGGAGGCAGC 

TTTGAGATACAGAGTGAAAATGGGGAGAACTGTAATCAAGACATGTTTGAGAATGAATCACGTAA 
. AGGTGAAAGTGCTCAGCACTCCGATGGGGAAAGTGACTTTGAGAGAGATGOT 

4 5 AGGACCACGGGGAG<3TGK3TrTCTCAGGACAGGGAAGTTGGCCAGCTCATA 

GAATGTCCCCAGTGTGGGAAGACCTTCAGCCGGAAATCCCACCTCATCACACACGAGAGGACCCACACAGGAGAGAAATA 
ATGTGATGAATGTGGAAAAAGCTTTAGTGATGGTTCAAATTTTAGTAGACACCAAACC^ 
GCIAGAGACTGTGGK3AAGAGCTTTAGCCGGAGTGCCAACCTCATAACCCACCAGAG 
GCCGAGTGTGGCAAGAGCTTCAGCAGGAGTCCCAACCTCATTGC^ 

5 0 CGAGTGTGGAAAGAGCTTTGGGAACCGATCGAGCCTTAAC^ 

AATGCGGCGAAAGCTTTAGTTACTAACTCCAATCTAATC^ 

TGTGGGCAGAGGT^CAGCCAGAGTTCAGCCCTCATCACCCAC^ 

TGGGAAAAGCTTCAGCCGCAGCTCTAACCTGGCCACACACC^ 

GGAAGAGCI^CAGCCAGAGCTCCAGTCTGATTGCACACCA 
5 5 GAGAGCTTCIAGCTGGAGCrCCAACCTCCTCAAGCACCAGA 

ATGOTTCAGCCAGCGCTCCXIAGCTCGTAGTGCACCAGCGGACCC^CACGGGCGAGAAGC 

GCTTCAGCCGGGGCTCCATrCTGGTCATGCACCAGAGAGCCCATTTG 

TTTAGCTGGAACTCAGTCCTCArTATACATCAGC^ 

CAGCAACAGCTCTAACrrTATCACACATCAGAGAAC 
60 AGGAAGTACAGCCTGGAGCCAGTGTCCCAGTGTCCTTTCCATTGGTGTCGCCCTTGATCTCAGG 

GGGATCCTCGCACCTCGAGAGTGAAGACGGGCATGACGGCAGGTGAAGGGGTTTGCT^ 

GGAAATGGGAAACTGCCTCCTCCTACACATGGGGCCTGTGCTCAGAAT^ 

AATAAAGT AGAG T TC CATTCTTT TCCTGA 

65 HUMAN SEQUENCE - CODING 

ATGATGGCTGCAGAGATCCCGAGAGTGACCACTCCGCTGAGCTCCTTG^ 
GGTCACCACCATG^TCCTGGAGGATGACTCCTGGGTGCAAGAAGCTGTGCTGC^ 

AGAGTGCTGGCAAGGGCGGCCCC CAGGAGGAGGTGACCAGGGGACCACAGGGTGCACT CGGCCGCCTCCGAGAGCTCTGCCGGCGC 

TGGCTGAG AC CAGAGGTACACAC CAAGG AGCAGATGTTAACCATGCTG CCAAAGGAAATTCAGGCTTGGCTGCAAGAGCAT CGGCC 
7 0 TGAAAG CAGTGAGGAGGCAGCGG CCCTGGTGGAAGACTTGACCCAGACCCTTCAGGACAGTGATTTTGAGATACAGAGTGAAAATC 

GGGAGAACTGTAATCAAGACATGTTTGAGAATGAATCACGTAAGATATTCTCGGAAATGCCTC 

GATGGGGAAAGTGACTTTGAC^GAGATGCTGGCATCCAGAGGCTCCAG^ 

TCAGGACAGGGAAGTTGGCCAGCTCATAGGCCTGCAGGGCACCTACCT^ 

CCTTCAGCCGGAAATCCCACCTCATCACACACGAGAGGA 
7 5 TTTAGTGATGGTTCAAATTTTAGTAGACACCAAACCACTCACACCGGGGAGAAGCC 
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TAGCCGGAGTC^CAACCTaiT^ 

G CAGG AGTCC CAACCTCATTG CACATCAGCG CAC CCACACAG GAGAG AAACCCTACT CGTGCCCCGAGTGTGGAAAGAJ^TTTGGC 

AACCGATCCAGCCTTAACACGCATCAGGGGATCCACACTGGAGAAA^ 

CAACTCCAATCTAATCAGACACCAGAGAATCCACACAGGAGAG 

GTTCAGCCCT(^TCACCCACCGGAGAACCCACACAGGAX3AGAAAC^ 

TCTAACCTGGCCACACACCGGAGAACCCACATGGTGGAGAAGCCCTATAAGTGTC 

CAGTCTGATTGCACACCAGGGCATGCACACAGGGGAGAAACCCTAC^^ 

ACCTCCTCAAGCACCAGAGGATCCACACGGGAGAGAAACCCTACAAATC 

CTCGTAGTGCACCAGCGGACCCACACGGGCGAGAAGCCCTACAAArGCCTCA 

GGTCATGCACCAGAGAGCCCATTTGGGAGACAAGCCCTACAGGTGCCCT 

TTATACATCAGCGAATCCACACTGCGGAGAAGCCCTACAAATGCC 

ACACATaU^GAACTCACATGAAAGAGAAACTTTATTGA 
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Table 107 

MOUSE NOMENCLATURE 
ICSGNM Kcnj9 
Celera mCG44B3 

5 

HUMAN NOMENCLATURE 
HGNC KCNJ9 
Celera , hCG39735 

10 MOUSE SEQUENCE - GENOMIC 

NNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 
NNNNNNNNNNNNNNNNNNNNNN 
15 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN^ 

NNNNNNNNNNNNNNNNNNNCCTCATGAATGCTG 

AGACCAGATTACAAGGTGCTGCACTACAAAGTGAGAAAATGTTAA 

CCCCACAATCATAACCAAAGCAATG^ACCAGGAACTATTTTCCTGCCT 

GAGAGGAACGG CTT ACAGTAAGATGGTC TGAAGACC TGGT AGTTTTTTTTTTTTTi"lTTl u iTTTAAG ATTT ATTTATTT ATTATATG 
2 0 TAAGTACATGTAAGTAAGTACATTGTAGCTGTCCTCAGATACTCCAGAAGAGGGCA.T 
CCATGTGGTTGCTGGGATTTGAACTCGGGACCTTTGGAAAAGCAGTCGGTGCTC^ 
TGGTAGTTTAAGCCTGCAATCTGAGCTGTTTGGGGA 

AAAGC CAGTGTGAATAACTT AGCAGGG CTCACAGT CTTGACATT CAGAG ATGGGGAAG ATTATGGGGCTGAG CT CAGACCACAAT A 
T AAAATGAAGAAGG AACACAG AGGAGAG AAG CCAAG AAC TGT CG GGGTTTATGAAAT CATT ACAAGACAC AAGAATTTATTATTTT 

2 5 TCCAGAATTGTTACCCAAGCATTTGGCATCCATCG CCACCTACATGTCAGTGTCCACCTGGACAGAAATCTCAAACTTAGT CCAGC 

GTAGAACATCTTACCCACAGGAGCGCTCCTCATGGGACTATGTCACCATCATCCAACrA 
TAGTCTTCCTTACAGCAGCAACTCCATCCTCTAACCAAAGCATCTCCCACTGAGC^ 
TTTATCGCTGCTGCAGTCTACAGCAGATGCACCTCTCAGCAGGGATCCTGGAGCAGCC^ 
CT7ACACTCCAATAATGCTTCAGGTCACTAACT 

3 0 ACTTGCTGCCAAACTTGATGACCTGGGGTCAATCTCCAGTACTGATGTGGTAGGAGAGACrrCAAC^ 



ACAGACAGACAGAGACACACACAGAGAGAAATGTAACGTTTAGAGAAGAATCCATCCATATTCT 

TTAATTATAACCTGGG CATCCTGCCCTGTCTTCCTCACATCCAACTCTATAGCTGCTTCCTCCT CTAACACCCAAGGTTGTTAAGT 
CTTGTGTCCCCTTCTGTATCTTGCTCCTTGTTCTTTGGT 

3 5 GACTCCTGTATCTCTCTGGAG CTCTACTTAGG CTCCAGTACCTGCAAGGGATTAATGCCCTCACATGACAGG CCC CAGACAG AACC 

CATCCTCTTTCCCTCTCACCAAGGTTGGGAATGCTCACAGCTC^ 
CATTACATCTTGCTCTTTATGCTTGCCTATGTCCCATTCT^ 
TATATTTCTCTAGGATACACATTTTCATTTCTATGGCCAGAATCATAAAATT 
CTCTTTCCAGTATCAAAGGGAGACAAACTGTTTTTATTAAA^ 
40 GTGGGGTGTGATGGTATACACCTTTAATCCCAGCACTTGGGAGGCAGAGGCAGGTGGATTTCTGAGTTCAAGG 
CAAAGTGAGTTCCAGGAC!AGCCAGGTCrrACACAGAGAAATCCTGTCTC 
AATAAATAAATAAATAAATACAAGGTCTCTGGATAAACrc 

GTGAGGTGGCACGAGATGGAACTGGGCAATTGGAAAGAGTTAGGTATTGTAGAGCCACAG^ 
GGCCCTGTGATGTTCTCACTCAAGAGTGACTTACATCAGGATTCC^TTCTTAAATAAGCACAC 

4 5 ATACATTGTGTTTGCTTTTAATATTTAACTTTTTGTTTTC 

GAAGGATGCTTAGCATGCTCATGTTTC'rGAGTGTTTACTAGGCGTGATAAATTTGAC 
CTGATGCTCCTGCCCCGGTCTCCTAAATGCAGGGATTATAGGTGTGCACCGCC^ 
GAAATCCCAGAGTCAGGGACCTTAGGTGGCTrGACCTATACACAGTGACATGCCCAGGAAGTGTTA 
TGTTTGACCCCAGAGTTTGTCAAAGGGTAATAGTACAGCGCTCTTGCATGACTTAA 

5 0 AGGTTCTAGTGGCCAAATGTCAGTATGAATAAATCTGCTGAGATGCGCTGTGCAGCGTC 

TCCTTTTCCTTTTTGATTCATCGCTCCTTTCAGACTTGA 

GGTCAGTTCCCCCTCAGAACAAAAGAGGAACATGAGGCGAAGACCCTTTGCAGAGGGAAAATCCACAGCTGGGCGTAGGCCG 

AG CTTT CG CTG G GAGAAG CAGG TGAGTTCGGATG AAGGGAAG CAACTGAGAGAGG CAAGGCAGAT C CT CAGACGGGGCGGGTTGGG 

GGGGGGGCGACTCGGAGAGGGAGTTTTCGGGGAGTCATCAGAGCTGGCCA.GGAAGAA 

5 5 CGAGGGACACATTTCTGCTTAGGTCCCACAGTATTAACACGGT^ 

G AGAG C CT AT CAGT ACTCGGTTT AGTCATT ACCTTTT AATACAGATGA.^ 
AATGGCTAATTA(^TTACAGAAGGACTACA.GAAGGCAGAGAGAG<3GAGGGAGGG 
GATGCTCTTTACCTATAAGGTTTATCTAGTATCTTTCTAATTGGTCCTTTTAGTGGC^ 
AGAGGGAAGAAC!AGAAAACCCCCAGATGCCTGGAACTGGGGAAGCTGTCTTAACCCTGACCT 
60 TAAACTAATGATTACTTTAGATG&CTTCTGAATGAC 

AAGATGGACCTGTGCGTAACTCTTCTAGGGCTGATTCCACATGGAATTTACCTACTT^^ 
CTCTGGATGAGCTGGAACTCAC(^CACACACCAGGGTGGCCTCAGACTCAGAGATTTACT 

AAAAAGCGTAAGCCACCACCCACAGACCCCATGAATTC^ CCCTCAG CT 

CACATCCTCAACCGTCCCTGCCTTCCCCTCCAGACTTCCTCCCCCATTTC 

6 5 AGGAAGCTCTCCCAGGTTGGCTGACCTCATAGCTGGCAACAAAGGCAACT 

TCCTGCACCCAAACTCAGAATTGCACC^VAAGTGTGTGTGTGTG 
GTCTGTGTCTGTGTCTGTGTCTGTGTCTGTGTGTCTGTGTGT 

TATGTGGAGACCA.GAGACGAATGAGGAGATCATAGACATACACTGTCACACCCAGCCTTTC 

GGT CCCCAG ATCCATGCAGCAAGTCCTTTGCCCACTAGGCTGTCTCCCGAGCTCTG CAGTGT 

7 0 GGCCTCACTGTCCTCTATTTCCAATCTGTGTTTATTACAACTCC^ 
4 CTCnTGGTGAAATATTTTTCTGTGCACTGATGGCTTGTGAAATT 

TGAGATTAAGCTTAGTAAGTAAAGCCCAGGACCTGGAGGATCTGGAAACTGGGTGAAAG 

TCAGGGCAGCCAGGATGGAGTCAGAGGGGTGGCTGACAAC^ 

G^aCCAGTTCTGTGCAAACATGTTCTTGTCCAGGAGTTTGGTTTCTT 
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CACTTGGCAGGGCAGAGAAAAGCAAGCTAGCAGTCCCCAGGCT 

G<n > TTAACAGCACT ATTGATGC CAAT CTCGGG CTUU^AACCTG ATATTTCCACTTGGAATAACflAGAAACAG CCAAG AGGATTGG AG 
AGAGG TCAGTGGACAAGGAGAGCCCT CTGCAGGT CGTGC TGGGTGATTCCAGAACAGAAGAGGGCAGCCCCTG CTGGACAGGGTCT 
CCTGAGATGATGGTGATGGTGACGGTGATGGTAATAGTGATGGTGAC^ 
5 GAAGAAAGAGAAAAACACAATGTCAGGCTTTAAATAAATAATC CT TATTGTGGTTTTGATATGAAACA 

CCCCTCCCCCAAGGGCTCGGGTATTGGAGATTTGAGCCCCAGCGTGTGGTGCTATTGAGAGOT 
TTGATGAGGAGATAAGAGGTGGACTGTAAGGAGGTGAAAGCTGTCT 
CTCAACCTITCCCAATGCTGTGACGCTTCATTTAACACAGTTCCTT 
CTGCTTCATAACTGTAATTTTACTACTGTTATAAATCATAATGTAAATATTTT^ 
10 CCACAGGTTGAGAACCACTGCTCTAGAAGGAAGTCACCTCCTCTTCTCrrCCT 

TCATATTCCCAAGAAGAATGGAGACAAACAACCCTGGACTGGATTCCTCCTCTTTTAAGTC 

TG AAAAG CTAGCAATATAAGATGG CT AGTCTCATCT CTT AGAT TTAAAAAACTAAGATTTTCCAAACAT AGTGGCT CATGTCTGTA 
GACACAGAGCCAGGGAAGCAGAGG CAGAAGGAT CCACTG GAGGTCCAAGG CTGGCCTGG ACTATGTAACAAGAG AGAGAGAGAGAG 
1 5 AGAGAGAGAGAGTAGAGAGAGAGAGAGGAGAGATTGAAGGAAAGAAAGAT1GAAGAAAAGAAAGATGAAAGAAAG 

GAAAGATTGAAGGAAAGAAAGAGAGATTTAAAGAAAGAAAAAAGAAAGATGAAAGAAAGAAAAGGGAAGAAAGATTGAAGGAAA 
AAAG AACGG AAGG AAGG AAAG AAGG AAGGAAGG AAGG AGTG C AGGGGGAGG AGGGAAG AAAAGAG TC 

AG AAAGAAAAGGAGACT ATATG AAGCTATTTGCTCAAAG CCATGCATCTTCTAT CAG AG AGTAGAATTTGAACT CAAGTCATTGCC 
TCTGAAGCTTGTATTACCCCACAGACCTGTCATAGCTCGTGAGGAC^ 
2 0 TGTTCTTTCTAGTTATGTATTCTTGCAGTGTTAAGG CT T AGGGGATTGG AT AT AAAAT AT CTTGTGCAT AACAAT ATTGG CAATAG 

TAATAACACCAGCTTAAATTTATTTTTTATAGCTTTAGTAATTTAATTTATGTATATGAGTTCATGGTAGCT 
CAGAAGAGGGCATCAGATCCTATTACAGATGGTTGTGAGCCACCATGTGGTTGCT 

G C CAG TGCTCTTAACTG CTGCAAAATGGT ACAGTTACTCTGGAAG ACAGTTTGG CAG TCAC OT T AAAC ATACT CTTT C C 
ATATGATATTGCAACCATACTCCTTGGTATTTACCGCACCCCCAA 

2 5 TTTGTAGCAACTTTATT^GGAATCGGCAAAAACTGGAAATGAAATGACT^ 

CCTGGCCGTGGACCATCATTCAGTACCAAAATGAGATGAGCTGTGGAGCT 
AAGLAAGCCAATCCAAGGAGCTGCIATACTGTATAATTCCAACCCCATGGCATCCT 

AAAAATCAGAGATTGCCAAAGGCTAAGGGGAGAGTGGATGGCTGGGGGCAGCAGAGAGGAAAGCACCCCACAAC 
TACACATCCTCGTGGCCGTTCTGGGTTTACAGCAAGAGAAACCAC^ 
30 TGCTGTGCCAAGTTAGATTTGTAAGTCGTAAACAAGCTACTATTCTCACTGGAGATGTCTAGAGTA^ 
GCAGAAGGCATGTGGAAACTCTTAGTGCCTTCTCTCAGTTT^ 

C^TGTACACGTGCGTACACACAGAAGTCTGAAGTCGATGTTTTCCTATATCACTCTCCACCT 

TGAAACCTGGAATTCACCAGTTTGTTGGGGCTAACTGGCCAGTGAGCT 

CCACGTCTTTGCCACCACATCCTGCATTTACACGGGTGCTGAGAACCCA^ 

3 5 ACTGGGCCATCTTCCCAGGCTCTTCTCTTGCTGTACAATTAAAA 

CACCGAGTAAGTGGAGCTAACCTGGGCTAGACAGTAAGACCCGGTCTTGGGGGTGGGGAACACCTAACAAAAAAATAA 
CAAAACAAAACAAAAAC CAAAAACATT AAAT G AAGAG C CAGGGCAG TGACAAGAC ACGTG ACTC CT CAAT CT CTGT C C AAC T C TGG 
AATTCAATAGGCTACTTTTTCTGTTTTCCTCATCCATAA^ 

GAGATGCTGCTGGATGGATTAGCAGTAGGAGCATGTAGCAGCAGACCTGTGCAACTCTGTGTCTTTCCACTGAT 
40 TACTGCTGGGCAAGGACCTATTCATTTCATAATCGCCTCTAC CTAGCCCAGTATGTGGTGTTTGAGCCCCCTGAGTCTG CTGGGTT 

GATGGTAAGAACTAGCCTAGACTTCTCTCTCTCTCTGTTGGACATTTGAGGGTTTTCTCAAC^ 

TCAAAACCCTTTTATTT ACATCACCTAATTTGATCTGCATCC CAGGTGAAG CCAGCAGAAGAGGGCTGTTTGCCCACGCCCACACT 

CTGAGACAGACAGAATCACTATGGCTCAGAGAAGTGAAGGGACCTCTTCGGGTC^ 

GGAGCCTCTGGCCCTGCTTCTCTAGCCCCTACCTCTGGAGACCTTTT^ 

4 5 AAAATTGCAGCCTTGTCACGCTGGGCCTGGTCCTTCT^ 

TTGTGAACTGCAAACTCCCAGATGGCCTCTGAAATCAAATATTTTATTTCCAATGCCTCT 
GTTCCCCCACACACACACTTGCTTTCGTCCCTATAAAGAGGTGAGGAGATGACTCT 
AATAGCTGTGGCX3TGTGGATTAGCCAGAGTGGTACCCAGGA 
GCCATAGAAACGAGAGAGK3AGCAGGGGAACCTGGGAAGTGGGGAT 

5 0 AGGAGAAACAGCAGTGTCGGCGGCTCCCAATCTCAGAGGGAACCTAGGGTACTGGGGGAGATC 

CCCCAAGGGTCTCTGCTGCTGGCTACTCTTCTCTCCAG^ 
AGCCAAGAGGGGTTTGGGTGGAGCTGCTCCCAGAGGGACTTCTCC^ 

CAGCGCTCTCTAGAGGGAGGTGGAGTACTCCAAGATAATGTGGTG CT CGG ATCTTACTG AAAGGGGTCACAGCATGCCCAAGAACT 
GTGGTCGGAAGAACTGGAGTTATTTGGAGGGAAGAGGAAGAAATGAAGACGTTGCT 

5 5 TGT CCCATGAAGAAGAGAGCTTTTCTCGAGATGGCAATGG CTAGGATGTCATCAGTAGGCTCCCTGGGCAGTCGTGTTCTGGGAAT 

GATCAGACACTGGGAATCCTTCCCCIATTCCIGGCCGTAGATGGAGGTCAGATCACCTTAGACCCT 
ACCTGAAGTTAATACTAGGATGAAAGAGACCTGGGGTCTCGAGGCACTGAAAACTTACAGATGAGGTGCA 
GCAGAGAGGGAAAAAACAAGCCTGCTTGCTGTTGGGGGAGGGGAAGATCTTAATCTGCC^ 
TCCTGACTTCCATGGAAAATAAGTGTGTGGGGTTACAAACCATCTTTTTGGGGTTTTTTCCTTGTGCCTTTCT 

6 0 ACCCTCCAAAGGTCTGCTGGCTACAGAACACTTGGCTCCAAAGTTTAAAAATGGAAT 

TTTCTCCCTAGGATCTGGTCAAAC!ATCCAAACCATCTG 

ACTTCCCC^GCCTTCCTGCCCCACCCCTCACCCTGAATGGGAGGAGATG^ 

TTAATCCTTATTCTACAGACTTCTGTTCATACGGTCCA^ 

TCTCTCCGGGAAGATTATAACCCAGAGTGCTrCTCAGGGGGGAAGAATTTGAAGGA 

6 5 CTTTGTGTGGAATGTGGGCTTTGGGACATATGGCAGGAGTGGGTGGGGTTGCTGGTAGGGTAG 

AGGGGGGGTCGATGTGGGTtnTTGGTGTTTCGATTGGTCTGATTC 
AGCGTAACTCTGGATGGCCTGGCGCCTCCTTCATCC 

CCCTTTTCAGCGTGTCTCCTGTGGGAAAGAGCCCrrGGCGGGAAATGGGCTGG TATCAGAGCATCAGTGAC CACGGTG AAGCAGTTA 
GAATTGCCAGTGGGAAGTTCCCAATGCTGAGGACATCCAACCTTTGCACACTG 

7 0 CTTGGGAAGTCTGGGGTGGAGGGGAAATGTAGCAGGAGAAAGAGTGAGGCCAGGGAGAACACCGAGGGAACAGTCTTC^ 

CTTCTGGCAGGATGCTGAAGAGTGCTGGGGGAAGGGATAATTGCCAGGGAAAGGGCT 

GCATGGAGAAGGAACTGCCMGAAGTCTACCTCTTCAGTACCCTAAATGTCTGATCCGGGGTGCCTGTGAGTT 
GCTTGAGGTAGTGACGCTGAGATCTGTGACATCGAGATGGCTAATGCCTCTTTTCT^^ 
TTATCCTGTGTAATCTGTACAACTCTCTCTCT 
7 5 TTTATTTACTTAATGTATATGATTAC^CTGCCGCTGCCTTCAGACACATCAGAAGAGG 
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AGCCACCATGTGGTTCCTCGGAATTGAACT 
GCCATACAACTTTTTCTTAACCATTGTTTTATTT^ 

AGAGTGCGAGTTACAGCCGGTTGTGAG CCGACTTGTGGGTGCTGGGAATCGAAATCAGATC 
TTTGAGCCATCTCCCCAGCACITGTGCCCCAACTTTCTG 
5 AAAAACTAAGGCTAAAGAGACAGGAAGGGAGATTGTCTCACAGCATT^ 

TTTATT AAGAC CC CG CAGGATCC CCG CTGCGGCCG CCATGG CGCAGGAGAACG C CG CTTTCTCTCCCGGGTCGGAGG AG CCGC CAC 
GC CGCCGCGGTCG CCAGCG CTACGTGGAGAAGGACX3GTCG CTGTAACGTGCAGCAGGG CAACGTCCGCG AG ACCTACCG CTACCTG 
ACCGACXTIXjTTCACCACGCTGGTGGACCrGCAGTCGCGCCT 

1 0 ACCTCAACGGCTTCGTGGCCGCCH^C 

TGTCCCGAGGGCATCGTGCTGCTGCTGCTGCAGGCTATCCTGGGCT 

GATCTCGCAGCCCAACAAGCGCGCCGCCACTCTCGTCTTCTCCTOSC^ 

TGTTTCGCGTGGGCGACCTGCGATCCTQVCACATCGTCGAGGCCTC 

GGCGAtaTTCATCCCTTTGCACCAGACCGAC 
15 CATCAGCCACGAAATCGATGCCGCCAGCCCCITCTGGGAGGCATCGCGCC^ 

TTCTCX5AGGGCATGGTGGJ^GCCACGGGrGCGGGC!AGG CTGGJVGGATGGGAGCAGG^ATGCAGGACAAGGG CAAGJVAAAGCAGCCA 

GGGGAGGCGCAGAAAGATGGACAGAGAATGGAGTGTAGGGTGACAGGCCTGAGGGGTAGCGGGGGCCGGGG 

ACAGGGATGGACAGGGTGACTTTGCAGAGTCAAGAAAAGCTTGa 

CZAGCTATGTCACTTTGAACTACATTTTACATCTCTGAATTCA 

2 0 T CGGTTTTCAG AGAATG AGTAG C CCCAAAGAAAGGT CCCATAAATAG C CCGCTG T CACAAGCCAATAAATAG CACAG C CTGGGTTG 

AACATAGGACATCTATCTTCAGTGTTTCCTG^ 

G^rrCTGGAATTTAACTGACCTAAGTAAAAGGGAGGTAGGTAGGAAAAAGACTAGTAC 
CTAGCCGTCAGGGAGCATAAGGAAGACGCAGGTGAACCAAGAGGCCACrAGGAGGAGCTG 
CCTGCTGCTAGCCTTGAAACCTGGCTCCTGGGCCTAGACAAAAACATCATCTTCT 
25 CTCCTCCTTCACTCCAACTAGGCTGGTTCTAGCCCATGCCCATTCCAC^ 

ACAATCTCGGACAGGTTTCTATCAGGGACTTTTTCATCTGCCTTCTCTTCCCCCT 
ACTCCCCCAAGCCCTACCTCTGCTTCTCAGGCCTTCTCCCTGCAGAGGCCCCGGTGGCCT 
CTTATTCCAGCTTTGCCAAAGAATACCAATGACCCCAAGATGTCTCAGGGCCAGACTTCC 
GAATGCTTACTCCTCTGTTTTTGAGATGGATTCCGGT 

3 0 TG CACAAAC CCAACAGAGAAGACA.GGACACAGGCT CAGGGCAA^ GG GGATGGG AG CTT 

GCTTGACTTTTCTAGAGATAAGGCTGG<3AAGGATGGTAGTATT 

AACCCAACATGATGACATTTAAGGGGAATAAATATAAAATTCTACATTTAGGCTTTAAAAAAATCACT 

AAAGGCTCCGGTGGAGAAAGAACTGGGGGTTTTAGTTGGCCACTGGCTTTGCTGCAGCAACGTGATC 

TGTAATGTAATCATGGGCCCGCTTCACCAAAGCATCTGGGCGAGAAGCAAGAGATAGTAAGCCTTCTTTT^ 

3 5 ACAGTTGAAAAAGCACTTCAGATGAGCCCITACCT 

TCAGATGAAGTAAGAAATGGCCATGAGTGTTCAATCTGAGGAAGAG 

CAGGTGGT AAGGGAATGGAAC CAGGG G C CACGGGT CCTAGG AG ACAG ATTTTAGTTTATGT AAGAGAAAAC C CAG AGC CAAAGAG A 

TGTCTCAGCTTGC1AACCACGCCTGACTACTGACCTGAGTTGAATTACCAGGTCTCACA 

TCCTCTGACCTCCACATAGATACATATGCACGCATATAGACACATAAATGTAA 

4 0 AAAGGACTGGACTTTTAATGAGATGGTGAGCTTTCAATCCTGGGGTGTAATCAGTTCAG 

GGTGGAGGCX3GCTCTGTGGGAAACAGGAAGGTTAGGCTTAAGGTTAAGCTTCTCAA 

ATAGAGAAGAGGGCTAACTAAAAAGAGAGGTGGGACTCTCAGAGAGAGAAGAGGGTTGTGGGATGACAGACAGGAGAA 

CTGTCAGGGGCCCCTTTGACTGATGCCGCTrCTCCTCCCCCCACCCCCCAGGAATGACGTGCCAAG 

GATGAAGTGTTGTGGGGCCACCGGTTCACATC^ 

4 5 AACCTTTGAGGTGCCCACACCCTCGTCCAGTGCT^ 

T CC CCAGCAGG CTGG ATGAGAAG GTGGAGG AAGAAGGGGCTGGGG AGGGGGCAGGTG CGGG AG AT GG AG CTG AC AAGG AG CACAAT 
GGCTGCCTGCCACCCCCAGAGAGTGAGTCCAAGGTGTGACTGGTTTCCTCCC^CCCCC 
TACACAGAAGCTGCGAGTGGAGGTGGAAGAAGAGGAGGCAGGCAXSTGTCCCGAGGAAC^ 
CCZAGGATCGAGTAGGGAAGGCTGAGGTCCTGGTTTGAAGAGAGAGGGTTGCAGGGCGGGGTGAGAGAACA 

5 0 TTTGACCTTCACATCGGTTCATGG^ 

ACAGCCAATGGATAATCGCTOIGGTGGTAAGTGGCTTGGCAGTCGATGATCGTCACCTGCAGCA 
GCATCCTTTTCTTCCAGATATAGGTAGCCTC^ 
CACCTCCCCTCAACTGGCCTCCCTATGTGTGACACGCCTG^ 
GTGCCCTCCCCAGCATCACrcTCCCATG^ 

5 5 TGAAGCAGGAC CTGGGG AGAG CCAAGGACAACT ATTTTC C CACAGAAGCAAGAAGC 

AGAGGAAAGGCCATGGAGAGACTTAATAAAGGGTTTTACAAGGGTACCTGGATCCCAGGGGGAAGTAGTT 

TGGCAGGGCTCATTCAGAACGGTGAGTAAGTGTCAGGTGTGATATT CAAAGACCTGGTTCTTAACACGAGAG CACAGCGAAGGTGG 
AGGTCAGAAATAACTCCCAGCCACTGAAGGAAGTATGGCT^ 

AGC GG TT ATAGC CACAAGTGAG AGGG CATTAGG G ACAGG AAG CT AAG GATTGAGTAAGGCAGTGGGG AATGGTGGGAG CCAGCAGT 

6 0 TACAAAGCTTTACTCACCTGGATGGGCTTGTTAAA^ 

GGAAACCAAAAAAAAAAAAAAAAAATCTGCACGTCTAACTAGTTCCCAGACCT 

C1X3GGGACTGCACTTGGGTGAAGCATCTAAGCGGAAGAGAAGCTGGAGGAAC 

AACAAGAAGGGAAATGTTGGGGAGAGGGGACCCAGGTCCAGACTC^ 

GGCCAGTGGC^CCCGAGAGGGCAAC^GAGATAGGAGAAGGCCATTTAGGGACCCCCAAGGAGGC^ 

6 5 TGGCCTTACCACAAAAGACCAACTCTCTTAAGAGACTCACAAGGCAAGACTGACTAG 

GTCTGCTGTCTCCCACCrGTCCTCCCAGGAC^ 

TCTTGGGATGGTTGAGGGGACAGCAACAGCATGGCAATGGACCTGAGGCTGGCCCCCCTGGAGCT 

GTAACCTGATAGGACTGGCTCAGACTCTGGCCCTGGCTATACCCATCCCrGCCCTCGAATA 

AAGAAGCCAGCATCCAATGGGGCATCGAGGCCTCCCTCCCAGTGCCCAGCTCAGAGTG 

7 0 TGTTGCCTCTCCCCTTCCATCTAGCAATGGCCACAGGTTTCGGGACCAGCTGGGTCACTCT 

CCTGCTGGGATCCGACGAACACATATTGGACCTGAGC^ 

AAGAGGGGTGGGGTGCTTGGTATCCGCACACGAAGCAATGGAATGGAGACATTGAAGCTGTTCCTGGAG^ 
TGTCCAGGGC^CAGCCAGGAGACCTGTGTTCTAGGACGA^ 

ACTTCTCTGGGCCTCAGCTTCcrrATCTGTTAAAAAAAAATGATATGTTGGCAAACT 
7 5 G AATG AAAAAAGGT AGACTGGATGCCGC T CAAAGTAT T AGGACAGCTG AGGCTCTT AGG AC CGGAG AAC CCTTTAGG CGGGG AGTT 
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TGCTGCTGTCCTTAAGTAGGC^GCCGTGTTCTGAAG 

TGAGGCCCCAGGGCCTCTGGGAJGGAGTCTGGTCCAAGACTAGTT^ 

GAGAGCTCAGAAAAGACTCGTCCCITGGAGCTCTC 
5 CTTCCCAAATAGAAGCCCCATTAGGACTTGGCTCAGCACAGACA 

GGTGGGTAACAAGGCCCACXSAAGGGACAGATGGTGTTT^ 

CCCACCCCCACCCCCGATGCTGCCTTTTATA^ 

ACATTTTAGAAAAACTAGTTTGAGGACAGGAACTGGCCTTCCT 

GGGAAGGTGGGGACAGGGGAGCCATGTCTCCCACrCTAGGT^ 
1 0 GGTATCCAGGCAACCAGAGGAGGGGTGGCACTGGCT 

GAGGAAGAGGCTCCCTTTGACCCTTACCCTGCTrATACXLA^ 

GGTACAGCTCAGCACAGGAAGCCTGAGGAAAGGCAGTTCCTTTCCCT^ 

TCATAAAGCAGCAGGTGATGGTAGCCAGGACGCrGGCGCCGGTAACTAGGGCGACT 

GTATCCACAGCTGGCAGAAAAGAAGACAGGCTCTGCTCAGAGAGTACC^ 
1 5 CAGCCCTCCCCAAGGGAGAGCTGAGCGCCAGTCCITGCCTA 

ACCCCCTCCCCTCGGTCTCAGCACTCAGGCCGGCTTGGGGCCCTTCATGCAAA^ 

GAAGATGGAAAGGGGCAAACAGAGCAGGAACAGGTGGGTAGATGGTGGCT 

GCCCGGAGCGCGCACTX3CCTGCCTGGTACCAGCTGTA 

TCCTC^VGGCCCCAAGCCATGCAGTCrrTAGCCGATGACTTCTG 

2 0 TCCCAGCTCTGCCACACan'CCTGACCCACTCCACCCACAAGCT^ 

CCCACCAGCTGGCTGCTAGTCGCAGCCCTGGGGGGCCGCCCCGCACAGAG^ 
ACAGTGCCCCCTGCTAGCCACGCCACG<3CCTCTAGCACCACGCCT 

GGCTAAGAGGTTAGGAAATAAATTCTATAAGTTCTGAACCCCGTCAAGGGCTCAACATCCTCrTA 
GGGAGAGGCCGGGAACGAGGACTGGCCGCTTCACGAAGTCGGGTCCCAGACCCTCGAACAT^ 
25 ACCTGCATCAGCGGGCCTGGCAGCCT CCAGCCGCAGTCGGTAGGTTCTGGATGCTACTTTCTC CATGGCAATGTGC CGGTCCTCAT 

CAGCCCACAGAGTACGCAGCATGACGGCCTGGTGGGGGCAGTGCACCGGAC^ 

AATCCGACGTTCACCAGGTCCCACGGTCACCGCCAGCTGGCTGGC^ 

GGCCCCCCACCIGTAATTCTTCTTTGCAGAAATTTAGAGGCCTCTTATATCTCCCT 

3 0 CCATAGCCTTTGTATCTCCATG CTTGTGCGGCACTCCCG ATG CCCAACTGAGAGACACCCCCCCCCCCCGAGTGGGCTATGCTGCA 

CTCACATAGAGTCTGCACATCAACATGAGCCAGGACT 

ACTCAGCGGCCGTACAGTGGTAGGTG CCCGAGTCTCCAGCCTG^GCACCCCCAACCACCATTCGGTACCGATCAGT CCCTTC CTTG 
CTCAGCCGAAGCTCCCCAGAAGCTAGCCTCTCAGCGTAGGGCGCTCCAGCCTCC^ 

CTGTAGAGTGGCTCGCCCCACTGGCGCCTCCGGAATGGCTCTCCCAAAGGACACCGAC^GGTGTGTGTGTTTCT 
35 GAG CCAGG CAGCCCAGCGCAAGCTCCTGCC CCTCGTGCACTGTGAGG CGTGAGGGGG AGGTGG CAGCCTGGCGCCCTCGGGGCCCT 

GGAGGGGCAGCAGATACCTGCAGCTCATCTGGAAGAACTGGAGAGA 

CAGAGATGCCAAGCTTGGTCTGGGCACATCAAGATATTCAATAACTACTTGCTGAACG 
CTGATGTCTAAGACTGATCC^TTTTAAATACTCAAA 

4 0 GTGCCGGCCTATAATCGCAACTACCCAGGAAGCTAAGGCAGAAGAGTCAA 

GCTGGTCTGGTCAACTTGGAAAGTTTCTGATTCAGCCCATCCCCCCCCCCAAAAAAAAAAAA^ 
TGATGGCACTGGAGCTGACACAATACTGCCTGAGCTACACAGTGAATTGTGGGAG 

AGTTGTAAGCT AG C CTGGGCT ACAAGAGTGAG CC CT CAAAAAACAACACAGG GAATAT AG CTT AGT AGAGTGCT C C CTTGGT ATGT 
TCCATGCTGTAGGTTCAATTTCCAGTACTGAGAAGATGGGGGTGGGGGGAGAAGAGGAGAAGGAAGAAGATGACAGAGAAGGAGGA 
45 AGACGAAGCAAAAATAGATCTGAGCGTGCTGGCTTATACCT 

CCAGTACCTCTCAGCTCGACCTTGGCACTGTAGTTGCCCAGGTACTGCGTATCCGTGG 
GTCCTGGGCCTGCAGGCGAGC^VATCTTGAGCACCACCGAATCTCCCTTCAGGC 
GCCCAAAGACAGCATAGGAGAACTGGCTATCCTTGGTGCIXmCAATGCCCAGGGACGTAGCT 
CACTCGAAGTCTTGCTGGGCAGGGCCCTCATAGTCACTCACGTTC 

GAGTCCCCACTTGATGGCCAGTATCATCAGCACCATTCriTGACTGCTGCCTGTGAA 

TAGCTGGGCAAAGACACAGATGGGAACACAAGATGGGACACGAGACACCAGCATCCCAGCAC^ 

AACAGAGGAATACTCTAGCTAACTAGAGGCAGAGC!ACCTACATGAACAGAGGAGTCCATCCGG 

TAAGTAAAATGCTTTCCATG CAAGCATCAGGACCTGAGTTCAATTCCCGGCAACTATGGAAAAAGCTTGGC ACGGTGG CTGGTGAG 

5 5 TAGCCACATAAGCCTGACAACCTGGGTTTGAGCCCCACAAGGGAAGGAAGGAACTGAATCCTGAAAGTTGTCTCCT 

CTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCrCTCACACACACACACGC^ 
CCTGAGCCATGACCCCTGAGGTTGACCTCTGACCTTTACAC 

ATTGGAAATAACAAATAAAAGAAGCATGCCTGATTCCCAACTCTCCAGCTCGTGGCTGGATCT 
GTGGCGTCTCTCCGGGCTCAGAAAGAACTTCTAGCTAAGGGATTCTGAGCCTTTTGCrrGGAGGGGAGCTGA 

6 0 AGGCTGCTTGGCTTAGCTCTGTCACCTGGGCTAGACAATGGAGCCAGTGGCCCAGGCTGGCTGGGGTACT 

CATGCTGGACAGGGCACGGACCTAGTCAGTCCTCACTACACAATACCOT 
CAGCCCACAAGGCTAC^CAGGCAGCTGTGGCTTCT 

CC C C CAGAATC TGT CAAC CT T CTCG CTAACCCAGATG ATGCACTATGCACAGCCC C C CAAACAAAAG CTTTCATTT ACAT ATG ATT 
TGCCTATCTGCGCAGCATTTGCRTAGACCTCCTTTTAATAAGGAGACC 
65 CCTCCCTCCACCC^TTTTACCCCACCACTGTTCC^CAGAC^^ 
ACTACTTCTAGGTAAGACaSCAGAACCCTAGAATGCAAGAC^ 

GAGAGCZACCTCTAGGGAGCATTGCCACTAGGTTCTAACCACAGATGTGAGTCTAGAGCTCTCTCCCAGCCAA 

CCGTGAGAACCATCTCTCTT'CTAGATGAGAAAAGTAAAGTGAAATGTGACATTC 

TACACACCCCTGAAGAGGGACAAAACCIAAGAAGGCXSGAGATGCT^ 

7 0 TTCCAJGGGGTAAGTCCACAGCTTGCTACTGGCCCTGG 

TCCITCTTCTGGCTACGTCACGCAGAGAACIACATGACCACCGCCTGAAGCAG^ 
CTTTCCTCTCCTGGATCTCAGTTTCTATCTCTCAGAACTAAGAGCTCCCAGTTCT 

T CAG G CACATATCGCTC CAG G CAC^TT TCAT AAT CAGGAGG ATGGTTAAGGCAG C CAAGCTGACACCATC CTGG CTGC CCATGGGT 
AAACTCTGCTGAGAAGGTCCCCTCCCACCTTCCTG^CTCCACACAAGCTGATGCTTGGAGATCCCCATGGG 
7 5 GCCT'CAGGACATAGGCTGCrACAAGGTTCTCAATGGCTGGGCrG 
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GAGTTCAAATCCACCCCCACCCTC^CTC 

GCTAAAATGACATTCAGCTCCTCTCCCGCCCCTGAATAACACTCACT 
GCCCAAAGGAAAAGAAGCAATCGCTAGGC^^CCAAGCCCCA 
CAAAGGAGTTTTCATCCTGGCCCCAGAAGCCPUUSGAACCAT^ 
5 TTGKKSGTGCCCCACTATAOTATCTTCTCCATATGCATATCAjCA 
CTTTCAAGGGCTTCAGTCTCCrGGCAACTGCTCCATGCC^ 
CCAAACATGGTAATGTTAGCAACCTCCACAGGCCTCAACACAC^ 
TGGCCCAGAAAACACACCATCAXAAACACCCACC^GGA 
GTTTCTCCTGAGGCAGGCMGTTGGGACCAACT 
1 0 GGCCCAGGGTGGAGACACGTGAGGGAAGAAGCCTCAGATGGTGACATGT^ 
AGAGATCGTGGT<IAGAATCAGCCCCTGGGCCTCCAGCCAACT 

ACACCCAAAACCAGGCCTTGAGCCGAGAGTGGGGCAGARGGTTGTCACGGTATTTGGTAGCAAC 
CGATGAGAAGTGGTGCCTGCCTCCGGAGGCCCGATC 

CTAGAGCGAGGACACCACCACCCTCCCCATAACTGATTGGGCAGACAGGCGCAAA^ 

TACCCGGCCGGGACAGGGACGCGGCGACCACC&CCTGGCTTACC^ 

GCTAGGGACGCCCATTCTGCGTAGGCGGCTCTGGGGAGACTCCTGGGGGCG^ 

CATGCCCRGGTGGGGGGCAGAAAGCGGAGCAGTGAAGCGTGGGTGCGC^ 

CCACCCTTCTTCCCCTCCTCCCTCCGCrrCTTCCCGCCCTCCGCAGCTCGGGAGACCAGTCCCAGCCGCG 
ACTGGCGGC^CCTGGGCTCCTCTATCCCCATTTCCTCGCTC^^ 



MOUSE SEQUENCE - raRNA 

2 5 CTGAGCTGCCGTTACATTC^GGAGAAACAGCAGTGTCGGCGGCTCCCAAT 

CAGGGACATGGACGCCAACCCCCAAGGGTTTCTGCTGCTGGCTACTCTTCTCrCC^ 

TCCTAGGGGACCCTGAARGCCTAGGAACCGACTCTGGCCATCCATCTCTCCGG 

AAGAATTTGAAGCAAAACCAGACCCCGCAGGATCCCCGCT^ 

AGGAGCCGCCACGCCGCCGCGGTCGCC^GCGCTACGTGGAGAAGGACGGTCGCTGTA^ 

3 0 TACCGCTACCTGACCGACXTGTTCACCACGCTGGTGGACCTGCAGTGG 

CGTGCGTCAAC^CCTCAACGGCHTCGTGGCCGCCTTCCT 
ATCACCGACCAGTGTCCOSAGGGC^TCGTGCTGCTGCTGCTGCA^ 
CATGTTCGTCAAGATCTCGCAGCCCAACAAGCGCX3CCGCCACTCTC 
35 GGCTCTGTCTCATGTITCGCGTGGGCGACCTGCGATCCT 

CAGACGCTCGAGGGCGAGTTCATCCCTTTGCACCAGACCGACCTCAG 
CrCACCTCTCGTCATCAGCCACGAAATCGATGCCGCCAGCCCCTTCTGGG 

AGATCGTAGTCATTCTCGAGGGCATGGTGGAGGCCACGGGAATGACGTGCCAAGCTCGAAGCTCGTAC 
TGGGGACACCGGTTCACATCCGTGCTCACCCTGGAGGATGGTTTCTATGAGGTGGAOT 
40 GCCCACACCCTCGTGCAGTGCTCGGGAACTGGCAGAAGCCGCGGCCCGCCTTGATGC^ 
TGGATGAGAAGGTGGAGGAAGAAGGGGCTGGGGAGGGGGGCAGGTGCGGG^ 
CCAGAGAGTGAGTCCAAGGTGTGACTGGTTTCCTCCCACCCCCTGTGGCAGACCA 
AGTGGAGGTGGAAGAAGAGGAGGCAGGCAGTGTCCCGAGGAACAGCTAAAGTTGGGAGAGGCCCG 
GAAGGCTGAGGTCCTGCTTTGAAGAGAGAGGGTTGCAGGGCGG 

4 5 GGTTCATGGGTGGATGGATGGACAGAAGGATGGGCTCATGGG 

TCGCTCAGGTGGTAAGTGGC^GGCAGTaSATGATCGTCACCTGCAGCACAC 
AGATATAGGTAGCCTCAAACCAGGGAGCGTGGCTTAGGGAGC^^ 

G^CCTCCCTGATGTGTGACACGCCTGCCTAACTAGAGAAGAGAGCACrGGGTAGAGGTGGACA 
ATCACTGTCCCATGGCGAGAGGTCAGAAAGGCAAACAAACAATGGGGG^ 

5 0 GGGACAGCCAAGGACAACTATTTTGTG AG AG AG GAATGAAACCTTG CAGG T C C TGC C ACAG AAG CAAG AAG CAGAGGAAAGGC CAT 

GGAGAGACTTAATAAAGGGTTTTACAAGGGA 

MOUSE SEQUENCE - CODING 

ATGGCGCAGGAGAACGCCGCTTTCTCTCCCGGGTCGGAGGAGCCGCCACGCCGCCGCG^ 
55 TCGCTGTAACGTGCAGCAGGG CAACGTCCGCGAGACCTACCGCTACCTG ACCGACCTGTTCACCACG CTGGTGGACCTG CAGTGGC 

GCCTCAGACTGCTCTTCTTCGTGCTCGCCTACGCGCT 

GGCGACCTGGAGCACCTGGAGGAC^CCGCGTGGACCCCGTGCGTCAACAACCTCAAC^ 

CGAGACGGAGACCACCATCGGCTATGGGCACCGCGTCATCACCGACCAGTGTCCCGAGGGCATCGTG 

TCCTGGGCTCCATGGTGAACGCTTTC^TGGTGGGCrGCATGTTCGTCAAGATCTCGCAGC 

CGAGGCCTCC^TCCGAGCCAAGCTCATCCGOTCCCGTCAGACGCTCGAGGGCGAGTT 
TGGGCTTTGACACGGGGGACGACCG CCTCTTTCTCGTCTCAC CTCTCGTCATCAGCCACGAAATCG^ 
GAGGCATCGCGCCGCGCCCTCGAGAGGGACGACTTa^GATCGTAGTCATC 

CCAAGCTCGAAGCTCGTACCTGGTGGATGAAGTGTTGTGGGGACACCGGTTCACATCCGTGCTCACCCr 
65 AGGTGGACTACG CCAGCTTCCACG AAACCTTTGAGGTGCCCACACCCTCGTG CAGTGCTCGGGAACTGGCAGAAGCCG CGGCCCGC . 

CTTGATGCCCATCTCTACTGGTCGATCCCCAGCAGGC 
GAGATGGAGCTGA 

HUMAN SEQUENCE - GENOMIC 
7 0 GG AGT AGGAGAAAG CTATGG CATTTTAGGAAAATTAATCGGG AGGTGACAAAATAGTTTGAACCAAG TC 

GACG AT AGAAAAT AATTGCAAT AATAT AAG CATGAAGAG ATG ACAGC CCAAAT CAG CGTGGCAATGG TGAAAAGTGGAACACAGAA 
AATGAATTGG AGT ACAGAAAAAT CAAAAGAAAATG AAAAAAGTTTG AAGC CAACTTG ACATGT TG AG C AAAAGAGGGAAGCTTCAG 
AGATCATACTAGAGTCTCAAGTCAGGTGATCAGAACT 

GAGGAGACTGAGGTGGGGGAAGAGGAAGTGATGAGTTCAGAGCTGGAAGCTGTGGAGAGGGGTCAGAACCA 
7 5 T CATTGCTG C CAGG GCAGTGTG AGT TGAAG CTT ATG AGAACAGG GTAGATC CCAACAAAG ACTG CACAGAGAAATG AG AG CCTGGCA 
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CAGAGAGTGAGGAACACCTATGTTTAGGG^ 
GGAGACftAAGAAAGTGAGATGTCATGGAGCTAAGGAAG 
GAAGAGGCATTGGGTTTGX3CAGTGACAAAGTCTC 
ACAAGTTACCACTAGAAAGTGAGAAACrrGl'CAGCAAGTATAGGTTACA 
5 AAC CAGACAATGTAJtTTAGAAACAGGCCAGGCCAGGTC CTCATGCCTGTAAT CCCAGCACTTTGGGAGG CCAAGGTGG GTGGATCA 

CCTGAGGTCAGGAGTTTGAGACCAGCCTGGCC^ACATGGCAAAAACCC^ 
GCACGCGCCTGTAATCCCAGCTACTTGGGAGACTGAGGCA^ 
TCACGCCACTGCACrCCAGCCTGGGGGACAGAGCrAGACTCrG^ 

AGAAACAGT CTTCCAG'i"l"rrr CTTCACACTCCGAATGC CCT CT CTTCCT AAGTCAATATGGATGAGGGGCTGTGGTGAGGTGG 
10 . T CTGAGGGC CAGC CTGCAAG ACTGG TATAAGACCTTTAAGTTTCAAAAAATAGGACAT C CAAAAGAT CCTTAAGGGGGCCACAGTC 

TTGACATT CACAGACAGAGAGGACTTAGG CAGGGGTGTCCAATT TT TTGGTT TC CCTGGG C CACATTGGAAG AAG AAGAATTGTCT 

TGGGC CAGACAT AAAATACACTAACACTAACAGTAGCTGATGAG CTTTAAAAAAAAAAAATCACAAAAAAAACCCT CATGATGTTT 

TAAGAAAGTTTACAAATTTGTGTTGGGCCTCATTCAAAGCTGTCCTGGGCT 

AGCTTGGAGGCTCTGGTGGAACTCCAAAATAAACATGAAGAACACCACAGAA 
15 ATTAACAGACATTTATTGTGCACTTATTATTTTTCCAAAAATGTT 

GTCCAGCCCACCCGGATATCCATTTCAAACTCAACATATTT^ 

T ATTCT CTACCATGGTAGAT ACACTATCATCAC CCAACCAGAAACATGGCAGCCAT C CTAGATTCTT CAAT CTTCCT CACCTCAT C 
TCCCTTATTGAATCAATGCATCTGTATTCTAAATAGCCTCAATATTGTCCCCTTCOT 
GGACACCATTACCTCTCACCAGGTAATAATAGTTTGGATCTTTG^ 
2 0 ACATGGGCCTGGTGGGAGGTGTTTTGATCATGGGGGCAGGTCCCTGAGCGGCT 
GATCTGGGTGTTTAAAAGTATATGGCAACATCCCCCATC^^ 
TGCTTTGCCTTCCACCACGAGTAAAAGCTTCCTGAGGTCTCCTGAGAAGC 
GCAGAACTGTGTGCCAATTAAACCTTTTTTCTTTATAAA 
TTTTTTTGAGACAGGGTTTCACTCTGTCCCCTAAGCTGGAGTGC 

2 5 CTGAAGTAGTCTTCCTGCCTCAACCTCAAACGTAGCTGGAACTA(^GGTGTTCACCATTACACCCAGCT 

ACTTTCAGTAGAGACGAAGAATCGCTATGTAGATCAGGATGGTCTTG 

AAAATGCTG GATT ACAGGCTTGAG C CACCATGG C CT ATCTCAGGT ATTTCATTATAG CAATGCAAGAATGG CCTAATACACCAGGG 
C TACTGCAG CAGCCTTCTAACT AC TCTCCCTGCCTCCAGT CTTCCT CCACTCT 
AAAGTAATTAAGCACCAGTAAAGTACATCTCTCTGAAACACACATOT 
30 CCAACTATCTATAAGTAAAGTCCGAGTTCCTTTCCCTGGAAGAGAAGGCCTATTATAACCTGGACCTGGTGCC^ 
CTTCTTCCACTGCCCCTATACACCCAAAGCTACAGCTACrrTCTTTTAACACT 
GCCCCTTGAGCCTTTGTCATTTACATAGCTCCAACGATTGTCCCT 
TGGAGCTCCATATTTCTAGTTGCTTGCAGGGGATCTCC^^ 

3 5 GTTGCCTTGGGCCCATTCCTGGACATGTCAAATAAGCCAACA^ 

TTTTCATTCCTATGACCACTATCAGAATCAGAAAGATCAGGACCTGACC 
CAACCTGTTTTCATCATGTCGGTCCCTGTGCCCTTGTTTTAAAACCCCAAACAGTTG 

GT CTT CACCACAGAAGACCTAGAAAAAAAAAAAAGAAG AT^ CATCTTT CT CTCT 

GAGGCTGGAAGTGCTCATGACTTCCTTAATTCTATGTTACTTCTGGTG 

4 0 C CT CTACTAACAATCAG AGTG AGGATAG AGTCGGGTGGGACTGGGCAGTTAGAAAGACTTT AT A^ CAGG GGTGG 

GAGCTTGTGGAAAAGTACACAGGTAGCTTCAACAGCACTGTAATGTTCTGAATTTAAAAGAGTGACTTAAATTGAGTT^ 

T AAATTATG CTTTATAACAT ATAG ACAT ATGTC CAC CATCTATATT CTTTTGTACAT ATCAAATGTCAG GTTTT CATTT T T AAATT 

TGTTTGCAAAAGAGAAGTCCTAGGACAGTCTCTAGGAGCCCAGTAGGGAATCAGTAATAAG^ 

GTGTTTACTACATCAGATAGATCAGAAGATGGGGAAACTGAAGTTCTGAGGAGTTAAGTGGTTTGCCTATGGTA^ 

4 5 AGTG TTTTGAGATTTGAATCCACATATATTTGAC C CCAAAGCTTGT CTCAG AATAATGCCAGAGGGAAT TTGCACGTTTTAAACAC 

AATCTTGCCAACCIAGAGGCTATGACCCCTGAGTACATGTTGGTATGAAAAA^ 

AACATGACAGAGGAAACTTCTCTTTTTGAAGACCCCTCTCTCTTCTTTCAGTTTCCC 

TCTCCTTTCAGACTCACTGCTTCCAGCTTTGGCCTCATCT^ 

AGGAGGAGGAGAGTTGGGGCGTGTC^CCTGTTTTAGAAAGAATCCACAAGTGGGCA 

5 0 AGATGTGGACAGAGGGAATCAGGAAAGCTTTGGGTTGGGAGGCATGATAGAG^ 

AGTGGCTAGAAAGAACAGAGACACTGGCATGGCTCACCACAGGATTCAGGATTCCAAG 

G CGGAACAAATTC CATT AGT AGTGGAG CATCTC AT AG CTGAATGACT CAGGC CGCAGAGGAGAAATCCAAGAGAAGGACTGAG CT A 
CATT C CCCTAGTCAC T AACG AATCATT ATGTAAGTAGATCACC CC C TTTAAAT AAATGCAATATACACAAAC CCACAT TTATAAG A 
CATAATT TAGGGAAT ACT TAGTT AC CT ACTAAAG AATT CTTTC CTTT AAAAAAGAAAAACATGGCCAG GCACG G TGG CT CATG C CT 
5 5 GTAATCCCAGCACTTTGGGAAGCCAAGGTGGGTGGATCACCTAAGGTCAGGAGTTCA^ 

CATCTCTACTGAAAATACAAAAAAAAAAAAAAAATTAGCCAGGCATAGTGGTGTGTGCCTGT 

GGCAAGAGAATCACTTGAATCCGGGCGGTGGGGGTTGCAGTGAGCGGAGATCGCGCCAC^^ 

AACTCTGTCTCAAAAAGGAATAAAAAAAAAGGAAAAAAGAAAAAAACAAATTT 

GG GTCCAATTGAT AG AAAT TCCATT AACAT C GAATGCACTGTGAT AGG AGGG AGG CAACTGGG AAT AAAGAAACACGAGGAATCT C 
60 GAGTCGGGTGGCCTGAGTCTTAGTCCTGACTATGTTCTTGGGACCTATTCCTACCT 
TAACCGTCATATAACTTTTAAATCrTTAGCCTATCTCTACCCAGTCCTA 

TCCTGCTCTGCTGAAAGTAGCCAGAAAGGCAGCAAGCTCCTCAGCCTCAGGAACCCAGCCTGAGGCGAGGGGCTG^ 

CTCCGTCTGGCCTGGAGCTGTGCTCTGCTTCTCCCCATTTCACTCTAATCTTCAGCTTCAGTCATT^ 

CCATATOTTTCCTCTGCTCTGAGTTTTCTAGAGCCCCATCCCCCTTGAATTTATA 
65 CTGCTCCACTAAACTCATATCCTOUICTGTCTGCTGTCTT 

CTCATGATCTTGTCCAGGGAAGCCTCCCCAGTTTGGCTGATCTCAGGGCTGCCA 

CTGAACTGCATTCAGCTTGCCAATTCCTGCACCCAGCTCAGAGCTGTGTCTGCTGGAGGAAGGG 

GTATCACCTGTTCCCTGTTCTCCAAGTC^ 

TTTTTTTTTTTTTTTTTTTTTTTTTTTTTATGAGACAAGGTCT 
70 CACTGCAGCCTTGAGCTCCCAGGCTCAGGTGATCCTCCCACCTCAACCTCCAAGGTGGCTC 

CCATCTAATT^TGTATTTTTTGTAGAGACAGGGTTTCGCCATC 

CACCTCAGCCTCCCAAAGTGCTGGGATTATAGGTTCGAGCCACCGTG 

CTGGTGCCTGGGAAAACTTTTTTTCTCCTGCCTCAGTTCAGCTCAGTGATAAA 

AGGACCTG CAGG ATG AGTG AATCAGGTGAGGGAGTGGTAGTCTTCTTCCTTGTCAGCCAGGCTAGGTTCAGGGG 
7 5 GGCGAGGGGCTGGCTGAAAT^GCCTTGTGGAGGGCCCTGCCAGTGATG CCCCCTCCAGCAAATAGGGC CAGCTCTATGCAAATGTG 
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TTCTTGCCCAGGAGTTTGGTTTCTTCTCT 
GCAGGCTAGCAGTCCCAAJU3CTCGGGTGACAGGACC^ 

CCAAT CTCGGGCTGAAAACTCGAT ATTTCCACT CACCAG CAAGAG AGCTGAGGAGAGGGCAGTATACCGGG 

GGCGCCCCCTGCAGGCCTCACAGGGTGGTGCCAGAACAGAGGA^ 
5 GATGGTGATGACGGTGACAATGATGATGACCATGAAGAAGACAATG^ 
TACTAACAATGTGTCAGGCATTGCCTTATC 
GAAGAATCTGAGGTTTAACGGGTTAAATAATTGCTCA^ 
TGCCTCCTGAGCTTATATTATCCAGTACCGAATTTCCCACCTTGCCAGG 

ATGTCTTC CTGCTCCTCTAGCTCAT ATTTTCTTGAT CCAAATT^ CTGGGATATAATA 
1 0 AT ACTGAT AACAACAGCAACAACACTTTG CGTTTGTAAACCACTT T C TTCTCTTCATTATTT CC CTGGGGAAAAATAAACAATAAG 
AT AT TTCTGTTTCTCCAAATTTTGTTCTGATTTTT ATCAG TGT^ CAAT ATAGT CATGATCAATTTCT AAGAAT 
ATTTTTAGGTTCTGCITTTTTATGTAACAGTGTGTTATATACACAT^ 

AC C CATGAT ATT AAT ATT C CACAGAT ATTACATT ACTGAGGCACACT AGG CTAAGG CTGACAACAC CAAATG CTGG CAGGAATGTG 
GAGCAACAGGAACAGGAATTCGTGGCTGATGGGAATGCAAAATGGTACAGCTACrrTr 
1 5 CTAAACATACT CTTACCAT ACGATCCAGG AATCATGCT CCTTGGT AT CTAC CCAAAGG AG ATG AAAACTT ACGTCCACATGAAAAT 

CTGCCGATGGATGTTTATAG CAG CTGTATTCAT CATGG CCAAATC TTGAAAG CAACCGAG ATGTCCTTCAGTAGGT AAATAGATAA 
ATAAACCATGGCCATCCTGAAATGGAATATTATTCMTGCCAAAAAGAAATGAACTACAAAGCCATG 
TAAATTTACATTACTAAGTGAAAGAAGCCAATTTGAAATGGCTACATATTC 

AATT ATGGAAAC3VGG AAAAAGAGCAGTTGTTG CCAGGGGT TAGGG AAGGGGG ATTGACT AGGCAG AGCATAG AGGACTTTTACAG C 
2 0 AATGAGACTATAATGGTGGAT ACi^CATCATTAT ATATTTGTCCAAAC CCAGAG CAAAAGTGAAC CCTAATGT CA 

GCTATGGACTITTGGGCGATTATGATGTCAATGTAGCTTCATCACTTGTAAC7VAATGTGCCAT 

GG CATACGGAAAATCTCTATAACT TTCCTCrCAGTTTTGC TCTGAAT CT AAAACT ACC CTT T AAGAAGTCTTCTTTTAAAACAATT 
TACAAAGCATGAGGTGATACAGATGTGGGAGTTTG^ 

TCTGTTTCAGTTTCCTCATCTATAACTGGGAAAAATAACACCTGTCrrTATAGAGTTGCCATGGG^ 

2 5 GTTCATATCCCATGCTCAGTGAATTAGTAGCAGCAGCCACTGTC 

TGGGCAGGGACTCACCCATTCTGTAACCACCCCATCTAACACACTGCCTGGCACTTGGG 
TTAGTAAGCCCTAACCTAGGCrxrrCTCrCTGGTGGACATTTGGGTTGTTTCTAGGGT 
GCCCTTTGTGGTTTTTTTGGTTTTTGTTTGTTTGTTT^ 
TTGCTCTTGTCCACACCCTGGTACAGATGGAATAACrc 

3 0 GGCAATAATTAGAACCCCIGACCCTGCCTCCCTTCCTTTAGTAGATCTATTTTCCTTCTAGCT 

CTC CAAAACT AGACCATGATGGTCAGC CTG ACCTGAGAG CAGCAC CTG CACGCAG AGACCCATGTTGAAGGTGG TGAG CTGC CAGC 
TACCAGATGGCCCTCTGAAACCCCAGGGAACCTAGCACCTTATTCTCAAATAC 

TGAGTTGTCCAGGAGGAGGGACTnrGGCTAAAAATAGCTATGGCGTGTG 

3 5 GGGGAGGGGGATGCTCTGGAGCTTGTCX3CCAGACTGGTTGCCGTGGAAACAAGAGAGGAGCA 

ACAGATAGCAAGTCCTAGTCAGAGCTGCCGCTACATTTAGGAGAAACAGCGGT 
GGGGGGCGGTOTCAGGGGC^TGGACGCCACCCCCCAGGGGTCTCTGCTGCCGGCTACTCT 

GCGGGGGACTTGGGGTTTGGG CCCCTATTTCCAAGG CAAGT^GGGGTTTGGGAGGAGCTGGTTCTTGGGGGAGTTTTCACCAGGTC 
TCTCCTTCCAAAAAATGAGCCCCCnTACTCCCa^TCTCTAGAGGGAGGAAGAGG 

4 0 GCAAAGGGGTCATAGCATGCACAAGAAATGAGGAGTAGGTTGGAGGAACTC 

TGATCTTGAGATAGAGGTAACAATTTCACACTTTTCCT^ 
CACATTCAAAATAAGGTTTACCTAGATCCCT 

CCATCCCTGGG CAGAGAGATCTGTCTAGGCAAGCCGACTGGGGGT CAG ATTACCTAAG ACCCTGAGAGAACATCTGGAAGCCCACC 
TGGGACTAAAGCTAGGATAATGGGAGCAGGGTCGTTTTCTGCATGACCTGGGGTCT 

4 5 GACATCTGAGCTTCAGGAAAGGAAAAGGAAGCCCATTGTTGGGGGCAGGGGAAACCCTAATCT 

CCCTGTGTCCCCTGACTCCATGGAGAATAAATGCAGGGGGTGCCCCTAAGCTCAAAGCCATTTCATTTTGATTTCTCTTC 
TCTCTACCCCAAGACACACAAACACACACACACACACCCTCT 
CTGGAGTGCTAGGTTTAGAGTCACATACCCAGGCAGTTTCTCCCCA 
CACACTCCTCCATCCCCCACCCACTAGCCAGCCCACGTTTCCGTGGAGTGGGAGGAGAGG 

5 0 GTGGAAGAGTCCCAAATCCTATTCTAAACCTTTCCCITGTATGGTCCATATCTCCrAGAGGACCCT 

TGGACCTCTCTCAGAGCAGATTGCAGCTCAGAGAGCTCCTCAGAGGC^ 

GTGGGCTTTGGGGCATATGGCAGGTGGGGGCGGGGCTGGTGTTAGGATAGTCCATGGGAAGTAA 

AGAGGGAGTGGGGAAATAAATGTGGGTGCTTAGTGCTTCACCTGATCTGATTCCATGTCTCTCATC 

ATACGAGCCTAACTCTTTATAACTCTGGGCTTCCTTTCCCAGGCTTCTGTGTTGGGATCTTCC^ 

5 5 TGTCTCCACTAGGAGAAAAAACCCAAGGGAAATGAGGCTGGCC 

AGTGGGAATTTCCTAATGTTCC^CCCTTGTGCIACTGGAGGGTTTCCACTGACTTTCCACAGCTTTCATTT 
CATGTTGAGGGGAGGGAATGGAGCGGAGTGAGTGAGGTCCAAGGAGGGAAGAATGAGAAAGACTGTGTAT 
TTCAAAACAGCCTGCGAGGAGAGCCATCGGTGGCTGCACTGGCTACAGCTG^ 
G CTCCATTACCCGCCTGCC CC CCTCCCCAAAAAGCCCCCAGTCT^ 

6 0 TTGGCCCCGGGGCTGCCCCTGAGGTGCTGCACACCCCAGCTGGAGGTGATGG^ 

TCATCGTGGAACTTGCIACAGTTCTATCCAGTTCAGGTACATCAT^ 



ATGCCCCGTGCCTCCCCATCTCCCGAGGGGCCACTCATTCGGC^^ 
CGCCCAGCGACGCGCGGC^GTGGOUSCMCTCGGGCCCCC^ 
65 TGGCG^MGJ^AACGCGGCCTTCTCGCCCGGGCaGGAGGAGCCGCCGCGGCGCCGaSGCCGCCAGC 
CGGTGCAACGTGCAGCAGGGCAACGTGCGCGAGACATACCGCTACCTGACG 

CCTCAGCCTGTTGTTCTTCGTCCTGGCCTACGCGCTCACCTGGCTCTTCTTCGGCGCCATCrGGTGGCT 
GCGACCTGGAGCACCTGGAGGACACCGCGTGGACGCCGTGCGTCAACAAC^ 
GAGACCGAGACCAC(^TCGGCTACGGGCACCGCGTCATCACCG^ 
70 CCTCGGCTCCATGGTGAACGCCTTCATGGTGGGCTGGATGTT^ 

TCTCCTCGC^CGCCGTGGTGTCGCrrGCGCGACGGGCGCCTCTGCCTCATGTTCCGCGTGGGCGACTTGCGCTC 
GAGGCXTTCGATCCGCGCCAAGCTCATCCGCTCGCGCCAGACGCT 

GGGCTTCGACACGGGAGACGACCGCCTCTTCCTCGTCTCGCCGCTGGTTATCAGCCACGAGATCGACG 
AGGCGTCGCGCCGTGCCCTCGAGAGGGACGACTTCGAGATCX3TCGTTATCCTCGAGG 
75 CCTGGGGAGGGGAGCGGGGTTGGCAGAGGGTGGGCGGGACCX3AGGAAGGCAGGGGCGAGACTAGG 
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TGGATGGAGGGGCTCGTGGAGGATGAGACAGTGAGGT^ 
(J^AGAGAAAGCTT<K3AG 

TTU^TAAT AGCT AGTG CTCGCCCAGT ATTCA.CCCTGTGT CATG CGCAGTTCCAAAG CACTTT CTACCTCTGAGTCX^TTTAATC CTA 
ACAAGAACCCTCTGAAGGTAACTTCTTGTTATTGTGCTC^ 
5 CACTTTCCTAAGCCAATAAGTGGCCATGCCTGGATTGGACACAGGCAATGT 

GGGTTAGGTTCAGGGGTTTTCTCACTGCAGTCA^ 

CTAGGCCGGATCrTGAGGGGAGTTT AAT ACCTAGT CTCAC1TGTAC CTCGGTTTCC CAATTCATCCATTT CCACTC 
TAGATGATGTTACCTTTTCTAGCTCTTTTCCA^ 
GTGAAGGCTTTGGGAGCAGGTGAAAAAAGACCAGTGTT^ 
1 0 ATGAGGATGAGGAAACTGCGGGTGGAGGACCAAAGGCCCACT^ 
TCCCTGCCGGCITCGCTGACAAGCTGGTGCCTTC^ 
CAGCTCCGCATCACTXSTTCCCATT^ 
GCTTSTCACTCCITTTCTCJCCAGTATCTCA^ 

CCCAGCCCTGCCCTCTCCTCCrCGAAGCCCrTCrrcrCCCCAGTGTCCC^ 
1 5 ATTCCAACTTCACCGAGGAATACCAATGTCTCAGCGCCAGGCTTTCCGAG^ 

TCTGTTTCCCCTTGTTCCGAGATGGATATGGGTTGGGGGCAAGACCCTGTGGCAGAAAGGAG 

CAGCCCAACAGGAAGATAGGACACAAGCCCCGGGCAGGGAGGACCAGGACAGAGG^ 

GAGAGATAAAGCTGGAAAGGATGGTAATATTTTGGGTGAGACAG 

AAACCCAGCAAGATGACATTTAAAATGAATAAATATAAAATTCTACATTTAGGC^ 
2 0 GAAGAG CAC TGGTGAAAAAAGAACTGGG AGTTTTAGTTGGCTACAGTCTTGATGT CGT AG CAATG TGATG CAG CCTC CAAAATGAT 

T ATGTAATGTT ATC CTGGG CC CTATT AGTGAAAGCATCATGGCCAGAAGAGAG AGATGGTG CG CG CT CT CTT ATG CACGGAG CAGG 

GAAAACCAGGTCAAATGAGGAATCGTGGAGGAAGCCAGGGATGTTAAGTCAAGAGAGAAA^ 

TTCAGAT ACTCAGAGGACTGTGAAGTGGGAGGGGAATGAAGCCAAG AGTTGG AAATC C CAGGGT ACAGGTTTT AGCT CTGT ATAAA 

2 5 GAACAACCCAACTATTAGAGCTATCATACAAAGGAGTGGGCCCTTTATGAAGTGGTGAGCT 

TAAGCTAGATGCCCATTGTTAGAAATGCTCCITrGGGGAGCCCTGTATGGAG 
TTCAAGGTTAAGCTTTTTTTGGTTGGCATCACCAAATGAC^GGAGGGGAA 

GAGAAG AC ACG AGGG AG CTGGGTAAG AACAGGAG CT AGGGAGGGGGGG AAATGGACTGG ACCAAAGGG AGGTGG GAG C C CTT AGG A 
AGGAATAGAAGGGAGGGTGCTGGGAGTAGGGTTGTGGAATGAGAAGAGGAGAGGGAAGCCTGGAGCTGAGATTCCCCCTGACCGGT 
30 GCCCCTCCTCCCAGGAATGACATGCCAAGCTCGGAGCT 

TGACTCTGGAGGACGGCTTCTACGAAGTGGACTATGCCAGCTTTCACGAGACT^ 
GAGCTGGCAGAGGCTGCCGCCCGCCTTGATGCCC^TCTCTACTGGT^ 

GGCGGGGGAGGGGGCGGGTGGGG AAGCTGGGG CTGACAAGGAGCAGAATGG CTXSCCTGCCACCCCCAGAGAGTGAGTCCAAGGTGT 
GACCAGCTTCCTCCAGACCCCTGTGGCAGACCGGGGGCCAGACACAGATACATC 

3 5 GGAGGAGGAGGAAGGCAAAGCCCCTGGAAATGTGCTAAAGTTGGAAAGT^ 

GGAAGGGAGGGGTCCTGATTTC^GGGAAATGGAGGGTGGGGCCGGGTGAAAATGCCAGTCTG 

GAGTGGATGGATGGACAGAATGATGGACTTTTGGGGGTTGGATGGGAAGATGGTAG CAGATAAAGACAG CTGACAGATACATAGAT 
GGACCAGTAGACAACTGGTCCACTCAGGGCTGCCACTAA£ 

CAG ACAGATACAGC CC CAAAC CAGGGTGCATGGCTTGGGGAG CAG AGT AT AGG ATGG ATTG CAGT C CC CAGTCAC CTCTT CTG C CA 

4 0 GCCTCCCCACATATGGCACAACTGTCTAATGACACGGTAGGCCAAGCTGA^ 

TGAGGAGGGTGCCCTCCTAGCTCCACCCT CACCAGG ATGAAGGCGTGCAAGGGGCTCAG CAAGGTGTGAATGACCTTAGTCCGCAA 
GTTCAGGGAAGCAGGCAGAGCGGGGAGGTGCCTGAGCTGGGGCCTGGAGAGGGGCCTGGGAAAGGAA 
CTTACAGTGGAGTGAGATCTTACAGGTATCAGGCACAGGCAGGAAGAGAGAGAGAGAGGTTCTGGGGA 
ATCTAGAAAGTGGGTTCACTAGAGCTGGGAAACAGGGAGCCCCTAGGAAAGCAGTGTGTCCTT^ 

4 5 GATTGGGTGCCATGTGGAGTGGACATTCAAAAACCTGGTTCCTGTCCTCAAAATAAGGGGCACCTGGGAAA^ 

TGTGGTGACTGAACGAGGGATAATTCAAACTGACAACCTGTGC^ 
GGCCAGTGTAAGGCATAGGGAATATGTAAGTCAGGAGTTAGAAATCT^ 
ACAGATTTTTGGGCTCCACTCCAAGGGTTTCTGACCCAAGAGG 
CTGCTGTTGCTGCACTGAGGACCACACTTTGAGAACCTGTGCT 

5 0 AACTGAGGAGTAGCTGAGAGGAAGATGAAGAGAAGCTGAGAAGAAGCTGAGGATCCTCACAGGAGCAG^ 

TGGGGTTTTATGTGTGGGAAAGGGACCCGAAGCCCAGGCTGAAGAGT^ 

GGGCAGTGACAAGTGGAGGGGGTGT CTGGAAGCTGAGCAGGCCCGACAGAGAGATG AAG CCATCAG AAGGACTTGAGGGGGCTCCT 
GGGGAGGTCGGGGGGAGGTGGAGCW3GAAGAGTTTTAGGGGCAAAGGACIAGAACCCC 

GGAGAAAATCGGAGAGAAGCGATAGGAGTTAGAACAT CTGGATGTGTCTGCAG CCTGCTGTCAG CCCAATTGGGCCAGGGGGTCCC 
55 AAAGACGCATATTCTCACCCCACCTC CACCTGCTTCCTGATCACATCCCAGTCACCAGCGGCAG CTTC CTGGATAGTG AGGGAGAA 

CAACTGCAAGTTGAGAGAGGCAGAGGGGTGGAAGGGACCTGAAGCTGGCCTGGAGAAAAGC^ 
GACAGCGCCTGTCTCCOVCAC^GCAGCACTGGCCGAGCAAGGACC^ 

TAAG CCCCCTGGGGAAAGCA.CTCCAGTCTTCTCTGTTCCAGG CTGGGCAGATAGGGTCCTATGGGGCACAGCCAGGGTCCTATGGG 
CATAGCCAGGGCCCTATGGGTCCTCTGGAAGCAAGAAAGGGGGC(^TGGAAGCAGCCCAGACA 

6 0 CCAAGTCCC&GTCCCTTCCTTTCAGTCAAAACACGGATATCT 

AGATTCCTGAAGCCTGGCATTCACTGGTCCCCTAAGAACAGATGTTGGGATGGAGAATGGGGAT^ 

GGGGGGT CTGGACAAG AG CG GGTGGG CTACTTGGT AT CCACACACACGCAC T CACACAGGAGCCAACC CATTG CAGCTGAACAAG C 

AGAGAAACTCAGTCTGGAAAGGCCCCTCCTGCCTGCTGAAGTCACTG^ 

GGGCACCACTGTACAGTGCAACAAGTCIAGGAGACCTAGGTCCTACT 

6 5 GCATAT CTGGG CCT CAGTTT CCTCATCTGT AAAAATGACAG CAAACTCGT AATGCTCAAT AAATGTTT AAATAACAACTGAAGG AG 

GCCTGCCAGATGCCTCTTAAC^TGCCGTGCAGGTAAGAATTTTAGGAT 
GGGCACTGGAGGAGGGGTGAAGCTGAAGGGTGGGAGGGAGGAGACCCCAGGGTAGGTACA 
GTAGTGGGTATATGCGGGTAGAGGGTATATGCAGGTACAGCGAGTACATGTGGGTGCAA1X3GCT 
CTGACTGCCTGTTGTCCCAGCCTGAGTATCAGTTGTGTTCTGA 

7 0 AGCCCCAGGGCCTAGTCGGAGGAGTCAGGTCCAAGACTAGCCTGA 

AAGCTCAGAAAAGGC^CTGCCCCTGGAGGCCCATGCCCTTTAACATGGGAGAAGCTGGTGCGGGGGTGACC^ 
CTACCCTCCTTTTCTATGCTTCCCTCCCCAAGTAGGAGTCCAATCAGGAGTTGTCTCAG 
CCCAGGTGTCCCCTCCTGGGGTGGGTGGCATGGCCCATGGAGGCC7VGATGGTGTTTC 
AATAGGTTGTCAATCCCCAACCACCTCCCTACTATGCACCCTGAGCGTTTTACAGTCTCATGGTAG 
7 5 CTTTTTATAAAAC^GTTTATTCACATT^ 
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AGAACAGAAGGCCAGAGGGAGGGGCTTGGGAGGGAAGGAGTGGGG^ 
ATAAATTAAAGGAAGAGTGGACAGAGGGAGAGGGTGTCCAGGC^ 
CTGCAAGACCTGAAAAGGGTGCCCGGTGTGGGCTAAGGACAG^ 
GGCCCTGTGGATCTGAGCTGCCTACTCCTCCrCCAGGTGGGGCCT 
5 CAATGAAAACAAG<3TTG<3GGGGTTCTTTTCCCTCACCT 

TGATGGT ACCAAGG ACJVGTGGCACCAGTX1A.CTAGGGCCACCCC TC CAGGGCTGGG 
GGAGAGAGGATGACTGTTCAGAGAGGATGCCATCATCCTCCACCCATACACTTGC 

GTGCAGGTGGG CACTGCAGGGCTG^jAAGTTG TAGCAGGAGG 
1 0 GTCAGAAGATGGGAAGGGAAGGCCAGTGGCAGAGGCCAGG^ 

TGTAGGGGTAGACTGTAACAGGCCCTGAG CGGG CACTG CCCG C CTGGTACCAG CTGTAGTCGG CATGCTG CACC CAGG CG CTGGGG 

GCACAGTGGTACACGCCTTTCATCCTCGGGCCCCAAGCTGTGTAGTCT 

AGGGCCTCCTCC^GGCCGGACTCCCAGCTCTGCCACACC^TCCTGGCCTACGCCACCCAC 

CTCCGTCCTCTGGTCGCTCC^CCCACCAGCTC 
1 5 GAGGCAGTCTCCCCGCGGTACACTGTGCCTCCTGCTAGCCATGCCACAGCCTCC^GCA 

TCAGAGGGTGCAGGGCCAGGGAGCATGGGGTTAGGGCrrGCCGCCAAGCACCGCCCCAG 

TAGAAGAGGAGCGTGAGGCTGTGGCCTGCAAACAGCTGACGGAG 

CTCGAACCCCGTCCAGGGCCCAGCCCCCTCTCACCTTCCTC^ 

ACGAAGC(XK3GTCCCAGACCCTCGAACATAGGCTTT^ 
2 0 GTAGCCGGTATGTTCTGGATGCCACCTTCTCCATGGCAATGTGTCGGCCCT 



CTGGGGGAAGTGCCCCTGACZACATTGCACAGCAGTTCCAAGGGCTCCCCTGGG 
GCCAGCTGGCTGGCTGAAAGACAGGTAG<3GGAAGAGCT 

2 5 TCACCCCTCAGCATGGG CCTCCTATCT CTATAC CCAATTT CTG AG CAG AGAAAAC CCAT CAAGGGCCGGGGGAG AGAAATGCT AGC 

AAGGCTGCTCACTCTGTGGAAGATGAGTTCCTTGGAGTCAGATGATGGCT 
TACTGTCCCTCCCAGCTCCCACAGTGGGATGTATAAGTGGCACTTACACAGCX3TCTGCAC^ 
TCTCTGCAATCTGGX3CCCAGCTGCCATCAGGATCCTGAATCCACTCAGCGG 
GCACCCCCTACTACCATG^GGTACCGATCGGTCCC!TTCCTTG 
30 TCCAGCCTCCACGGCCAAGTCTGACCGGATTCCC^CCACTTCCTGCAGAGTTGACCGCCCAACTGGTGCCT 
CAAAGGACACTGCCAGGTGTGTGTGCTTCTGTGTGCTTGTCCTCGCCAGGGAGCCCAGTGCCAGCT 
ATGCGTGGGGGTGAGGTTGGGGCCTGGCGGCCTCGGGGCCCTGGGGGGGC^^ 
GAACAGCTGGAGTGAGGGAGGGCTCGGAGCTGGCAG CCCTTGTTACTGTTT CCTGTGTATAGC 
GCTCCCAGAGGGCAAAGATCGCATGTTGTATTATTTCrrTCT 

3 5 TCAATAAATACTTGTAGAATTTCAT AGAACCAG CC CATCG C CTACTCACCCTT ATGTTTGAGACTGACCTCTGTTTGAAATACTGA 

TCTTTCCC&GATCTGGGTCCTGTACTGTCC^ 

ACTGGACAACTTAAGGACATTTCTTCTCCCAGGAGGGGTCTTAATATGATAAGATGAGCACTGGCCTGGGTGAGGAACTCTGGGTT 
TGAGTCCCACZATCAGCCACTGAGTTATTGGGTGACTTTGTGCAAATCACTTAACCTCTTT 

4 0 CCTAAGGGGCAACTAGATAGGTCACTTGTGGCCTTGACTTTCTGCCTTGAGAGGGTG 

ACCTCTCAGCTCCACCTTGCCGCTGTAGCTGCCCAGGTAGCGGGTATCAGTGGAGGG 
GGGCCTGCAGGCGG^CAATCTTGAGCACCACGGCATCACCTTC 
AAGACAGCIATAGGAGAACTGGGTATCCTTGGTACTGACAATGCCCAGT^ 
GAAGTTCTGCTGGGCAGGGCCCTCATAGCCGGTCACATTGCAGGAGATGGAGACA^ 
45 CGGGGACC^C^CCTCCCGGGCCCAGCATCCCATTCCTGTAGGGAAAGGCAGAAGGAG 
CCCTCTGCCGCGACAAGC!ACC!ATTCTTGATCTCTGCCTACAAAAGGAAAGGAGACCTG 
TTCTGAGAATAGGAAAGGGATGCTGTGATATAAGACA.CCTGGATCTCAAGGAGGTGGCATG 
ATGCATCCC^TTTCTGGCGGACTAGAAGCAGAGCACCTGAAGGCAGAAAGGAGT^ 
TCCAACTGAACCTTGATCTGTCCCTGCCACTCACCCACCTCCATGTCTGCCATTCCT 

5 0 GGCTT^GGGGACTCTGAGACTACATGTCCCTCT CCTTTG CTTGAGGGGAGCTGGCAGTCTTGCTCAGAAGTGCTAGTTGGCTCAGCT 

GTGTCACCTGGGCGAGACAATGGAGCC^GTGACCCTAGCTGGAAAGGGCACAGGCC 
CTCTCCAGCTGCGCCATGAGCTCACTGCTTCTCTCACCCC^ 

CTGCCCTGGCCATTGG^GGCAGAAGATCCCCTCCTCCAGTGCCTGCCAACCTTCCGGGCTAGCCC^ 
CCCCAGTTCCTTAACAAAAGCCTTCATTTGCACATGX3TATGCATTCATTTACATATATGGCT 

5 5 TCCCCAGCTGCCCCTTCTCATCTCTCTCCCT^ 

CCCCAGCAGATACTCCTGGCAGACTTAGAGGGCTTAGCTCCTCCCTTCTTTCCTTCCATA^ 

CTCAATGTAAAG AGGGCTAGGC CACC CCTCCCCAC CT CTCCCAATT TT ACAG ATGAG AAAGGTAAG GCAG G AAAAGT ATAATATGT 

TAGCCAAGATCATGCTGTCCCTAGATGX3CTTCCACACACTCCTCCAGAGGGG 

CCAGGCXTTGAAGGGACTGACCTCACCAACCAGAGTGTCACT^ 

6 0 AGGAAAATGCCCAGGAGTCTGTGGTCAAACTCTACCCTCCAGCTTCT 

GTGGGATTCCAGCACCATCCCCATGCCC^GCTGTGTGGCCTGG^ 

GGAGAACTAAGAGCTCCAGCTCTGACCATGTGTGAGTGCGTATGTGACTCAGGAGAGCCCT 
TCAGAGTGACGGTCGAGACAGCCAAGCTGACACCTTCCCTGAC^ 

6 5 GGCTCCACAACrGCTCATGCTTGGAGATGCCCATGGGAAAGTCACCTCCACAGCCTTAGGA 

CCC CT CCTCTGT ATCAGT CG CAG CAAGGAAAGGGACAGCAAAGAGG CC TGCTTTGGAATCAGATCTGTG TT CAAAT CCTAGCCCC A 

ACACTCACTAAATGTGCrCTCTGGGGCAAGTTACrT 

TCAGACCTCTGCGTGCTCCTTTTTCAAACACAGGCCAGC^ 

GGAAAACACACCAAGTGCTAGGCAACCCAGGC^ 

7 0 AGAGGACTCATCCTAACCCTGCAGGGCCCAGGGACCTGCAGCAGGGGAAGGCTTTGCTTGGTGCCACTGTC 

AACAGGCAGCTGGGGCTACCTTCAGCCTCTGCCrTGACmCAGCAGCT 
TTCAAGGGCTTCAGCCTCATCAACATCTGTACTGGCAGT 

AGC CCTCCAGTT CAGGC CAAAAACAATT CCACTGT CATTATC C CCATG CATCCATG CAAGATTGGC CCAGAACACC CCACCATGAA 



75 



ACTAAGCGGTGTGAAAACTGATGGGTGGCTGCAGAGCCAAGTGCCATTTGGGAGAC^ 
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GGGTGGAC^TGGGGGTGAGAG^^ 

TGGGCAG^CTGGCCTCTGGGCCTCCAGCCAACTCTGGGGC^ 

AGGCCAGGCCTGGAGCTGAGTGTGGGG CAGAAAGGAGTCG CAG CATTTGGTGCAGCGACC CCAGT ACGTGGGT ATGCT AGCTGAGA 
TGTGTGGCCTGCCCCGGGAGGCCGAGCAGTGCCTGGGGCAGCACCTTACT 
5 GGGGAGCCTGCACTCTTCCCAAGACTGGCTCGGCGGACAGCCACA^ 
CCTGACGGAGGAGGATGTGAGGAGCCCCGAAATGCTAGGGGGGTC 
CGGGGGCCTGGCTTACCTAGCATTAGCAGCAGa^ 
Q3GCC^CrrCIX3GGGAGGCTCaMGGGATGGCGCGGGTTCTGGGGG^ 

10 TCCAGCCCrrCGGCAGTTCTGAAACCATTCTCGCCCCX3GCCCX3 

CTACAGTCGGACTCCAATCCTGGCTCCTCCCCGGGCCCC^^ 

CTAACGTCAGCCTCCAATCCTGGCTCCGCCCTGGArc 

AGTGAGCTAACTGCACCTCTGCGCATCGAAATTCCCACCCACCCTCGCA^ 

GGAGAGTTGGGTGCAGGGTCCCTGGGATTGGCGAGGTGACTGTGACCACGCAT^ 
1 5 CTCGCTGCAAACCCTGCTGAAGTAGGGGTTGGCGGAAGCC 

ATCCAGAAGGTCACTGGAGACCTGTTTTTCACCCCCTCAGAGGGCA 

HUMAN SEQUENCE - mRNA 

ACATTTAGGAGAAACAGCGGTGTCTGCGGCTCCCACCCTTCGGGGGGCCOT 
20 CCCAGGGGTCTCTGCTGCCGGCTACTCTCCTCT^ ■ 

GGCAGGTGGCAGCAGCTCGGGCCCCCGCCGCACTCCAGGCGCCCGCAGCGCTCGCCCTGACGCGGCCGCCATGG 

CGGCCTTCTCGCCCGGGCAGGAGG AGCCGCCGCGGCGCCGCGG CCG CCAGCGCTACGTGG AGAAGGATGGCCGGTGCAACGTGCAG 

CAGGGCAACGTGCGCGAGACATACCGCTACCTGACGGACCTGTTCACCACGCTGG 

CTTCGTCCTGGCCTACGCGCTCACCTGGCTCTTCTTCGGCGCCATCTGGTGGCTG 
25 TGGAGGACACCGCGTGGACGCCGTGCGTCAACAACCTCAACGGCTTCGTGGCCGCCTTCCTCTTCTCCATCGAGACCGAG 

ATCGGCTACGGGCACCGCGTCATCACCGACCAGTGCCCCGAGGGCATCOTGCTGCT 

GAACGCCTTCATGGTGGGCTGCATGTTCGTCAAGATCTCG^ 

TGGTGTCGCTGCX3CGACGGGCGCCTCTGCCTCATGTTCOT 
^ ^ GCCAAGCTCATCCGGTCGCGCCAGACGCTGGAGGGCGAGTTCATCCCGCTGC^ 

CCCTCGAGAGGGACGACTTCGAGATCGTCGTTATCCTCGAGGGCATGGTGGAAGCCACGGGAATGACATGCC^ 
TACCTGGTAGACGAGGTGCTGTGGGGCCACCG CTTCACGTOVGTGCTGACTCTG 
CTTTCACGAGACTTTTGAGGTGCCCACACCTTCGTGCAGTGCTCGA 

ACTGGTCCATCCCCAGCCGGCTGGATGAGAAGGTGGAGGAGGAGGGGGCGGGGGAGGGGGCGGGTG 

3 5 GAGCAGAATGGCTGCCTGCCACCCCCAGA 

GACACAGATACATGGGGAACrTGCATATCGGAGGTGGTGGAGGAGGAGGAGGAGGAGGAAGGCAAAG 

TTGGAAAGTCCCCGTCCCCCAGAACCTCAAGTCTAGAAACCAGTATGGAAGGGAGGGGTCCT 

GCCGGGTGAAAATGCCAGTCTGTGTTTGACCTTCACATTTGTTCATGA 

G ATGGGAAGATGGTAGC AG ATAAAGACAG CTGACAG AT ACAT AG ATGG ACCAGT AGACAACTGG TCCAC TCAGGGCTG C CACT AAC 

4 0 CTGTAGAACACCCCTCTGCAAATTTTAAAAAGGAACCCTTTTCCrrCCAGACAGATACAGC 

AGCAGAGTATAGGATGGATTGCAGTCCCCAGTCACCTCTTCTGCCAGCCTCCC 
GCCAAGCTGAAGTGIAAGGAGAAAGGAGCCGGACCAAGATGGGCACATGAG 
AGGCGTGCAAGGGGCTCAGCAAGGTGTGAATGACCTTAGTCCGCAAGTTCAGGGAAGCA^ 
GGCCTGGAGAGGGGCCTGGGAAAGGAAAACCAGGGATAGCTATTTT 

4 5 AGGAAGAGAGAGAGAGAGGTTCTGGGGAGGAAGGGCCAGGAGAGAGATCTAGAAAGTGG 

CCCTAGGAAAGCAGTGTGTCCTTGGGGCACAGTCATTCACATCACTGATTGGGTC 
CCTGTCCTTCAAAATAAGGGGCACCTGGGAAAACAGAGGAATCTACCTGTGGTGACTGAACGAGGGA 
TGCAGTCCCGTGGAGGGTAGGGGAGTGTGGGTGATCAGAAGGCTGGGGCCAGTGTAAGGCATAGGGAATA 
GAAATCTCCAGTGTGCGTTGGAATCACCTGGAGGGCTTGGTAAAACACAGATTTTTGGGCTC 

5 0 AGGTGGGGACCAAAACCATGCATTCCTAAGAAGTCCCCAGGTCATC 

GCTCTAAGTGAATACTTGGAAGTCGTTTCAGGACATC^ 
AAGAAGCTGAGGATCCTCACAGGAGCAGACAGAGA 

GAAGAGTTTAACTTTGGGCCCAGAAACTCAACCATCAATGGAAACAGGGCAGTGACAAG 
GCCCGACAGAGAGATGAAG 

55 

HUMAN SEQUENCE - CODING 

ATGGCGCAGGAGAACGCGGCCTTCTCGCCCGGGCAGGAGGAGCCGCCGCGGCGCCGCG^ 
CCGGTGCAACGTGC7VGCAGGGCAACGTGCGCGAGACATACCGCTACCTGACGGACCTGTTCACCACGCT 
GCCTCAGCCTGTTGTTCTTCGTCCTGGCCTACGCGCTCACCTGGCT 
60 GG CGACCTGGAGCAC CTGGAGGACACCGCGTGGACGCCGTGCGT CAACAACCTCAACGGCTTCGTGGCCGCCOT 

CGAGACCGAGACGACCIATCXJGCTACGGGCACCGC^^ 
TCCTGGGCTCCATGGTGAACGCCTTCATGGTGGGCTGCATGTTC^ 

TTCTCCTCG CACGCCGTGGTGTCGCTGCG CGACGGGCGCCTCTG CCTCATGTTCCGCGTGGG CGACTTGCGCTCCTCACACATAGT 
GGAGGCCTCC^TCCGCGCC^AGCTCATCCGCTCGCGCCAGACGCTGGAGGG 
65 TGGGCTTCGACACGGGAGACGACCGCCTCTTCCTCGTCTCGCCGCTGGTTATCAGCCACGAG^ 
GAGGCGTCGCGCCX3TGCCCTCGAGAGGGACGACTTCGAGATCGTCG 
CCAAGCTCGGAGCTCCTACCTGGTAGACGAGGTGCTGTGGGGCCACCGCT 
AAGTGGACTATGCCAGCTTTCACGAGACTTTTGAGGTGCCCACACCTTCGTGCAGTGCTCGAG 
CTTGATGCCGATCTCTACTGGTCCATCCCC^ 

7 0 AGCTGGGGCTGAC^GGAGC^GAATGGCTGCCTGCC^CCCCCAGAGAGTGAGTCCAAGGTGTGA 
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Table 108 



MOUSE NOMENCLATURE 
ICSGNM Ppp3cc 
Celera mCG3861 



HUMAN NOMENCLATURE 
HGNC PPP3CC 
Celera hCG15020 



MOUSE SEQUENCE - GENOMIC 

TTAGGTAGATAGTTGGCTTTATACTTGCTACATAT^ 

CTGCTTCCCAAGTGCTGAGJVTTACAA^ 

TTAACT AGAATGT AAAG AGGCTCCrcTGTTAATCTTATAT ATG CATG CTACCACACAGTG AGATAGAT AGT AGTAGAT AAATA 
TTTACTTAACTATAATGTAATTATTTCTTTGGT^ TTT AGAATTTTTACTTT AAAAACCTTT CTAGCT 
CATTGT CTACCAAAG ATT C CAG AG CTT CACTTT AG ACAAACATC^CTGTT ACTCAX3AAGTCT AGT CTG 
TTAGTATGTAGTGTTCATGTTAGTAAATCTCCCAGCCT^^ 

CTTGCCGT AGGAGATAGAGACTGCGATGT CT AAG CTTCATAAAGAAACG G AGTGTGTGAC CTTG CACAC CAGTCG TCCTTAGCTCTT 



GTCC CACGTAGC CCAGGG TAAC CTCTAG CTATGTAG CTG AGGTGGTTG AGCCAATCCTT CCAG TCTTAACCATGC CTGGTTAAGTT 
TGTTGCTTTTAAAT ACAAT CAAGGAGGTCGG CAGT CTGAGT CTGGAAG TG AG TGACAG AAGAAT AAAATG CC CCGT CGTCG CTT CA 
GGGAAGAGGGCAGTGGGCTTTGGGGCCTGTTGCAC^TAACT^ 



CATCTTGCATCTGTT CTTGT CT ACAT AGTTACTG CTTACCT C CACA 

GCAGAATG CT CTG ATTTGTGT ATTAGGAAGAAGGGAT CTGGGAG ATGGCTCAGTGG GCAAAGTGCTTG CT ATGCAAG CATGAAGG C 
C^GAGTTTGAACCTAGCACTCCTGTGGTAGCCAGTGCCTGTACTTCCAGTGCTGAGGAGGTGGAGGCTG^ 
ATGCACACACCCAAGGAAGCCAGAAGAGGGCGCTGGGTCCTT^ 
TCTGGTCTTCTGAAAAGTGTTCATAACTACTGAATCATCTCTCTATCCCT 



AGGCTACTGAGAAATTTAACTGAAAATCTGAAAGAAAAGGACAAATAGCCC 

AT AAG ATGT ACTTTT AAATATTGTCTAGGTTGAAATG C C TTCTTTTAAAAGGT AT GCTTAG GG AGAG CACTTGT AATGTGTATGAT 
GGGGTTC CAGAT AG CTTT AATATGTGAGTTT AT I"irr AGTCATCTAT ATTT AAAG AACTGT AAAATT AGC CAT ATG CATGAAAAGT 
AG GCAAAAG AT AC CAAAGT CCTCIACTG ATGATTCATGGATGAGCAG AGTAG ACTG G CCGGTAAA 
CATTATTTCTTTATCAGCAAAGCAATCAGCAGGGAGCCTTT^ 

AATTATATTGTTATCAGTATTTATTTGTATAAGAAAATTTGAGTAG CT TG CT ACTAAATTAAAAT ATTTACAGTAT AACTGACTGT 
AAAAACTGT ATTT CAGCAACACAG GTAATTATG CTTATTTTATTATGT TACCAGTGGT CATTTT AAT TGGT AGGCTT CT AAGAG AT 
TTGTGTATTTTTTTATACTCCTTAATTTTAAAAAATAGTTTTCTATTCTC 

G CAGAGC CTTG AAT ATG CT AAGCAAG AGTT CTACCACTGAGCTGT ATC C CT AG CCCAAT AT AT AATAT CTGTG ATTCTAAGAATTC 

TTTAGGGGCTAAAGAGATCGCTCAATAGTTAAAATCTCTTACTGTTCATGC^ 

CACACAC^GTGGTGGTGGTGGTGGTGTTAi"ri"iGAGGCAGCATCTCTTTATATAACCTTGG 

GGGATTAAAGGTGTGTGCTACCACATAGAGCTTTAGTTTTACTTT^ 

CTATGCAAACAGGATAGTATGCTCTGGAACCCTATCTCT 

GGG'i'T Cl'AAGAATTGAACT TGAG ACCTTGTG CT TGAAAGGCACTGAAC CATCTC AC TAG CC C TG TGGTTTGCT TCAGT AAGTTAAA 

GAC CTCAAGCTCACTAC CACCTTGAACTGci' tt AT AAtttt AGT CTTTACAGAAATTGTGAATTTTC CT ATCTTCT GACATCTGT 

CAATATTCTT ATTCAT AGGTAAATT C CAACTG AATTGAAT ATT AAT AG TATT ACTGTTCATTGACAAG CTT ACAAT AGAGTGAAAC 

ACAAGTCTAAAGTAATGTAATATTGACTTAATAATGACTATATTATATAATTTTAAAAGGTATTTAGCAA^ 

ATCAATTG CTT CCTT AAAATATAAAG C C CAC CACTG AAGAACAC C AGG AACATG GTATAGTGAAAAAT AAACTTTAT TT ATTTAG C 

TTATTGTAGAAAGTATTTTCACCCCCTAAAAGCTGTGAGCATG 

AGTCAGGTTAGTAGTCT<^TCAAGGGTGGCTCAATTCTCATTTGAATGGTT 

AGATCTAGTTTTTGGACTt^GATACTGAGTGTOTCTGGCCCTCAAAAAACAG^ 

GTTTGCTTTGGTTCAGTTTCTCTGCTTGCAAGTCAGTGGTTTGTGGCT^ 

TAGTT AAAC CAGGGT ATTTACCACTTAAGGGAGGTAAACCAACGAGATTTTA^ C CATTTGTGAT 

AGATATGTTGACCTGAAGCACTGGCCTATTTCCTGGATGGTGATTTGT^ 

AGGGATCTGGAG ATGACACACTGGGTTAT CCCAG ACAG AATTTGT ATTT 

ATGTATGCATGCATGGGTGTG<3GACATTCATGTGGAAGTCAG^ 

TCCTGAGATAGAGCTCTGGTCACCAAGCAGGAACATGAAGCTCCTTTACCT^ 

TTTGATTTTCATTGTTTACCTTTACAAGTGTCTGAATGATAATTCTAGTAT 

TAACAAT AAAGGAGGGGGTGGGAGC CT (TTTGACATTCTAGAATCITG AC CCAT AGGGAACAGAA 

TTTTTTG CTTG CT CAGTTAT'ITT T CTCT CAAACTCTTAAAAAAGAAAAAAAAGG TT AGACTT ATT TCAC ATGTTT CAACAGTCAG A 
GAGATT CAG AG G AAAGTGGGT ATTG TCTGAGAAGGAAG CAGTT ATAGG CAG G CACT CATCATGTACTGCTG CTAGGAT AATAACAG 
TTGTCAGATTTTAGGTAGACATTTG 

TGTTAAGTTGACTCCAATGCTTACAGTTGCCGTTGACTTA^ 

TTGGGTTTTAATCATCTGTGATGTGTGTTGTGGGTTGCTGTGCTrCACTTTGTTTTATGAATCATCG 
TCTTTTAATTTTTTGAAAAAAAAATTGATGATTTACATTC 

CTGGAGCTGAGGTTACAGATAGTTTCGAGGTGCTATGTGAGi~i\rrGGGAATTGAACCCATTT 
TAACOTTTGAGCCACCTCTCCAGCCCTTAAAGATGTATTTTATTTTA 

AGTGCACCACATGCATGCCTAGTGCAGTGGGTCCCCTGGAGCTAGAGCTACAGACAGTTGTGAGG 

TGAACCCAGGTCCCCTAGGGAAGCAGCC!AACACrrC!TTACCCATTGAGTTATC 

CCAGTGTGAGTTTCTTGTGTATTTGTGGGGTAAGGGGTTCATCTCATGGTAG 

TC T CTGTAGG CTAG CATG CCTGTAACCTTTGGATTGTG GAGGCAGAG AAG GAAACCAATG CAATGTTAAAACTAG CTTGGTCTGTG T v 
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TCTGAGTTTAGGCCATCCAGGGCTATAATATCAGGAGACTCTACTCAAAAAA 
ACAAAACCAAACCAAAGAATAAGTATTTGTAATTATTGAAAAGCTGGAAA^ 
GTATTAGTTTTCTCTCCTTTTTCAGATAGGGGTAGACATTACTGGOTA^ 
ACCTCTGACGGCCTTTTCACTTCCAGTAAATGGCCATASTC 
5 GGGGGAAGCCCCTTTACTTCCCCTTGAACCTGGGAAGCrTGAGA^ 

ATGCTTGTCCTTGGAACTCAGT CACATGAAAGG CCACCTACAGGT ATT 

AACCTATACATAATTATTGl^lT*!' AAGTCACTATGTTCAGGGTGACAAAT CAAAG ATTGTGGGAAAG AAATAT ACAAGGAGTTTAG C 
AGAGGG TGGTCATGCTAGC CTCAG CTTCTTGGG AAGCTAAAAC AGGATTGTCT AG CGITAT AGAGATG CTGTCTCAAAAG CAACAGC 
1 0 AACCCAAGGATGTTGGTAG(ZAATATTAACTTATAACAAT 

TGGAGAGATGGCTCAACGGTTAAGAGCACTGACTGTTCTTCCAGA 
AACTATCTGTCATGGGATCCGATGCACTTTTTTGGTGTGTGTCTGAA 

ATCCTTAAAAAATAAAGAAAAT AAATAAACTG AT ACATTTTTT T CCAT AAAGAAAAG ATCTGTCTT CAAAAGAAAACAATC^ 
TGTTATAACCAATGTCAAAATGGTCCCAGTGTACAC^CATGTAAATAAAGATGATTAAGAACAACA^ 
1 5 TGGGGAATATAGCTCATCAACAGAGGCTTCCTAGAGGGAACAAGGCCITGGAT^ 
AGCAAGTTTAAAAAAAAAAAAAGGAAGGTTCCAGGTGAGCTGAT 

GCTGGTCTGTGAGGGAGGACACCCTCATTATGTAACGTTTAAAATCATGGCTTTAGATGGCTTAAAACCT 

TTG ATTTTGAACAACTAG AAGAAGTGTACAG ATTGAGCACACAGT ATTAAA CTG CCAGCCC CTCCCT AC TG CCTT CAGGGCAGTC 
TCTGGGCTTCTTATTTTCTC7rGTCTCTGTCTCA^ 
2 0 ATGCTCTGCCAG CCCTCTTG GATCATG CTATAC ACTTCATG AAT CTTAACTG CTTTTATGATCTTAT AGGGACTT AAAAGATTGGT 

TTTCCTAAGAAGCACCAACTTGTCACTGTAGGCGCACAGTAACTAAAA^ 

TGTCTCTTCTGTTATGTTAAACTT CAGGTCTG AT TCCTG CT C AG CTGTG CACAT AGACAT ATT AATT ACTTAAGGAG TGT CCATCT 
AAAAACTG CTTAT ATAAAAT1T GTt AGCCTTAAATTTGC CT AAC CTCTCTGTAAAAT ACAAG ARCG AAGACACGGGAATCTAAGGT 
ACTTTTTTT AAGTT C CAGAATT TTGT AG GCAG ATCACACAGT G ATGTT ACT ACTCTGC CAAGG AGG AGCAATTAT CCCATGAAGGA 

2 5 AACT AAGGCAGATT C CTTATG CAAAAGT AAT CT CTTGATGAG AATTCT ATCAC CC T CCT CAAG CAGT AAAGACTCTT ATGTAGCCA 

G CAGGACCTTG CAGG AT CAAC CAAAAT AATGGT CAACTCGAC CATTCT AGTCAAGGTT AGCTGTTT ATCAGT CAGG CAAACACACC 

AGCTGTGCAACACCATCCCTTCTCCTCATTTCTCTAAGACTCAGAGTTCCTTCCAGAAACT 

CATTTCTGTACTTCCTAGTTCAGCCTTGATCTGTCT 

TCAGGT CAGGGATTGTGTCC T AAACTCCT ACAAAAACCTTG CACTTCAGCIT^ 

3 0 TCATCATTTTCTT ATGTT AAAACACATCTCT CAC TT CAAAGG AAAT AAG AGCCAAATATGAGTG AC CATG GT CTTTG CTGTG AC CT 

GACATTTGACAAATAGCAAC T GAAG AAGG AATGGTTTGTTTTGG CTCC TAGGTTG AGGCTGTAGTATATCATGGT AGGAAAG CAAG 
AGTGGCTCCAGCTGCTGTGGCCCTGGGTGCCAGCCCAC^ 

GTGGACAGTTGTTCTGGCAACTCAAGATAAAGCTTGGGTCTCTCCTGCAGATATCGT 
GAAACACTTTCTCTAGGTGGTTGATTTTTGAGTGTGKIAACCCCTTCAGCTG^ 

3 5 CTACTTTAATAAACTGGGGCTATTGAGTCCATTTCCrGTTGGC C CAGT ACAAATGGAAGGTTGT TCT TTAGTGATGCT AAT CT CAT 

ACAGT AG AGAAAT AGTCAGG G T AATAGC CCCACT T CTCAGT AC CAATT TTCATAT AGTCAGGGTTTT AT AGAAGAAG CAAACCAAT 

AAATGTGCATATTTCTGTTATAAAAGGAGATGTATTAGGTTGACACATGTAACAGGGCCCAGTGGTGGCCATCTGTA 

GCCGAGAACCCACTAGTTGCTCTGTCCATGAAGCTAGATGCTTTACACAGTACCAATCTGGCACTGAG 

G AGG CAGTGCTAGGAAGCTGAAGTAG CAAAAGATGCCCCTGCTGAGAAGTCACG AGGGAGGCCGCAGGTG CACACTACCTTTCCCT 

4 0 TGGACCCCTTTATTCTGGATTTCCTGTTGGAGCGTGCCC^CTGACTTT'GAGAATAGG 

AGCACCTTCACAGACATACTGAAAGGTCTGTCTTTCTI^ 
CAATAAC^TTACAGCATGAAAGAGGTTATGTGGACTTTT^ 

CGCCTTTAATCCTAGCACTTGGGAGG CAG AGGCAGGTGGATTT CTGAG TTCGAGGC CAG CCTGGTCTACAGAGTGAGTTCCAGGAC 
AGCCAGGGCTATCCAGAAAAACCCTGTCTCTCAAAAAAAAAATAATC^GTTAGCA 
45 ATATGAATGAGATTCTTAGCTTTGGGGTTGGTGGAATCTAGCAGCCTGCTAAATTAATTCATTTCAAAG^ 
AAAGTTATGAGAGGTCAAGCATTGAAGTGGATATTAATGGCTTCA^ 
TCTGAGGATGACCTTGAGTGTCTGTTCCAGCTGCCTCTAC^^ 

TTATGTGATACTGGGGAT CGAACCCAGG^CTTCTTCTrATTCAAGGGAAGTACTCTACCAACTG CCAGCTGTT 
TCACTTAGACAGATGTATTTGTTGCGTTCATTTCTTCATCTATTCATTCAGTGGA^ 

5 0 AAC CACATAACTTG AACTGTG AC CATTTT AAAATTAG AATG AT CAATT CTGCTACACAG CTTCTTAGACTTTTTCACATTCTAAGA 

CTGTGCAGTTATGATTGCTTATTAGGCTTAGTTTATTTTAAGAAGTATGACTGTGCT 
TGTTTTCTCTTCCTTAGCTGTCCCCTTTCCTCCAACCCGACGGCTAACTTTGAAGGAAGTTTT 
ATCTTTTGAAAAACCATTTAGTAAAAGAAGGTCGGGTGGAAGAGGAGGTG^ 
AAGCAGGAGAAGACCATGATAGAGGTGGAGGCTCCX3ATCACAGGTATC 

5 5 CCCTTAGGTTTGTGATAGTTTGAGAAATGATTAC CAAAGG C CAG CTGAAATG AGT GTG AT AAAT AT TTCT CT AATT TT GAAGCAAT 

CTAGGGATTTTGACTTTTGTTG CTTAAAAAACAACATTAGGGTCTGAAACTTGGTGGATAAGAG CACATCCTGTCTTGCAGAAGAC 

TATGGAC^CTTGTACTCATGTACACACACCCCTCTCCCAGACAGATGCAC^ 

AG AAGCAT AC T CTT AACTGCTT CAT ATTG ATAATAG GGGCAT TG AGTT CTG AT TGAGT AGCTAAATATACAGAAGAG CAGTG AAAA 
60 GGAAAGTGATTAAACTGAAGTCCTGTTGCCCTGATGCTTCTTCCTAACGCACA 

GGTGATGTTCATGGACAATTCTTTGACCTGATGAAGTTGTTTGAAGTTGGGGGATCACCTAGTA^ 

TGACTATGTGGACAGAGGCT ATTT CAGT AT AG AG GTAAAATT AACATGG CTGT ACTG C C CCTATTGT AT CATAT TCT AAAAGAT CT 
GCTCTCCAITCCTTAGTAGAAAAGAITAAGATATGAATGCTGATAGAATTTATCCACTTTATT^ 

TTTTTTTCT C AAAAAAAT CT AT CCTTTT AAC CTG TAG AAAAC CTGT(^TATGG G AGGCTAT ATT AAGCTAATTTTGAAAAT AGATA 

6 5 TTTG CTAAGGTTTT CTC CTTAAAGAT AGCTGT CTGG GAT ATAGGG ACATGCATAG CTTTTCATT CTAAACAGTGTCTG AG CTAAGT 

ACCTTAATTATAATACTGCAGCCAACGCTTAGGTTAACTTCCT^ 
CTGGTCCTGTTGCACACGGCTGGCATTAATGTCCACTAGAAGGCAAAAGATC 

ATG TTTT AAGAATTAAATG ATGG AACATGAGTAG G ATTAAAAACAC^T ATGTGCTT TTGTGGGTC^AACAT AAAATGATTT TAG GG C 
C CAATACAAAGGCTTGAGG ACCATG TAATT ATAG CACATCTG CTTCACAAATAAAAC^TCCTGTGTG AAG AAGTCTTCAT AAT^ 

7 0 GTTTGTAATATGCCCTAGAAAGAACAGC^GGTGACAAATTCTGTCCTGTTTTAGTGC 




CCTCTTCTTCTTCCTCCTCCTCCTCTTCCCCTTCCTCCTTGTTGTCT 
7 5 TCCTCCTCTTCTTCTTCCTTCTCCTCTTCCTCTTCTTCCTCCTTCTCTTCCTCT 
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CTCTTCTTCTTCCTCCTCCTCTTCCrCCTCT 
CTTCCTCCTTCTCTTCCTCITCTTCCT 
CTCAACTGTCTGACTCAGTTTTCAAAGAAGAGTCAGTTCrc 
CAGTTAAGTAGC^GTATAGGGAGCTAGGGTGTAAGTCACT^ 
5 CTCTGAAAAAACATTGTGTGTGTGTGTGTGTGTGTGTGTCT 

TTTTATCHXSTTACTACCC CAGACTCATT AGTCT C CTG CTGA 

GCCTTGTCCTGCTGCTCATGCriCACAGTGAGCACCCGGGGATAGCCTCT 
ATGTGTCACTGAAGACCTTGCCTATTGCAGTCAGGA 
TTAAAAGGTAGAAGATAAAGAATAAAGAGTGTTTTCAATTATCTATT^ 
1 0 GCCTCCTCCTCCTCCTTCCTCCTTCCTC1TCTTC1TCCTCCTTCT 

TCTTCCTCCTTCTTTGTATTATT ATTATTATTTCTTTTG TTTTT AAAG TTGT ATTTAT TTAT ATGAATACACTGTAG CTGTCTTCA 

GACACACCAGAAGAAGGTATOTCATTACAGATGATTGTGAGCCACCATATGGTa 

CAGTCAGTACTCTAACTGCTGAGCCATCTCTCCAGCC 

CTTCCAATCCCTTCGTTTTTGTTTTTGTCTAAGTATTTCATT^ C CTCAGAGTTGCTGTATAT CTG AGGATGACC 

1 5 TTAGTAAACTTCCGATCCTTTTGTGTCCATGCTTCCATG^ 

GGGAG AT ACT C CAC CAACCGAG CTAATT CCTAGCC CCAGGGACAATGTATT AAACATAAACATT AGTGCTGG AGAGATAGCTCAGC 

AGTTAAGAACACAGCCTGCTTTTGCATAGGACCrGGGTTCAACCCCTAGCA 

TCTTCCrCCAGATGATTCTGGCCTCTGTGGGCAAAI^TCT 

CTTGAAACAAATAGGACAAAATATTTTCTAATTATTATGACI^AC^ 
2 0 TTTGTTTACTTTAAGTGCTAGGGATTGTCCCAGTGCGAAATCTACC^ 

AGGTGTCTTTGTGTTCCCCAAGTTGGTTAGTTGATCTTTCTTTCCTTACCCTTACC 

CTGTAGCCCAGACTGGCCrrGGAACTTGGTTGGTCATGAATTTCTGCT 

CCTCTGGCTCTAGTTCCTGGTTTTCAAATAGTTTCAAT^ 

GTACCTCTGTGCTATTTCCAGGAAATTTCATGTTACATGGTTTTCCCTCCT 
2 5 GGCTGTAGAATCCAATCTAGAATGAGATTCAGATAGAGGTAACTCAGCAGTAGAACCCTTGCCT 

ACAAAAAGAG T AAAATT CGTGTTCTCTGGAGG AGCT AG CAGT CCTAT AGT ACAGGCT CT CCT TTCAG CGGGTCTT CTAGT ATAATA 

GAAGTCTTCCTATAATCAGCAGTAGGAATCTTCGTATTAAGACCTTTCCCA 

TATTGTTCTTG C CCAGGAC CCAAGTCTGGT TCTGM CACAGATATTGAAAG 

ATGTCAAATAGCCCAAAGAGTTGTTTGTGAGCTTTGAAACCTGGGGCT 
30 GCCCGTCGTTACATGTCCTGACTGGCCTAGTX3CCTACCTATCT 

TGTAAAACCGCGCCAATTCCTCCCCCAGCCCCACCCCTT^ 

CTCCTGTGTGAGATA€GTTTCGAACCGGAGCTCCGCCATTATGGCTCCACCATGTGGT 
GTGTC^GCCCGGAGCTCCATCATTAAACTACCraiTGCrrTTACA 
ACGACAATTGAGTGGGGGTTTCCCCACTAGGTTCTTTCAATATCAG 
AAATAAGTCTCCrTTTGGGCACTTGCCCTCATGTG(^CATACTTAAAAACACA 
TTAAAGACCTCGCCAAGTCTTCTAATATATAGTAGGCATTCAGTTTT^ 
CITTGTTATTTTTriTACTTGACTGATTTCTCT 

TTCTTTTTAAGATTTTCACTTATTTTTATGTGGATAAATGTTTTCCTGAATG 
GAAGACAGCIACCAGATCCCTGGAACTGGAGTTACAGGAGCATGX3GACCCTCCATC 
4 0 GC^AATAAATGCTTCTCACTGCTGAGCGGTCTCCAGCT 

GCTTGCTAACTAAGGCTGACCTTGGACTCCriTACGTGCCrGTCTCCACTCCCAGGTGCT 

CGTC T AGGTT ATGTTT AG CTTTTGGATGCAAT CATTTTT ATT ATTTTCAAGTTTT ATTTGTGTGGG GGGGGGAGAAG G AGG GG AAG 

GGAAGGAACTCGGGAGGGAGGGAGGGAGGGGAGGGTCCATTTATGCTTTTTGTATGCAAATGCT 

AATCCATTCTCTTGGAGCTAGATTACAACAGTTTTGAGCTGGAGA 

4 5 CAAGCACTCTGAGCACCTCTCCAGCCCCTGTCTTTTC 

GAACTATTCCCTTACTCTGACTTTTTGTTGTTCTTT^ 
TAATCATTTATTTTATGGGTATGGGTGTTTTACCT^ 

TCAGATCCCCTGGGACGGGAACTACCTGTGATTGTCAGCCACCACATGGAACCTGAATCTGCT 

AG CTGTCCT CAT AG CACCTTG AATG CAT TTTTAAAAAGAAAATT AGG ACAAT AATGG ATATT C CTT CTG TG ATTCT TTTC CATTCA 

5 0 ACATCATATTAGAAACATTTTTATCATTAATTTTTCTATGATGAC^ 

TTAGTGTTGGGTAAATAGGCTTATAGAAGGTTGCTCACTGCCTTTCCCCCTCCrrCCCTCAGGAACAAAACCCC 
AAAGTAGGGCTCTGGTGGCAAGGGGAACTTTCAGACTCAAGAAAGGAAGCC^ 

CTGACCTTGTTAAAGTAAATCCTGCCACTTGTGATTAGCCTCAGGATATCAGA.GGCTCATTCK3CCT 
CTCTGGTGCTATTCT TGGTTITTGTTTGCTTGGCTGCCCTATGGGCAGCTTC CTATGT 

5 5 CTTTGGGCTTTGCCI^ATTTTGTTGATGCAGAATCTCACAAAGTCC^ 

GAACACCTACCTTTGTCTTCCCC^GGCAGCGGGTGCAGCCrAAAGCTTTTGGACAT 
GGTGTCCCSU^GGCAGCCATTTTATGGCTGTGAGGACTGTCCTGTGAGTCACAGAGGAATG 

ATTTTAACTTTTAAATTATATTTTCTGTGTGTGTGTTTATATGCATGTCTATAGA^ 
60 CrCCACTATGTGGGTCCTAAGAGTTGAACTCAGGTTGTTGGGCTAGCAG 

GCTAATTAGATTTTGTATCACTATACTGAAAATGGCACAAACTTATTTTCC^ 
TGTAGAATATTCCrTTGAATTTAC^^GATATATT^ 

CTTTGTGGAGTTTGTTGTGACHTTCCACCTTTATGTGAGTACTCrGATTGAAC 

TCATCTTATCCGGATAC^TCGTTGATGGCATGACAGACTTT'AGATCATAGTCTTAAATGTAAGTTCTTGAT 

6 5 TTGTATGCTAGTTTCTATATAATTTGTTTACCATCTATGAATTTGAA 
ATGG ATCACT CAGGGCAGGCCTTTCAGAGATGAG G ATATTTGGGTAGAAACGGGAATG ACCTTCAAG AGAAAGCACAG CTGAGTG G 
GAGTCCCTTGTGCGCAGAGGGCACAGAGAATGAAAGCTCAGTGACAGGGACAGCCCAGAC^ 

GGG AGGAAGAG CTG CTG CAGGCAGACAG AGAGACTGAjCAGATAGG CAG ACCTTG CAAACCATAGTCAGGG AGTTTAGATTTGTTCT 
GGGTGTGATATGAAGCACCTGCGTTGTTATTTTAATTGCTATTTTTTCTG 

TTGTGTCTTACAACAATTACTCTGACTGTTATGTAACCTGGGACTG CTGGAG CCTAAAGACAGGCACAGAAACCCCACTTAGGAAG 
ATTATCTAATAATGCTTGAGACCAGGTGTGCTTCTCCAGAGGTTTTCAA^ 

TG C CTGTCCAAAAGTAAAAT CTTCG AAATGCTCCAG AGGCAAAT CT^ CACAG TGG CATTCACTG AGGTTCAGATT CT 

GCAGT AT CTC AG ATTTTGGAAT TTGG TAATCAGTCTGTGAT ATGGAT AAGAGATACCAG TAACAAATTAGGATGGACAG ATGGAGA 
GACAGACAGATACACATACACAGACACATACAGATACG^ 

7 5 ACCTTATCTGCACTCAAGGGTTTTCACCCTTCCGGTCTCCTCTTTAGTGTCAGGCTTATTA 
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TAAGCATGCATTTCGTATACTCCTCACTTTATAAA^ 

ACATAGTTCTTTCTCATTCCCTTGCTATAGTGTCATGGCGTTGCTTAGCATGTGT 
TGTACACATGTGTTCATTGTCAGTATTTTACTATGATAATGTTG^ 
TTGTCTTTTGACAAGGTTTCTCTGTGTAGC^ 
5 TAAGTGCTGAGATCAAAGGCATGCATACCACCCAGCTTTTTTT^ 
T ATGTATGTAG CCCATGTATGTGCCTC GTAGCC C^GAGGTCA 
GTAAATCACCATGTGGGTGGGTGCTGGGAACTAAGCCCAGGTCCTCTGGA 

CATG CATTCTTGTTAAACTTCATTTTAAC CTGTGGACCCCTCATTCGTT TGTC^CTTGT TGGGTACT ATGT AAGATTAGATCTAAG 
1 0 TTTACTA'rrr r'ri"lGAAATGGCATCTAAC C CCAG AGCTGTTTCATT AAAAAG CT CATTAC C CCAGGGATCTC CTCCACCCTGrriT 

TATGAGATGAATTTTTAT ACTGTAGTGCAGGCAAGT CTCCAATTC^ 

GATGAGCTGCCATGCAC!AGCrGTTACCTGGTGATTAGAGATGCCG C CTT T ATTGTGT ACAGATGTT CATATGAACTTGG ATCTGTT 
TCTGGAXrrACTGGAAGATTGAGGCATGTGG CAGGG CACAAG GATTTT ATTTTATTTT AT TTT ATTTGAAC TGAAGTTTTATT ATAT 
AAC CAGGTTAGTAT AGAAAGTAGTCT AAGCT AGCCITGAATT CATTGCT C CTATGCTA 

1 5 CTT CAT CTGACT CAGGACATTATGTTTTTGTTTCTGT ATTTGG Ti "1'rGGTGTTGTTGTT GGTGGTGG'i'G'l'TTTG'ITri nTGTTTT 

TTGTTTTGGATACAATATGTCATGTGACCTTGAACTAC^ 

TGGGT ATGGGACATTATGT ATGTTTATGGGTCCTAGGATCAAACTC!AGGAC CACTCT ACCAACTGAG 

CTGCAACC C CC^CIUTT TGTTGCTGCTGTTGTAAAT ATT ATTCG TCGGGGTGGGGGTGGTT AIT C T ATTTAGACCATAT AGTTCT 
CAAAGACACAAAAACCT TT AG TTTTAAAATAAGCCTTAAAG GACTAG AGCTGGGCAGAT AT CAG C COT CT ATG CTCTTTTGTCTAC 
20 TCCCCTCTCAATAACCCTGAGATATCACTTGTGTGTTGTG 

TCACCATTCATCTCCCCCACAGCGACTTCTGCCTTCTCCCCCTCCTCCrTG 

CCCT CTLT'rri'GTACAGCCCAGGCT GTAGGCAT CTTTATTAAC CAATCAGAGATAACTTGGAAG CAAGTTTT ATACAACTAAAGCT 
GGCATATGTAAGGATCTGTTCTTCTt^GAGCAGTGAGATCTTGGGGTATAGAG 

CTCCCATACATGTTTATTTATOTATTCAATTGT ATATCCTATCTTG CTG AATTCTT TCAC T AAGOT ACTTTTT ATCATTG ATTTTT 

2 5 CAGGGGTGTTTTTT AGTTATACAATTATATTGT CAG CTTTTAATTTTTC CTTT AC CCATTCTAC CT CAATATTTAGATACTTAGAG 

GTTTTTTGTTTTGTTTTGGATACTTAGCAGATTTTAAAGAAGGTTATTAATTTTACCAAATTTAA^ 

AGATTACAT TTACAACTTGGG ACT T ACAGTGAGATAT ACCAT AT AGCACAT CATTT AG ATTTAGAT C AAATGTGTTTATGTATTGT 

CATGGTCATGAAGAACACTTATCCTCTAATTGTGTCTCCTAGTGTGTC 

ATTGTTTCTGCTTCGAGGAAATCATGAATGCAGGCATCT 

3 0 AGAACAGTCAGTTATCTCAGGTGCTGCCACAOT 

TGCCTGTAGCTCAGCGCTCGGGAAGCTGAGGCAGAAGGATCA^ 

TAAAAGAAGATGGAGAAGAAAAGTTCAATGCTTTTACATTGCTTATC^GAGAATGAGTTTGTTT 

CACATGGTAGTCTCC^TTTGAGAAGGTTATAATTGAGTCGCTTCACT 

GAAAGGAAAGGTCAGAATTATTAAAGAAAATTGTAATCCAGGCTTC 

3 5 AAAAGGATTAGGAATTCAAAGCCAGCCTTGGCTATGTGGTGAATTCA 

CAACAAAAGAGACAGGATCTT CAGATCATT CCAGC C C CAGAAACTTC CTG AAAT AAACACAC CAT C AGAT AAT AACCAGGAAGGGT 
TTCAAAGT AGATTT ACT CTTGGTAGGGATTT AAATTGGTGTCAAGT ATTTGGAC AG CAAT TTTG AAAT ATTTG TGAAGATTGGTTT 
TGTATTTGTGAAGTATACATTCTCTTCTCT(^GTATTTTGACTGCTA^ 
TGTCACAGCAGAAGTCAGGATTCTCTGTAGGCTTTGCTCAGC^ 

4 0 GCTAGTAGTCGTTTAGAGTCCTTGCAATGATGGATTAAAGTCCTTTCTTCTAGGTCCCAAACTGGGGGTGAACGGTGG 

ACTCTCACTCTCAGTTTTTAAAGATTGCITTCAAGT 
CTGGCCCTGCCrcGATGAGTCTCTGACTITCTGCAGTCAGAGCTGGA 

GTTCATCCAGAAGACTCTTCCCITTCCGTCCGGTGAAGTGTAACAGTGAGAGGCTCATCGTA^ 

CT ATACAG AGTGG AAGCCATTGG AAGTC ATGT T ACAATTTGT CT AGCACAG ATGGACAG AAAACAG TAG ATATTTGCTT CTT AAGG 

4 5 ATGCTCAT TG CAGTGTTAATTATAAC!AACA7VAAAAAATTAGATGT C AAT AAAGGGC CAATT AAGTACAAT AT ATGAG TAT CAG AAA 

GGGGTGTCGGAATACAGAGTTAGGAAACTTTATGGAGTCCAGTCTTTGCTTCACCT 
GGTCTGCAGGCTTACACAGCAAGTGCTTTACCTGCK3AGTCATCTTCCT 

TGTTT AAAT ATTAAGGG AT CTGTGTGTGCTATATGGAGAGATGCTTAGATGTGOTATTAGCTGAGAAAAACCATT 

5 0 ACTTTTTGCCACTATTTTGAACTAOTTCTGTAATTTTTTTCAAGG 

ATGGCTTTTATTTGTTTTCTTTTTTCTTCTTTTCT^ 
CAGTGTTTCTCTGTAAAACATGGCTATCCTAGAACTCAAGCT 

CTCTG C CT CCTGAGTG CTGGGATTAAAGGTGTGTG CC C CCACT GC CTGGCTTAAAAAATAAAT AAAT AAAT AAATAAATAAATAAA 
TAAATAAATAAAT ACAGGGGCTACACAG G AGGAAACC CTGTCT CAAAAAAC CT AAAT AAATAAATAAAT AAAT AAATAAATAAAT A 

5 5 AATAAATAAATAAATAAATACAGGGCTACACAGGAGGAAACCCTC 

AGATAGATAG&TAGATAGATAGATAGATAGGACTGCAGGACTAGTGAGATGCTGAACACTTGCTACT 

GTTCCCAGGAAGCAAAGATCTGATTCCTGCACATTGTTGTCTGCCTTCCACA 

CAAGGTATGAAAGAAAAGCATAACTGTGCCAGTCTCCCTGAGAACTCCCAACTCCT^ 

AGTTT CTCT AACATAT ACTG GTTT TG CCTAAAAAATTACAT AAT ATAT ATAAAACAGGCAACAACGGAC CTGGGATCTGCTGGATG 

6 0 GAAGTCAGCTACTTGCACTTAGAGTTAAGTGAGCAGGAA 

TTTCCATTTGACAGAGGGTTTTCTGCATTG CCCTGGCTATGCACAGCTCATTGGTTTAGGCTCCTTGTTTGG CTTGGTTTTTAAGG 
GACATTTACTGTAATTTCCCGCCCTGTT CT CCCCTTTTAAGCCACAATTTTTATTTGTTCTTTTATGGGTTTGGTGCTGGAGAGTG 
AAGCAGATTTTGTTTGTGATCAGTATAATTGGTCCTGAGTATTTTTAGAAGATAGAACAAT^ 
TTTATTTTGCCCAGAATTCCTCACCTCATGTCTCTTGTCTGCTGGCGCTCAGTGTGCTCOTAGCTGTACTCCAGCCT 

6 5 GTCTGCTGACATACTAATGTACACATATAAAATGTTATTCCTATATGAGGAAAGAGCACGTATGTGTC 

TCGTCTGTGGGCCAOTGTGGCTTTTAGAATCAGAACCATATAATTACTGCTAAGTGATGCT 
CTTTTAAGGAGTGTGGTTTTCTTGTCTCATATCAGTTTTATTTATTT^ 
AAGAAAGAGTCTGAGTTCCCCAGGTTGGCCTTGAATTTTTGGGC^ 
ACCAGGGTCACCAGGCCTGTATCACATTTATTCTTACTTATACT 

7 0 AAGAAAATTTGAGGGOTGGAGAGATGGOTCAGTGGGCACTGACT 

ATGGCTCACAACCACCCGTAATGAGATCTGATCCCCTCTTCTGGTGTATCT 

AATAAATCTTTAAAAAAAAGAAAATTTGAATAACAAACAAACAAAAAAAGAATAACTGATATCT 

GGGGCTTGTTAGACTACCCAGGATGGCCTTGAACTACCTC^GTACCT^ 

CCTTCTGAAATTAC!AGGCATGTGTTACAGAGCCrGGCTAATAGTTAACTATCrGCCTACTGTTT 
75 GCAAGAAGTGTAGCGAGTGTCTTGACCCCGTCTCIATAGTTTTGGGTCCAGA^ 
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TCTTTTTGAT AG CTTACTGACAAGTT CCAGAAACTTTTGTAATTTTCTAAAG G ACACAG CACTGTTCTAAAGAAGAATACTGTT CG 
TCACAGTATTTGGAGTAGCAGATTGGTGTGGTTAGATTATGGCATATCCATAAGT 

T AC CAAGCTGTTATCATAAGAAAGACAGAATTTTAT CTTATCC CTGG GGATTTCTCAGCTAGG CTAAAGAAATTGGTTTGCTAATG 
GCCTGTACCATTAGTCCCCAGCACAGGTGACAGTGGACCCT 
5 AATCTTTAGTGAATGCAAATGAGAATTAAGAAAATATATG 
AACTTCCTGGAATTCTTTTACAATTTTGTTTTATTGCTG^ 
GAGTTAAGATCAGATTGGACTATATGAAACCTTTT^ 
ATAAATGTCTATAATATAGACATTAATTTTTCATGAAAATAAAGC^ 

AGTATTT AAATTCTATTTAATTT AAACTTAA CTAGAAG CTG 

1 0 CCAGTTCAAGGCCAGCCTGAGCET 

TTCTCTGCCACTG AAGCTTGCAGTTATGAG CACTGCXJ CC CAGTTTT CTCCCCTTTTATGTAAT TGTGTGTT AT ATTTAT ATGTATG 
CTTATCTGTGTGTCTGTGCCTGCTGCTCTCAGAGACCAAAC^ 
TCACCATGTGGGTGGGTGCTGGGAACTAAACCCAGGTCCTCTGG 
1 5 AGCCCCAGTGCTCCTTCAATCTTACATGGTAGAAACATC 

TTGAGGATTGAACCCTGGCTCTATGGTAGAGCAGACAGTGCTCTAACAGCTGAATCACCTCTCCAGTTGCCT^ 

ATGAACATGG GAATGTG CCTACTT CATAGAGATT CTG CGAAATG ATCACTTAACACAT AGT T ATAACCATTGTG TGCCAGGTGT AA 

ATAC^CCTCTATCCTCATAGCATCATGACTT^^ 

TGC TCAAGATTCCAAG CTAGAAAGCT AACAAC CAATCTAAGGAC CAAACTTAAGAG AGTT AGGTTACAG GTCATTTATCT 
2 0 GACTGGCTCAGGCATTTAGCTCACGGATATATAC^ 

CATCAGAGAACATT AGAAAAAAT AGGACTCGC CAGGTGTGGTGG CA.CACG CCTTTAATCC CAG CACITGGGAGGCAGAGGCAGGCG 
GATTTCTGAGTTCGAGGACAGTCTCGTCTACAA^^ 

CCCCCCCCAAAAAAAAAAAAAGAAAAAAGAAAGAAAAAAAGAAAAAATAGGACTATGGTTACTGT 
AATGGTAGCATAGGACt^CCTGTTTGCGCACATACT 

2 5 TGGTTTACAGAGTCGCTCGGTACTCTGTGCATACTTGCTGTCT 

TCTGAGTGATTATTAGGATGAAAGGCATACTAGATGAT 
GIGATTCTTATAAAGGAATTTTAATTGACATTGGTAACAA 

CACATGTT AAAGATTCTTTGTTTG LTl" l-rTAl u ri u rGAACAGATGTTAT ACT CACAG CrTGGATCTGCTTTTGAAATAG CAAAGTGAGG 
CTGTAACTGTAAGGAGGTCCATCCCTCCACAAACTGAGCTCATGTTAAAGTTAGA 

3 0 GTGCCA.GTGTGAAGAGAGCCCTCAGATTCCATTAAACCCAAACCrrG 

TATGAGAGGGAGGCCTTCTCACCTTCGTTTATACTTTCCTTTG 
CATAGCTTATCOTGGAAC!TCACTTTGTAAACCAGGCTGGCCTCT 
TGTGTGCCACCATGCCTGGCTCCAGTTTACAGTTTCCTATATC 
TAGCATTCACTCTAAAGGCACAGCTTGACTTTAAATCCAAGGTCAACT^ 

3 5 AGCTGTGAACCTTATAAAGAAAAGAGCTTTGTTTAG C TT ATAGTTTTGG AGliTTT CAGGGT ATGAT AC GAGCATTGGTTTTGAATT 

GATGAGGACTTCATGTTGGGAGAACACAITAAAGAGAGAGAGAAGA.CA 

AGGTTTAC CCTGTGTGGTAACATCC CACACTGAAACCCACACATCCCTTGAGAGATACATTCT CATAGCCACACCATAACACC 
AAGAAGGAAAATGCTGGTTTTTTTTTTT^ 

4 0 TTGTGGCIACACCTACAA.TCTGCCCACTTAGGAGTATTTCCTTGTAC 

AGGAACTCGAGAGATGTCTCAGAGGATAAGAGCCCrGGATTCCCTAGAACAGAATTTAAGGAGAATTGTAAG 
TACAGCTAGGTAAAGCTTCCAGCACCCACATAACAACAGGTAG CTCCCAAGTG CCTGTCATCCAGGTCCAGGGTCTACTGCTGCTT 
CTGAGGGCACCTGCACAGTGCACATAACTTAGGTAGAC^ 

TGT AAAGT ACACTAG AGT AGTAAT AATTTTC CTACACATTTTTTAATG CTT AGTGG CTAAGAGC 

4 5 TGCTCTTCAAGAGAGGACCCAGATGGCAGCTCACATCTCTCCTGAACrrCCAGTTCCAGGGGATCCAGC^ 

CCAGGCATGTG CT CAGTG CATGAACACACACACAGG C!AAACTACTC ATACACGTAAAAATAATCTTTTT^ AA?UUVCAG AAAAAAAT 

GAGCCCAAAG^TTTCTGATACAGAAATAGTTTTCAGTACTCT 

GGAAGCCCCAGGGATCAGCAATGCCAGCATTGTTTGTAAACGCTTAAT^ 

TAAAGCCGTAACCTTCCTCCAGGTAAATGATGCTCACTAATAAGACATGGGCAGCCACCTGCCTTATCAGAT^ 

5 0 AGCTAGAAAGTTAATCTAAGGTC!ATACTTATCTTTAGAAAGGGAAGA 

TTTCTCTGGGACAGGGTCTTCAGACTTGCCTGT AACCAGTGG CTTCCGAGCTGGCTTCACCTGGTTAGT CTCTT CAGTATGG CACT 

GTTCATCCCCTGACTTTGTCCTCTTTGGGTAGGTCGGATCAAG 

GTCTTCCTCTTGCTGCCCTCTTAAACCAGCAGTT7CTCTGTC 

AGGAAAGT AAGTAACTTTTTACTATTTT CATAGAGTGTG CTTTT AAATTTTTGTTC TTT ATCTTAG CCAGTG ACT ACCTATAT AAC 
55 AT AT AGCAATTG C TG AGGTTAAATG AAAATACAGTCAG AG GT AG AAATACGGGACTGACT CATAAT CTTGTCACTGTCAGAGAGCT 
GTTAGTAACACTTGGAGAAACATTCTTTCTCT1T1TAAAAATTAAAGTTTTGTTTCGTTTTTTG 
TCTCGCTGGAATTCACTCTGTAGATCAGGCTGGCCTT<3AACTCTCA^ 
GTATGCGCCACCMTGCTTGGCCTTAAAATTAAATCTT^ 

AATCATATTTGTATATTTTAAACAAGGTCTCTCTGCAAAGCCCAGACTAGCCTTGAACT 

6 0 T GGT AT AATT ACAGGAC TTGTG CCAC CATGCCCAGCTTTAAAAAATT AAATT ATATTC CAGCTTG CAAAG AAAGAACATGAATGTG 

AAGCCTTGGCCAGGCCCCTTACCCTTGGAGTACATTCrTTCTGTAA^ 
TGAGGCGCACTGCCGTTTGCATCCTAAACAGGATGGTTGGTTGCTCCATGTGCCTTACA 
ACTGTGTCTTCATGACTGAGGGAAAGGAGCAGCCTTGTGCACGTGCACTTCC^ 
ACTCAGGATTTCCTTGAGTTCACAGCAGTCTCCCAGTC 

6 5 TGTCG CCTTTAGTAGATTTTGTATTTAACCACTGTGATCAGCTGTGGTAAAAAG CTCCCCTCrCACTCAGCAGAAGCCCTCTTCTT 

GACTCTCAGATGTTCTGGTGTCTTCAAAAACAATTATTTATATTCTT^ 
AACTG AAAAAGTAAGTT AATGACAAATATTAAAAGTAAGTTAATG ACACCACT 
AAAGAAGACATCTCTATTAGAACGAAGTGAGCAGTGGACAGGATGCCTGCCTG^ 

G AAAATAGAATTT AACTTTAGTCGTCACAATTTTTTGTGGAG G C TGATTTCACATTTGAGGTAAAT TTCACTAGATTAG CATGGGA 

7 0 GGAGAGGCCCTTTTCTACAGGAACAGAGATAAGATACTTG 

GGGCTCACGGCCTCATCTGCAATACTTTCCTGTAGGGAGAAGGGGACTTTTAATCTGC^ 
TTTTTTTTTCTCrTCTTTTGGCCCTTGTGTGTCrTCTAOT 

T CTTCAGGTAAG CTGAGAACACAGTGAAACAAACTGGGC C CCAT^ CAGTTTTT CTTT AATTACATATGTGTCTATAC 

7 5 ACATGAATGCAGGTC CAAGAG AGAG ACT AG AG ACATAAG ATC C C CTG CAG CTGGAAAT AT AGGCAGTTGTGAACCACCCAGTGTGG 
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GTG CTGGGAACIT AGGTCTTCTATAAGAAGAG CACATACTCTTAACCACAGJ^GCATCTTT CTAGCCCACTGTG AT CCATATAATG 

GAATTATTCTTTTCTAAATTAATriTlT^ 

GCGCCCTCTGAAGTTAGAGGAGGGTATTATATCCACTAGAACT^ 

GTCCTCAATAAGAGTGGTAAGTGTTCTTAACCACC^ CGCCATCTCTCA^ C CCTAGTTTT ATGAAATTAATCTTG C CAC CACACCT 

TCTTTTTTCTCTGTCCCCTCTCTCACCTCCTTC^ 

TGGGCAAGCACTTGTTCAATCCCTTAATAGCA^ 

ACACAGTTAAAACCTCTCCCAGTATCTCACTGTTATAAGCTCT 

T AACGAT CIXLACTTTTAAAAGATTTTATTT CITT ATTTT ATT ATTTCTAT ATTTT AT AG CTGTCTT CAG ACACACCAGAAG AGAGC 
1 0 ATTGGATCCTATTACAGATGGTTGTGAGCCACCATGTGGTTGCTGGGAAT^ 

TAACCACTGAGCCATCTCTCC71GCCTCTTATTTTTATGTGTGTCGGTGC 

TGGTGCTTGTGGAGACCAGAAGTAGGCATCGGATCCCITGGATC 

GTGGGGGCTGCAAACGAAACCTGGGTGGGTCTTCTGCAAGAACAGGAAGTGCT 

CTCTCTTTCCCTCTCCCTCCCCCCCTCTTTTTTTTCTGTC^ 
1 5 TGTATGTATGTATGTACGTACATACGTACGTACGTACGTACATAATCCCAGTATTTGAGAACAGGCAGGCTGCACAAT 

AAG G CCAGCTTCATCTACATAGTGGGTGC CAAG CCAG CCAGG C CTACG CT CTTAGGGTTACTG TT ACTGTGACAAAACACT ATGAC 

CAAAAAG CAAGTTTTTTTGGGGGGGGGGCAGGGAGGGTT CATTTGGCTTATCAGATCACAAT CCAT CATTGG AGT AAGTCAAGACA 

GGAACTGAACTAGGGCTGGAACCTGGAGGC^GCTGATGGAGAGACCATGGAGGGTGCTGCTTACTGGC 

ACTTCTGCTATTGACCAAAACAAACCAAAACCAAAACCCCAGAACACT 
2 0 AAAGTTCACATGTG AG CTCTTGGTATGGTTTG C CAAGCCTAAATAAATAAAT AAATAAAGTAG AGAAAAT AAAGACATTGAAATCT 

AAACT AATTTT AATTC CACTGT TTTATT ATGACAAATTT AGTT CTTTAAAAATCTT TG CT AACTGGTT AAG ATAATACAGTAATT AA 

ATGATACATATGTATAAAAGAAGCCTTAATTTGCCCCTAAACATTTTTAAAATAATCGT^ 

TCTAGTGACTTATTTACTGAAACAATTTAACTTTTCCCTCTTATGA 

GTTACAGTTCCTGTGGTCACCTGGCAACACCrrCTTACAA 

2 5 CTATCCCCATACTCACCTGAGTTCCCTGCATCCCAGTCCGTC^ 

ACAGATGCACCTGAGCCAGCCCATACGGCGCAGGCTTACAGCTGACA 

TCTCTTTTGTTTTACGTGTAGGAGTGTTTTGCTCACATGTATGTTTCTGTACCACATGTCTTC 

CCCTAGAACTGGAATTACAGATGTGGGTGCTAAGAACCAAACCCAGATCT 

CCACCCCCACCTCACAGTTTTGTTTCTGTTTCTGCACTTTTCT^ 

3 0 GCTGTCCAGTGTCCXSTGAGCAAAGCAGACTGTAGTGTAACTTACAGAGATTTGTCAAGTGTGA^ 

AATGTTAATGATTCAATCTTATATTTACATAAAATATCATTATGCAAG^ 
GCCAGAGGCTCCTAGGAATCTTAAACCTCATACGGAACCTA^ 

GAATTCCACAGCCTCCTACCTCATTCCTAAGGCCTTCTCAAAACTGCTTCACAGTCTATGAAT 
CTATGCCAGTCAAGAGGGGGACTTCCTGTTCCGA 

3 5 AGCATAGTAAAAAGAAAATTGCCCCCTTTTTCTCA^ 

G ACAGCCTTGAACTT CTG CTT CTTCTGTCTCACCT C CCAAGTACTGG GGTTACATG CG CAGCATCCCTAA7VATATGGTGTATAGGG 
GTTG G ACCCCGGAATTTGT ATG CTGCACAAG CAGTAGACCAG CTT AGTTATAT CCGAATTCAAG AAAG AAAT ACTTTCTAAAAGAT 
AAGTAAGTAACAG CTACAAATTCTGATGGTAATTTTATATCCTAGTTCTT CCTGGTTGTCAAACAGAAGTTCAAAAAAAAAAAGAC 
CATTTTAGGAGTATATGCAGATATC!ATTAGAGTATGGTGGTCTATAATATTTATATCTACTCCTC 

4 0 T CTTG CTATTG AT ATT ACAT AT AAAG TT AT AAGT AG ATTCT TT TAATTTC 

ATTTTGTAG<^CCTACTGTATAC^C^TACA^ 

AATACTAGTAGTGTTAAGGTAAAATAAATTAAAAGATGGAGAAACTTTCT^ 

GAAATAG AAAT CACTGGTTT CTGAG CTlTTlTTTi w lTTTTTT TT AATCGAAATT G AAT AT AAT AT ACT CGACAG ACAG ACAT AG AG 
TGTTTTTCCTCTGGGCTGGAGGAGTTACrrCATTAGACCrrGCTTCCTGTC 

4 5 CTCTGAGGAGGGAGGTTGTGAGGGAGGAGCTGGTTTCTGAGCTTTTAATAAAAGTTCAOT 

GTTGAGAGCACTGGCTGTTTTTCCAGAAGGCCTGGGTGGAGGCA.CAATGTTATTOT 
CAAGTGCTGATTTCTCTGCCTCCACGTCTGGTTGCAATCCrrGACATGGCATTGTAA 
GTGATGTGATGCTCCTTATTGATTCTCrrGGTCTGTGAATAAAAGCTGATCA 

CTTCCGATGCCAGCCAGGGGGGAGGGTCGGGGGAGGGGAGAAAGGGCGATGGGAAGTGGGATTCACT 

5 0 AGGGGACCTTGAGAACACACCTGGAGCTGAGCGACCC^GAGCAGTGCTAAAATACAAGGATCT^ 

C CAG AAT AT TTTAGAGGAT TAAAACAGATT AATATTGACCAG C TATTGTGGTGTAAAGCTTG ATT AAACAAG ACTTACAGTAT CAT 
CTCATTGATTTGGAAGCTAGTCAGGGTAAAGAAAAAAATTATCAC^ 

GGTGGTTCGTAATTGTCTGTAACTCCAGAGTC!ATGGGATCTGCCACCCTCCrrTTGGCCTCT 

CAT AT C CAT ACAAGTAGGCAAAAG CCTCAT AGAAT AAAGTAAATCTT TAAAATTGCAGAGTC CAT AGTC TCTGTAATCTTAAAACA 

5 5 ACTCTCAGATTGGGTGTTCTGTTTCTTCCTATTAAAACATAACTGC^ 

TTACCCTGCAGTTTGTGAATTTTTACAGAACAACAGTTTATTATCA^ 
CACTTAGACCTCACCTTCCTGCAGATTTTACTTTTAGTGTAAAAATGCCTGAGTTC^ 
GGAAGAACCAAG CAACTGGCITTCCGTCACTTATTACGATTTTCTCTGCCCCT 

GAAGTC CAGCAAT AT CTTAGTGTG AATTGTT AGTAACTGTG AG CT CTACTTT AATT^ITG TGTTTT ATTTT ATT ATTGTGTTCATTT 

ACACCTATTTTTAAGTACAAAATTCAGTCAG 

CAACTCCTCTTCTGATTTGTTCTGTTTTGAGAOU^^ 

AT CTCCO^AGTXSTT AGGGTTACAGGTATGCACT C CCATTCCT AG ATGAAGAAGAG ATCTT ATTGGG AAAAAAAGGT CAGTAAGTTT 

6 5 AGAATGTAAGCTGCTGCTTGT CC TT CAT CT CCTG AG AG ATT AGGC TCTTTGG GAGCATATTTG TTTAGTATTTTAAAATGGGA 

CACTGTACAACCCAGACCAGCITCAAACTCCCACCCTAC 

TCTCCATTAGGTGGGTGGTTGGTTGGTTTGGAGACAGGGTTTCTCTGTGTAGCCCT 

GCTGGCCTAAATTTGAAGATTCACCTGCCTCTGCCTCCTAAGTGCTGGGGTTAAAGGTGTGAGCCACGACTG 
CATTTAAATCTGAGACTGAGAGCTGGAGAGATGGCCCTGTGTTTTAAGAGC^ 

7 0 T CAGGGGACC CAACACCAT CACACAGACATG CATG CAAGCAAAACAACAATATG CAT AAAATTAAAAATAAGTTGTTT TTTAAAAA 

TCrTGAGACTATCCTTCAGTGGTTTTTCrTTCTCTTTTAAT 
TGGGCGCTGAGTTCCACTCCCAGCCCTGATCACTCACT 

TTTCATGGGTATTAGCAGCTTGGGTTTTTTTTTTTTTTTTTCTAGTTTCCTT^ 
75 CTCAAGGATAGTTGTGCCCCTATGCCTCGTTTTGTGAGGCACTCTCrTCC 



2184 



WO 03/008583 PCT/US01/51291 



GGAAAGCCITGGGGCTCCCAGTCTGCTTAG 
TTTATTTTGTTTTATACATATTTTTTGAGACAGA^ 
GCCTCAGACTCACAGAAAGCTGCrGCTGCTAACTTCCAAGTGCTC 
GCCTCTTTAGTTCTGAGAAGAACTCTCGCACCTTCTTCCTCCTATGGCTC 
5 CT AGGCCAGATGACCTTTGAATTTGCTTTGTT 
ATTACAGGCIATGTGCCATGTCTGACATCT^ 

AG ATCTTT AGGCTCATTGGTACC CAGGCTGTAT CTri^l*TC C CTGATGACC TAAAACATCCTTAAATTGGGG'i' rrrrATTCTTATT 
TCTGTCTC CCCAC CCCCTAT CCCCTGGGAACAT ACTCCAAAAATTT CTr AAGCTTC CATTTTTAATCATCCACTGTTGAGACTAAC 
TTCTTTTTCCTGTG ACAG AATTACTGATATATTCATGTCAGCTATATAAAATG CACACAG CATGAAAATAATGGAAGTGTACATGT 
1 0 CAGGTAAAGTTAGGCTTTCAACAACTGGAA^ 

TTTCATGT CTCAGTAAGAATAATTGAAAGAAAGCCAGGTGTAGTGGCACATAC CTTTATAATC CCAGAACTTGGGAGGCAGGAGG C 
AAGTGGATTGCTTAAATTGAAGACC^GTCAGGGAACT^ 

AAGAGAAGAGAAAGAGAGAGAGTGTGTGTAAAAACAAACCAACAAAAAACCCTAACCAAACAAAAA^ 
1 5 ACAAATAACAACAAACGAGACAGAATAAGGAATTTTTAAAAAATA^ 
GGCTGCCCCTCACCTAAGGCAGGAAGGTGGTGGCTGCAAAGACACAA 

CTCAGG AGG CAG AGG CAGAGGCAG GAGGATCTCTGTGAATTTGAGG CCAT CCTGGTCT ACAGAGCAAGTTCCAGG AT AT CCAGGAG 
TACACACTGAAACCCTGTCTCACCCACAGTCACAGGTCT 
GTACACGCTGAGGTACAGACTCTGAAGATGATCAGACAG 
2 0 j^TCTGAAATAGAATACTATGCCATGTTGGCCAAAACTGGTGC^ 

CAGAAAATACT ACAGAGATGCATG CTGG GTATT ATTGAC CCTCGTG ATGCTG ATATT ATCTGAGGTACCAGAAC^ 
AATGAACAAGGAAAGCTTTGCCAGAGCTCATTTAAAAGAAAGTA 

GT CAG CTGTTT ATAGACACTCAGAATGCTGAG AATAGAAG ACACTTTT TT A^ C T ACCCTT AAACATT 

AATTATAAATTATTTTGCCATTTATTGCACTGTGAGTTGATAACATCTTGGAGTGTTTATAA 

2 5 TTGCAAAGGAGTCTCCGTCTGTTCGATGGGTAATAGTGCTATTTGCCCTTAGAAATTGTGACTGTTAA 

TTATAGGGAGTTTTCTATGTTGAATCTGAAGTTTCITCAGTAGCTA^ 
CTGTGTTTTGATGATGTGAATTCCCCAGGTGTCTTTGTTCCCGGTGATAAT^ 

AAACTTGTCAGGCTGTCCTGAATTTACAAATGGAAAATAAGTCATTGAAAAGTCACTGTAGTAGGCTC^ 

G CTG CT ATG CTGGG GTTCTT CTGTGCACACATG TCTTG TGTGTTTAC T ATG AAAACAATG TCATGA 

3 0 ACATCAGGCAGTTGAACTGTTCCCCACACCCCTACrrGGCTCCCAAACTTC^ 

GAGAAAGGTAAGAGAATCCCTGTGTGCATCTGAACACCAGCTCT 
TGTGTGCTAATTTTGTTTGTTCTCTACAAAGATTTTA 

ATGTGTGCAGTACTGGGTGCAGGATGTGTGTGTGTGCAGTACTGGGTGCAGGATGTOT 
TATGCGTGCAGTACTGGGCGCAGGATGTGTATGTGTGCAGT^ 

3 5 GTGTGTGTGTGCCTGTGTGCATCTGAACACCAGCTCTCTTCT^ 

TAATTTTGTTTGTTCTCTACAAAGATTTTAAAAATTTTTACTTATAAAA 

GTGTGTGT G CAGT ACTGG GTG CAGGATGTGTGTGTG 
TGTGGAGAGGGAAGAGGACTACTTCGTGGAGTGGGTTCTCTCCTTTCACC^ 
40 GATTTACATAGGAACACCTTGATCCAGTGAGCCCTATACAGAGATTTTTAAATGGCATTAATCTGTTCAT 
TGGTTTTTCTTTATGTGAAATATAAATTTAAAACAACTGTCITC 
TTCAATGAATTATCAGTGCTAACAGAATGTAGACTGAA 
CATAAAACACACTTACACAACTGAGCTGCGCAACGCAAACAGACCC^ 
GAGGCCCGTTGTGATTTCTTTGTAAAATGGAAAGTTTTATATGCATATCTGAGCC^^ 

4 5 AGTAGTTCTGTGATAGACTCCACAGGTTCTTATCCAGCCAGGACAAGTGTCGACAAAAATTCTCTCTGGGG 

AGTGGGTAAGAGCACTG ACTGCT CTTCCG AAGGT C CTGAGTT CAAATC CTAGCAACCACATGGTGGCTC CCAACAT C CAT AATG AG 
ATCTGACACCCCrCTTCTGGAGTGTCTGAAGACAGCTACAGTGTACT^ 

CTCTGAACCTTATGGTCTACCCTTGGTTCTATTTTCATAGCrAATTATAATGTGATAAGTTGT^ 

ATTTTAACTT ATG CACAT CCAGAAGTGTC CTATTAATTGTGCACACACTGTGTTATTTGCCACTTAGTGACAGAGATGCTGGTCAA 

5 0 TATTCTCAACATATGCTCGGATGAAGAAATGAACGTAACC^ 

GATTTCTTTTTATTGGTTTTGGTCTTACTAGAA 

TGGTGCATGCCTTTAATACCAACACTTGGGAGGCAGAAGCIATTC^ 

CC^GGGCTATACAGAGAAACCCTGTTTCAGGAAAAAAAAAGTTC 

TTGTTCTGCCATCTCAAATGTCTGCCATCAATGCCTATGTGTAAACTGTCT CTTTACATAGATCTTTTTTCTT CTCATTTTT 

5 5 GTCACTTACAAAAAT GAGGTG ACATG CT ATCATGT CTTT ATATCT ATTTCTTGTTTTCCTTT AT AGTCT AGTTTTT C^^ 

TAGrrGTCACATTCATCAATCACTGTGCTAACCAATATTATAGATGCTGCTGGGTATTATATTTT 

AAACTG CT ACAATTTGT CAC CAG ATGGAAGAGG CACACACCATAAT CC CAGCATT TAGG AGG C AGAGGT AGATGGAT CTCTG TG AA 
TTCAAAGCTAGTCTTGGTCTACAGAGTAAGTTCCAGAGCTACACAGAGAAGCCCTGTCT^ 

GTTT CAAATTTGT ATAG ATAAATCACTGCT CATTCTAT CAACAG C CAG ACTTTGTT ATATG CCTGATGTGTGCCAAATATTGTTTA 

6 0 TGCCTTTCTTCTTAATAAACATACATTGTACTATGGAAGACAAACAAGAAATAAATACAATGCAGCTA 

GAAAGTAGTGCTGTATTAGTAACTCTGAGAGTAACATTTACTTGTTTACTTTTATAGGTTAAATGTCT 

AGTATTTAATGCTTAATCTGAT ACAG ATAG CATTTTTG'rri'G'iTTG T TTGTTTTGTTTTGTTT ACACAGGGTTTCTCTGTGT AG C C 
CTGGCTGTCCTGGAACTCACrCTGTAGACCaGGCTGGCCTCAAACTCAGAAATGCGCCTGCCTCTC 
AAGGCATGCGGCACCACCACCCCATCCCCACCCCACCCCACCCCCGCATAGATAGCCTTTTAAAGTC 
"6 5 TTGTGTGTGCTACATAGTTCATAATGCTTTrCCTAGTACTG 

ACACACACACACACACACAGAGTGCTGTAGCTAGAACCCAGACATTGTGTAAGCTAGACT 

AG CCCTAAATTTTTAATTTGTACTTTTATGATG ACTAATAATTG CATT TTT'CTT ATGT (IT ATTGACCATATT ATTTT CTGTTTTGT 

GAATGTAAATTCATATATTACTAGTTTTTCAACTGGATCCTATGCATGTTTTGCTTCCT 

GTGTCTTAGGATAGTAATGCTGGTTATATGTGTAGCTAAGTGCTTTTTTTCCCTTCTGATTTCT 

7 0 CCTTTAGAAGAATTAAGTTTAAATTAATTTGTAGCCAGGTGCTTCT 

CTTGTCTGAGGATACTIGGAGTTTTTGGGGTATGGTGTC^ 

CACTTAACTATTGCATAAAGTGTTGTATGGTGGAGAACTCATAGCTCCCACTGGGAGTCTA 
ATTAGATTGTACTGGGAGATTATAACAGATGGATO 

AGGAAATCTGAAAGGAACTTTAAAAAGAACATAGGGAAAGTAAAATATTT^ 
7 5 ATAGAAATTGAAATGGTTGTAAACTATGATCTGTGTTAAGAAAAACTTTTTGGGCT 
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GCTGATGGGGCTCATGGCCTGAGCCC 
TACACAGGCTGCACAGTGTGCACTTGGTTCTCT 
ACCCCTACCCACTGAGCrGAAAAT^TGGGTTT^ 
CTAGAAAAGTTGGAaU^AATGTTTTTTGAAGTTGGTTGGTGGCAOT 
5 GAATCTCTTAAGTTCGAGGCCAGCCTGGTCrcCAGAATTCCAGGACA^ 
AAACAAAACAAAGCAAAACAAAACAAAACACAAAGTTTTATG^ 
TACAAGGTGAGAAATGAAAAGGTCAGAAACGCAGACAGATGAGGT^ 

GTGGGTAAAAGGGTTAAGAGGAATTTTTATAACTTTGTGTACTAATCTTAAAAGAATGGGCT 

TGAATGT AAGTTT CTGTTCTTGGGACCTCAGAGGGTGACTCrrGCAAGAGCAAGGGTGAGC CAGGTGGAGATATACT ATACAGT CTC 
1 0 TCGTCATCATCTCAGCCTTTGGCATCAAGTACTCCCTGGTAGAAAGATCTCTGTGAAA 
TCTGTCAAATACAGTGCMTGGTGACAAGCAAAAATAAGCAGG 
CAGAACAGGGCTGGCAGGATGGCTCAGTGGTAAAGGAACTTAC^ 

T ATAAGTCTCTGT CCACTGTGTT CACACTGTGGCATG ATCCTCC C CATTACGCAT ACTCAGG CATG CGTGCGCATGTACAT ATACA 
T AAGCGCAAT AACAACAACAGAAATAAG ATTTT AAAAAGTTCCAG C CAA^ CTTTAATT ACACCACTTAGGA 
15 GG CAG AGGCAGGTGACTGTGTGTGTATGGAATAT AGAAGTT CTGG GCAGTC C CAAGAT AGAT AATGAGACTGTCACAAACAACAAA 

GAATTTTCACAGG^CGTGGTGGCGCACGCCTTTAATTCCAGCACT 

CTGGTCTACAGAGTGAGTTCTAGGGCAGCCAGGGCTACACAGAGAAACCCTX3TCTCAAAAAAACAAAAACAAA 
CAACAACAACAAAGAATTTTCTGAAG CAGCAGAAATT AAAT ACAG AC CCT CTCT CAGG CATTGGCTTTGTT AAGCCAAAC 

TTAAAGCAAAGCAAAATCATAATTTGTTTTTTTGGGAGATTGACTGG C CTGGAACT CACTGGCTTGAAACATCACAGAGATCCTC C 
2 0 TGCCTCAGCCTTTTGAGCCCTACGATTAAGGGTATCCCTGGCTTAAT 
AATTGTGAAGTAAAAGAAAGTGTAGCATATTTGATAAAGCATCAAAA^ 

AAGTTTGATG CAGTGAGACAGGG AC TAAGTGACCTG GAAGGTAGGTCAGAG CAAAG CTAACTG AAG CATGGTGAGAAGGGAGTGCA 
GAGTCAGCCAAGAG CCT CAGAACACGCGCTGCCAGC CC AG CACATGGTG CCTG GAG TAC CAGAGAAAG CGGGAAAGGCAGTCT CTG 
AAAGGTCCACATCAAAGCAACTTCCAGAACTGAGAAGACACGGAGCTGTAACTGAAGAA 

2 5 ACAACAGCTTCTAGCCTCTTGTAGTGAGGTCGACAAGATG 

AAGCCGCCGCCGAAAAGCAGGTTCTTCATGGAGATAGGGG^ 

ACTCCAGAGTCCAGCACACAGCAAATGTGTAAACAAGAAAGGCCAAATAAATATTTGTAGATACCT 

AAG AAAATT AGCAGGAAAT ATATACT AAAT AAAATTCT AAAGG ACAGTATTT^ C CAAG AAAGAAGTTT AGAA 

ACACAGGAAGGAATGAAGAGAAGTAGAGAGCXSCATGCAC^^ 

3 0 GTGCCAGTACTATATAAGCAATACTGGGATGAATGGATTCAAAATGTATAGGCCTGAAA^ 

ACATGATAAGCCAGTGTTCTGAACTATAGCCCCAGCCGTTAGCCTTATACI^ 
TTC^GCTGGCTTACAGGTTCAGAGGTTCAGTCCATTATGATTCTAGTGG^ 
GAGCTGAGTGTTCTATCTCCTGTTCCTAAGGCCACTAGGAGACTGACTTCCAGGGAGCT 
ACAATGACACACTTCCTCCAACAAGGCCACACCTAATAGTGCCACTCCCTGGGCCAA^ 

3 5 CACTAGCTTGTATCAAGGTGACATAAAACTAGT 

CTTAAAGAACTCCACAGGAGGGCTTGCTCTATAGATGTCAAGATTTAATATAATGCTAC!AGCAATTAAGATGA 
AGCTAAGCATTGAAACACAATAGAGAATCCAGAATTAAGCTACATGTATATGG 
TGGCTTTGTTTGTGTGAGACCrCAGTGTATAGCTCTGGCTATGCTGGAACT 
GATCCACCAGCrrCTGGGATTAAAGGTATGTGCCACCATACCTGGCC^TAATGGCATTTAA 

4 0 AAAAAAGTAAATCCTCGTGGATTTATTTTGTGGATCCAAAATACCTATTTTTGTGACTATATT^ 

AGGCTGGGGGTATGGCTCTGCAGTGCAGTACTTGCCTAGCATC^ 
CAAACAGGCCAGTAACAGCCAAGCAGTGATGAATTCCTCT 

AGTGTGGGTTATGTAGTGAGTTGGAGGTTAGCCTGAGCTTCATAGAGAGTCCCTGTCTCA^ 
AACTAAAATGTAGGGAGTG^GGAGATGGCTCAGTGGTTAAGAACACTGGCTTCTCTTTC^GAGATCTTG 

AATGAATAATCTTTAAAAAAAAATCTGAAATGTAAAACAGTGGATTTTGTAGTATGTATAATACAGA^ 

AATGGAAAAAAATCCTTTAAACGGGATGCAAAAAACACTAGCCACCAAGAAAATTATTAAT 

CTGGGACITGGAGAGATGGCTCAGTGACTAAGAGCACATGCAGCTTTTGGATTC^ 

TTAAT CTG AAGCCT TCTCTGGTCTC TGAGG GACCTG AATG CACTCACATGCATGCAGGTAAAACACTCATG CACATATATTAAAAC 

5 0 TAAAATAAGCTAGGTGTGGTGGCGCACGCCTTTAATCCCAGCAGTTGG^ 

CTGGT CTACCAAGTTCCAGGACAGCCAAGG CTTACACAGAGAAACCCTGTCTCAGAAACAACAAG^CAAC^ 

AAACC CAAAAAACAAAAAAAAT AAAAAACTAAAATG AAGGTAAG AG CATGTG AAGTGG 

CAAGACATCCAGAAGGATGGGATATTTGCAGTAC^VAGCAACAAACIAGCCCATTGACAA^ 

TG TGG C!ATGAAT AGGCACCT CAG AG AATTT C CT AATGG CT AACAAAAAT AT AAAAAT AT ATT CAGCCC CATTAGTAATCATG AAAA 

5 5 TGCAAATTAAAATTATAATAGGATAATACTCTACTTATCAGATTAGTCAGACrTAGA 

CATGTAGTAACGGAGTGTCCACATACAGAAGTAAAATGTTATAGATCCCACT^ 
ACATACCCATGCTCTAGAGATGAGCATTCTAATGTCCT 

AGAAATACATACAGAGCCTCAC^TGCAATAGCCAAAAGTTAGAAGCAACACAAATGATTACTG 
TAGTTGACCAGGCCTGATACAACTTTTAACACrrTGGGAGGCAGAAGCAGGCAC^ 

6 0 TAGTGAATTCTGGGACAGTCAGGGCTATGGAAAGAGACTGTCTCAAAAACCAA 

CTATGGTGGGGCAGTGTGTGATAATGAAGATGGGCCATGGTGAAAATAAGAGTGTGTCTTACTGAGATAT^ 
AATTGTCCAACTATATGAAACCCAAAACCTAAAACCTGGGCCACTGAGATTATTAATTCA^ 
TGAGCACCTTAGTTCAGTGTCTGGGACCCACATGGTGGAAGAAAAGAAACAA 
TACACACAAAGAGAGGATAAATAAATATAGCTTGTTAAAGAAAACA^ 

6 5 ACCCCAGTACTTGGGAGGCAGAGGCAGGTGAATTTCTGAGTTCGAGGCCAGCCTGGTCTTO 

CTACACAGAGAAACCCTGTCTTGGAAAACCAAAAAAAAAAAAAAAAAAAAGTAAGTGC^ 
AGAAGATAGTTCTAGTGAAAAGCAAGGGACTTATGTCCrrTGGACCGGAGAGGCTACA^ 
CACTGGTATTGCCCTGGGTGACTAGTTTTACAGGTATGTGCCTTTTAACAAATC^ 

G AAAACGTG TG CTT CAT AGT AAAAAGGTAAAAATTACATGTCT AC AAATAGAGG AGATTO AACTTTTGG AA C 

7 0 ATTGGCAGTGGTCAGTGGGGAGCGAAGCAATTTAGAAGATACATAA 

AAAG CA.G ATTT AAAAAAT ATTAAAAAAAAAAAAG AACC CCAGAAGAT CTT C CAACGTTAAGAT 

CTGTCTAGGGATGGAACAAGGTAACAGGTGACTTTTAAATGTGTTAGTGTGCT 

TTAGTG CT AGGGAATGCTGAATAT CAGT ACTAG TTG CT AGAAC TG AG AAT AAAAG AG CAGGCAGGGACACTGCG CT CACT CAAGTA 
7 5 CAAACGTAAAGGCATTCTGGACTGTAGATGACAAAGGAGAACTC 
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AGCTGTCTTTTTCCCCCGCTCTGGAAGTTTGTGATTCCCTC^^ 
GCAATGGCCTTACAAGAC^GAAGCCACAGAAAj^^ 

ACTGACAAGTGAGGCATGCTAG CAAGTCAAACAGCAGCACCGTGTT CCTGCTGAGAG AT CTGTGAGCTATTCTGTC CCACAAACAG 
TATCTTTGTGC T ATTTTCATGCTGGGGGTGGGGTG GAAAATAGTAAATGGGAATT AATGTGAAGGGT AAACTATTAAG CAATGGGA 
5 AGATAGGACAAAACTGGGAATCTGTATTTACTCTTGGGGAGCAAGC^ 
TGGATTTTTTTCTCTrCCATGTAA^ 

TGATTGGTTAGTCX3TCTTGACTTGGTAGTG 

GAC CTCTTGTGTGATTTAGCTGAGTGGAGAGGT AGGAG CTTTGT CTGACTTTTGGATGCTATTGC 

1 0 CTGGTATATGCCTG AATCTTATCTTTCATGTTTGGG CT ATTTC TAACTT AC 
ATTAACAGTGTTGCTTTTTAAAATACCT^ 

TGGCTGGT AT AAGGGAGGAAG ACTGTAGAAAGAGCGTGG C CTT CCTAAGTTCAGGACTGCTGCrG CCTTTGGCAGAC CAGAC CCCG 
CATCCAAAGGTGGCGCGTAGCATTCTCAAAGGAGATCTTGGCTTCTAG 

TTGTGTTTCATTCCTCAC CATGTTT CAG AATATGAAAC CT CCTG CTC C CCATAGATGACTCGGAG TCTT TCTGTG AC CTCTT CATT 
1 5 CTCCCCTTCATTTCTTCTCTCCTGCTGAT^ 

CTTTGTITCT TT CTCTTTTC T I ^ 

GTCCTGTTAATCATATCACTTTAACCCACTrGT^ 

AAATCCGAGCCATTGGGAAAATGGCCCGGGTCTTTACGGTTCTTC 

AAAGGAGGTTACAAOTTAtSCTTCACAGCAGTGTTTTCA 
2 0 TT AAACAGACTCCCTG TG AAAGTGACAG CATTG GGAACATCCATGTGGTAGTGGTGTAG ATGGAG AGGTTTAGTGTGGATTTT AGA 

GGTAGATCCCCATGTTCAAACTCTGGCTTTCCITAGCTGTGT^ 

G CT AG AGATT AAAAT AGAAC CT AGCT CACATGTTGTTAGTGAGTCAAATGGG AGAAAGTGTGGGG ACGTTG C CTGGCTACCAGTG C 
CTCTGGTTGAAAAAGGACTGGTTAATTTTTGTGTCTGAAGTGTATGTA^ 

2 5 CAGGGAAGAGAGTGAGAATGTGCTGACCCTCAAGGGCCT 

AGAC CATTG AG ACTGGTGAGTATGAAGATGTCCCTTCCTAAGAGGTG TG CCCC CATT ACCAGG AGTTGGTGAL"r rrTGTTTAGAAC 
CTGGTG CT AGAGCAATGGATATTTAC CG AGGG AAAGAGGAATCATTG TC C TT AT C CTT CTTGT ACACAAAAT AATT CAAAGTAG AT 
AGTAATTTAGGGGAAAAAGTGTGAAGAGTAGGTATTAGAGTAAGCAAAGTCTTCA^ 

ATGCTCTTAAAAGAAAATAGAGGTTATTAAAATTAAATTCCCCTTATGAAAATACATAATTAAGAGAAGGAAG 

3 0 AAAAGATTTCTGGACTACCTATGACTGACAAAGGCCT^ 

AG CAAACATGTATG AAGATGCC CAG CAT AGTTACT CTCCTGG AAG ATGAAT ATG AAAG CATAGTG AACCACTTCTGCAT ACCC ATA 
AGAATGGCTGGAATGTAGAAATCTGACGAGGTCAAGAGTTGGTGTGTCTATGGAGGCCTG 
ACATAGACTCAACACATTGGCCAGCTGGTAGCCAATAA 
CCAAGAGGAATTAGCATGCATGTCCACCCGAGG 
35 GAAGCAGCTCCCCTGCCTGTTGGTAGTGAATACATTATATTATATTGATACAGTAGGCTACT 
TGAGGTCAAAGCACAAGCTTAGCTCCTGTCC^TCAGTAAGAAAGAG 

TGCTTTGG AAATAAC AT AT AAG T AAT C C CT AATT AAAGCACTGATTCAAAC GAG AAATGAAGAGACAT AAAAT AT AC ^ CAA 
GCCCAAAAAGGAGTGAA^3AAAAAAAAATATGTATATACATACATATATATATATCCCAAGTTACT 

T AC C CTTG ATTTGT AGATTTGCT CCTATGAG AT CT AGCAAAAAAATTATATTAAAGCAACTTTGTGG AT CAATAAATTTTTAAAAC 

4 0 TTCAAGTAAGG AAGATAGGT AGG CATTT CTCITTTTTTCTTAAT C CATG TT AC TT CCTATT AACCTAAAACACCCTCAT CAG AACT 

GTT CACAGAAAATCTCATATGACT AATT AACAAGT AAAAT AAAAGTAGTTAAGTATCTTATAl"!' 1' 1'T AAAAG CCTGT ACATTTCTT 
TAGAGATATGGCCTCTAGGAGGTCGACCATCCTTCAGAGGATAGTTCACAGCCCTG^ 
TTAAAGAACAAAAAAGGAGAGGACATGGAGATGGTGGGAGGTGGGTATGATC^ 
CAACCATGGAGTTGAGGGTGCATTGGAAATACTTTATGGACOT 

4 5 TTGATGACCTGAGTTTGGTCCCACATGGTGGAAGAAGAGAACTATATAAAAG CAG AAAAACC AG C CAT ATACAT AAAAT AAAAAT C 

AACCTAAAAAAAGTTAACCACAGTCAGTGGTAACTAAGATAGGTTTGATGATGATGAT^ 

CATG CT TGTAAT CT CAGCACTTG AG TGG CAG ACATGGGAGG ATCAGGAAGGAG TTTGATATTCT AG CTACACAGCAAATTTCAGGG 
ATGGC C TGGCCTAT ATATGAC C CTG CTT AAAAAAAATCGTTTTAAGGATTAATGAGAGCTTCAG TGTCTG 
TGGTGTGCATGAAACTTTTGGTTTGATCCTTAGGACTGGGGAAATGACAGAA 

5 0 TAGAGAAATGAGGTCATCCTGTGCTGCTTGAGGGAATAGAGCTACTGTAGAGAGAACATTGTATCAGTTCT^ 

GAGTCACCATAGATTCAGCACTGCCACCCTACACG^ 
AATGGTCATACATGACACTTTCAAAAGAGCCTCACATGGCATTCTTCT^ 

TTGG AT AACAAAGTAAGGTCTAT ACAGTGGGATT CCACAT AGCC^ CATCAAGG CTAGG CT A 

TACTGCATAGTGAGTTCCACGCCAGCCTAGCCTACGTAGTGAGAGCCTGTCT 

5 5 ACAATGTTCTAG GAT AGGAATT ATATGTAAACAGTTTCAT AATT TTTTTTCTT AC CATGT CCC TTG GT ACATACAGAC TG CAT CTT 

AGTAACCAATGCAAAAAACAGTTCAGTGAGGGCCCTAAAATTTTAATACT 
TTATGGTCTTTGCAGGGGACCTGAATTCGGTTCCCAAAACCACATAATAGCT 

GGATGCTGGTATATACTTTACCACACACATTCAAACTCCAAGTAAACCACCACT 
60 CACCTTCTCACAGCTGCAGAGGCAGCACCCTGCTCT^ 

AGAGAGACTACAGCAGTGCTCTAACAGGAGCCTTCCTAGGGGCC^ 
ATTGCAGTGGGAGGTGAAGGAGCTTCGCAGCTCTCTCTCTCTCATAGTGATGAAGGA 
CCITCCTTCCTTCCTTAGGAAAGTTGCCCTTGGAGGAGCTGGCAGC^ 
ATGTGTTACAGCTTGACTGCCTGACAGAAAGTGGCAGGGGGAAGTTTCCCTATTGGTC 

6 5 C CGGAAGGGAGAAG ATT G CTTT CTT AGTAATTGACCTGCTTTCTTGCACAAG GAAAAG AAG CCAT AT GAGGTGTTAGCGTG AGC CT 

CTG CCATGTCCTGCATCTCAGATGGGTACAAAT CAATT AGAGTT CAAAT CAATG CCTAGCTGTTCAC CT CT AGACCAG AAG T AACT 
CTTCACTTATAACCTATTTTTCTCCCCTTCCAAATGCTAGAGCT 

TGG TTAAGGACAGTT AG GTCATGCGTAAGTCATT TAAC AATACTAAG AAG CAAG ATT AAATAG G ATTTT TAAAT AGGACTG AGCAA 
CAG CTCG ATTGATACAAT CTGAGCAGTT ACAAG CTCAAACAT AGTTGTGATATT AAATT AT CTG ATT TAACAAG T CATTTT GATGT 

7 0 GTGATGATCACAAG ACACAACCAGCAGTTG ATGATTT CAAATT CCCATG C CAACAGG AG TGTT AGAACTGT T AGTGC CTC CC CACC 

TCATGCGCATGCTCCTGAGGTAATCCCCTCCCTTCTCTCT 
TCTGTCTTCATTTTTGTATGCACTGCTCCTGCCTTCTTACAGCCAAACAAG^ 
CTCAGCCGGAACACCCCTGTGTCTGTTCACCAGCCTCCAAGCTTCATTGGCATGTCGGAACT 
AGGAAAGGAGAGGATGGCCCTCAGGGTGGCCACATGCCAGCCTGTTTCTCAGATGGC 
7 5 TGTCAGTTTTATGTAATG ACTG AAGCTTGTG CAGCTGGTAATG ACAGAG GTC AAATGATTT AAGTGGTT C CTTT CTTTATTAAGAG 
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AGTT AC CGC CGGGCAGTGGTGGTGCACGCCTTTAAT C CCAGAGG CAGAGG CAGAGAGG CAGAGAG AG AGGAGAGG CAGAGACAGGC 
GGATTTCTGAGTTTGAGGCCAGCCTXSGTCTACACAGT^ 
CAAAACAGACAGAOUIACAGAGAGAGAGAGAGAGAGAGAG^^ 
ACCTGGGGCTGGAGAGATGGCTCAGGGATTAAGAGC^ 
5 TGGCTCACAGCCATCTGTAATGGGATCCGATGCCCT(^^ 

CTI^AAAAAATAAAAATAAATTTTAAAAAG AGTTAC CCAATAACT CAAAGGTTAGCAGGAGTTTCCCTAAGACA.TTTCAG CTTAGG 
GTATTTTGTTT CT ATGGAG ACATTTTAAAT CAT ACTGCTT ATGAATTTTTAGAT AAAAACC^TTGGAATTCACAAAT CTG CAATGT 
AGATTTAGTAATCTCATTAGATAAjGGTGAT^ 

GTGGTAGAATGTTTATCTAGAGTATGTAAGGCCCAGGGTTCAAACCCTGGT^ 
10 AGTATCTAGAAGCTATGTTGATGTTTTTCAXCT^ 

TG T CACTTGTGCATGGCT AATGC C^TGTTTGTGCTTAAAG C^GCT AT CTGTGTGATAGGATG TGCTT TCCTGGTGTTGCCTTTTCT 

AAAAGTGAGCAGGGTITTCATAAGAAACITAAGTTC 

CTTGTTATACTTTTCTCTTCTCTTCTCTTTTTATGA 

CC CCAGAAG AGGGCATCGGATT C CATTACAG ATGGTTGTGAG CCATAATGTGGTTGCTGGG AATTG^ G ACCTCT AGAAGA 
1 5 GCAGCCAGTGCTCTTAACCACTGTACCATCT 

TAGAAGCAGTGTTTGTGAACAGAGCTGATTTAGCOVCCTGTTGC 

C CTAGGAGATTGCTGTG TG CTGG G AAGCTTGGTTTAAGCTG CT AACTTTGCCAC CTCT CTGAG CACT TC CTACCCCCTCCTGTCTT 
TCTGC^GCCATGAGAGGTTTTACAATTGCACACAGGATCCG^ 

GCCACC CCGAAAAGAGG CTTCAT ATCATCATC ATGCAGG GAG GATG CACTCACACTCGCATCCG CCACAC CCACAGGCGTCAAG AA 
2 0 GGACCGACCATGGAAAGAAAGCCCTGTAATGACTCAGGGCCCT 

AAAATGAC CATTTTAT AAATTCTT C T AATTTATGTT CAAT ATATATAT AT AT AT AAAATACT^TTGTTTTGTTT C CCTCC CCTTGT 

CCTAATTTTAGGAACGAATCTGATTGGTTGGGTGTGTGTGT^ 

ACCTTCCCCTAATAATAAGGGCCraGGAAACCTTCACCCTAGATT^ 

2 5 GGTG AT'ITT TUTTriTAATTT ATGAGATAACTCGAAAATGTCTTT CAATAGC CAGG CG TGGTGGTGCACACCTTTAATCC CAGCAC 

TTGGGAGG CAGAGG CAGGTGAATTT CTCAGTTCGAGGTCAG CCTGGTCTACAAAGTGAGTTGT C CAGGACAGC CAGGGCT ACACAG 

AGAAAAAAAAAATTGTCTTTCAAAATTTCCCTTCTGCTCAGTACCA^ 

GGACTTCCAAGAGAACAGCCCAAACCTAGTGATTTTTAAGATCC^ 

3 0 GCTGTGAAAGCCCTCTACTGCCCCGCTGGGCCTCCACTCCTGGGTTCATGTTAAAAA^ 

CTTAGCCTGCTTACTTAAGTCTGCTCCCCAGGAAGAAGAAAAC 

AACAAACAAAATAAACAAGTAAATGCAATAAACCTTTTAAAATAATAATA 

•CTCTTCAGCTGTTGCCAGATTCITTGCriTTCAGTTTCT 

GACGTGAGCGCTGAGATCTGT CCTTGTCCCTCTGTGTCAGCACGT CAAGCATGG CAGCTGCTTTCCCAGCATTCTTCTGTTTTCAC 

3 5 TGTC^AGTGTGTCTGAAAAAAAATTCT(ZAGGT<3TTGGAACG<5G 

CCTCTGCTCCTACAGTTAATCAGGAAACTCCTTCCCAGTGTCAGGTCATTAGGC 

AACCC^^AATTAGGGATTCATATTTAATTGCCCCTTAGATTTTTT 

ACTGGCCTGGAATTCTCTCTATAGACCAGGCTGGCCTTGAAOTCACAGAGCT 

AAC CACACCCG G CTTTGTTTGT AGTTTTTG AGTCAGGGT CTCAT AT AGT C CAGG CCAG CC CT ACACTT ATGAAG CTGAGG CTGTCT 

4 0 TTGACCTCCTGATTTTCTCCTTTCCATCCTAAGTGTGGGCACCACTCCTTGTTGG^ 

GTCTTATGTGTCTGTCACCTTTGACCCJIGTCTGGTTC 
GTCTGGGTACCCTTGATCAAAGCTTGGAAAGGGTAGATTTGTro 

AAACTAAAAG CAAGTAT CT CAG ACAATAAT AT AACAGGTTGAG AAGTT CAGG G<JAAGGAGAAAAAAACAACAGT ATAACT^ 
l"i"l"iAAAAAACTGAi"rr"i' CATGAGGAAGATGAAAGGTTAGCTTG C CT ATCTG TGGGGTTTTTTGGGGGGGG AGTGTGT CAGGGCTA 

CGAGTGCTCTGAACTGCCAAGCCATCTCTCTAGCCCCATCTGGTC 
CAGTTTGTCCATGACAGATCAGAGAGTGTTTTGGAACCTTCCTGCG 
ACGGCCCACAGTCCCAGCACTGGGGAGGTCGAGGCAGGAAGGTC^ 
GGGAGAAATACTTGTCACACAAGCITGATGACCCAAGTGTG^ 

5 0 G GAG C CAT AGCAT ATGCATGCT TGCACTCACGAAC^CTAAGA CAGGGGGAGA 

ACITGTCTTCATGTGGGGATGATCCCAAATCCTGGAACT 
CTGCCTCCCGGGTGTGCCACCACGCCCGGCAAGAGAAAGGGTCITA^^ 
GAGGAAGGTCGTCTTTATTGTCIATTTCTTTGTAGCTTATGCTTTCT 
AAACCTCACCTATGAAAGTCACAGTAGATCGTATTGTGAC^ 

5 5 GCAGAAGTTCTGAAAGTCCCTAGGAACCAGAAATAGTTCAGACTT^ 

TAAACTTCATTAGAAGTTTTTCACATCTTTTCAGATT^ 

AG ACCAG C TAT ITCTGCTTAGGGCCATAACT C Cl - r 1TAAAGT AGTTGGTTT1T1TGT1TTT T1TGTTTTTGTTTTTTGGTGTTT 

TGTTTTGTTTTAATGTGCATTGGTGTTTTGCTTGCATGTATATCTGTGCC^ 

AACTGTGAGGTGCCATGTGGATGCTATGAATTGAGCTAGGTCTTC^ 

TCTTTCTTTCCTTCCTTCCGTCTGCCCGCCTTTCTTTCTTTCTTTCTTTCTTTCTTTCT 
TCTCTCTTTCT ATGGTTTTAGAGCTTTATTGTAGAAAGG CAG^ 

GT GG AGAG AAGGGG AAAGGGAGG AAGGTAGGG CT AG AG ATGAGAATAAG AAAGGTG AG AG AG CTAAAG A^ 
CAGACTCCCTAATGTGTGCATCCATGGTATGCATGTGCACGGGTG^ 

6 5 TTCTCTGAAATCTCAAGGTGAC!ACCTCATTC 

TGCCTAGGAAGAGGGGTGTCAGACTGTCCAGGAATGTCAGCAT^GCTCCT 
CAGAGTAGCTTGCTAGCCAGTGAACAAGTTCCTCTCACTAGTGAACATTTG 
TCGGGCATGGTGGCGCACGCCTTTAATCCCAGCACTCX3GGAGGCAGAGGCA 
AGAGTGAGTTCCAGGACAGACAGGGCTACACAGTGAAACCCTGTCTCTAAAAACAA 

7 0 CAGAGAACCCCCTATTGGTTCAGTCAATAGAAAAATC^TTTGCTATCATTAAGAGC 

TCAAC^CTTCTTCAGCCTTGGCATTATCCCTTAGTTCCTTTTC^ 
TATAAGAAAAATACTOVAGTCATGTGAGATGCAGCTGGGTGCCCCGCTTTATAGC 
GCGGAAGTAAAAATCCCTTCTCTCTGCCTTATCTA^ 
TGTAAAGTTCACACTCTAGAGAAACAAGCGCATGCTGGCCAATCrCA 
7 5 CACATTGTT TT C TT CACATTAATGATCATTGGATT CCTCTG CCCAACGTTTGTG CTGAAA^ C C TAAC CAGT AAAGACTTG 



2188 



WO 03/008583 PCT/US01/51291 



CTGAGGTAGCACTCTAGAACCCCkGTGTTT^ 
ACCCAGGCAGAAAAGAATTCCATAGCAGGAGTGGACCTAGJ^ 
TTACCTTATCTGACACCCXSTCCTGTCTTTCCTGTAGCCrGTTA^ 
GTTTGGGTCACACAGAGGAAT CTCTTTCAGTTCTTGC 
5 GAGAAGAAAGCCAGTTTGCCGAAGGAC^GGAGCAGCGCTCrCAT 
CCATTGGCAGTTTGAGAGAGGGAAACTACM 

GTCAGTGTTAACGGCCTTCATCTATGCAGTCACAAAGGTAAAGATGA^^ 

CGCCTCAGCAACTGTGG CAAGACGTGCTAAAT AACAC CTTTGTTTGAGCCAGGGTTATG CATCGTCAACAATTTCATG CCAAGTCT 
AGGTGGTTTCAGATCAAGCAGCAGGGACGTGGACATGGAC CATCCCTCTGTGCAGCT CTGAAGTAGAG C CAAGCACACTGCCTGCC 
1 0 TGTAAGTCCCTGCCCACATTCCCTTTACCCCCTTTCACAGTTCTCCAAGTACCCT 

ATGACGAAGCAAAAGAAC CACTG CAAATTTGAGGGTTGC CTGGTGT CACAGT AAATGGCAGGCT AATCAGGGCT 

C CTGTGTTTAAAACAAGAGAAAAGGGAAATGTGTTTTTGT CTGT CCCGGAGCCTT TG CAAAGATG C CATCAGGAGCCC CTTATGTT 

CATTCCTCCCAGCATCCTAAGCCAGAGCTCTTTTCCCCATCACCCGACCCTTCCT 
1 5 TGATGAAGCAGT CAGACATGCCGATGTAG ATGGTGG CTCCCTTGAATGATGGCCTATTCG CTCGTACTTAG AGGTTACCTGCGTTC 

T CCTG CTTTCTGCTATGTTAAAG ACAATC CT AG AAATGTTAC CTCT CAACAGGAGACATT AGTCTTTGTGAATAAGTTTTATAAG A 

ACTTTCTTACTATACACTAGAAGTCCTCTTAAGATTAAAAGAAGCAAA 

TGTCTTTATTTCCrrTGAATTCTTAGTAGAGGGACreATCTTTAG 

GTTTCTGATGTCTGGTCTX3ATTTCTGACAATGCCAGTAAGT 
2 0 GCCTCIXIATGAGTTTCTGCTTCCCAACCCTCGCCCGCCTGCCC^ 

TCCTGAAATTATGCCCACCACGATCTCCTGCCCTAGACCCTCTGCGGCCTTTCGAGCAGA 

TGATAACCCAGAAAACAGTCTTTACAAAGCACAAAAATACATTTTACG^ 

TCAGAAGCTCAAAGGGAGTTGATGAAAAGCTTGTACTCCTAAGCT 

AGATCCO^GGGGGCTGGGAATTCACAAGAGCAGATAAGCATGGCTC 

2 5 G ACAGGG ACTTGGAAAGATT ATGGCAACAGAGGTG GAAAGGCT AAAT C CAGGT CTGCTG C ATCCAAATAGCAACATTGACAT CAAA 

TACAGCCACAGTGTGCAATTTTAAGACCAGAAAAATGAGCCCGGATTTTTTAA 
TACCCC^GTGACATCCTGAGTTAACAGTGGCCTTATCCCT 
AGTCGTGGGAAGCTGTTTCAGAGCCTAGGAAAGAAGGATGAGAC^ 
ACTTTGTK3TCTGOTCTGGGTAATTAGATGGCTTCACTCC^ 

3 0 CCTCCTCCTCCTCCTCTTCCTCCTCTTCTGCTTCCTCCTCTTC^CT 

TT ACTCTTGAT AGTGCAT TGGTCACT TAG AGGTAAATTGAGTCIT ACTGCTAATACAGTGGAG ACAGACAGTTT ATCA^ C 
AAGGAGGCCAGAAGAGAGGCTAGCCTTTGTTTGGATAGCAGCACT 

CCCTTATCTCAGTTTCCCTGAGCTGGGATGGAAGGCTGAGGCCGACCCAATTTTCCAGGCCCT 
TGACTTGACTTCCATTCCCTCTGAGGGCATCACTCTGCAAAGTCTGC^ 

AGGGACGCATGGTGGATGTGATGTCAGTCAGTGTCCAGGCCT 

CACTGCCTTTGGGGTTTAGATGCCAGGAAACATGAGATGCCCATGATATGTGTGCTATGAAAATATT 

ATCTCCAGGCTCTTGTTCTTGTGGGGTTTTACATACCT 

TTTTTTATTGTTGTTGCTTCCCTTAGAGTTCCTTCACT 

4 0 TCCAGATAGACCAGTGTGTAAGAGGGCCCTTGGAGCCCGAAAGTGGGTCATTTCACTTCCCTCCCT 

AGTGATTGGCATAGGTGGGCCCATTGATAGAGCrrTAGTTGGCAGTACTGCCAGACCTGGAATCAGGAGATGGCCC 
C1AGAGTCGAGATCTGTGCTAGTATATCTATGGTGACTTGGCTTCAGCT 
GTGTGTTCTTGGCTACTCCCTTCTGATCTGTAGAAAGAGAACACTCTTCTAAGAGC^ 
CAAGAATGTACTGAAGTTTGTCCTAGGGAAGGAACTAGCCGGTCCCTTTCTACCGAACAG 

4 5 CAGTGTCTCATCCAAATGCTAGACTGTCCTGGACACTAATTGTGATTGAGACT 

CTCCTGGTGACTGGGATCAGATAGCTAGAAAAAACAGAGCAGTTACGTA^ 

TGGCAGTCCTTGACCCAAGGCCATGCTGGAGGGGTGCAGGCAGCCGACTACTAGGGAAGGAGGTCCCTCT 

C AGGAGACAG AG CATG GCAT ACTG AC C AGGG CTG CACTATCTGGG ACAAT GTT AC CTGG CT CT AATGTGCCACTGG AAG CTACT CA 

5 0 GGGTGAGGCAGGCAGGCAGCAGCCAGCTACAGCTAGGGGGAGG 

ATTCTTG CT ACATTTTTCT CCCT AAT CTT CTTAAAAGAG G AAGAATTCTC CCT AAATCTGGGCAGAAAAG AGGGCAG GAGATTATT 

TAAGTTTCCTTGGACZTTACCTGCCCTGCGGTCCTAGCGCCT 

GTGAGCTCCAACCTGAGAGGTGGCGAGCC!AGCCCCG^ 

TCCCCGTTCCTCCCIAACCTACATCCTCCAAGACTGTTTA 

MOUSE SEQUENCE - mRNA 

GTTGTGTCCCTCAGCCGCGGTCGCCGTAGCTCTCGACX3CGGTCGTCCCCGCCGAGCCGCGGCGCCGCCGCAGGGAGGGAGCGGCGC 
TGCGGCCCGCCGCGTCCGAGGACGCCCCGTATTCCGGGGCCGGCACGTGGCT^ 
60 GCGGCGGCTTGCGTCCAGCGGGCCGCTOGAGCCGGGAGGAGACCATGTCCGTGAGGCGCCCTCIAGTTCTCC^ 
ATCAAAGCTGTCCCCTTTCCTCCAACCCGACGGCTAACTTTGAAGGAAG 

AAACCATTTAGTAAAAGAAGGTCGGGTGGAAGAGGAGGTGGCCTTAAAGATCATCAATGATGGGGCTGCCATCCT 

AGACCATGATAGAGGTGGAGGCTCCGATCAC^GTGTGTGGTGATGTTC^TGGACAATTCTTTGACCTGATGAAGTTGTTTGAAGTT 

GGGGGATCACCTAGTAATACTCGCTACCTCTTCCTGGGTGACTATGTGGACAGAGGCTATTTCZAGTATAGAGTG 

6 5 ATGGAGCTTAAAGATT AACCAT C CTAAAACATTGTTT CTGCTT CGAGGAAAT CATGAATGCAGG CAT CTTACAGAGTACTT CACCT 

TGAAACAGGAATGTCGGATCAAGTATTCAGAGATGGTC 
TTAAACCAGCAGTTTCTCTGTGTACATGGAGGAATCTCTC 

TGAGCCTCCTGCTTTTGGGCCAGTGTGTGACCTGCrGTGGTCTGATCCCTTAGAGGACT 
ATACCCACIAACACTGTCCGAGGCTGCTCCTACTTCriTCAGTTACCCTGC^ 

7 0 ATAATCAGAGCCCATGAAGCCCIAGGATCCGGGGTACCGAATGTATA^ 

TTTCTCTG C CC CTAAT TACCTAG ATGT CTATAACAAT AAAGCTGCAGTGTTGAAGT ATGAAAAC AATGT CATGAAC^ CAGT 
TCAACTGTTCCCCIACACCCCTACTGGCTCCCZAAACTTCATGGATGTTTTCACGT^ 
GAGATGCTCGTCAATATTCTCAACATATGCTCGGATGAAGAAATGAACGTAACCGATGAA 
AGTCATC^GAATAAAATCCGAGCCATTGGGAAAATGGCCCGGGTCT^ 
7 5 TCAAGGGCCr^CTCCCACAGGCACACTCCCACTGGGGGTCCT 
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GCAGAGGAGCGGGAAGCCATCAGAGGTTTTACAATTGCA 

TGAGAGAATGCCACCCCG AAAAGAGG CTTCATATCAT CATGATGCAG GG AGGATG CACTCACACTCGCATC CG CCACACCCACAGG 
CGTCAAGAAGGACCGACCATGGAAAGAAflGCCCTGTAATGAOT 

ATCACATTTATTTATACTGGAAATGAAACAGGAAC^ CAGT CGCATTTAT 

5 TCTGTAAGAAAAATGACCATTTTATAAATTCTTCTAATTTATGTTCAATATAAAA 

MOUSE SEQUENCE - CODING 

ATGTCCGTGAGGCXJCCCTCAGTTCTCCACGACCGA^ 

GGAAGTTTTTGAGAATGGGAAACCTAAAATGGATCTTTTGA 
1 0 TAAAGATCATCAATGATGGGGCTGCCATCCTGAAGCAGGAGAAGACC^TGATAGAGGTGGAGGCT 

GTTCATGGACAATT CTTTGACCTGATGAAGTTGTTTGAAGTTGGGGGATCACCT AGT AATACT CT CTTCCTGGGTGACTA 

TGTGGACAGAGGCTATTTCAGTATAGAGTGTGTC 

GAGGAAATCATGAATGCAGGCATCTTACAGAGTACTTCACCTTCA^ 

GCX3TGCATGCACACTTTCGACTGTCTTCCTCTTGCTGCCCTCTTAAACC^ 
1 5 AATTACTTGTTTAGAGGACATTAGGAAATTA^ 

AT CC CTT AGAGGACTACGG CAGCGAGAAGACC CTGGAGCACT ATACCCACAACACTGTCCGAGG CTGC T C CTACTTCTTCAGTTAC 

CCTG CAGTTTGTG AATTTTT ACAGAACAACAGTTTAT^ C C CATG AAG CC CAGGATGCGGGGTACCGAATGTA 

TAGGAAGAACCAAGC!AACTGGCTTTCCGTCACTTATTACGA 

CAGTGTTGAAGTATGAAAACAATGTCATGAACATCAGGCAGTTCA^ 
2 0 GTTTTCACGTGGTCTTTGCCTTTTGT 

GAACGTAACCGATGAAGAAGGAGCTACTACAGGTCGAAAAGAAGTCATCAAG^ 

TCHTTACGGTTCTTCGGGAAGAGAGTGAGAATGTGCTG 

TCTGGAGGAAAGGAGACCATTGAGACTGCCAAACAAGAAGCCGCAGAGGAGCGGGAA^ 

GATC CXyU^GTTTTGAAG AAGC CCGAGGTCTAGACCGAATTAATG AG AGAATG CCACCCCGAAAAGAGG CTTCATAT C ATCATGATG 

2 5 CAGGGAGGATGCACTCACACTCGCATCCGCC^ 

HUMAN SEQUENCE - GENOMIC 

GGG ACTACAGG CACGTG C CACCACACC TGGCTAATTTTTTGT ATTTTT^ CCAG G ATAGT C 

TCGATCGCCTGACCTC^TGATCCACCTGCCTCAGCCT 

3 0 GTGCTTJ.T CAT ATAAAATTG CAGATTATACATTATAAT AT AC CTTTTGTTTTGTGT AG cr attttagaaatgctcatttt actatg 

GTTTTTAAAATATTCTACAGTTCTCTTACCTGCATTTAACAC^ 
CAGGTTACl'TTC^CACAACAGTGGAAACAAGGGTTC^ 

TGGT CATT AATCAAGGTTT AGGAAGCGCTTAGGGAGTTTCATGAG GAAATACACCAAAG CTCACAAGCCC TGT AAG AAGG AAACC C 
AGATTTGAAGAAATGTGTAGAAATGATAGTTAAACATGAATC^ 

3 5 AAATAAAAAAAAGTGGGGGGATACAAAATTCACCAAGTGGACATCTGGCCAGGTGGAGAGATCCACCAG 

• CITCCAATGATTCTTGAGAATTGTTAATGTCTAAAAACACC^ 
TGTTTGGGAAGGGAGCTCAATTAGAGCTTTCTTCTTTTTAAATTTTCATGGA 
TCAGAAAAGACATTGAGAAAGCAATGAATGTTTAACTCTATTGGAACTAGGCCAOT 
ATGCCTCTAATCCCAACA.CTTTGTGAGGCCAAGGTGGGAGGATCGCTTTAGTCC^ 

4 0 AGACCCCCCATCTCTAAAAAAAGAAAAAAATAGCTGGACATGG CAGTGG CATGCCTGTGGTCCCAGCTACTTGGGAGG CTGAGGTT 

AGAGGATCACTTG AG CCCAGGGAGATAGAGGCTG CAGTG AG CTATG ATCATGCCACTGCACT C CAGCTTGGGCGACAGAG CAAGAC 

CCTGTCTCAATAAATAAACATATAAATAAATAGGTGAAGATAAA 

TCTCTCTGCTCCTTCCCTCCCTTCTCrCTCrCTTOTCCCT 

TTAGAACAGGGTCTTGCTCTGCC^CCCAGGCTGTAGTGTGGTGGTGTGATC!ATGGTTCACTT 
45 TGATCCTCCITACCTCGGCCTCCCGAGTAGCTGGGACCZACAGGTGTTCACCACCGTGCCCGGCrrAA 
GATGGAGTCCTGCTCTTGTTGCCCACGCTGGAGTGCAATGGCATGA 

GATGGGTTTTACCATGTTGGACAGGTTGGTCT 

TATAGGCATTAACCATCGTGCTCGGCCACCTGGCTAATTTTTAAAAAAAATTTTTG 
50 CAGGCCGGTCTG<3AACrrCCTGGGCTCAAGCGATCCTTGTTCCTCAGCCTCCCAAAGTGCTG 
CTGGCCTTATTTTCACCTATTTATAAAGCAATACTTTCrrCACCACA 
CAAGTGATGGCTACTGTTCACATTTTTTTCCT 

T ATCCTT AACACAGTGCCTGGCACAC AG T C CACACCHAATTCATACITGTTGAATGAG T ATATTAATATTTT CTTG AGGTCTTTT A 
TTTTT AAAGAAATGG AG T CTTGCT ATGTTGTCCAGG ATG GAGTG CAGTG G CTATTCACAGGCAAG AAAATAGTGT AATATG ACCT C 

5 5 AAACTCCTGGGCTC1AATAGATCCTCCTGCCT(ZAGCCTCCCAAGCAGCTGGG 

TTTTATTTTATGCTTTTTTTTTTTTTTTTTTTTTGAGAC^ 

GCTCACTGCAGCCTTAACCTCAGAGACCGCCCACCTCAGCCTCCCAAGCAATCOT 

CAGGCGCATGCCACCACACCTAG CTAATTTTTCTGTATTTTTTATAGAGACGCGGTTTCACCATATTC CTGGTCTCAACC 
TCCTGAGCTCAAGTGATCTTCCCGCCTTGGCCTCCAAAGTGCTGAG^ 

6 0 TTTGTTTTTGTTTTTTTAGATGGAGTTTTGCTCTGTTGCCCAGGC 

CGCTTGGGTTGAAGCGATTATCCTGTCTAAGCCTCCCGAGTAACTGAGCCAGCCATGTTTGTTAATAAA 
ATATATATGGTTATATTTTAGCTTTTTCACTTTTC^ 
TATGGTATTTCCTAACTGCATGATGTTCCIATTCAATCA^ 
ATCATTTTGTGGCTTGCnTCTTTTGCTCAGCGTAATGTATTTTAGAGG^ 

6 5 TTTTATTAGCTGAATAGTATTCCATTCTATGGAATTTTACAATTAACATCTAAAAAA 

TGGGTGCTCTTTTCCACAGCACACATACrAAAAT^ 

ATTTG TGAAG CATT AAAAGAAAAAAAAAGAAGGAAACTG AGGG CAGGGAAATAAGTGAG AATT T ACTTTTCTTCT ACATACTTTTG 
CAGTATTTAATTTTTTTCTTTTCTTITTTTl^ 

CAATCTCGGCTCACCGCAACCTCCGCCTCCTGGGTTCAAGCAATTCTCC^ 

7 0 CGCCACCACGCCTGGCTAATTTTTTGTATTTTTAGCAGAGATGGGGGTCT 

TCAGGTGATCCACCCACCTCGGCTTCCCAAAGTGCTGGGATTAC^ 

GGGCATGTGTTACTTTTAAAAATATATATTTAAAGCATTTAAAAAATTCTGAGAGCAGAGTTGGG 

CCTGTGTGACAGTGGGTAAGTGGCTTGATGTCTCTGGACTCATTTCCCTCACCrrGTAATATAATAATGTTGGTCCATATGA^ 
TGGTCTTTGGCCTCTTAGATACTCCAATTGGCCAGTT^ 
7 5 TAAGGAATGGAGGTGATTAGTGATTCATTATATCCACAA^ 
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AAAT ATAGGCTGG GTGCAGTGG CTCATGCCTGTAAAT CC C1AG CACTTTAGGAGGCAGGCCGATGTGGAAGGACCACTTGAGG CCAG 
GAGTTCCAGACAGACTGGGCAACATAGCAAGGCCCCATCT 
GTAGTCCCAGCTACTa\GGAGGCTGAAGTGG<JAGGA 
GCACTCrAGCCTGGGOVACAGAGAAGATCCTGTCTCAA 
5 TCTCTGCCTTAATAGTCTAGGTGGGGAAAACAGCCACATGTGAATAAATAATATCCTCC^ 

AAACGCTCTTGGT C CACAGAGGAAGGCTTTAC CTCTACCTGGATGGATTAAGAAAGGCTT CAGAGAGGTGAGT ATAACAGGGCTGG 
ATCTTAAGGGCATGGCTGAACTTGTGAGTCG^ 

GAGC CCTGAACAT ATCCACTAT ATTCCAAC CCCATG C CTCTGTACAGCTTTG C CCAACCTAGGACACCrTCCCTTC CTCCTTGCTGT 

TCACATCC CATCCTTCCTTAG AGGG CTG CAGCCCTGAAAAAACATAGCAAATGGTTTCAAGTGGCCGGC CTGAAAGATCT CATGGG 
1 0 GCT ACGGACAGAGAGAAG CTGTAGAGGTGAGTCAGGGTCAGGCTGGGAAGGAACT 

CCCAAGGGCGCCAGGGAGCCACTGAAGGGAGATGAGGAGCAGA^ 

TATGGTGGCTCACACCrrGTAGTCCCAGTACTTTC^ 

GCAACATTGCAAGACCGAGTCTCTAOUU^AATTTAAAAAATT^^ 

AGGCrTAAGGTGGAAGGATTGCTTGAGCCTGGGAGGTTGAGGCT 
1 5 AGC CTGGGGGACAGAGTG AGAC CTT ATCTCAAAAAAAAAAAAAGTTGATT CTGG GGTCAATGTGG ACAATGGATGAAGATAAG AGG 

TCAAGACCAAAGACT CAGGGAC CAGTT AAGAAGCTGATGCAG CGG AGGGTTAGCCCCTATTCACT CC CAG C CTTTATG CCCTG AGA 

AT AGCTGATTCTATT ACAGAATC AGTTGAGATGGATTCCCG CAGAG AGCAAG ACACGTTC C CTGTGT CTGC CC CAAACAACCCAAA 

GTC!TTTTGAAACCTCTACTAGGATG<3GAAAGCAC^ 

TTTCCAACCTGGTTTCCTCCCGGAGAAATCCATGGATCTAACGCCCTCTTCCAA^ 
2 0 GAG CAGGGCTGCAAT CC CAG CTATCGGG AGGC TG AGG CAGGGAG AATTGCITGAACCX3GGG AGGCGGAGGTTG CAGTGAAC 

TTGTGCCACTGC!ACTCCAGCCTGGGCAGCCTGGGCXjACA.GAGTGAG 

AGAAAAAGAAAAGAAAACATTGCAAAC CAATG GGAAGGG ATGT ATT ATT CAAT AAT AGT AATGGAAAAATT AAC CGACTACTGTAT 

GCAAGCTCTGGAGCCTGAGTTGTACCACAAAGATATTCTT^^ 

2 5 CCCCTAGGACTGGGGGAGGGATGGTCATAACTCCTTAGGGG 

GCAATGAACTGTCAGCCACCAGCCCTCAAGAAACAGGCAGAGTACGGTAGCTCAT 
GCAGGAGGATCACTTGAGCCCAGCAGTTCAAGGCTTCA^ 

AC CTCATCTCTCT CTT AG AACAAAAACAAAAACAAAAACAAAAACAG CAGCAACCAGAAG ATGGTT ATCC CT CATGGTGAAGGGGA 
TCrGGGCAGGGCATCCCCGCCTCTGCTACAGGGAGGGCCTATTAATATTATC^ 

3 0 AGAAGTGAAG^AACTTACCCATGCCCACAGCTGGAACT 

AGCAGCTATTG^AAACAGAGGACCAGCAAGGAACTTGGAGAAAGGC^ 
CCGAGGCGGGCAGATCACGAGGTCAGGAAATCX3AGACCACCCTGGCCAACATC 
TACAAAAAAATTT^CTGGGTGTGGTGGCATGCGACTGTAGTCCCA^ 
GAGGTGGAGTTTGCAGTGAGCTGAGATCACGCCACTGCn'CTCCAGCCTGGTGA 

3 5 AAAAAAGTGGGGGTGGGGACACGAAGGGACAGGGCTGCAAGTTGCTCAOVAT^ 

GG ATTCTGTAAATT CATGT AT AC CATG CTTGAGGACT CAT G CCGCATATTCCACGTGGTG CHT^TAGAAAAAGTT CGTGGTG CT AT A 
GAAAAAGTTCAGGTACTGCACGTCTCTTGACGATGATGGTGTTCACTAAG<^ 

CACATCTCTGTGCCTCAGTTTCCTTATCTGTAAAATGGGGATAATCATATAAATGGGGTTGGTGTGAGGATTGAAC 
TTGTGGGGTTTTTTGGTTTTGTGTTTGAGATAGATGGAGTCTGTCTCTGTGCCCAGGCTGGAATGCGGTGG 

4 0 CTGCAACCrrCCGCCTCCCXSGGTTCGAGCGATTCTCCTGCCTCAGCGTCCCGAGTAGCT 

AGCTAATTTTTGTATTTTTAGCAGAGATGAGTTTTTGCCACGTTGGCCAGGCTG^ 
ACCTOIGCCTTCCAAAGTGCTGGGATTACAGGC^TTM 

AGACAGTCTCGCTCTGTCACCCAGGCTGGAGTGCCGGGGCGTAATCTTGGCTCAC 

CTCCTCCCTCAGCCTCTCGAGTAGCTGGGATTACAGGCATGTGCCACCACGCTAGGCTAATTTTTGTATTTT^ 

4 5 TTTCACCACGTTGGCCAGGCTGTTTTC!AAAC^ 

CGTGAGCCACCTTGCCCGGCCCGCAATGTTTTTTCTTTCTTTCCT^ 
CGCTCTATTGCCCAGGCTGGAGTGGTGCAGTGGCGCGATCrCGGCrCACT 

CCTTAGTCTTCCAAGTAGCTAGGACTACAGGCGCCTGCCACCACGCCCGGCTAATTTTTTGTATTTTTAGTAGA 
CCGTGTTAGCCAGGATGGCCTCGATCTCCTGACCrC^ 

5 0 CACCCCACCCGGTCITTTTTTTTTTCTTTTTTGCCGGCCCGCAATGT^ 

CTTCCMATGAATTTTAGCCTCTGACCCAGGGGTATGATTTGATAGGCGTGGAGTTCrG 

AGGA.TATCTGAACAATTCTCTTGGGTGGAAGACTCAAGGATCATGAAAACGAAAACCAAATTCTAGTCGGC 

GCCTCCTGCGGTTACCATCAAAGAGCTATGGTCAAAATTCCAAGG 

CT ATACAT AAAT CCAT ATTATTG GAG ATTT ACG CTT AAT CCTAT AATCTTTTACCTAACAGCTGTCCAG T AG AATTTTCTGGATGT 

5 5 CCAATAAGGTAGCCGCGTGTAGCTCTTGAACACTTGAAATGTGGCTAGTCCCCTAGAGGAACG 

TACAACTACTTATCGGTCACTGCG^TAATTGTTAAGACTGGTTTTGCCTGTAACTTGCATATGTTTT^ 
ATTTC^TTTGCATTGCTAAAAGTCAAAATCCTTTTGACTTTGCCAGTT 

GGCTT ACAGTTTTAAGGACAG AT AAAGGTTAGTTTTTAACCTCG AG ATG^ CATGGGG 
TTCCACAGTGAGCCTGGGCCGGGGAGGCAGGTACTGTGTCAGGGGGAAGAAAAAAGGGG 

6 0 GCATAGGAAGAGGTCAGTTCTTGTGGCCGAGGAGGTCCGGCCCGGGGGTCCCAGGAGCAGAGATCTCCCTTCTCTTCG^ 

AGTGAGGAGGGAGCAGAGCCTTGCTC^GAAACGGAGCTCCCCCAATCCCCCACCGCCCTAGOT 

ACCCACAAAAACTGAAGGAGCGCGGCCGGACTGATCACACTCCTTGAGACAAAGCGGGTGGGAGA 
CGCCTGTGGGCGAGGACTGGGAGGCCAGGGGTTCTCGGCGAGGCGGTCCCAGGAGGCGGCGCGGAGC 
65 CGGGTrcCAAGATGGCCGGGCCGGCCTCCCGCGCGGGTCGCGCGGGCra^ 
GCTGGCAGGAGAX3AAGGGGCCGGCTGCGGX3GAGGGCT 
GAGGAGCCGAGAGC^GCGCGTGCGCCGAGCTCTACTGCCTCGC^ 
CCCGGCCGGGTCCGKTTAGGACAGCGGGGCCGCTGGGAAGTTGTGAGAGCGGC^ 
GCCGCGAGC^AGCCGCGGCCXrTCCCGGTCXaCCACCCTTAGC^ 

CTCGGACACCGCTGAGGAGCCGGGGCCGGGCIACGGCTGGCTGACGGCTCC^ 
GCGGCGTAGGCGCACGTCOXSCGGGCTCCTGGAGCCri^AGGAGG 
ACCGACCGCGTGATCAAAGGTGCCTGGCGGGCCGGGCCITCCT 
GG CGGAGGGAGGCTGGGGCCGGGCTGCGCCCACCCTAGGAGGGCT 

7 5 AGCCACCCGGACCCGGCGTTTTCTGCTGCACCTGGTCAG^ 
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AGTGGGGACTT CTTCT CCCACCGCCCCGAAGGG CC CX3GATCCCTGTCTTTl"riTCT CTTTAAGAAACG ACTCGGGQGAAG CCAT CG 
GGGGTGGTGTGAGCAGGGAGTCGACTC^CCAAGTAAGATATTTAAGATTA 
GGACGTATACCATGTGCCTTTTGACCCGCTTTCTCTA^ 
TCAGATAAATCCATTATATGATGTTTAAAATAAGTTTACGGAGCTCTT^ 
5 AGACTTGGAGAATGACGTTTTCTGATTCATATGTGTACCTG 
TTTGCTCTCTAAATCTTGTAGAGTTTTAGGTTATAGTTC 

ATGTGCTGCTTTTATTCAGC CGTAAGTCTTAGGC^ CTATTTTGT AAG 
AAGGGAATAGACTTAG ACCAGCTGTGTTAGTGG CCTCTG GATGCAGGGGTTAGCXSTGAGAGTGGGG AAAATT ATT AAGATACT^ 
CTTCTTTTTCATTTTGGTATTAGAAGGAGAAAGGC^ 

1 0 CATGTTCAjGGGAGGGGGAAGAATTTTAGTC 

TGTACTACAT AATT ATAATGGGATTTTTGAGATTTATCTGCTT AGCAT TTGT ATGCTATCG TTAG AGGTTTAACTGAATTGTTCAT 
CTAACTGACATTAT C CCCTGAACAG CTAAATTATTGTG ACCATTTAAATGTGT CTTCATAT TATTAGTGT ATAGCCTATAAGAAGT 
AT ATAGCCCTCT CTT AAGATTCAGAGTG TACTTAACATAAC C TTTTTTGAGGAATC^UU5GTCATTATGTA(^TTAA1"1TC1X3TACT 
GTGCTGTTAATATGGCCGGTGCGTTCrcCTGTATAAAGTTGTTAGCAGTTTCGTCCTGTA 

1 5 AGTTCAGTGTTTTGTTTGCTTTTAGTTTCTCTCTCCAG 

GTCTTAGCTGAAGTAG<3CAGGGAAACCAGATAAGCCAGAACTGTGGGTC^ 
AGCCCTCAGGAATCTCACAATCAAATCACCATTGTGCCATGTAA 

TCTTAAATGACCCTTAAATATAGTAATAAATATTGTAAAATAAAATGTAATAAAATAAAAAAGTTAA 
GGCATGGTGGCTCACGCCTGTAACCCCAGCACTTTGGAAG^C^ 
2 0 GCCAACATGGCGAAACTCCATCTCTACTAAACATAGAAAAATTA 

GTCT^GGCACGAGAATTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAAATGGTG 

G AGTG AAACT CT CTCTAAAAAAAAAAAAAAAAAAAAGCAGGGGCTGGTTTATTG CTACGCATG CTT ATT ATT AAAT AAACACTGTC 
ATCATATCCTGCAGGATGCAAATTTGTTCTTTACCAAAATTAATT^ 

AAAT AGAAATAATTT CAGGT CCTTAG AATGT ATGGATTi' CT LTCCT ATATAAAAACTTCGG CATAAG AGCTTGACCACAG CATTTG 

2 5 TGGCTTCATGATGACTTATGGTGCTTTTTTCAGGGTTTTCCAGA 

CTAAGGTCCATTGTAACGGGTACTAAACCCTGGTCTGATGCTATCTTGTTT^ 

G CITf TTCAGAGGTGATACCACAC CC CAG CTAITTI'TGTTT CTGTCATTTCC TGTTAACTTT AAAGT GT ACCTAATTGTTCT ATTT 
G AGCAAATTTTGG AGAAGAG TGGGT ATCTGGGAG AGG TTTGTATTTT AGGTGTC CTTT AAATAG CAG CTTTCTT AAATTGAAGG AT 
GTGGCTGGGCGTGGTGGCTCACACCTGTAATGCCAGCTACT 

3 0 CTATAGTGAGCCAAGATTGTGCCACTGTACTGTAGCCTGGGCGACAAGAGTGAAACTC^ 

TGATCTG T AAAAT AGGG TT AAT AATACCTTC CCTGTCAAG C CTG GGTCATGCAAGTGACATAAAG CATG TGAAAAGGT CAAGTTAA 

GGTAGTACTTTGAGACATGATAATAGGTTGGTAATAGATCGCTTACTACAGAAGTAGTTAG 

TATCAGAAAGTATGTGAAATTATAAGCTACAAAACTGTT^C 

AAATCTCGTCGGTAACAATGTGAAGAGATCAGCACAGTTCTGTCTGGTACTCCCTAGGCAGCAGATTT 

3 5 GTGTTTATTTTATGTTTAAACCTTCTTATGATTTTACTCATTC^ 

TTGCTCATTCATTTCTTCTTTGCTTTAAAAGTAGCATTTCTGTGAGCCTTTAGAGCAGTTTATA 
TATCTAAGCCAGAGAACTAGGCACCCAAATGATTAAATTGTGATATTCTGTCTCCTTTATTT^ 

T TC CTGTTT AAGG ACATGTTTT TTCTTTG ACAGTG AT1TTT ATAAAAGCAGTTT ATTT CAG AAG CAAC 
AAGGAAATTTCAGTAGGAATTACATGACAAATATAGAGAATCCATTGTCTTATTT^ 
40 AAAAGTCAGATTTCTTTTTAGTAAGGTGCTGATACATTTCTCCCTTTCTCTTAGAATTCA 
GTACCATTTTTACCACACATGGCAGGCTACTGTTTCriTTAATATCG 

rrTAATATTAGTGTGGACAAAAATTGCTATTTCTACAAATATACCCTAAACCCGATTTCAATCTCT 

TTCCACAACTCTCCC CCGACTCTCCCl u rI l l^TATATCCATAATGGATTTACCGAGGGTTTAAAATTTTTTCTOTTAAATATAAG CT 
TAATCCTGCTACITATACAATTTGAAAATCAGAAAG^GAAGAAAAGA 

4 5 AAT ACCTTTGTGTGTTT CCATCTG CAGG T AT ATTTTGAATT CAAAGGAAAATGG AT CACACAT ATT AAAGCTTTTGTT ATG CAAAT 

ATTTCAT ATT ATTTTAACAT AAATATTG T AATTTTAC CTAAAT ATTTCCAT AATGGTTT CAAT CAGCATGGAAATG GTTTT AG AGT 

GCTTATATTAATATTTTGGTTCTTTATATTTTCAGTGG^ 

AAGTATAAATTTTCAATGTGAATACGri X lAAAGTTGATATCTCCTGGAi r 

C CATTGTG ATTAACAAG CTG CCAAACTTTTG AAACATT CAAAATCAGTG CT AAT CGAG ATTCAT ACATCTGAGT AGAAATCTGGAA 

5 0 CCTCTGAAAGGGCATCTTAATTTAAACACTTAATTTTTTTCTTTTTGAGCAAGGATAAA 

GTTGTGATCAAAGCTTTATTTATGGTCTGACATTTGATAATTAAGCT^ 
TTTTTTGCCTTACGTAATTTTTAGTGTACCATAACTTTTACTGGAGrc 
GTGAGTTTTTACAAATTTAGCACCCACATACCCATGTAACCACTCAACCA 

GAGTTC^CGTTCAGTCCACTGGTTTCCATCCCTAAATGGGCAACAGTGGGGAAAATATGACACTTGTAAC 

5 5 AACGTCATATTTGTAAAGTAACATACTAATCACATCTTACTGTAAAGTGAAAAAA^ . 

CTCATTACTTAAAGTGAAATAATTAATTAAAGACCTTTTCCTA 
AACATTTTTGGAAAAAGCTAATTTTTAGGCCAGGCGTGGTGGCrrCA 

ATCACCTGAGGTCCGGCCAACGTGGCGAAACCCAGTCCCTATTAAAAATACAAAAATTAACTAGG(^ 

TC C CAG CT ACITAGGAGGCTG AGGT AGGAGAATCGT TTGAAT CTGGG AGG CAGAAGTTG CAGTG AACTGAG AT TG CG C CACTGCAC 

6 0 TCTAGTCTGGGTGACAGAGCAAGACTCTGTCTCAAAAAAAAAAAAAAAAAAAA 

GAGACAAAATTATTCATTTAAT TCCG GGATT TG TAGT CATTTTGG AAC T TTG CTGCTACTTAATTGCTAAATGACTAATGGGAATA 
AAGGG ATGT CTTAGAAT CAT T ACCATCCTTTGG TGCAAT CATTACT AG ACTAT AATG CC CGTTT ATAGTTTTTTTTGGTGGG GGAC 
AGGGTCTCGCTCTGTTGCCCAGGCTGGAGTACAGTGGCGCGATC^^ 
CCTCAGCCTCCCGAGTAGCTGGAACCACAGGCATAGCATACGCCACCGTGTCCAACT 

6 5 AGAGTCTCGCTCTGTCGTCCAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACT 

CCTGCCTCAGCCTCCTGACTAGCTGGGACTACAGGCACCTGCCACCATGCCCGGCTAATTTTTTTT^ 
GGTTTCACTGTGTTAGCCAGGATGCTCTCGGTCTCCTGACCTCGTGATCCGCCCGTCTCGGCCTCCCAAAGTGCT 
CGTGAGCCACCGAGCCTGGCCTAATTTTGGTATTTTTTATAGAGGTGGAGT^ 
GGCTCAAGCGATCCTCTTGCCITTGTCTCCCAAAGTGCT 

7 0 TTAAAATTTAGCAGCATTTTACATTTCAAAACTGAGGCCTA^ 

TATTCTATCTGGTTATCCAGATGAATGTAGAGAGAACTTTTAGTTCCCCATATTCAATTTTGTTTTGTTGGC^ 
TGAAGTCTTTTAGTTTCTTGAGTCTTCGTTTGTTGTTTGAAGAGGG 

G AT AT ACTGG ATGG CTG GACAAG AT AATGT ATTT CTTTGATGATCATTAC TAG ATTT ACT AATTGCT AACAGT T CATGAAGGTT TT 
TTTGCACTGTCTAGACAATGTCAGCCCAAGAGACTGAAATTCTAAATGvAACAAA 
7 5 ATT ACCT ATGT AACATT CTTGCT AAAGGTGTTT AATTT AAGCTGAGCTT AG TT ATAGTAG ACAATCAGACAAAT C CAGTT TGTGGG 
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ACTTTCTACAAGACAGTTGGCCTGAA 
AGATGAAAGAAGTTGAAAGAGAAATGACAGTCAAGT^ 
GGGATAACTGGGAAAATTTGAATATGGTGTATATATTAGACGGTAT^ 
TTCGCTCTTGTTGCCCAGGCTGGAATGCAATAGCGTGATCT 
5 CCICAGCCTCCCGAGTAGCTGGGATTACAGGCATGCACCACCGCACTGGGCT 
GTATTGGTCAGGCTGGTCTCGAGCTCCCGACCTCAGGTGATCTGCCCACCT 

CACCGTG CCTGG CCTATATCATTGTTAAATGT CTTGAATGTGACT ATATGGT ATT ATGGTT ATATAGG AAAATGATC TTATTT TTA 
GGCAATACCTGATGAAGTATTAGTATGTTXIAAATGTCGTG^ TG CAATTTG CT TTTAAATTGTT CAAAAAAAAATCATTTGTA 
GAGAAAAAAGCXIAAAGTGGCAAAACATTTACTATGGTGAATCTAGGTAAGAGCATATGG^ 
1 0 TTAAATAGATTTCAGTT TTTCAAAATGTAAAGTTGGGGACAGCCAGG CATGGTGGCTCT CGCCT AT AGTCC CAG CTGCTTGGGAGG 

CTGAAGCGGGAGGATTGCTTGAGGC CAGGAGTTCCAAGG CTTCAGTGAACT ATGAT TGCACT ACTGTGTTC CAG CTTGGGTGACAC 
AGTGAGCACCCATCTCTATTTAAAAAAGAAAAAAAAGTTAGGGAGAAAAAAAGTACAATGAAGAAATG^ 
GCTCATGCCTGTAATCCCAGTACTTTGGGAGGCCAAGGCGGCTC^ 

GTGAAAC CCCGTGTCT ACAAAAAATATAAAAAATTAG CCAGGTGTGGTGGC CTGCACCTG TGG ACCCAG CT ACTTGGG AGGCTG AG 
1 5 GTGGGAGGATCACCTGAGCCAGGAGGTGGAGGTTGCAGGTTGCAGTGGGCCAAGATAGCG CCACACTCCAG CCTGGGTGACTGAGA 

CCC CATCTCAAAAAAAAAAAAAAAAAAAAAAAAAGGAAATG CC TCAAAACAG CT ACAGCTAGGTTTAGGAAAGGCTGTCATCATAT 
TTGAGGATTCCATAGTXSTCTTAATGGGGGAAAGCTCAA^ 

G CAG CTTGTT TAG CAG AAAAAGTAGTG GTTTCIAG CTGCATG ATGGGGGTTAG AAAAACAT ACAGAAGTC^ 
GGTACTTAACTAAGAAGTAGAGAGAAGATGGGCAAACTAGAAGG^ 
2 0 ATAGTTGAAAACTGGTGGAGCAGTTGACATCAATTTCATTAATG^ 
TTTTTTTTTTTTTTTTCTGAGACAGAGTCTOSCTG 
CTCCCGGGTTCAAGTGATTATCCTGCCTCAGCCTCCTG 
TTGTATTTTTAGTAGAGAAGGGGTTTCACCATGTTGGT 

C CAAAGTG CTGGGATTACAGGCGTG AGCCACCGCG CCCGG CTGTG ATG ACCATGTTTTAAAAACC CAAAAGAAATAGACTTAT AAA 

2 5 CCTATTCTGAAATCATAGAAAATCCAAATATTACCCAGACAGTTGATGTGGAAAGTGGAATAGCT 

AGATTGAAGTGAAAAAAG GGGTGGTAGTG GGATAAAGT AAATATT AACACT AATG AGT AG AG G AGAAAAAAAGG ATTTTCT AAGTG 
TATTTTTCATCCCTGTTAATTGCACCACTCACCCATT^^ 

TCTCCATTTTACTGCATGTCCAGTCCCACCAGACTAAATGCTGTGAACCTTTGGTTTGTACATATCC 

ACACCTTTTG CTTGTGCCATTGCAGGAGCCTCTTG CTX1AGACT ATGT AGTCTGTC C ACG CTGTCATCGT GATCTGTGAACAATATC 

3 0 AAGGGTCATTCITGTGCTCAAAATCCTTCATGAGCACrre 

TAAACCTCTCTAGGCTTGTGTTCCATTTCTTCCTCCTCATTTC^ 

ATTTATTT ATTTATTTATTT ATT T ATTTATTT ATTTATTG AGACGGAGT CTCG CT CTGT C ACCCAGGCTAGAGTGCAGTGGCG CCA 

CCACGGCTCACTGAAAGCTCCGCCTCCGGATTCACGCCATTCTCCTC 

ACCACGCCCGGCTAATTTTTCTTTTTTTTTTTTTAATAGAGATG 

3 5 TCATGATCTGCCCACCACGGCCTCCCAAAGTGCTGGGATT^^ 

AGAGTCTCACTCTCTCTCCCAGGCTGGAGTGCAGTCGCGCCATCTTGGCTCACTGCAACCTCCGCCT 
CCTGCCCCAGCCTCCCGAGTAGCTGGGATTACAGGTGCATGCCACCACGCCCAGCTAGTTTTTG 
TCACCATGTTGGCCAGGCTGGTCTCAAACTTCTGACCTCAGGTGATCTGCCCrr 
TGAGCCACTGTGTCCGGCCCCGAGCCTCCrTTAATACTACAGGTTGAGTGTTCCrAATCCAAAAT 

4 0 AGCTTTTTGAGTGCCAGATGACACCACAAGTGGACACTTTCTTT^ 

AAC CCAAAGTTTTTT CAGTGTCC CCAAGGG AAAAAGAC CCTC C CAGCC CCCTTCGGTTGTGATATAT CTTTTC CATG CACAG CATG 
ATGGTGATGT CCAGGCAACCACAGATTGAC CACGTAGGTGG CTAAGGGTAGTGACACATTTG CTTTCT AATTCAGTGTACGCAAAT 
TTATTTTGTGCACAAAATTATTAAAAATATTTATAAAATTATCTT(ZACATTC 
GACTTGGGTTCCATTCCCAACACATCTCATTATGTATATGCAAATAT^ 

4 5 T CCTAAG CACTTTGAATAAGAAGTC ATAC TTGT CGT C CTAGAATGTTTTATATCTGTGAATG GG CTCTT TGCTTCTGTGCATAATG 

GTTTTTTTTTTTTTTAACAGTTTTAGAGATAAGGCCTCAC^ 
GGCOTCAAACAAACTCCTGGCCTCAATTGATCGTCCrcCCTCATCCCT 
GGCTAAATGTTTTATTTTTATTACTTTTTTGTAGTGACAGGGTCTTGCrrATGTTGCCCA 
TGTTCCTCCCACCTCCCAGAGAGCTGGGATTAOVGGCATGAGCC^ 

5 0 CAGCTCCTGTTTGTTTTTCAAGATTTAGCCAAGGGGACTTCTGAGAAG 

TTTGAGAACATGGGCTTCAGTCAGACATACCJTGGATTCAAAT^ 

AGCCGGCTGAAGAAGTGCAGTGGCACCATCATAG CTCACTGCAGC CTCAAACTCOTGGGCTCAAGTGATTCTCTTGCCTCATTCTC 
ATGCCTGGCACTACAAGCATGTGCrACTATGCCCAGCTCAAATTCTTTTAGTGATGCATGACCT^ 
TTTACCTGCATTTCTTTATGTGTAGAAGAGGGATAATGAAACTTGCAGTGGCAAGCAC^ 
55 AGGTGCTG CTGCCGCCATCGCTATCATCATCATCATCATAAGAATTAOTGTCTCCTATGAG CTCCCAT AAACAATCTCATTGCTTT 

GCAAATCGCTTTAAATATTTTCTTGTCCTTTCTTC^CAATAAACT 

CAGTG CTTAACACACGCTTAGGTGCTT AGT AAATATTTAAGTATG TGCAAG CAT AAGAAG CTTATTTT ATGGATG ACAGTG T AAGA 

AAGAGTACAGACTCTGCTGATTACCAGCCGTGTGGCCTTTGAGCT 

AGTGCCTACCTCAGAATTTAAAGATTACATCTCATATTGTATGTAAAGT^ 

6 0 TTAGCT ATTAT TTAAAAAAT AAATCAAT CTTG G ATCTTGGCGAGATTTG ACAGTAG AT CTATTTTACTTTAAGTGACATGTG AAAT 

GAAGATTTAAGAGTACTGTATTTTATATAGTCACGCTTACAT^ 

ATTTCATCATTATGCAAACATTGTATAGTGTACAGTTATCCCT 

AAATCCCTGAATATTCAAGTCCAGAAGTTGGCCCTGTGGAACCTGAGAATTCAA 

65 T AC AAACCTAGATGTTATATAT ATC T ACAAACGTAG ATGTTACACAC CT AGG CTGTATGGT AT ATAG CCT ATTG CTCCT AATCTAC 
AAACCTGTGCAGCATGTTACTCTATTTAATACTGTAGGCAACTGTAACACAACAGT 

CTAGAAAAGGT A(^GTAAAAGT ACAGT ATTAAAG ATAAAAAATGG TC CAC CTGTGG AGGGCACTT AACATG AAT AG GGC TTGCAGG 
ACTGGAAATT'GCTCTGGGTGAGTCAGGGAGTATGTGGTGAGTGAATGTGAAGGCCTAGGG 

ACAAACACTGCACACTTAGG CTATGCT AAATTT AT AAAAT ATTTTTCTT CAATAGT AAATTAGC CTTAGCTTACTGT AACTTTT AC 

7 0 TTTATAAACTTAAAAATTTTTAAACTTTTTAACTCTTGTACTCATACTTAGCTC 

CTTTCATATCCTTATTCTTATAAGCTTTTACGATTTCAAACATTTCTTATTTTTAAAACT 

CATACATTAGCTGAGGCCTCCAGAGAGTCAGGATGATCAATGTCACTGT CTTCTACCTCCACCTCCTGTC CCACTGGAAGGTCTTC 
AGGGGAATTAACAGGCATGGAGCAGTCACCTCCTATGACAACAATGCCTTCT 
GTTTTACAGTTACCTTTTTTTTGTTGTTT^ 
7 5 TAGTGAATGTATAAGCCAGTAACATATTTGTTATCGTTATATGACTGGCAGTGCAGTAGGT 
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TGTGAGTACTAGTGCCTTATGCCGTGA^ 
TACAGGATCTCTGTCIATATATATATAGTCTTGTAGACTGA^ 

TATTG AAAGGATTT AAGGCTGGGCG CGGTGG CTCATOCCTGT AATC CCAG CACTTTGGGAGGCCGAGGCGGGTGGATCATG AGTTC 

AGGAGATCGAGACCACGGTGAAACCCCATCTCTACTAAAAATACAAAAAATTAGCCXSGGCGCG^ 
5 CTACTCGG45AGGCTAAGGCAGGAGAAT13GCATC 

CIXX3GCGACAGAGTGAGACTCCATCTCAAAAAAAAAAAAAAAAA 

TGATGAAA.TTTTCCITTTCTATAGTATAGATAGAATTTTGGAA 

CACATTTCTGGTTATGCTTAGAAGGGTTAAATACTACTTGAAAAG 

ACAGAAATTTAGAACTATATAGTGACCrrATGACTAATCCCACTGCAAAAGGAGCAGGTC 
1 0 AAATGTTTGTTCCCTTGATCTATAAATTGATAATCCIT^ 

GTGTTTTTTTATGTGTGTG CCT ACGT ATG'lTri'Trr Cl^GTG ACCTAAGGCTTTGTGGAAGTTTAAAATAGAGTGATTTT AATG AA 

TGGCTGGAGTAATAAAGATAGTTGGTGGGCTTCTCCATGGAAGCAAGTTGGTTATTTTTAAC 

CATATGCCACTATAGAAGTTATTGTTTCTGTTGA 

GCTAATACTTTGAAAGTGAGTTCTGTGACrrATTATTACTTAGTTGGT^ 
1 5 GTT CTTGTGACTATTATT ACTTAGTTGGTTCTAGGAGGG C CAGTTTT T AGCTAAT ACTITGAAAGTGAGTTCTGTGACTATCAAAGA 

C C^GAGGAATAATCTCAAAG AAG CTGATCAGATCATGGAACT AAAG^ 

ACTTTTGGTAATAATATTTAATGTAACTTATTTAGCAGTGCGTGTTC^ 

TGAAAAAGTAGGGCAGGACreCTGGCTTAAAATAGGAGTGAAAGCAGTGAACAGACATAGCT 

TGAGG ATAAG ACT AC CACAAT AATCTAAAAATAATAGCTTATAGG C CAATAACCTTTAAGGTTTATGTTTCAT ACT CAC CGCAGGT 
2 0 CAACCCJATGTGGCTGTTTCTCATTTATGCAGATCTACATCW 

AATGTATATTGTAGTAGXIATACAGAATCTGATCTTCLAAAAGGTTC 

ATTTATTTAATTTTT ATT TTG AT AATT ATAG ATTCACAGGAGGTTGTAAATAAAGAAATGTAAAGG^ CTCT C C 

CCTAACTCCCITCCCTCTCriTTGCCCGTGT 

TACAG TCCAT AG AG CTTGTTCAG ATTT CACCAGTTATACATG CATT C CTGT CTCTGTATGTGTGTTTAG CTTT ACACAATTTCACA 

2 5 TAT AG GTAGCAT AACT ACCAG CACAAT AAGGGTACT CAACAGT AC CGTAAC CACAAG ATTT AGTTACTTCC CCTGT AT AG CTACAC 

CAGTCCCCCTTCTCCACGGGAAACTCAGTTTTATAGTTTGGCrATTGTTTT^ 
GCTTCTGATTTGGAAGCATTGTTAAATAATTTTTAATGAAATACAGGTAATCTGAAAGCAGCT 
TGGGAACACTGAAAGACTTCTTGGATGCATAGGCTTAGAATGATTCTCAGCTGTGATTTATT^ 
CCCTCCCTCCCTTCCTCCCTTCCTCCCTTCCTCCCTCCCTTCCTTCCCCT 

3 0 CCCTCCTTCCCCCTCCATCCCrTCCCCTCTCTCAATATTCTGTAATATTGAAATATl^ 

ATATCTGAAATTTTCTCCTTTCTTCCITTTAGTATAC^ 

TAAAAGTGTGGAAGTGGCTTTGGAACTTCTGAAAGACCCAGCCCAAGTCCCATCTTCATTTATTTA^ 
TTTTTTCCAAGGCTGGATCTCGGCTCACTGCAACCTCCGCCT 

GGACTACAGGTGCACACCACCATGCCCGACTAATTTTTGTGGTTTTTAGTAGAGATA^ 

3 5 CTCTTGAACTCCAGGCCTCAAGCGATCCATCTGCTTCATCCT 

AAGTCTCAACTTTCTGATACACTTAATTGATTTCTGTAGCT^ 
CCATACAATCTACTGCTTAACTATTCTGTGTTTATATCTTGTTTTATGCGAATTCT 
AGAATAAATGTGTTAAACITGTATATTTCGGCCGGGCGCAGTGGCrTACGCCTGT 
GGATCACCTCAGGTCAGGAGTTCGAGATCAGCCTGGGCAACACGGTGAAACCCCG 
40 TGGTGGCACATGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGA^ 
CXSAGATCGTGCCATTGCACTCCAGCCTGGGC^CAAG^ 

AAAACAACTTGTATATTTCACACATTACCCATACAATGTGAGCATGTAGTAGG 

AT AATTACTG CTCTAGT ATTTTTT CC CTAGCACITTCAAACGTT AC T CAAAG AATCCTTCTAGCTCTTTGATACAAATTTTAG AAC 
TTCAOTATTAACAAATATCAGCTGGGCTCGGTGGCTCACX3CCTGTAA 

4 5 AGGTCAGGAATTTGAGATCAGCCTGACCAACATGGTGAAACCTTGTCTCT 

TCAACTGTAATCCTAGCTACTTGGGAGGCTGAGACAGGAGAATTGOTTGAACCTG^ 

G CCACTGCAT TCCAGCCTGGGC AACAG AGCGAG ACACCGT CT CAAAAAAAATAAAAATAAAAAT AAAAAT AAATAAAT AAAATAAA 
GTAAAATAAAAATATATATATATCACTACTAGCTAGGATTTTAGGAAGG CTAC CTAT AGAAACTT AG ATATATTT AGT TATGG AAT 
GTTTTGTTTTTCAGTTGGTAATGTACACTCTTTCAGTGTTTCAAAGGAGGAT^ 

5 0 AACTGGTGTTGGAGAAACCAGAGGAGTTGATGATTTTATAC^CAGTCrrTTCCT 

GGTCAATGAATTTCTTCAAAGTTTGCTGCTTTTAAATCAA^ 

TTCATTGTTATGAAGAAGTGTTAGTAGCACGTGGCATTATGTGTTTCAC^TATTGAAGTTGACACCA 

TTTTTTTTTTTTCTGAGACAGAGTCTAGCACTGCTGCCCAGGCT 

CCTGGGCCCAAGTGATCGTTCCGCCTCCACCTACACCTCCC^G™ 

5 5 TAGTGATTTGTTATTTTTTAAATCTCTATTCTC TTTTTAAT ACAGAG CTG CAAATTT CTT ACTTTTCTTTAGTGCAATCCATGTGT 

GAAATATGTGTTATAATTTGTAACTAACAATATTTGAAGTCCTGAAGTTTTTGAATGATCTTGTATGTA^ 
TGGTTCCCACTAATAACCACTTCATTATTATCACTT^ 
TTTATCTAGAATTGTGTAAAATACTTTCATTTTCATGAGAGAGTATCT^ 
TGTTGGTTGGACTTAAACTTTAGTGACCTCTCAGAGA 

6 0 AGAAGTACAAGAGATTATTTAATCCTATTGTCrGAATGTTTGTGACCCCCCAGAATTT^ 

ATGGTATTAGGAGGTATGGTCTTGGGGAGGTGAAGTTTTTATAAATTTCTAAAAGAAAGTATAGG 

GTAGACTAT'l"l"rClT AGATAGGTCAT AGAAG GCATG AACCAT AAACGCTGACAATTGGACTTT TGAAAAATTG TTCTT CTAAAGAT 

GTTTAAGAAAATGAAGAGGC^GCCACAGAATGGGAGAAAATATCTATACTGCATTAGTG 

TTAGAAGATTGTCTATATAGTATAAOTGATTATAAAATAATGTATTTACATAT1TAAAATATTAAGATATA 

6 5 AGTAGTTGTCTTTGGCTGGTGAGATTCTAAGAGTTTTGTTTTTGTGGTC^ 

TTTTGCCTCAAGTGTATAGTATCTTTGATTTATAAAAATCTTTAAATAAAGATA 

AT CAATACTTG ATACT ACATGGAT AAT ACTTTC CTATT CT ATTTTTGT A CHTG AAAATTTCAAAG AATTTTTTGTGT 
' GTGAGTGTGTGTTTTGAGGTGGAGTCTCGCTATGTTGCCCAGGCTGGTCTCAAACTCCT 
CCCCTGTAGCATTGAGATTACAAGCATGAGCCACCATTCCCAGGC 

7 0 TTTTAGTCC^GGTGCAATGGCTCAGATCTGTAAACCTG^ 

AGACCAGCCTGGGCAACATGATGACACCTGTCTCTAC^GAAAATTTAAAAATTAGTCAGGrc 
GCTACTCTGGAAGCTGAGGTGAGAGGATTGCTTGAGGAGTTCC^GGCTA^ 

CTG AGTGACAGAGTG AG AGAC CCTGTGTCTAAAAATGAAAAAAGAAAATCTTT AC TTT ATTT AT TTATTTTTTG AGAT AG AG TCTT 
GCTCAGTTGCCCIAGGCTGGAGTGCAGTGGTGTGATCTCGGrCTCACTGC^ 
7 5 AGCCTC CCG AGTAG CTGG AACTACAGGTGTCTAC CACCATG CC CGG CTAATTTTTGTGTTTTT AATAGAAACAGG ATT T CACCAT A 
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TTGGCCAGGCTGGTCACGAACTCCTGACCTK5TTATCOT 

CACC CAG CTGAAAAT CITTACTTTATTTACAG CACTTTGTGAGCCAT ATTTAG CAAACATAGTTC CACATATGAAGTTTG TTTTTA 
ATGATAAGGGGTTTTTAAACTTCATTTTTAGTA^ 
ACCCCTAACTGCTGGCTGGTTTTCTAGAATTGTAGAAGCAGGTGOT 
5 TGGGT AAGTTCCAGCAGT ATTGG ATGTTGGT AATATTACTGTTG^ CTCT AAAGT CT 
AATTCTATGT A1TTTT TAAGTAAATAGTAGAATAACATGGCATAACATTTTAA 
CTGTCAGCCACCAGTTTCCTGCCIATGGGACTGTTAAATCTCACC^ 

ACACTGATTTAGCTTGC TGC^VGG AAGGAATAACATACAT AAGAGCTGTGGGACCTCTCRAAAGGATAGAGTTAGGAAAGAGGA 
CATACAGGATTTTAGGAGCTGTTAGTCACAGAGTAATGTTAGGATAA 
10 TTGAGTGAGTTGCrGGAACAGCCAGTAGTTTTGCCTCTAGGACAX^T 

TAGAACATACGGGTATGAGTGAGCTGATATTAATAAAGGTTTGAGGCTGGGCACAOT 

GAGTCTAGGATGGGAGGATCGCATGAGGCCAGGAGTTTGAGACTAGCCTGTTTAACTTAG 

TTTTAAAATTAAAATTAACTGAGTGTGGTGGCACACACCT 

CAAG AG TTGG AGACTG CAGTGAGCTATGAT CATGC CACTGCACC CCAG CCTGGGTG ACAGAGGGAGACCCT CT CTCTTAAAAAGAA 
1 5 AAAAAATAAGGTTTGAGCTTAGATGGCCTGTGATGCAC 

AAGTGCnTTTCTGTGCATCTGCAAGCAATTTGTAGTGTAAT^ 
CGCTAAACTGGAACATTTACCATTTAGGGGAGGTGATCAATC^ 

CTTTGATGGT AGACATTTTGGCCT CTACAGGCAC CAGGCCATTTTGTT CTTTTTiTT CT1" ITTTCTU'T TTTl u l u rrTTT GAG ACGGA 
GTCTTGCTCTGTTGCCCAGGCTGGAGTGCGGTGGCACGATCTCGGCT 
2 0 GCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCACCCGCCACCATC 

CCATGTTAGCCAGG ATGGC CT CGATGTTCT TTTTCT T AGATG ATAATTTGTTGAAT CAG CTGGTGTGACAGTGGCTGCACAGCAGC 
ATT T AAGACTCTAAAGATCAC&CACTTGGCATCTC CAG ACAGTT ATT ATT AAAGTT TAAAAT AATGTCCCA 

TATATCCCATAATCTATTTGX^TTrTCCTTTGGGCAACCAAATATAT^ 

CAACTGTC C CATATTTGGTTTATATTTTGC CAAGG AT AAGCATCTTG ATAGTGCT AA TATCTGTTAGCAG CC 

2 5 TAAAATCTTACAACAGTGGAGAGAGAAGAAAATGCCAAGAG^ 

GGCATCTGGGAGGTTGTGTTCTGATTATCTTGGACATAAATAGTCTATTCATTTTA 

AAATCTTGGCTCIACCCACTGTTTGTCCATGGCAATGATGTCGGCAGTATG CATCATTAATTTATT CTCTGTAGTGTCAGTAATGGG 
TG CATGATACAAACATTT ACAAAAGTT CTTTTGATAAGTCTTCCACAAGGCTCAATTCCTATCTTTAG CTTATAG CCAGG AACTTT 
AGATGAATCTAAGGAAGAAGCGGAAACCACCTGTGGATCATGGGACTATAG 

3 0 AGAATGGAAGGGAGTGGAATCTTCTATTGCAGAAAGTACATAATGAATTTTATGAGTAGTCACT^ 

TCT CAC AAACAGGGAGG ACATTGACACTCT CTAGG ACl^ TGTATTATGAGTTTCACCATTCCAAG 
TTTAOTGAAACCrAGAAATAAAATCCTTTTTTAGATAGCTATTGTGGC^GTCTTTGTACTCGAAT 
TTTGAGACGAAGTTTCTCTCTTGTTTCCCAGGCTGGAGTGCAATGGC^ 
AGCAATTCTCCTGCCTCAGCCITCCAAGTAGCTGGGATT 

3 5 AGATGGGGTTTCACCATGTTGGTCAGGCTGGTCTCGAACTCCTG 

TAAGC CG AAAGATACTT AAATGTTG AAGT AT ATTT ATTTT AAC CACATTT TAT T ATACATGATTAAATTGGG AAGGCT TAACTTT C 
CTTATGATTTGTGAGGTTTTTTTTTCCCTTTAACCCAAATTTAATAA 

TCAGAGT ATG C CAAACAAAG CTAATAATTTTTAT AATCCTT TAAAAAATTTTT AAT AAAGTAAAGGACT AAAG CTTTGTTTTAC CT 
AGTGG CGATTT ATT AT AAAC CCAAAGATGTAGTAGGTGTAAAAG 

4 0 AGGGTCTCGCTCAGCTGTCTAGGCTGGAGTGCAGTGGCTGGATC^ 

CCTGCCTCAGCCTCCTGAGCAGCTGGGACTATAGGTGTGGACC^ 
TTTCGCCATGTTGCGCAGGCTGGTCTCGAACTCCTGAGCTCAA^ 
TATAGGTGTGAGTCATCATACCTGGACTACTTTTTGCTTTTTGACTG^ 
TAGATAGTTGTAGTCTTTACTTAAGTTTGAATGAAGTGAGTTATGCGTGGGCT 

4 5 TTG ACCATATACC CACTTT ACTGG AG AATGAG T ATGCCAAG AG ACCGAGG CAG AAT CTG CAAAATCACTTCTG CTG CAATGT ATTT 

GTTAAAGCAGGTCACTAAGGCCAGTTCAGATTCTACGGGAGAGA 
TGGAATCCAGTGGTGCTATCTTGGCTCACTGCAGCC^^ 

GGG ACTCCAGGCTTGTG ACAC CACATTCGGCTAA'l' J l' 1"1TGT TGTGTTTTTAGT AG AG ACGGGGTTTCAC CATGTTGCCCAGG CTGG 
ACTCAAACTCCTGAGCTC^GCATTCTGCCTGGCTCAGCCTGCCAAAGTGC 
50 AGACTCTACTTCTTGATAGGATAGGGTCAGGTCACTGCTATGCTTTAAATATGATCCCCAGAAAGC^TATGTTGGCTGCCTGGGTG 
ATGGGTATACTAGAAGCCCAAACTTTACCATTATGCAGTATATCCATGTAAA^ 

AAAAAT AAAAT AAGTATGTATTAGAAAATT TATC CCAAATGCAGT AGTGT CG^GAGGTGGGGC CT AATGAG AGGTGTTTAGG TCAT 
GAAGCCACCTCTAATGAATGCATTAATGTTGATTATAAAAGGG CTT AAGGCTGCAAGTTCTATCTCTTG CTCTCTCTTATCCCTCT 
TTGCCCTTCCACTATGGGATGATGCAGCAAGAAGACCCATGCCAGA^ 

5 5 TATGAATGAGTAAATTTCTATTCATTATAAATTAT^ 

ATT ACAGAAG AGT ATGTAGAGTG AG AAAT ATT AT TG T AG C CATT TTTTTTT AGTTTT TG CTTT TTTT AAAAAATTATT CTTT AAGT 
TCTAGGGTACATGTGCTCAACATGCAGGTTTGCTACATAGGTATACATGTGCCATGTTGGTT^ 
TACATTAGGTATTTCTCCTAATGCTCTCCCTCCCCCAGCCCCCCATCCTCCGACCAGCCCCAG 
TCCATGTGATCTCATTGTTCAATTCCCACCCAAGAGTGAGAACATGTGGTGTTTGGT^ 
60 TGATX3GTTTCCAGCTTCATCCATGTCCCTGCAAAGGACATGAACTCAT 
GCCACGTTTTCTTTATCCAGTCTATt^TTGATGGACATTTGGGTTC 

ATACX5TGCACATGTGTCTTT ATAGT AGAATGATT TATAATCCTTTG AGTATATG CCTAGT AATGGGATTGCTG GGT CAAATGGT AT 
TTCTAGTTCTAGATCCTTGAGGAATCACCACACTGTCTTCCACAGTGGTTG 

CACATCCTCTCCAGCATCTGTTGTTTCCTGACTTTTTAATGATCACCATTCTAACTGGTGTGAGATGGTATCTTAT 

65 • ATTTGCATTTCTCTGATGACCAGTGATGATGAGCATTTTTGCATATG^ 
GTTCATATCCTTTGCCCACTTTCTGATGGGGTTGTTTTTTTCTTGTA 
TTTGTCAGATGGGTAGATTGCAAAAATTTTTTCTGTTTTTGCTT 
AGCTGACTGCAGCCTCAACCTATTGGGTTCCAGAGATCCTTCTGTGTCAGCCT 
ACACCTGGATAATTTAAAAAAAAAATTTTGTAGAGATGGGGGTCTC^ 

7 0 CAGTCTCCCCAACTTGGATTCCCAAAGTGCTGGGATTACAGGAGTGAGTC^ 

AAAATGTG CTACAG ACATTGT AAGGGCAAAAT AT TTAACAAT AACTGAGATTTCT AAAAATTG CAG ATTGG CTAAATGTTAATAAA 
TT ATG ATCTAAG T AAATTAT AT CTAC CT AG GAG AAAAT AAAAAAAAATTTATGG TAT TTT AAGGT ATTT ATTTATGTACTTATG CA 
TTG AGATATTTGCC^tTCAAAATTTT TACT AGTGCGT ATAAG TACCT AACACATAGT AG CAATT CAATAAAATT ATTTGAATGAATT 
AATTGAGGATACTTTTCGATCATAGTTTTTATGGTGAGAGTTTTCATTTTGTTCTGTCTTATTT 

75 TTCG^TTTAGTCTATTCAAGATGTGTTTTTTGT^GTTTGT^ 
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GTGGTGCGATCTCAGCTCACTGCAACCTCCACTTCCTGGGTTCAAGTGATTCT 
CCACACCCGGCTAATTTTTTATTTTTAGTAGAGA 
CCTGCCTCCGCCTCCCAATGTTCTGGGATTACAGGTGTGAGCCACC 
ACTGGGTAATTTGTGAACAACAGAAATTTATTTCT^^ 
5 TGTCTGGTGAGGGCCTCTTCCTTATAAGTGGCACCTTCTATGTGTCCT 
TTTTGTAATGGCACAAATGCCATTCATGAGGGCA^ 
TGTTGGGGATTAGGTTTTACCTATGAATTTTTGGAACATTCAAATC^ 
CATTCrTTTTTTTGTTTTTTTGA^ 

TCTGCCTCCCGGGTTCAAGTGACTACCCTGTCTCAGCCTCCCGAGTAGCTGGGATTACAGGCGC 
10 TTTGTATTTTTGGTAGAGGCAAGAGTGTCACCA^^ 

CCTCCCGAAGTGCTGTGATTACAGGCGTGAGCCACTGCACCTGGC 

ATTTTGAATCAAGCATTTT AAAAATGT CTGCTTTT CAT AAAT AGTTT ATTTTT ACAT ATTT ATCCCTTT CATT^ 
TCATGAGTrTTTTTTTTAATCAGCTGGCA 
TGGGAATATATCTTTTGTAATX3TTTTGCTTGTAATAATTGA 
1 5 CATTTAAAATGGACTCTGATTTAAAGGTACTGTTGGAGGCATTGGTGACGOT 
TCTATTTTCAAGAAACTGCCATAGCAATATTTTTAGTCTCAT^ 
TATTTCCACTCTTCTTGAACCTGGGCAGGCITGTGACT 

C CATGAT ATAAATAGGATACAACTGATC CAGCAATCG C ACTACTGAGTATCTACCCAAAGGAAAATAAAT CAACATGTTTAT TG CA 
GTACTATTCACAACAGCAAAGATACAGAATCAGCGTTGAGTATCTAT^ 

2 0 CAATGGAATATTGTTCAACCATAAAAAAAO^AAACAAAATAATGT 

AGTTAAATAAGCTAGGCACAAAAAGACAAATATCATATA^ 

GAATGG AAAAT AGAT AAG AG AG ACTGG GAAGGGTGAG TGGGGGAAAGGGGAAG G ATGAAGAGAAGTGAGTT AAAGGGTACAAACAT 
* ACATTAAG ATAG GAGTAGATTCAATGTTTGATAGCAG AG CAGG ATGACT AT ACC T AACAAAAATGTATTG T ACCTGGTG ATGAACA 
TC CTGAAGACCCTGG CTTAATACT ATGCATAATATACATGTAACATAATT CCT CATGTAAT CTIX3TAAATTTG CACAAATAAAAAGA 

CATGTGGGTATTCTCTTCAAAAACCTGAGCTAAGCTGCAGCCAACA^ 

AATTCCAGCCCCAGGTTGTCGAGTATCCTGCTGAGGCCCCAGAAGTTGCGGAGCAGAGACAAGCCAGCCCCACTGTGCCCTGTCTG 
AATTTATGGTCCACAGAACCTCTGATAGATAATAAGTGATTATTGCCTTAAGCCACT 
TAGATAATTCATATAATTTCTAGAAACTTATAGGGAAATAAAGATTTGCTC 
30 ATGATCCATGTATATATATATGATATGGGTGATATATA^ 

CACACACATACAT ATAT AT ATGTT CT CTGCTGTATCATTTATTTATGTTT TCTTTTTTTTAG AG G CAG AATCTCACTCTGT CAC CC 
AGGCTAGAGTACAGTGGCTCAGTCATAGGTCACTGCAGCCTTGAACTCCT 

CACTGGGATTATAGGTGTGAGCCACGATG CCTGGC CCTG CAGTATAATTTACAATTTTTTGAAATAGTCTAGCT ATCCAACAATAA 
GAAAATGTTTAAATAAATTATCATAAGCACACTATTTTCCTAAGTAAAATTTCTACATTTrCTATATT 

3 5 AATTGGTATATAGTCAATTATGTCATAATGATAAACCTAATTGA 

GCTT TACAT CTGATAAAG GACATGT AC CC AGAACATACAAGAACTT TCGT CAGTCAATAAG AAAAGATACT AAATGTTTTAAATGG 
ACAAAAGAATTGAACAAGCAC ATC CCCAAAGAAGAT AT TCAAATGG CCCTTAAGCAAGTAGAAAGGTCCT CAACATCACTACTCAT 
CAGG GAAATG CAAAGTAG AGAACAAAG AG AT ACCGGT AGACG TTCACTGT ACTGGAATAAAATTT AG CAG ATG AGCAATATTGAGT 
GTTGACAAGGATATGAAGT AACTGGAAC C CTT ACACCTTG AT GGTGTGAGGGT AAATGGT AT CACTTTGT GAACTGTTGGGT AAAA 

4 0 CCAATGTTGACCCCATAAACTTTO^CCCATTCrTGGGTATATACCT 

GAATTATATTCATAGCAACTGTATTCATAATGGCTCAAATTGGAAAGCTGCTCTAGTCATACATTGTCTGCC 

CG CAGTAGTG CTT AG CATGT AGCT ACCTGTGTTT AGTTT AATATGTATTATTAATG AAAATTTAT TTAGGGG ATT T C ACCCC CAAT 

TTTTTAACTTTATGGTTTCAAATTTCTTAAAACCTTTCTCTTTACATCAGTTTGGTT 

ACAGATGGCCCAAAGCACAAATTATTAAAATGCTAACTTTTATCTAC^ 

4 5 TTTTTITTTTTTTTTTGAGACTGAGTCrGTGTC^ 

GTCATGCATCCGAGAAATGTGTCATGAGATGACTTTGT<^ 

ACTGCTATACACCTAGATGGTGTAACTACTACTATGCATAGTCTATATGATATAGCCTATTACTTCTAGGCCACAAAC 
CAGGTTACTGTACTGGATACTGTAGGCAGTTATAACACAGTGCT 

CAATAAAAACACAG TATAAAAGGCAAAAACTGGT ACAC CTGTAT AGG GCACTT AACCATG AATGGGG CTTACAGG ACTGGAAGTTG 

5 0 CTCTGGGTGAGTCGGAGGTGAATGATGAGTGAATGTGAGGGCCTAG 

AACACTGAAGACTTAGGCTACACTAAATTTA 

GGGATGCTGAGGTCGGCAGATCACTTGAGTCCAGGAGTTCAAGACC^TCTGGG 

ACAAAAATTAGC CAGGCATGG TGG CGCACAC CTGT AG T C C CAGCTGAGG ACTGGGAG GCTGAGGTGAG AG GAT CATTTGAGC CAGG 
AGGTTGAAGGTGCAGTGAGCCATGGTAGTGCCATTGCACTCCCATCT 

5 5 TTTCTTTAATAATATATTAAACTTAGCTTACTGTTAAGTTTTTACT 

GTAATAACACCTAGCTTAAAAGACAAGTACATTGTATAGCT 
AAAAAGTTTAAATTTTTTTTTAACTTTTAGAATATTTTGGTAAA 
CACACACIX1AGCCTAGGTCTAGACAGG 
CAGGAGCACTAACAGGGATGGAGCTGTTATCTCCTAGG 

6 0 T(^TTGAGGAGATGTCACTCTTTTCAGAAATTTGCCCATGGTGTTTTGCTTGGTTTATTT 

AGCAGTTAATGCAGCATGAACATTCCTGTGTAT^AATGAAAACCTTTCTGTTTTTGGGGGTC 
TTTGTTGAGGTGTGCAAAAGCTTCTCCCAAACCCTTCACTGTGAGTTTTGTTGGGG^ 
CTCTCTTGCTGCTTCTTCAGCTTTGCATTCCTATTCCAGTTCCCACT 
TACT AGATGACAGGAGTTTTT CAG CTCCATTTAACTACAGTCCATTGTTGA 

6 5 GCTATTTTAGTGGTTGGGTAGCTGCTATCGTTTTTTC^^ 

CGAAACTTATAATAGGACTTTCTTTACTTGTAAATGATACAATATGGTATGGCTAG 

CAACTTG GTT CCAT AAAGCTG GAAGTG GAAACAT TGG ACAT ATAT CAG AAATTCAATTGAACG C CT C CT TTT AAATTTATGG TTT A 
CAG ACTTGTGGCT CTT AGATC C CAGGGGAT CTAAAGT AG ACTGTGTT ATAAAGT AGGCACAT AACAAG ACAACAGCCA^ 
ATATTAAATGAACTTGCTAATTCTCCATGATATTATATATATAAACTTTGTACCT 

7 0 CAGGTTAGAAAACATAACTGGACTTCCCTT^^ 

AGTGTCACAATCTCGGCTCACTGCAACCTCC^TCTCCCAGATC 
AGGTGTGTGCCATCACACCCXSGCTAATTTTTGTATTTTTAG 
CGACCTCAGGTGATCCACCTACCTTGGCCTCCCAAGGT^ 
CTTTTATGCTCTGTTACTACAGCAAATTATTCTCAGA 
7 5 ACATTAAGCTACTGCCTTTTGTTTTATTCTGTTTTGTTTTGTTT^ 



2196 



WO 03/008583 PCT/USO 1/5 1291 



CAGTGGCGCGATCTCAGCTCACTGCAACCTCTGCCTC^ 
TGGGCACirGTCACTACACCTGGCTAA 

CTGACCTCAAGTGAT CTG CCTC CCTCAGCCT (2CCAAAGTGCTGGGATT ACAGGCGTGAGC CATGGTG CCCG G CCGAGACTTCTAAA 
ACATAGGTGTCATTAGGACAGTTCTATAGGAGAAATAAATGCCATTTTAGTATTAT 
5 AGTTTCATTTTTAACACCTGTTCTATAGATAGAA 

CTGAATCAGCTTCAAGTCTTTCATGTTTATGTGTAAAACATAG 
AGGCCGAGGCAGGTGGATCACCTGAGCTC^GAGTTCGAG^ 
' AAAATTAGCCAGGTGTGGTGGCATGCTTCTGTAA 
GAGGTTGCAGTGAGCTGAGATCGTGTCACTGCATTCC 
10 AAC CATT ATAATTTATGCACACACAAAT ATTTAAAATG ACT CATTTATGATACATAGT AT 

CTTAGAATTTTTTCC CCACGTACTGGTGC TGTGGAT^ 

GTTCTGTTTGTTCT CTATACATTTAAATATTTTAAATTATTATTATTAT TTTTTTGTAG CTGTCCCCTTTC CTC CAACCCAACGGC 
TTACTTTCAAGGAAGT ATTTGAGAATGGGAAAC CTAAAGTTGATGTTTT AAAAAACCATTTCGT AAAGGAAGGATO 
G AAGTAGC CTTAAAGAT AAT CAATGATGGGG CTGCCATCCTGAGG GAAG AGAAG ACT ATGAT AGAAG TAGATGCTCCAAT CACAGG 
1 5 TATAAAAAGTCTTTGCATGATACTTTTTTAOVGTATAGATTTGCATGAGCAGTT^ 
GTGGTGTGGTAATTTTTCTAGAAATTATGAGACAGT 

GTCTCATATTTCCAGTAGAACTATTTAGTATGAGTAGATTGACTACAGTTT^ 
TTAGAAGTTAAACTGTGATCCCTTTTGGTGTAATGCCCTT 

T AGTATGTGGTGATATT CATGGACAATTCTTTGACCT AATG AAGTT ATTTGAAGTTG CAC CTAGT AACACACGCTACCT C 

20 TTTCTGGGTG ACT ATGTGGACAGAGG CTATTTCAGT AT AGAGGTAAAAATT AA^ T ATATTG T CTTTC 
AAAAAAGATG ATTTC CATTCTT CAGTAGAAGAAATT AAAATG AGAACT CTGGCAG AATT TAT AAAC TTTGTT ACTTTCAAAATT AA 
AGTAGTAGAAATGTTTTAATTGATAGAACAGTTATTCCTTTTTTAAAACCAGTCGAA 

CTGTTCACCCT AAATGGGGTTT AGATTAGCTGAC CTTAATTATAAG AC CAT AAG CTG CTGAGCT ACAATTAACACTTAGATTAACT 

2 5 GCCAGTGCTAAATAAACACTAAGCTTTTTGTAGG 

AGGGGAAAAATGAGTTGT AGT T AATTAAAACTTATACHTTC CTTTT ATAT ATTTTGAAAATGGAAAGAT AGGAACA 

T AAAT AG AAG CTTT AATGTGT AAAAGAT AAAACT ACTTT AGAAC TTT ATG CAAG G ACTTGAGT ACAGTGGAACCACAGCACATT CC 

TAGTTAATGTTCATTGATTCAACT CT ACATG CTGGCAAAT AAAAG AGAAAAAAACTTTTGAAATAG CGTGGGTAAT AGGATTTG T A 

ATGTGACCAAGAATAAGCATGAGCTGAGAAATTGAGTCTGCITTCT 

TGATGCCCAAGCAAAGGAGCAGCAGTTGCTCT^ 
GCAATCTAAATTCATGAATCAGAAAAGTTCAATGTTGGCCAGGC^ 

G GCAGG CAG ATT ACGAAGTCAGG AG ATCAAGACCAT C CITGGTAACATCX5TG AAACGC CGT CTCT ACTAAAAATACAAAAAATTAG 
CCGGGTGTGTAGGCAGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAG^ 

3 5 AGTGAGCTGAGATCACGCCTCTGCACTCCAGCCTGGGCGACAGAG 

TAGTTGCCAGTGCACAAAATTCATTTAGTGTGA^ 

TTCCTTCTTGCTCCX3TAAATGTCCAAC4GTCTCTAAATAGGTACTGGGAATAGGCGGAGATTA 
GCTTCCCTGTTGCCACTCATATCATGCATGTTTCAGTAGACATAAGTCAAATGTATGGATGTGGGGCTGCGCG 
CTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCACCrGAGGTCAGOAGTTCGAGACCAGCCTGG 
40 CCCATCTOTACTAAAAATACGAAAGTTTACCAGGCMGGTGGTGGGCGCCTGTAATCCCMCTA 
ATCACTTGAACCTGGGAGGCAGAGGTTGCAGTGGGCTATTGTGCCGCTACAC 
AAAAAAAAAAAGAAAGAAAGAAAGAAAGTATGGATGTTAGAAAGAC^ 
G AG AAGGTTTCAGACGGTAAAGATT AGAAG AAT ACAAAAATTACTGAATTTCTA^ 
AGATGGGGTCCTGCCATGTTGCCTAGGCTGGAGTGCAGTGGCTATACACAGGTGCGATCATTTCG 

4 5 G ATTCAAG C CAT CCT C CTG CCTCAGCTT C CTG AG TAT ATG GAACT AT AG ATGCACAC CACTGTG C CTGG CTAGT CATTGTATTTAA 

TAAAACAACITTTAAAATATTTATTTATTTATT 

CAATGGCACGATCTCGGCCCACCGCAACGTCCGCCTCCCAGGTTCAAAT^ 

CAGGCACCTGCCACCACGTCTGGCTAATTTTTTGTATT^ 

GCTGACCTCGTGGTCCACCCACCTTGGCCTCCCAAAGTGCTAGGATTACAGGTG 

5 0 TTTTTTTGAGATGGAGTCrrTGCTCTGTTGCCCTGGCTCGAGTGCAGTGG 

TTTAAGCAGTTCTCTGCCTCrcCCTCCTGAGTAGCTGGGATTACAGGTGCTTGCCACCACA 
AGAGATGGGGTTTCACCATCTTGGCClAGGCrGGTCTTGGACTCCTGAA 

ATT ACAGGCATGAGCCACCGCG CCCAG CCTAAAACAACTTTTTAGTGATAAATTATTATGCCAAATTATCTTCTGATTAACTCGTA 
TAACTTAAAATAGCTTTAAAATAGTTTTAATTCATTAATACATTCAATAAAGCAC^ 

5 5 GACATCATGTTCAGAATATTTTGTGTTATGCA1TTTTTT 

TGTGAGAATAAAATAAAG AAT AAAATTTATG CT CTGT AGGAGCTT C CAGT CTAAT AGAAAAGATGCACT T ACGAAT AGAC TTTTGT 
TATAATATAAAATGTGCTTCTTGAATTAACAGCGTTTTTCATATTAGGACTTAGCA 
AATATTTGTTGAATGAATGAATGTTAATAAATCATAGTTTGTATTTAGCTTTATTCTGTTAATTAGAT 
TTTTTTCTGCAATAACTTCAGAGATAAATGAATCAAAACTT^ 

6 0 AGAT CT ATCG AGT ATTCATTC^AAGTCACTTATTTAT AAACCACT AAG AT TAT AAACCTCT ATAGGC T ATAGCTTACT CT AACATG 

TTGO^GTATACACTTGAAAAATAATTTTTTAATAAA^ 
AGGTCATGGAATTCTTATTAAAATTTTTTGTTATAAATAATTTATTTTT 

AAACTTTCTACAAATCAGAGATAATCACTGTTAATGTTTTATT AT ACTTT CTTG CAG CCTT TTTTCTTCiT XT ACATTTGTAT ATA 
TTTTGTTTTCAAGAAAATTGAGAGCATTTTGTACGCTCCTTTTCTGATCTTCT 

6 5 CTGT ATTGTTTTCTC CAGAAGTGTAGTGT CAAAGT CAAAT AAAAT AGAGAC AAATCT CTG AAATT AAAACATTTTATC TGAGAAG A 

AAAAAGACCAGCCTGGCGAACATGGTGAAACCTCGT(TTGTACTAAAAATACAAAAATTAGCC^ 

TCTCAGCTACTTGGGAGGCTC^GGCGGGAGAATGGCTTGAGCCTGG 

TCCAGCCTGGGTGACTTAGCAAGACTCTGTCTCAAAAAAAAAAA 

ACAG ACTG AG CGGGCTTTGGT ATGTCTGAAG AAAAGAG AAGGTTAGGTTT T ATGAGAAAGAAAAATG G CTG TTTG AG 

70 AAAGTTCATTGACACTAGTAAGGTTCTGGAGGGCTGACAAGTTTTACrTAATCTCATG 
GTTAATAGTTTCATGAGTCCGTCAGTTTCTTTATTAGAGTTGTGGAAATTTTTATTTAG 
CT AT ATTTAAGAGTAC TTGTTGAAGT CTITTTCCATGAATCTG ATTGTGCATG 

AAGCTTAGAACAG CCAT GGTT AAAAATCTG ATGAAAG TTT ACGATTGATAAG G AATTTTGT TAT T TCT ATTAT AG AT AGCATTTT A 
GGATACAATAATCAGAATTAAGACTGAGTTATGAGTTATACAATTTTTATGAGT^ 

7 5 ATACCC CTAAATG CAACT AAAATAAGAT CT AGTATCACTTAC CATTT AACAG TTTC TTATAT AATTT ACTAAATGAGCCT GATCAT 
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TTCATATCTCTGTAAGATGAGAGAT AT AGATCTTTTGAGGCTTTCCAGGGG CTGAACTGGAAAATC CCAAAGTTAATTTAATTTAA 

TTTTATTTTTTTQJVSACGQAGTCTCGCTCTGTCA 
CAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGG 

TTAATAGAGATGGAGTTTCACCATATTGGCCAG^ c CACCTGCCTCAGCCTCCCAAAG 

5 TGC CAGGAT T ACAGGTGTGAGCCACTGCAGC CAGCC CCAAAGT T AATTTT AG^ CTGGTCACAGTGG 

CITAAGCCTGTAATCCCACTGCTTTGGGAG^ 

TTGTTTCTATAAAAATAAAAAAC^UU\AAAAGACTTAATTGAGGATT^ 

AAC^TCTGATCAAAACAGAATCACACAGGTCACTAT AAAATAATGGT CATTTGG CTGTGCG CAGTCGCTCATGCXTTG T AATCC CAC 
ACTTGGGGAGGCCXiAGGCAGGTGGATCACCTGAGGTCAGGTGTT^ 
10 ' AAAATACAAAAAGTTAGCCGGGTOTGGTGGTGGGTGCCTGTAA 

TGGGAGGCAGAGGTTGCAGTGAGCCAAGATCGCGCCACTGCACT^ 

AAATAATAATAAT AATAATAATAAT AATGATAAT ATTCATTCAG CCAGTGATAAAAG ATTT CCAAAGTAAT ACAGAAAGTTACATG 

GATGTAAAAACGTTAGTTTCCTTAAAGTTTAGTTTTCCTAAGTTAT 

AATGTGGAAGTTTTGTGTTCTTTTTTTT^ 
15 TATACATGTGCCATGGTGGTTTGCTGCATCCATCAACCTGTCATOT 

CCCACCTACCAATAGGCCCCAGTGTATGATGTTCCCCTCCCTGTGTTCTCATTGTTCAACT 

TGTTTGGTTTTCTGTTCCTGTGTTAGTTTGTTGAGAGTG^ 

TTTTTTTTGGCTCCATAGTATTCCATGGTGTATATGTGCC^ 

CCAAGTCTTTACTATTGTGAATAGTGCTGCAGTAGAC!ATACGTC 
2 0 ATATACGCGATAATGGGATTGCTGGGTCAGATGGTATTTCTGGTTCTAATCCTTGAGGAATCA 

GAACTAATTTACACTCCCACCAACAGTGTAAAAGCATTCCTATTTCTC 

ATGAT CGCCATT CTAACTGGCATGAG AGGG TATC TCATTGTGGTTTTG ATTTGCAT TT CT CTAATG ACCAGTGATGATGAG CTTTT 

TTTCATATATTTGTTGGCCGCATAAATGTCTTCTT^ 

TTTTTTTCrTGTAAATTTGTTTO 

2 5 CATTCTGTAGGTTACCTGTTCACTCTG 

ATTGGCTTTTGTTGCCATTGCTTTTGGTGTTTTAGTCTTGAAGTCTT^ 
TGTTTTTTAAACAGGCTACCAAAAAGGTGAAGAAAAACCTTCTGTAGTATGAT^ 
CTGGAAATCAAACCGGATGAAAAGAGTACTTGAATTAATCA.GACAGAG^ 
TACGTCAAGAAAAGCCAAGATACAGTAGAGGAAAACATAGCTTTTT^ 

3 0 GTCGAG TT AGAAT AGG AGAAAAAAAGTT ATAGGAGCAG ATG AAAAAG TTGG AGAG AGT CACCTT CT AGCCAAGCAAAAAG AT ACGC 

TTTTTCAAGTGGAGAAAGAACAGAAGGGAATAATATATGACTTGCAATC^ 
GTATACATTTGAGAAGTTTCAAAAAGAAACAGATTTCAGAATT7UVGTATCAAAACCT 
CTTTAAGAAAAGCCCTGTTCTACAATAGGAGACCAAATTTTATAGTTTTG 
GGAAGACCACATGTCATTTTATTTTTTTATTTTTCAATTTTTATT^ 

3 5 GCTGGAGTGCAGTGGCGCAATCTCGACTCACTGCAGGCTCCXjCCTCCAGGGTT 

CTGGGACTACAGGCATGTGCCGCTACGCCCGGCTAATTTTTTGTATTTTTAGTAGAGACTGTGTT^ 
TCACGATOTCCTGACCTTGTGATCTGCCTGCCTGGGCCTCCCAAAGTGCTGGGATTATAGGTGTGAGCC^ 

ATGT CATTTT ATTTTAATT ATAG C CAGCT TG AT CACAT ACAAAATTC CATTTATAAATT CCGTTTTCACAAAC C CTATTACAAT TT 
CACTCAGACCATTAATGACATGCTTGGACTTTCTGCTTTGTCCTGTATTTC^ 

4 0 TTTG CCACATAAGATTC TTTGTCATACAAAATT ATTCT ATTTCTC AAT AAAGGAC^ C CT ATCTTGTAATT AAAAAA 

AAl'TlTT'lT AAGAGACAG AGT CTTG CTCTGTCACCCAG ACTGGAGTG CAG TGGTGCAATCATAGCTCACT ATAAC CCTGAACTCC T 
GGGCTCAAGCAGTCCTCCTGCCTC^GCCACCCCAATTAGCTAGGACTACAGGTGTGTGCTACC^ 
TTAATTTTTTTAGAGTTGGGTCTCACrGTGTTGTTC!AGGCTGGC 
CAAAGTGCTAGGATTACAGGCATGTGCTACCACAGCCAGCITGTATC^ 

4 5 AAGTCACAAGAACTGAAAAAACATTTGGGTTAATTACTACATATTTTATATG^ 

TTTTTAGGGATTTCTGGCTGACTATATGAGATTTTATTATGTAGAC^ 
CTACACAGATATACATACTTATAAATAAATT(TTTATGGCTTTTGTTTTAGAGTTTTGGTCA 
TATAAAAAACAGTAGGATCCAGATTATCTTTCTGACAAAATGGAATAAGGCCAAACTTAATGTTCTTTTTTA 
ATAAGGGTTGTGAACTAAATTTTGAGTAAAGTAGTTTGAGTCAGTTACCAATAAACTGGAT^ 
50 AATATGACTGAATTATCTTCTAACAAGGTCrGTTCAGTATTTTATTGGTCTCAATTCT^ 
TCCTTTTACTGTGGGCAGGCTATCTTTACATGGGAATTTCATTTTTGCAAAG 
TATCTIXKTGGTTTTAAGTGTTTTACTTAAATGGTCAGTATGCCIAGAATAGT^ 
AGAGGAGGCAGAGCGGGAGGGGAAAGCTCTAGAGAGGAAAGTGATTCAGCTAGCTTTC 
TGAGTCCTGAATCCTTTTTTTTAGCCTCAAAATATCAATGAGATACGCATATCT 

5 5 TTTTAGTTTAAGGAAGAAC^GACTGTTTGAATGATC 

CATGCCTTTGAATTTTGAGAGCCCATTTTATTGTGGGTGCTCTGGTTTAATCCCAAATACACAA 

GAATTTG CAAAGG CCAAAT AAACGCAAGCACAGATG GGAAAATAAAG T ACCCACTT AC CAAGAAGGACGAT AAGAAGCCTTCT CCA 
ACTGAAAGAAAAAAGAGAGATCC CAAGGC CTCAACC CAAAGGG AAAAATCC CTT C C CTG AG AC C C CACAG CCAG GGT CT ACAAGG A 
GGAAAAGAGTCTTCCTTAATAGGAGATCCCTCAGCCAGATCAGACGGGAAGGGGAAAGAC^ 

6 0 AC!AGAACTCAACGAAAATCGGGAGTTCAGTCCAAG 

GGGTGACTTTCAGAGGTCCCCAGTGTGGCTACCTTGTACTATAGTTCCAATGGCT 

ACT CATACT TGACCAAGT ATGTC AAAGT CAAATG AAACACAGAG ATG AAT CT CTGAAATT AAG ATGT TT T ATTTGTGAAGAAAGAA 
T TG CAATTT AG AG CATG CATGCAG ACAGAATGGT ATGT C CG AAGAACAAAGAGAAGGTTAG AGGTT TT ATGAAAAAG CGAAATGTT 
ATGTGTTGCTCTTTGAGAAAGTTC!ATTGGCACTAGTAA 

6 5 AAAATTAGCCTTAGAATTTCAGCAGATCATTTCAGTAGCCATTAGATAAAACTGGTTTC 

ATGCTTGCAGAJGAATTACATTTTTGGGTCAATGTTATATGTCCTGAGTGCTTCCCC 

ATGACAGG AAT AACC CAGTT CAT ATGAT CAACTTT CACAGTAGTT AATGAT ATAAAT AC CTTAATTT ATTTCACCATT CCT CT ACT 

GTTGGAAAAATAGGTTTATAGCAGGTCATGCTATTGCATACATTTTCCCTCTGTCITCCT 

GATAGATGGGATTGGGGTTGCAGGGAGAAGCTTTGATCTCAGGGAAGGAAGGCC^ 

7 0 GGTCTGAACTACTCAAGCCAATCAATTTCCTGTCACTAATGATTGGTCT 

ATAAAGGGAACTCTGCTGGGTTTTTGTTTGTTTGTTTGCT 
CTTTTGGACCCCCATTCTACCATTTCACTTCATATGTGGATGTGTGTACT 
ACTAAGGAGGAAAAGCAGACCTGTGAATCGCTGATCACAGAG^ACAGAGGGCCT 
CCAAACCTGGACATCTCATCTCCAGAGTTGATGTG l " ! ^ ^ ^ 
7 5 CAGGCTGGAGTACAGTGGCACCATCTCGGCTCACTGC^CCTCTGCCTCCCAGGTT^ 
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TAGCTGGGATTACAGteCACCCACCACCACGCCT 
AGTCTCAAACTCCTGJVCCTCAGGAGATCTG^ 
CAGACTTTGTGTTTTATGAGGTATTTAAATATCTTACAJCA^ 
TGTCTTCCAATTATTTCCACTCTGCACATTGCTAAAGTTAA^ 
5 AGATTCTCACACATGAAGTAACTGATCAAAGAGTAGAACATTTT^ 

TTATCTTTG AGACTTTTGGTGTT C CCT CTTCATTTGTAC CTATT ACAAAATG CTGGCTGGGCGCCCTGG CTCIACAC CGGTAATCCT 
AGCACTTTGGGAGGCCAAGGAGGGTGGOT 

CTAAAAATA(MCAAATTAACCAGGCGTGGTGGCGTGTGCTTGTAGTC 

ACCCTGGTGG CAGAGGTTGCAGTGAG CCTAGATTATGCCACTGCATTCTGG C CTAGGTGACTGAGTGAGACTCTGTCTAAAGAG AA 
1 0 AAAAAAAATAAAAACAAAG TG CTTCCTCTTC CT CTGTGAG AG AAGAAGCTAAATTTTAGT'rr CTTT AATTGATTTTGTGAGAAAAG 

AAAAGATTAAAGGG AACATTTT CAGCAAAGAT ATACTGT ATTAAAAG CACAG AAG C TGCCAAAGAGAAAAATGTT AT CAGATGAGC 
AGGTGATATAAATGTTATTAGATCACCAATTGATGAACAGGTA 

T CACACAGAAGTAAAAAGTrAATTGAGTGGTTGAACAGATCT CTTGT AATTGAATTATACAAGACAATTGT A 

CAGTTG CATCCTGATGTATAAAAT AAGGAAAT AAAACTTG AAATTAGAGG CCAAG AAG CACATCAC CAAATGACATTAT ACT ATGG 
1 5 GTGGATTGTG CCTTT ATAT AT CT CGTTACCTCAAAAAGGTGAGTTAGAAG GCACAAAT GT ATAC CCAGAT AAATCAAAAGAT ATAT 

TGTGTTCTTGGTTGGGAAAGGTAGCTATAGAAAAGAGAA^ 

TATTTTGTGAAAATTGTTATCTTGGTAATGAAATTTAACCCCT 

GTTTAAGCTTTATGATTATTTTGCAATGCCTGTTTC^ 

TTTAGATGCACTCATAATTACGTACAATGATGTTTGAATTGTTTGT 
2 0 CTTCAGGGATACAAAATACCCTCCAATGTGTAGGAGACACCrrAAA 

TGTTAGATTCTGGGGATCTAACAGGGAGATGCACAAGCAAAGTCCTTTACTC 

GTTT ACCAT CCATGAGAAC CAAATCAGAGG AAGGAG CTAAAG CGG G ACAGGAG CAGCT AACTT AGATGGAGTG AT CAGGGCAGGTT 
CTTTTGAAGAGGCAATTGTTGAACAGAATGACCTGTGGGAAC^ 

AACCAGTGT AAAAG CTT ACAGGCAG AAG CAAGTCT AGGGCAGT CCATGT ATAGT AGTG AGG CTATCGAGG CTAGACAG AAT AATGA 

2 5 GTAAGGGAGAGATGGAAGAGGTGCAGAGAGTGG<3GCAGATAGAT<aATC 

TT CTGAATG TGCTAGG AAGTTATTGG AGGAATTTTTTTTTTTTGA^ G AGTGACGTATTCAGATTT ATGTTTTAAAACAATT 
ACTTTGGCTGTTAGAAGGTCAGTTGACTTTTGGAGCCCIAAAAGCAGGTGCA 
TAAGAGATGTCAGTAATTAGATTAGGC^TACTGAGAAACATTTGGATTCM 
AGTCTCAGTTGAATTGATCGAGCTCCTCAGGTCATGGCACATTT 

3 0 TTTGTG CTT ATCIX3TGATCTCTGAGTTCCTT ACT CC CTCT CCCCAGTAT ACACATGCTT CT ACTGTATTT AAT ATATGATTCAGAA 

TA(^TCTAATGTATATATGAATGAATGATTCTCAGATTGTCAACAGTGCCT 
AAGCAAAGTCCTTTACTGTATTTTCTAGTGAGAGGTACA 
ATATATAATATATAATAAGTATATATTACATAT^ 
TTATATATTATATATAAGTATATATTATATATTATATATATAA^ 
35 TAGTAAGTATATATTATATACATACATAACAAATAAATATA 
GCTt^AGTGCAATGGCACGATCrCAGCTCACTGCAACCTCT 

CTGGG AT TAGAGG CG CATGCT ACCACG CC CAGCAATTTl"ri M rri'GT ATTTTT AGT AG AG ACG AGGTTTCACCATGTTAG CCAGGGT 
GGTCTGGATCTCCTGACCTCGTGATCTACCCACCTTGGCCTCCCAAAGTGCTGGGATTA™ 

T ACATCT AAT ATATT AATTTTTTAATGTAT AG AG CT TTTACAG AT AAAATATT TT AATTTT ACATAAATGAT ATGG TG CTGTCCAG 
40 TGGTCTTGAAATGTTAACATACACCATTATTACCTGGAAAGCT^ 

ATCAGGCGTGGGGTGGAACCTGGCAATCTGTATTCTGTCGGGTTCCCAGGTGATCCT 
ATACTTTGAGAACCACTGTTGTAGTGCTATTTATACTGTTGACTTTTTTCACTTACCA 
ATACTTTAAGTTCTAGTGTACATGTGCACAACGTGCAGGTTTGTTACATATGTATA 
AACTCGTCATTTACATTAGGTATATCTCCTAATGCTATGCCT 

4 5 CCCTTCCTGTGTCCAAGTGTTCTCATTGTTCAATTCCCACCTATG AGTGAGAACATGCGGTGGTTGGTTTTTTTGT CCTTGCGATA 

GTTTGCTGAGAATGATGGTTTCCAGCTTCATCCATGT^^ 
ATGGTGTATATGTGCCACATTTTCTTAATCCAGTCTATCATTGATGGACAT^ 
TGCCGCAATAAACATATGTGTGCATGTGTCTTTATAGCAGCATGACTTATAATCCTTT 
ATATCCAGGTTACTACTATACGTGCCTCTAGTTGATTCCTTTCTGCCT 
50 AGTTTTCTCTT1TGCTTGGTGGACACCTAGTTTACCTCCAGTTTCTTTCTAACATGAACACTGCT 
ATCTACCATGATTTAATGTCTCTAGGTTTTATGAACAAAATCGGATTG 
CATTGGTTGCa^GATGGCTCTTCAGAATGGCTCTCCA 

CCTC CTTC CAGAAT ATG ATTTT ATCT CAT ATTAT AAT ACTTG C CAATATAATGGG T AT AAAGT AG T AT TCT ACTG TTTT AATT AAT 

GTTTCTCTAATTACTGATGAAATTAAGCATCTCATATGCCAGCCATTCAGCTCT 
55 TTTCAGAGATAGGTTAAAQATATACAAGCATATTTGCAAATCTAATGAGAAAATAGTGTACT^ . 

TGCTGCTTTATCTGAGTCCCACAGATTTGGACATGTATTGCTTTC^ 

CCTAAAATAATGAAGTAGTATGTTTTTTCTTTCCAGTCATATG 

ATGTAATTTGATTAAAGAAAATGATTGGAATAATACTCATTCT 

TTTTGTGATTCTCCCATATATTCTTGGAAAGAACGTATATT 
6 0 GCTTGAACTATCAGTTACTTTCAGCATTGTGTCAA 

TTCTGAGAAATGGGTGCCGATTTTGCTGTTGTGAAATCATCATAGAGTGTACTTAC^ 

CCTAGGCTATATGGTATAGCCTGTTGCTCCCAGGCTACAAACCTGTAC^CATGTTATTGTGCTGAATACT 

AC!AAGGGTAAGTATTGTGTGTCTAAACATATCTAGATGTAGAAAAGATACAGTAAAAA 

TAAAGTACTITATCATGAATGGATTGTACAjSGACTC 

6 5 ACATTGCTGTATACTACCAGATGCTTTATAAACACTGT^ 

TAATAAATTAGCCTTAGCTTATTGTAACTTTTTACTTTATAAGATTTTTAATTTTGAGCTG 
ACCTAATCT(^CTGGTGGTC<^CCCGAGACCCC<^GCACC^ 
GAAACAAACAGCTATGAGCCAGGAAAGACTTGCCAAACTGCAG 
AGAAGGCGGTTCATAGAACAGCGACAGCAGATGATAAAA 

7 0 ATTGAAGAGGTGAATATGTTTACAAACCAAGGAACAGTGATCCACTTTAACAG 

GCCATGCTGAGACAAAGCAGCTCATGGAAATGCTACCCAGCAT^ 

CT<K3CTGAAGCTCTGCCCAAACAGTCTGTGAATGGAAAAGCA 

GGAGAATTTTGATGAGGCTTCCAAGATGGAGAATTTTGATGAGGCTTCCAAGAATC 

ATAAAACTTGAAGAAGTT ATTGG G AGCTGCT ATTTTATATT ATGACTGCTTTTT AAGAAATTTT TGTTT ATGG ATC^ C 
7 5 T AGATCTCT AATATTTTT AAGCC CAAGCTCCTTGG ACACTGCAG CT CTTTT CAGTT TTTG CTT AT ATACAATTCATTCTTTG CAGC 
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T AATT AAG CTGAAG AATCCTGGGAAT CAAGTTTGAAAjCAAAGGTT AAT AAAGTTCTTTGCCTAGT ATAC CAAAAAAAAAAAAAGAT 
TTTTAATTTTTTTAACTTTTTGACTCTT^ 
ATGCAGCTGTACAAAAATA Tl^LTlT CCrre 
AAGAATGAAGACACACACftCACACAT^^ 
5 CCTCCACATCTTtn , CCCACTGGAAGGCCTT 

ATCCCTCCTGAAGGACCTGCCTGCGGATGTTTTACAATGAG 
TAAAAGTATGGTATAGTAAATACATAAACCAGTACCATTTAT^ 
TTTTATATGACTGGCAjGTGCAGTAGGTTTTTTTACATC^ 
GGCTACATCACTAGGGTGTAGGAATTTTTCAGTT 
1 0 GCAGC^TTATGTGGTGCATGACTATATTGATTTATCTGTTTCT 

AGAOVTACCCAAAATTGGGAACAAAAAGAGGTTTA^ CTGGGG AGGCCTCAGAATCATGG CAGG 
AGACAAAAGGC^WVAGGTACTTCTT ACATGG CGGTGG CAAGAATG AGG AAGAAGCAAAAGTGGAAAC CCCTGATAAAC CCAT CAG A 
TCTCATGAGACTTTTTCACTATTACAAGAATAGC^ 

TCTGGTAGCTACAATTCAAGTTGAGATTTGGCTGGGGACACTGCCAAACC^^ 
1 5 TGTGAAAT AT ATACAGAATATTTCATAAGTGGTGTATACAA CGTTCTGTATTTGTGGATATGC CTAT 

GTCTCCTITGCTAAAATTGATTTGTAAACTCA^ 
CTTTGAGTCTCCCAATGCACATGTTCCCACCT 

AAGTATTCTTTTCATAGTCTATTTAGTGCCTGAATGCCTGCTATAGGTAACCAATCT 
TTTTTATGTACAAAGTAGATACATACATATGAATTTCATATATCT 

2 0 ACTTTTTATTTTTTTT AAG AG ACAAG AGTCT CACTCCGTCACCCAGG TTGGAGTG CAGTGGCACG AT CATG AT CAT AG CT CACTAC 

AGCCTCAAACTCCTCGCCTCAAGGGATCCTCCTGCCTCAGCCTCCTGAATAGCTGG 
TAATTTTTAAATGTTTTGTAGAGATGGGGTCTTGCTGT^ 
TTGGCCTCCTAAAGCACTGGGATTGCATTTGTGAGCCATT^ 
AATCACTCTAAATCAGTTTATAGAGATCTTTCTCATCCTTGTGC^ 
25 ATGGTCATTTAGGTTGTTCCTAATATTTTACTATTACACACATCCT 

ATCTTCAGGGTAGATTCCTAGAAGTGGGTCAAAAGTTAAATGAATGTGTAGTTTTGTCAGTTTC 

CAAATTAT ATTATTATTATT ATTATT ATTGTTATTATTTTTGGG ATGGAG TTT CTCT CCAGG CTGGAGTG CAATGGCGCTATCTCG 
GCTTACTTCAACCTCTACCTCCCAGGTTCAAGCGATTCTTCTGCCTCAGCCTCC^ 
GGCCTAGCAAATGTTTGTATTTTTAGTAGAG^ 
• 3 0 TGCCTGCCTCTGCCTCCCAAAGTGCTGGGATTGCAGGCGTGAGCCACCATGCCTGGCCTAT^ 
CCCATCTTTTATTTTATTGCCTCTGGATTT^ 
AAGAAGAACTCACTCATGTTTTCTTCTAGAACTTGTC 

TGCTTTGTGTGAGATGTGG AT CT AACTTT ATG TlTi"l M IT'iTTTAAATGG CAAC CACTTGTCC CAGAGC CATTTATTT AAAAGTTCA 
TCATTATTACTGTGATTTGAGATGCCACCTTTATTATTTACCAAATTTTTGTA^ 

3 5 CCACTGGTATGTTTATCTACATAAGGACC^AGTACCA 

AGTCTG CCCT CAT AGTGT CT CTTTTTAGTGT TTTC CTGGTT A'l' T EJTTT LTTG CATGTTCATTTTT CCATG TT AATT TTAGTGT CAA 
CTTGTCTAACTCCATAAAATAGCTTGTTTGTATTrrTCTTGGAATTT 
TTTTTTTTTTTTTTTTTTTTTTTTTTTGAGACAGAGTCT 
GCAACCTCTGCCTCCCAGGTTCCAGTGATTCTTTTGCCTCAGC 
40 CTAATTTTTGTATTTTTAGTGAGAOSGGGTTTCACCATATTGGCCAGGCrrG 
TCGGCCTCCCCAAGTGCTAGTATTACAGGTGTAAGCCA^ 
TAAGAACAAG^GATGTCTTTCCATTATTTCATGTCT 
CACATTTCTTGTTAAATTTATTCCTGTGTCTTTAGGCCTCT^ 
GATTTTTTCTGTATAAAAGGTCTATTTTTGTTTGTTTGTm 

4 5 ATGATCTTGGCTTATGGCAACCTCCGCTTCCCGGGTTCAAGCGATTCT 

GTGCCACCACACCCAGCTAATTTTGTATTTTTAGTAGACGCGGGGTTTCT 

OSGTGATCTGCCCGCCrTGGCCTCCCAAAGTGCTGAGATTACAGGTATGAGCCACTGCACCrGGC 

GATGGAGTCTCCCTATGTTGCCCAGGTTAGCCTTGAACTCCTAGGCTCAAGTGAGCC 

CCATATGTGTGCCGCTGTGCCCCAGGGCTATTGATTTT^ 

5 0 GAGTTAGTTTTATCATTGATTTTGGGGGGTTATATCTGCAAA 

TTGAGGTGTAATCCCATTGGCTGATACCCTTAC^CCAGTTTTGAACAGTAG 
TTAGTGAAAATGCCGTTAGTGTTTTGGAATTAAGTAAAATACAGGTTTT^ 

TAG ATATTT AGATAG AACATGGG AAC AGAGATTAGAG AGAGAT TT CT ATGAAG GCTTGT CAAATT CT AGGG CAATAAGTATATCAC 
ATTCAAAAG GTTTCATTTTT CAATTTGGG AGTAAC AATGAGATATGC CATT ACAG C^AAGTCATTTGAATT ATATAAATTGTGTTT A 

5 5 TGCATTATAATGGTCACAAAGAAAATTTTATGCT 

C C CAAAACATTGTTTCTGCTT CGGGGAAAT CATG AAT G CAGGCAT CTT AC AG ACT ATTT CACCTT CAAACAGG AATGT AAGT ATAA 
TCACTCCTCTAGAACCTTTTTAGTTACCCTTTAACCTGTCCGAAGTAAATCAAACAGA 
TGCTTCCTGTCCTGTTCTTATTTTTTTCACCTGCAAAGAACTAAGA^ 
GTTCCCCCTTGTTTATAAAATAAATAATTCGGGATATTTGGAAAGC^^ 

6 0 TTGCAGT GAGCCAAG ATCACG C CACTCT ACT C CAGCCTGGGCG ACAAAGCAAG AT TCCAT CT CAAAAAAT AAAAT AAAAT AAAAGT 

GTCCATAATCATGCCACTTAGAGGTAATCACTGTTATCACAC^^ 

GTTCAGGTATACATATATCTGTATATATTTTACAAATAAAATTGGGCTCATATC 

TAGTAGCATTTTTTCCATATCATTAAGTATTTTTGACAACATGGTTTGTAAGTGTT^ 

CG AACACACAAT CATAT ATGT AGC CAATTT AG TAACAATCAT AGGATATTTAGG TTGTTTCT AAC TGTG ACT ACTGCTGTAATAGT 

6 5 AAAAATAAGTTTAATATAAAAGTAGAATTAT^GGCCGG 

GGTGGAAC^CAAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGATGAAACCCCT 
GTGTGGTCGTGCGTGCCTCTAATCCCAGCTACTC^GGACt3CTGAGGCAGGAGAATC 
AGATCCTGCCACTGTACTCCAGCCTGC^CAACAGAGCGAGACTCCGACTCAAAAAAAAAA 
ACAGGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACITTGGGAGGCCGAGGCG 
70 C^TCCTGGCTAACAAGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCG^ 

TACTCGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAAG CGGAGCTTG CAGTGAGCCGAGATTGCGCCACTGCAGTCCGCAG 
T CCGG C CTGGG CAACAG AG CAAGAC TC CGTCTCAAAAAAAAAAAG AAAAAAAAAAGAAAGTCAAAACATT ACTAG CTG ATTTTAAA 
AAACAGATAACATGGTGGGAGGCTGAGGCTAGACGATCCCTTCAGCCCAGGAGTTCTACACCAGCCT 

CAT C TCAAAAAAACAAACTCGT AGATTT C CTTGACTAG CATCTAG TACTCATAAT TGG CTG CT CACTAACAAACAGT ATGATGCTG 

7 5 ACTATGTTGGGGATGTTTGGGAGATCCTGCCCCAGGGCATTTGCATTTTTAGTTCT 
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ACATACATTCTATTTCATTCATGTCATTTATTCCGAGATTTTCT 

TCATAC CCTGTTT CTTTT CATAGTGTTT ATCAAC ACCTGAGATTTATGTGTATT TATTTGTTGACTGTCTTTCTCTGCCACCAG AA 
TGTAAGCTCCATGCTAGCAGGGGTGTGTTTTGTTC^^ 
CTTAGTAAATATTGGTTGAGTTCATTTGAATGAATGAGCATC^ 
5 GGTTAGGGGGCAGGCACATTTTAAAGACTTTTGGCATACT 
AAAATTTTTTATTTTTGTGATGAAAGTACTGAAATA^ 

T AAAATAAGGAAGCT AAACT AAAT ATCTTGGGTCCCATCTGGCTCTAAA^ C CTGAATATATAAATAT 

ACTT T AGTGTTCAT ATGGTTGATATTTTG ATCA CTG CAGATTATTATCTTAGAAAAAT A 

TATTATAAAATATTTAAACTTTTATTC CAAAAAT AATTTT CAAAGTTAAAAAAACTTTTACATTTGTTTTCAG AG ACT CAAAGTTT 
1 0 ATTAATCTGCCACAGATCCCTTTACTGATAGGTAGATTC 

CTTCTTACATTGGAGATAGTAGTTGTAATGAAAACATCCCTCCTAGTAGAGC CTCAAAGAAAAGGT CTTTGTTATTAAAAGAAATT 

TTAG AGGATAATCACTTCTT AAAAAGTCCCTCTGT CAAAAGTCAGCTGCAAG AGGG CAAGAT ACAAAT CCTT CCAG CCCTTGG TG A 

CITACTGGAATAAACACTTTAAATCAGCAGCTAATTAC 

TGAGCATGGTGGCTCATGCCTGTAATCCCAGTACTTTTGGAGGCTCA 
1 5 CTGGT CAACATGTTGAAAT CCCATCTCT ACAAAATAAAACACAAAAAATTAG CTGTGCATAG AGG CACTT ACCTATAGT CCCAGCT 

ACTCAGGAGGCTGATTGAGGTGGGAGGATCACCTGAGCTCAGGAGTTTGTC 

CTGGGAGACATAGTGAGACCCTGTCTCAACAACAACAACAACAACAACAAAGATOT 

TTTATAAATTTTAAGTGCATGTCCCCTTGATCTAGCAATT^ 

TG CAT AAAAAATT AT CACAAATTT AGTGGCTAAAAAG AACACATTT ATT ATCTCATA^ CCAGG CCACA 

2 0 GTGTGGCTAGATTCTCn'GTATAGGATCTCACTGAAATCAAGGTG 

CCMGGTCACTTGTTGGCAGTCCTTGTAACTGTAGGATTAAGGTCCCTGTT 

TTCCAGATACTCCITGCAGTTCGGTAACATGTCGCCACC^ 

AATGCATCTCTCTGACCTTCTCTTCTGTGATCAGCTAGCTGGAAA 

G CATGGTGG CTCATGC CTGTAATTTCS^CACTTTG GGATGCCAAAGGGG AG AGG ATCACTTGAGG CCAGGAGTTCAAGACCAGC CT 

2 5 GGGCAACAT AGTGAGAC CT C CT CTCTG CCAAAAAGAAAATT AAAAAACTAG AATT CAGT CAGCCTGCATGGTGGT ACAAGC CTGT A . 

GTCCCGGCCAGTTAAGAAGCTGAGACTGAAGGATAGATTGAGCCTGGGTGTTTGA^ 
CTCTAGCCTGGGTGACAGAGTGAGACCCTGTCCCTAAAAATAAATAAATAAGTAA^ 

CCAAGATAAT CTTTCTTTTG C CATATAATGGGACAAAAT AATGGGAGT CT CATCATACTT ACAGTTTTCAT CCACTGAAAGAGG AG 
TGGATTATACATACAAAGATGTAGGTCAATGCAGGTCATTTTAG^ 

3 o atatggatgcttaaggatgttcactgcagtgttaattgtaacaacaac^ 

GTGATGAAATAAGTGATAATATGCTTTTGAACAATG^AATG 
TGCTCATAATCCTACACTTTGGGAGGCCGAGATGGG^ 

AAGACTCTCTCTCTACIAAAAAATTTAAAAATTAGCTGAGTGTGGTGGCATGTGCCTATAG 

GGAGG ATTGCATGAGCCCAGGAATT CAAGG CTG CAGTCAGCTATG ATCACCACACT CCAGCCTG GGTGAC 

3 5 TCCIAAAAACAAAAGACAGGCAGACCAAGGTAGACCCATAATATGTACTGATATGGAGAGA 

CAGAGCATGTAAATGCTCATCTCCTTTGTAAATTA 

AGGACTTAGAGGGTTTTTTTGTTTTGTTTTTGTTTTGTTTTTG 

AACTOVrGCAATCCTCCAGCCTTAGCCTCCTAAA^ 

ATCCT CTCT AGTATTGTTTGAATT ATTTTTT ACCAC CAC T ATGT ACT ACT TTTATAATT ATTAT T ATTTTT CAAATTGGT AATTT A 

4 0 AAAATTACCCAGTTGGGGCCCAAGCATTTGAAAAGCTTTCT 

GGCTCTGGAGTCTTTAGGAAGCTTCTACTATCTGTCGTTTGG^ 
ATTATTTTTCTAAACCAAATTTTAGGGTATTCCTTAACACCTTCATTTCTTA 

AATGC CCTGG AATGTAAT ACTCG AT AAAAG CTAATTTT ATGAATT AGGAGT AGAT ACGT AGTGAAG AAAG CTGCCTATG G C TG C CT 
TTGTTTTG AATTAAGTTTCAG AGGGTTTTTTAGGTGACATT TT T CTT AATT ATT CTTTTGT CTT ATCAGATTTTTTTTATT TTT AA 

4 5 TTTTTATGGGTATATAGTAGGTGTATCTATTTATGGAATAC^TGAGATATTTTGATACAGGGATGCAAT 

GGG AT TTTTTTGACAGACT ATTTTT AGAAGATGGAACAAT AG T ATCTC CTTTTGC CCAGAATTT C TT 
ACCTTAGGTTTTTATTTGCTT'TTCAC 

TT CTTTT T TATTATAATG CTGCATCTGAAATGT AAAT AAAT CTTG G AACAAGCACACAG ATTCAAGT ATAAGT T TAAAATTTT ATT 
CTTGTATAAGGAGAGAGCATGTGTGTTGCTCTGCATGGGTGTTTTTACTGATAAGCTGATCTCTCTT 
50 ' T AG AACT AGCATCTTATCATTAATAAGGGAGGCTTAGTTT CAG ATTGTGCAGCTGT AAT CTTGTT CATG T TTTT AAGAGGGAGGTG 
G GTGAAGG CTT TTCTTGTAT (ZAATCTTAAATTG CTTT AT AATGCAG AG ATTTTTTAA^ CCTTTCTTTATGGAAAAGT AA 
TAATGTTTTTTTTTAAATTGATGTATAATAGTTGTACGTATTTTTGAGGTACATGTGATAT 
ATTATTATCAAGTCAGGGTAACTGCAGTATCC^TCGCCTCAAACATTTATTTTTTCTTTGTCT 
TTTAAGAGACAGGGTCTCATTCTGTTGTCCAGGCTTGAGTAOUVTTG 

5 5 AAGTGATTCTCTTGCCTC^GCTTCCCGAGTAGCTGGGACTGTAGGTGTGTGCCAC 

AGACTGAGTCTCGCCrrATGTTGCCAAGGCTGAACTTGAACTCCCCAAAGGATCCTTCCG 
CAGATGTGAGCTGCCATGCCTAGCCCCTAACAGTTAATTTTTGACATATCATTTGTT^ 
ACTTAGAAAGC^^GAAACATAGTGCCTGCCTCCCTCCCT^ 
TGCTCTGTTCCCTAGGTTGGAACACACTGGTAAAATCATTGC^ 

6 0 CAGCCT CCTG AGT AGC TGGGACTACAGGCACG AGCTCTCATG C CCAG CCT ATT TTT ATGTTTTTATGTATTT ATTT ATTTATTTT C 

TTTGAAGGAACCTCGTGTTTATATTATTCCACAAATACTCTGGGTGCA 

ATTT CTTATTT C CACG AAC CT AT 1 TT 1TT1TT T TT TT TTTTTG ATAG GG AGTCTCGCTTCGT CACC CAGGCTGG AG TGCAGTGG CA 

CAATCTCGGCTCACTGCAACCrrCCGCCTCCCAAGTTCAAGTGATTCTCCTGCTTC^ 

CACCACCTACGCCTGGCTAATTTTTATATTTTTAGT 

6 5 GTG ATC CACCTGC CT CGGC CT CCCAAAGTGCTGGG ATTACAGG CGTGAG C CAC CATG CCCGGC C CTT CCATGAAC CATTT TTTT AA 

ATGTATAGTTAAATAGCTTACTGACAAGTTCCAGAAACTCCTGTAATTTTACTAAT 

TTT AAAAAAAT AAATACTGTTTGTCATTGCTCATTG CAG CATT TTTTAG AGT AG CAAAAGATTGAT CT AATT AGAT TGTGGTATAT 
CCATAAAGCGAAATACTAGGTAAATGTT^AAAAAAATTACAAAG CTCTCCTGTAGGGGAGGC^UW^TTTT ACCTTTACTCT CAGGGT 
TTTTCAGCTGGGCCTGAGAATTAAATTGACATGAGATAAATTAATAGGAGAAGAGCATATAAATTTATTTAATATAAGTTCTACAT 

7 0 GACATAGGAGCCGCTATAAGGAAATGAAGATCCAGAGACTCAGAGTTGAACACTTACATGCTGAATCGG 

GTGAAAAAGT AACTAAATGATGTG AGG AGGCT AGAAG ATAAGAGTT AT TT G AAT TCAGT CTCAACT AT TAAT C CTTG GTG ATAACA 
ATGTTATTTTCCTTCTAGTAC!AGGGAGGGCATCTTTTCTTTTTTTTTTTT 
TTGTTATATAGGCAAACTTGTGTCATAGGGGTTTGCTATAC^ 
TTTTTTCTGATCCTCTCCCTCCTCCTACCCTCCATTTTTAAGTAAGT^ 
7 5 GAATTTCATCTCCTGCTTTTAAAAAGCAGCATGAGCO^CACTGTGGCAC^ 
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GGGAAGATCGCTTGAGCCCAAGAGTTTGAGACCATCCTGGGCATC 

AG<3CCAGAGTGATCTTCTTGTACGTGTGGTTTTTCAAG 

TGGCATATTCTTAACTCO^CTCTTCATCTGAAGA^ 

AGT ACAT ATATGCTAT CATTTATGTAAAAAC CAAAAGGATGTTTTGGAAAGAGAAGAAACATATATGTCTGGATGTGTATC 
5 AACATCTTTGTAAAGACACCXIAGGAAACTXSGTAAT^ 

TTCTAAAATTTGJWSCTATGTTGGGGCCAGGCACAGTGGCTCA^ 

C^UiGGCCAGGAGATTGAGACCAT CCTGGCGAACACAGTGAAAC CC CGTCT CTACT AAAAATAAAAAAAAATTAGCCGGGTGTGGTG 
GTGGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCTGGA^ 

TTGTGCCACTGCACTCCAGTCT CTCAAATAAAT AAATTAATT AATT AAT TAATTAAATTAAATAAAT AAAAAATAAAAAATTTGAG 
10 CTATGTCT CACGCCTGT AATCCCAG CACATTGGG AGG CIX1AGGTGGG AGGATCTCTTG AGGCCAGGAGTTCT AGGTTG CAGT AAGC 

T ATGATAATGC CACTGCATTCCAG C CTGGGTGACAGAATGAGAC C CTGTCTCAAACAATAAT AATAAAAAATAAATAAAGCAAATA 

AAAATTGAAACAAAT AT ATCAT CAGTG ACTAGACCAGATTGTG ATTT CTTGCAAGT AGTGCTG ACTCTCAAAATTG CTTTCAGATA 

ATTTTGTTGAAAATTTTGTTTTGTTAATG1TGTATTATTGAT A TATCAGTGTCATCCTGTGG CTGTAATGCA 

TTTTCCCTTCACTATAGAAATAACTGTTTTATGGGATATATCCTAAAATT^ 
1 5 ACCTCTGGGAATTATGTATTTGGCTATTTAGATTCrCCTATTCAAAATCTTAT^ 

TCATTAAG AG CATG AACAGGT TTAAGT CT CAGTTCTGCCAGTTACTTGCTTGCAGCTTTG GGCAGGTAAGCCTTAAT^ C CTCAGTT 

TTTAAATGAAGACAGTAATAATACCTACTTCACIAGGGTTTCTCTGAGTAT^ 

TG CCAGCATT ATTCTGCAT ATTTTACATAAATGAAT TCAT TTAATC CTCACAG CAAC CCTG AATCTACTGTTTAGGGTGTTATT AT 
T ATTATGACCAT AGATAAGGAAACCAAGG CACAG AG AGTT AAAATTTTT ATCCAAGGTCC CAT AG CT AG TATGTGGAGGGGGCTGC 
2 0 AGTTCTTCAAATCAGGCAACAAGTTCCAGACTCCATTCTCTTC^ 

CATG AGCCTTCAATACATTTAAATAAT AAAGGT AATATAGG CC^^ CAG T ACCTCATG C CTGTAATCC CAGCACTTTG 

GGAGGCTGGGGC^GGAGGATTGCTTGAGGCCAGGAATTGAAGACCAACCTGGGC^ 

TTT AAAAATTAGCT GGGTGTGGTGGTGTGCG CTTGTAGT C C CAG CTACTTAG AAGATTAGACAG GAG GACCACTTGATC CTAGGAG 
CTTGAGGTTAC AGTG AG CT ATG ATTG CACTACTG CACTCCAGCCT AGGCAACAGT AAG ATC CTGT CT CTAAAAAAATTTTTTT AAA 

2 5 AGGTGGTATAATAATAAATGTT ATTAT ATAAAGATTACAT ATTCAT CATAAAAAAGAAAG AACAG CT AAACTAAGAAAAATATAGA 

TACCATTGTCAGTCT AACC CCT CAG AAAATTAAATACTGGT AATT ATAT ATG (ZAAT TCATAT ACTTTG AAAAAATAAACATG GTGA 
TATTTCATAGCCTATTTT AT TCACAT ATT AAT ATAACATG AAL" 

AAAGGCATTTATCAATGTAAGATGTTATGGAGCATAAATTAATAAGTAATTGAAATTCCTACCT 

3 0 TGG AGGAATTC CTTACTC ACACTGG CAACAG AGGG AACTCTGGT^ CACTG ACCACAAA 

TTAAAGAGTTTTCTGfTTTGTTTTTTACTT 

AAGCAGAGTATAAGGATGT AACTGTAAG GAG AGG CAATGATG CATAAGACATGGTTCTTG TGAACTTTGAGCTCCTAATAAAGTTA 
GC!ATATGTTGTGGAATAGATTATGCrrGACTGCTACAACAGCAATACAAAATGTTCTGAA 
TAATGAACTCAAACCCTAATCACCCAGTATTGGCGTTAAAGAGAAATCTTCACACTCAGGGATGGAAGG 
3 5 TGGTCTACTGGTTTACACTTTCCTTAGGGATCTCTCT 

GACCCATCATTCTTCACTTTATTCCCCAAAACCTTGCCAGGACGAGGCrCT 

CACAAAAAGGAAATAAACACAGAAATAATTACCAACTAGATAGTCTGACTCACTCGATACA.GCT 

CACCTCATCTTCCCAGAATCCTGGAAGAGGAGAAGGAGCTCACGGTTCTGC^ 

CACACACAT C C CTG AGCACAGCT TG ACITTTTG AAAC CAAGGC CAAGGAAGAAGAAAATGTT CCTTTT TCT CTTT AAACACCCAT^ 
40 TTCCAGTTTCCC^CCTATTATTTCATGACTTTTCCTTTCCAGAGATTTGTAGAGA 

ATTCTGT AGCC CCAAGTTGACAG CCACATCAAAAT ATAAT TT AT T AAGCCAAACATTTATTTTTTATTGTT T AGAGACAGAGG CTC 

ACTCTGTTGCTCAGGCTGGAGTGCAGTGACACTATCGTAGTTCACTGCAGCCT 

AGCCTCCCAAGTAGCTTGGAGAGCAGGTGTGTGCCACCACAGCCAG^ 

GCCAGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCAGGCGGATTGC^ 

GGGAGGCTGAGGCAGAAGAATTGCAAGAACCCGGAAGGCAGAGCTGCAGTGAGCTAAGATCCCX3CT 
ACAGAGCAAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAGAGGCCT 
GCAATCCCCCGACCTCAGCCTTCCAAAACATTGGGATTAC^ 
ATAGAATCAAGACATAATTTGTTTACAGACCTTAAAAATTATGAGCAAAAACAT^ 
5 0 CAGTTTCCACTTATATATCTTC!AACACCCCTAAGAGATCTTTAAGACCCAATAAAATATT 

TAGTTGTCATCCATATACTTAAAAGGATATAGGTCAGGAAGGGCAG CACTGGG CATTCCTTTTCAGTGG AGTGGTCCTCACAGGAG 

TGCATACCGCAGTCCAGCAGCTTTCCATCTCTCTTGCCCCCTCCCATCT^ 

GAAGATTAGGGAGCCATCCITGGTATAGAAGCCrrCGTGCATC 

AGCTATAGCTTCCCACCAGGTAACTGGGTAATTTTTACCACAAGCCATGTTGCCTAATA^ 

5 5 TTC CAAG CAACAG AG CTGGAAAGTTAAT C CAAGGTCACAC TTGT CTTTAAG AG G AGAGAG GTATTGTTAAC CTATTAT C CACAACT 

TTTCAAAAGTATCTTTACAAAGATAATTGTGCTTTCAAATCATCTAAAATAAATGGCTC 

ATATCCTTTTTCAAATGTGATACGTTTTAGTTCTTCCATTGACT 

GTGTATGATGCCTGTATGGAGACATTTGACTGTCTTCCTCTTGCTGCCCTCTTA 

GTCACCTG AAATTACTTCTTTAG ATGACATT AGGAAAGTAAGT AAT CTTTT ATTATT CT CACAGGGAATAT TTTTAAAATGTGTTG 

6 0 GGTTTTTTTGTTGTTGTTGTTTGCTTTGAGACAGAGTCTCTCT 

GCAACCTCCGCCTCCCTGGTTCAGCCTCCCGAGTAGCT 

AGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAACTGA 
GGATTACAGGTGTGAGCCACCGCACGCCTGGCCTAAAATTTGTT^ 

AAAT ATAGATAATGAAAAGTAAG AAAATGGAAGTT AC CTCT G ATTTCAT CACTI ATAACTACTGTTAACATTTTGATATGTAC CTAC 

6 5 CACAAATT ATAATTTTTAATGAGCTCTATGT TTTTAC CAACTT AGAAAGCT AACAT TCTTGT AGTATAGT CCTG AAACTTTGAATT A 

TATCGCAGTGCAAAAGGAAGGAAACCAGGGTTCTAATCACAGTCCTGCCATTAACT 

TAG T CACATTT CTTCTC T ATGAAATGAGG AAGTTGGAAGGGATG GT CTG AG GACAC TG CCAG CT CCG AAATGCATTGG CTTTGACA 

TTTTAGATGATAACGGTTTCATTGTTTTACTTGCCTTCCTTGGT 

ATGACCAATTTTTATTGTACATGGTTGTCTTATCTTCTCTACCT 

7 0 GATAGATACTCATCCTCTCTTACAGGTACTGAGACT 

GGTAAGAGGGTGTACCTATATGTACTGGCTTTCCTCTTGCCTTACTGTCTT^ 

TATGAATTTTATAATT CTGAAT CAATTGTATCAATAAAAAG CT CTT TCACG CAGT AGAAATT CT CTTCTTGGCATTCAGCT ATT C T 
ATTACTTGTTTTCTCTTGGTTTTTATGCT'TTCAACTAAAGCAGCA 
CAACTGCATAGACTGGAAAGGATAAATGCCCTTACAAAA.TTAGCATTCTTTA 
7 5 AT GAACAGAAC CATTGT AAAACATTT CTG^GTCATCGT AACATTGGCTCACCT AG ACAAA CAT AG 
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TCTTTCTAAAATAGTATATCTCTATAGTGGTTGAT^ 

CTTTTACTACAGTATGGTTAGATGACAGGGATGTTTAT^ 

CACTTTGGGAGGCCGAGGAGGGCGGATCACCTGAGGTCAGGAGTTC^ 

AAAAATACAAAATT AGC CGGG rGTGGTGG CACATG CCTGTAATCCCAGCTACTAGGGAGGGTGAGGCAGG^ 
5 GGGAGGCGGAGTTTGCGGTGAGCCGAGATCGTGCCATTGCACTCCAGCCTGGGCAA 
AAAATTTATTTCAGAAAAATAATAAAAATTATGTATCTAAAATTGTTA 

ACTGGCTT CT CTC CTAATACATTCTTGTGGGT^ TTii CTCTCCT 

GATTTTriTCTlTTGGCTTTTGTGTGTCTTCTAGTTAGACAG 

GGTCTGATCCCTCAGAGGATTATGGCAATGAGAAGACCTTGGAGCACT 
1 0 AGG T AAGCTAGTCCTTGAGGTCGAAAATTATGAAAGGAAACTGTAATT CATTTTAT CAG ATGATTTTTCAGCATTT TATATTTCAA 

ATCT ATGTAGTAT AAGCACT CCTGTTTAATTTTTCGATTAAT AGGAGG CT^GAAACTTGTTTGTTTGTTGG CTTTTTAT ATTTTCT 

TAGGTAT AT ATCCT AAG AAGTGTAAT ACAC CAAACATGATTGG ACAT C CTCCAGTG CTTCTG ATATTTTTGT CTG ACTCTAAGTGAT 

CT AG ACTATTTGTT CCTAAACATTATCTATACACTTTTTTAAAAAAAATTAAAG CTTTCTTG GCTCrrfCC CCAGAGATI' CTAAT A 

CACTAGGTCTGAGAGGAATCCTATTGTATTTTCAA^ 
15 AGGACATTGTGAACT AAG CC CAGT ACATTT CCTGGAAGTTGG CrGTTGTTTTAAATTT CACAGGAGCTT TCATTATGATTAGACAC 

TCAAAAATAGGAAGACAGGCAGGGCGTGGTGGCTCACGCCTAT^ 

GCCAGGAGTTCGAGACCAGCCTGGCCAACATGGCAAAACCCCATCTCT 

CGCCTGTAATCCCAGCTAATTGGGAGACTTCAGGCAAGAGAATCACTTGAACCT 

G CCACT ACACAC CAACCTGTGCAACAGAGCAAGACT CTGTCTCCAAAAAAAAAAAAAAAAAAAAAAAG G AAG ACATAAGAGAAG TT 
2 0 GGAAATCTAAATTATTTTTAATCCCAGTATTTT^^ 

TAAGATAATGTGTTCATTAGAATAGACACATGCACACACACIATC 
CCTTTTTTTCTTTAAAAAAATATTTTTTAA 

ATATAGTCATACAATGTATAATAATCACATCAAGGTAAATGAGTTATCCATCACCTCAAGCA 
CCAATTATTACTCTTTTATTTATTTTATTTTT^^ 

2 5 TGCTCGGTC!ACCCAGGCTGGTGTGATCTCAGCTGACCGCATCCT 

AGTACTGGGATTAC^GGCACCTACCACCIATGCCTGGCT 

GTTTGGAACTCCTAACCTCAAGTGATCCACC^CCTCAGCCTCCCAAAGTGCTGGGATT^ 
CTTCTAGTTATTTTAAAATTTACAATTAAAITATTATTCACT 

3 0 CATTCTACTCTCATCTCCATGAGT^ 

GCCTG GCTTATTT CAG TT AACATAATG ACCTCCAGTT CCATT CATGTTGTTG CAAATGACAGGATCTCATT CTTTTCAATG GCTGA 

ATAATATTCCACrGTGTATATGTACCACATTrrClTlGTCCTTTCATCTGTTAA 

TTGTGAACAGTGCTGCAATAAACATGAAAGTGCAGATGTCT 

GAGATTGCTGAATTACATGGTAGTTCTGTTTTTAGTTTT^ 

3 5 CACCAATAGGATATGAGGGTTCCCTTTTCTCGACATCCT 

TGGGTGAGATGATATCTCATTGAAGTTTTGATTTGCATTTCTCTGATC 
CATTTTATGTCTCCTTTTAAGAAATGTCTATTCAGATATT^ 

TTGAGCTCCTTACATATTCTGGTTATTAATCTCTTCTCAGATGGATAGTAAAAACT 

4 0 GCCTCAACCTTCTGGACTCAAGCAGTCCTCTTGCCTC^ 

CTTTTTTTATTTTCTGTAGAAACAAGGTCTTGGCTGTTC 

TGGCCTCCCAAAGTGC!TGGGATTACAGGGATGAGCCCCACCCAGCCAAC!AACCATTTTTC^ 
ACTAATGTGATGAAATCATTTAACTTTTATTTCTTTTTAC^ 

AGAAATTGGTACAATTTC TG TGGACATCAACTTGG CAATATCTTT C AAAGGTAT TTGTACATGG C CTTTGATCCAGAAATTCT ACT 

4 5 " TCTAAAACTTAATCCTGTAGATATATAATTGATTCTCA^ 

TTTGTAACACCCAAATCAGTACTTACAGAGGTTTCATAATCATTCATGGGAATGT^ 
CTTCACTGACATGTTCCTAGCTGAGGTTGAACAAAATGAT 
TGAGTTTTATTTAGTGAAAGATGTTTTATGTTTTTGTG^ 
TGAAGTGCTGCTGTCTAGGGTCCCTAAGCAOkAAAAGGCTGTGGTATGTCT^ 

5 0 TGTTTAGGCATGAGTTATGGTGGTTTTGGCCATGAGTTC^TGTTAATGAATCAACAGTGTAAATGAC CTAAAGTATTATT ACACA 

GAAA(ZACATCTGAAACAAGGTTATATATTAATTGGTGGGTGAAAATGTTATGACCGGAGCCTC^ 

CCTTAGCAGTAGTGTTTCATTATTTGCTAATTCTGTTTTTGCAGCA^ 

ATTTGACTCACCCAGCACACATGTTCCCAGCTGAGGCTGAACAAAG 

TAG CCACAGT ATAT AAT AC T TTT AGTATGT TGTTGAG TTTG G CTTT CTAGTG TTTT ATGG AG GAT TT AAAAAT CT ATGTT CATCAG 
55 AGATACTCACCTGTACTTTTCTTTCTTCTGGTG 

ATTCTCT CTTCT ATTTTTTGGAAGGT TTAAGAAAGATTG GT ATT AATTCTTTTT TTT AATGATT AATAGAATT CACC CATGAAACC 

ATCTGGTCCTGTGCTTTTCTTTGTTGAGAGCrcTTTGATTACTGATTCCATCT 

TTTCTTCATGATTCAGTTTTGGCAGGTTGTATATTTCC^ 

ATTCGTAATAGTCCGTTATGATCCCTTTTATTTCTGAGGCATCTATTGTAATGTCT 

6 0 GTATCTCTTTTTTTCTTAGTCTTGCTAAGGGTTTGTCAATTTTGTTTAGTTTTTCAAAAACCAACTCT 

TCTGTTGTTTTTCTATTCTCTATTTGATTCATTTCTGCCCTAAACTTTCCTTTCTTCTGCT^ 

TTTT CT AGTTTCTTGAAGTGT AAAAT AAGGTTGTT TAATTCAG ATCTTTTTTGT '1TT"1T AAATCGG CATTTATT ACT AT AAACT TT 
C CTGTAGGT ACTG CTTTTGCT G CAT CCCATACATT TTG ATATGTAT ATTTTTGTTTTCAT T TAT C TTGAGG TTTTTTT T TTTTTAG 
ATAGGATGTTTTGCTCTGTCACCCAGGCTCAATGCAGTGG 
65 CCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGATGTACCGCCATGCCCAGCTAATTGTTTT^ 
GAGGTCTTGCTGTGTTGCTTAGGCTGGTCTCGAACTCCTGACCT^ 
ACACAATAGTTGTTCAAAAGTGTGGTGTGTAGTTTCTATGTATTTGTGAATTTTCCTC 
ATTCCATTATGGTGGGAAAAGATACTTAGGACGATTTACATCTTCTTAAGTTTGTTGAT^ 
AATATTCTGTGTACACTTGGGAAGAATTCTATTCTTCTGTTATTTG^ 

7 0 ATAGTGTTGTTCAGTTCTGTTGTTTGCTTATTGATTTCTGACTGGATGATTTATCCATTACAGAGAATGATG 

GCTATTATTATTATTATTTTGAGATGGAGTCTCGCTCTGTCGCCCM 
CCGCTTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCAAGTAG 
TTGTATTTTTAGTAGAGACGGAGTTTC^CCATGTTGGCCAGGCTGGTTTCGAACTCCT 



7 5 CTGTTGTTGCTTTATATATTTAGGTGCTTTGATGTTGGGCACATATATATTTAT 
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TTATCATTATATAATGACTTTTGTTCCCTCTTC 

GTTCTCTTCCGGTTACT TACTTG CATGGAAT AC CTTTCTCTACCCTTTT ACTTTCAG CTT ATGTGTG CTAAAGTGAG 
TCTTTTGTAGACA6CATGTAGTTGGGTCTTGTTTTCATCCA 
GATAGGTAAGGACTTACTATTACCATTTT 
5 GATTTTT ATTTTTGT AGTGGGGTATTTTGGTTCTTTT^ CTTTTTTGTTCAT CGGGTTTG CAT CAA 

ACATTTTATAGTTATAGCACTCTGTCTTACGCTG 
CACACACCTAATOTAATTAATTTCACXSTT^^ 
CGATCTTGGCTCACTGCAGCCTCTGCCTCCCGGGTTCAA 

CACCAC CACACCTGG CTAATTTTTTATATTTTTAGTAGAGATGG GGTTTCACCATGTTGG CCAGGCTGGTCTCTAACTCCTGGCCT 
1 0 CZAAGTGATCTGCCTTCCTCAGCITCCGAGAGTGCTGGGA 
TGAACTTCCTTTAGTTGCTCTCATAAGGCTGGTGTACT 
TTCATTCTTAAAGGGCAATTTTGATGAGTAAAGTATTCCTGGT^ 

AGGTTTTCTCTTGCTGCTTTCAGATTT^ 
1 5 GACCTTTGGGCTTCATGAATTGAAATGTCCCTTTTCCTCCCT 

TTTCTGCCCCTTTTTCTTCrCCTTAACTCCCATATGCCro 

CAT T XTTTTTTATTCT TTTTT C TTTTT GTCCCTCTAATC 

CTGCCTGAGTCTGCTGTTGAAGCTGTCTCTTGAGCTTTTCAGTTC 

TTTTTATGGTTTCTGTTTCrrTATTAAACAT 
20 CTTCTTTAAGATGATTATTTTGGATTCTTTGTCATC 

TTTCCTTTGGTGGTGTC^TTTTTGCCTGGTTCTTT 

CTTCCAGTTTTTATGGACTCGTTCGCCAAGTAAATACCTTCTCCTATTG 

TTAAGCAGGGCTGAAGCTAGATTATGGGGCTACATTA 

CATGGATGGGGAGAGCTGTCTCTGGAACTCAATAGGGTGGGACCCAAGCTGGGTCCCAG^ 

2 5 ATCAGCAGCCCTTTTACCAGAGGAATGGACAGGCATGATTTCCACAA^ 

AG CAGGG CTGG AG CCAGGTCAGAGTG CTG CT TTGAGTCTGCAGTTG GGTCTGGTTGGTGG ATCTGTTACTAGGG GCATCAACAGAC 
ATGGTTCCTT CC CAGGATGG ATGAG G CTGGTGGTAGGACTIGCAGACAAGTGGGACTGG AGCCGGGTT^ 
CTGGGTCTGTAGCTGAAAGGTCAGTGG^CTGCCTTCrCCAAGCA^ 
CTACCTCAATCCTAAGGCTCTTACAATGGCACTTATGGCCATGGATGGCTT^ 

GATTTTAAAAATAAACTAAATAATTAATGCCTAAOTAATTCCZAGGTTAATTTTGTA 
ATTCAGAGAGGTAATAGTATGAAAAGAATATTTTAGGAGTATATGAGAATATTGTATCT 
TTTACATCTGCTCCTAAAATATTCCTGCATAAGCTTGCAGTGAGCCGAGAT^ 
GACTCCGTCTCAAAAAAAAAAACAAACCTATATATATAC^TATATATGTATATATGTAT^ 

3 5 TATTACCTGTGAAGCTGTAAGTATATTCTTTTA 

TACTAC^CACAC^CGCACACAC^CACAC^^ 

GTATTGTTATGGATTAAGACAAAAAAATTAAAAGATGAGATGGAG^ 

GAT CTGTAGTG AAGT ATATTAGTG AAAT AGAG AAACAG AATG ATGAGTACT AAG CT ATT CAT AAAATTTCAGT AGTGTGTAAT ACT 
TGTAGTTTTGAAACAAAATCAGCTAATATCACCTTGACTTTTCCCCATCrTCCT 

4 0 GTTTTCTAATTTTCTTTTCCAGTTACCCTGCAGTTT 

GC CCAAG ATG CTGG GTAAGTT ACATT AAAATTG T AC CAAATAT CG CTGCAG AGTCTTTGCATTT AAT ATG CAGACAG ATGGACTTT 
CATCTCTTTITCAGGTATCGAATCTACAGGAAGAGCCAAGCCACAGGCTT^ 

CC T AGATG TCT ATAACAAT AAAGGTAAAGGAAT C CAG CAATATTTGAGTTTGAATT T ATGAGT AAACGTGAG CTCTGGTTTAATTG 
TATGTACGTATGTGTGTTTGTGTCATTTTAAAACATAAATTTTAAATAAGAAAAAAAGGGGTCTTAGTTG 

4 5 ATTTAGAATTTATGCTGCTGCTTGGTCrrTTATCTCCrAAGGATTACTGGCTTCTTTGTGAGTATA 

GATTGATACTACAGAATCAAAGTATAAGAAGAATAAATTGCTTCAGGCTCATGCCAGAATAGTTAGCATAAA 
GAATTTAATGTGTTTATAACAGTATCTCAAAGTGACCATTGGTATTTTAATCTTTC 
AGATATGGTCTAGGTAGAGTGACCTAGTGTACAAAGTCAAAAAAAAACCTTCATTTT^AG 
GTGACTTTGACTGAGTCACATTATCTTTCTAATCTTCAGTT^ 

5 0 GAGAAATTTTACTTCTAAGACTTTATCCTGTTAA/^ 

T CGAGG CAGTGGT CCT ACT AAAAACTTCAT ATTTCTGT7UVCTATTTT TATGACTAT AAATTAATAGGACTTTG 
AAAAT AAACATG C CAG AACAAACATATC CAAATGAGTATTGT CCCTT C AAAGTAGT T ACC TTGG G AGG C TACACACCTATT CCAAC 
AATGCTGTCCTTGTTCAAAAACATTTTTGGAACT CCTCTTTTGG CATTG CCTTCAG AGTCTCCG ACACATTCTTTTG ATTATCCTTC 
AATGATGGGAAATOTTCGTCCTTTGAGGGTGGATTTGATTTTTGGACAGCCAAAAGTCATTCGG 

5 5 GAGTGATCAAG C TAG ATG AT ACCATTTGGGGT CAG AAACAAGGT ATGAT T AT AAGG T AAT GAG ACTGGTT TTT CT CGTGTTG CTGA 

TACAGTGGCTCTGGAGGCACTTCCAAAAGAGGAGTTCCAAAAACGTTTTGAGTAATGGC^ 

GGCAACAAGTTGTGCTG AAAAGAACACAAAG CAAGG AGT CAGT CT AG TC CTGCTTCACTTTTAAGTGTG TCACATTT AACAAG ATA 
CTTCT CTG AGC CT CATCTTTAAAAT AT CTGT C ATATGGG GTTGTT GAAG ATAAGTTG G AATAAGGAAAATGGAT ATTAAAGTGTTG 
TTCAAAAT AT AAATT ACT ATAATAAT ACAGATTTTACTGTT ATTGT AC CT CAAAGGACCACACT CATC TG AT A 

6 0 AAAAGGTCTGGTGTGTTTGT AG CTG GTGGTTT CTT AAGTCTCATT ACTTT ATAGTT ATACCTTGr ATATGCAGTTGGGG AGATAAG 

TTT CAAAG ATGGCCAAAATAT AATT AATATTT ACT CT ATTT AAAAGT AAAC CATGGAAT GG CACAT CTCATTTAAATCCGATTTG G 
GT(^GTCATTTTAGTGTGTCCTCCAGTGACTTGTCTATTTTAA 

CATTTAACTTAGCATTCTTCTCTGAGCCTTATCTGTCCTACAATGCTCTGCTTCTCCCTAA 
GATATTTCCAGTATATTAGCAACTTAGGTTTTATT 
6 5 TCAAGGACTCCTGTGGCCCTAGATCTCCTATTTGTAATGAAGGTGTTTCTCTCT 

TCTATTGATTGCTCAATGGATTTATCAGTGTGCCAAGCTGAAACACGCCACCTAAGTCAGC^ 

TAGCTTGTTTTTTGTCTCAATCTAACCTGGTTTTGTTCTTGAT^ 

CAGTGGCTC^TGCCTGTAATCTGAGCACTTTGGGAG^ 

CAAAATGTGACCCCATCTTTACAAAGAAAAATTTAAAAGTTAGCCAGGCAA^ 
70 GCTGAGGGAGTAGGATCCTTTGAGTCCAGTAGTTCGAGGCTC 

GCAAGACCCTGTCTAAAAAATAAAAAAAAAAGTAAAAGCCACAGTTCATTCTACCTGA^ 

AACTTTTATCTCAAAACAGTCTTATTTTCATTGGTTGTTTCTTAAG 

CAGTCCCTGGGGGTACTCAGTTACTCCCCATACAAGTGCCTTCITTTCTGAAAAAGGG 

TTCTTCTACTGTTGCTCTTCTTTCTCTTTCTCTCTTT^ 
75 GCTTCCTTTTCTTCrrTCTTTCTTTCTTTCTTTCTTTCT^ 
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CTC CCACCT CAGCCT CCCAACTAG CTGGGACCACAGGTG CATGCCG CTACAC CTGG CTAATTTTAAAATATTTTTTATAGAGATAG 
GGTCTCGTTATGTTGCCCAGACTGGTCTCAAACTCCTGAGC^ 

AGTGTGTGCCACCACACCTGGCCTCTTTCATATTA'ri TTTriTT TTTCAGAGACAG AGT CTCACTGTGTTG CC CAGG CTGGAGTGC 
5 AGTGGTGCGGTCTTGGCTCACTACAACCTCTGCCrCCCGGGTTOU^ 
AGGTGCACGTCGCCACGCCrrcGCTAATTTTTTTTATTT^ 
TGAGCTCAGGCAATCTGCCCACCTCGGCCTCCC^AAGTGCTAGGAGTGCT 
ATTTCCTAGTTCCACTAGTTACACGAAAGACCTTTAATCATGA^ 
CACACCCAAAAGCAGTTTTATAAGCTCTTGTTTC 
1 0 TGAGGGAGTTGTTGAATAATGATATATCA.CMATTTAC 

CATAGCCAGTGTTAGGCTTTTTAAGTGCTAOAAACTATATTTTAATTTA 

AAT AACITAAATTTTAAGAACACTAG ATT AT CCTGT AGCAAAGAAACAG GAATGAGGACG TG ACCTATAT CTGT CATTTCTGGAGA 
ACTGTTACACCCTGTTTTATTTTATGACCACAGAATAAT^ 
TGX3ATTTTAAGACTTTCAAAATATAAAGAAAATCAG 
1 5 AGCATGCTGGCAATACACTGCTGTGTATTGGAGAAA^ 

TTTTGAACATTTAATTGTAAATTGCTCCAAGTTTGCCATTTGTATCT 
AGAGTGTATACAATCTTTTGACTCTGGCTAATGACTAGC^ 

GTTGTAATT ATAACCATTTGG CCTATAAAAGGAAAT T ATGGGGAAAT TTTCCT CACGTGGATCTTGAGGTTTATATAGATAGTTAC 
TGTGCAGTGACTCCAAAGTTTCAGTTTCTTTTGTACTTT^ 
2 0 AGAGAATGAACTGAATAAAAGGCTTTAAAATTTTCCATCTGCCCT^ 

TTTTTAGCTGCTGTGX^SAAATATGAAAACAATGTCATGAATATCA 
CTTTATGGATGTTTTCACATGGTCTTTGCCrTTTGTTG 

GACTGAGCATAC CTTAT ATTTG CATGAGC CCCTCCATGATTCAGAAATGTTTTCr AATTTTC 

2 5 TTTTTTTTTCrGAGATGGAGTCTCXSCTCTC 

AGGTTCAAGCAATTCTC 

HUMAN SEQUENCE - mRNA 

GGGCCACCCTTAGCAGCGGTCGCGGTCGGTGCCGAAGCGGTGTTCCCCGC 
' 3 0 GCCCTCGTCCTGTCAGTGGCGTCGGAGGCCGGCCTGCGGTGGCCGCGCCCTTCTC 

GGCACGGCTGGCTGACGGCTCCGGGCAGCTAAGGCTGCCCGAGGAGAAGGCGGCGGCCG 
CTGGAGCCTGGAGGAGGCCGAGGGGACCATGTCCGGGAGGCGCTTCC^ 
TTCCTCCAACCCAACGGCTTAOTTTCAAGGAAGTATTTGAGAATGGGAA 
GAAGGA03ACTGGAAGAGGAAGTAGCCTTAAAGATAATCAATGATGGGGCTGCCAT 

3 5 AGATGCTCCAATCACAGTATGTGGTGATATTCATGGACAATTCTTT 

ACACACGCTACCTCTTTCTGGGTGACTATGTGGACAGAGGCTATTTCAGTATAGAGTGT 
AATCATCCCAAAACATTGTTTCTGCTTCGGGGAAATCATGAATGCAGGCA 
AATCAAATATTCGGAACAGGTGTATGATGCCTGTATGGAGACATTTGACTGTCTTC 
TCTGTGTACATGGAGG7VATGTCACCTGAAATTACTTCTTTAGATGACATTAGGAAATTA 

4 0 GGACCTGTGTGTGACCTGCTTTGGTCTGATCCCTCAGAGGATTATGGCAATGA 

CCGAGGGTGCTCTTATTTCrrACA.GTTACCCTGCAGTTTGTGAATTTT^ 
AAGCCCAAGATGCTGGGTATCGAATGTACAGGAAGAGCCAA 

TACCTAGATGTCTATAACAATAAAGCTCCTGTGTTGAAATATGAAAACAATGTCATGAATATCAGGCAGTTTAACT 
CCCCTACTGGCTTCCAAACTTTATGGATGTTTTCACATGGTCTTTGCCTTI^ 

4 5 TGCTCAAC^TATGCTCTGATGAaSAACTGATTTCTGATGATGAAG 

AAGATCAGAGCCATTGGGAAGATGGCACGGGTCITTTCAATTCTTCGGCAAG 

AGATCCGGAGTTTTGAAGAAGCGCGAGGTCTGGACCXIAATTAATGAGCGAATGCCACCCCGAAAG^ 
CCAATGAAATCTGTAACCTCAGCaCACTCACATGCTGCGCACA^ 

5 0 GCCGTGCTCAGGTGGATCTAAAACTCAAGAACAAATTCTATTTATTTATTATTGGAAAATGAAA 

CTGGAGGTGCATTTATAATTCAGTCTGCATTTATTCTGTAAAAAGGTGACTGT^ 
ATAAAAAGTGCATCTGTTTTGTTTTTCCCTTTTTTCTCC^ 

CTTGTGCTATAAAGGGGAACTTCCCCTAATAAAAGGGCCTTGGAAACCTCAAACCTGGGTTTCTGACCCC 

55 HUMAN SEQUENCE - CODING 

ATGTCCGGGAGGCGCTTCCACCTCTCCACCACCGA^ 

GGAAGTATTTGAGAATGGGAAACCTAAAGTTGATGTTTTAAAAAACCATTTGGTAAAGG 

T AAAG AT AAT CAATG ATG GGG CTG CCATC CTG AGG CAAGAGAAGAC T ATG ATAG AAGT AGATGCT CCAAT CACAGT ATGTGGTG AT 
ATTCATGGACAATTCTTTGACCTAATCAAGTTATTTGAAGTTGGAG^ 

6 0 TGTGGACAGAGGCTATTTCAGTATAGAGTGTGTGCTGTATTTATGGAGTTTA 

GGGGAAATCATGAATGCAGGCATCTTACAGACTATTTGACCT^ 

GCCTGTATGGAGACATTTGACTGTCTTCCTCTTGCTGCCCTCTTAAACCAGCAGTTTOT 
AATTACTTCTTTAGATGACATTAGGAAATTAGACAGGTTTACGGAA^ 

ATCCCTCAGAGGATTATGGCAATGAGAAGACCTTGGAGCACTATACCCACAACACTGTCCGAGGGTGCTC 

6 5 CCTGCAGTTTGTGAATTTTTGCAGAACAATAATTC^ 

CAGG AAGAG C CAAG C CACAGG CTTTCCAT CACTT ATT ACAATT TTCT CTGCCCC CAATT ACCT AGATGTCT AT AACAAT A^ 
CTGTGTTGAAATATGAAAACAATGTCATGAATATCAGGCAGTTTAACTGTTCrC 
GTTTTCACATGGTCTTTGCCTTTTGTTGGGGAAAAAGTCACAGAGATGCTGGTAAATGTGCT 
GATTTCTGATGATGAAGCAGAAGGAAGCACTACAGTTCGTAAGGAGATCATCAGGAATAA 

7 0 GGGTCTTTTCAATTCITCGGCAAGAAAGTGAGAGTGT 

CT CTCAGGAG GCAAGCAG ACTATCG AG ACAG C CATCAG AGGGTTCTCG CTT CAG C^CIAAGATCCGGAGTTTTGAAG AAG CGCG AGG 

TCTGGACCGAATTAATGAGCGAATGCCACCCCGAAAGGATAGCATATACCCT 

CACATGCTGCG CACAG GAG CG AC CAAGGG AAG AAAG CC CATT CATGA 
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Table 109 

MOUSE NOMENCLATURE 
ICSGNM N/A 
Celera mCG9110 

5 

HUMAN NOMENCLATURE 
HGNC N/A 
Celera hCG1641650 

10 MOUSE SEQUENCE - GENOMIC 

TGATAGGAGAAAGGGTCTCTTGGTTACCATGTGGCATG^ 

CCATCCCTCCAGTATGCCAGTCTTCTCCTGGTTCIACrGC^ 

TTTGTTTGAAAAATACTTATTTTTATG 

CATCTCTCCAATGCCCOTACAACACCCCACTCrrGTCTGCACCCAA 
15 TTGCCC^TATGGACGTGGTATGTGACATCATCATACATTTTGGTGTGGGGACT 
TTTCGGCAGCCATCAATTGCCAGTAGCTTACTCACCAAGGTGG 

CACGG CTACHX3TGAG ATGACACATTG CCTCATTTTTAAAGAT AG AATCGTACACT AT A^ 
GTGTATGTGTGTCTGTGTGTATACACTCATATAGCTAGACTTGAACT 

AG AACGTACTACATATTGCAGGCTGAGCTTG AACTAT CAACTATC CT CTAGTCT CAG CCTCCCAAGTGCTGG CATT ACACACATGA 
2 0 GCCGTCATGACTAGCTTCCCTTCGTTCTTAGCACATCCCATGA 

ATACCTCA1TTGTTTCCCTGAGTATAATGAGGTCTTAAGTTGAACCT 

TCTCACACTATTAGACAAGGAAATCTGTGCAGACAAATTTTGTGTCTATGATATGAAGAATGTCCACCTGC^ 

TGTG ACT ATTGATTGTGAAGTAGTTCGCCTAAGAACAG AG GG CAGCAG AC^ CAG CAAGG AAGGGAACAAG AGCAGACAA 

AGAATGAAAGCAAAGATGCCCTTTCTCCAGTCCTCCCXSCCCTCGAGCCCCTTG^ 

2 5 TGTTTCTTCTTCTCTTTGTTTTTGTTTIX3TTT 

TATGCCAGGAGGTTTTTTGTTTTGTTGTCGTTTC 
TTTGGTTTCTTTGTTTTTGTTTGTTTGTTTTTGT 
TCAGCTCTAGAGTCTCCrcGACAGCTCTGTTCTCATTO 
GAAGCTAACAATGAAGAGGAGCAGTGGCGAGTGAGAGACACCCAG^ 

3 0 GG CACCAGTT CC CCTACT TGGAAATGAGAGGG ATGAGGG CTCT AT CTGT CTGGG TTTT AACAGT AGAATGGTGG AT ATGTTCAAAG 

CAACTTGCTAACACTACTGCTCTGGGATGGTC^ 

AGG CAGG AGGAAGTGGATGGAG ACAAGCTG ACCATTGGTTTGAAGGT C CAAG CTTTT AAT CC CCTT C C CT ACCTG TGAGG AG AGCC 
AGGCCAAACTGACCAGGCCTTGATAGCCATTCCCCAGCCAGCCGTGCCCTCCCTG^ 

. ~ TCAGGAATAAGGAGAATTTGGGGTCTGAGGAAAATCATCCCA 

AGTTCCTTCAAAAATGACATGACGTTTTTCTGCCAA.TTA7LA.CTTCCCT 

AAGG CACCAGAGGC CG AC CAGATCACT AGATTCT C CCCTTT CAGC CTTGG AAAAAT CATC CACTTT ATTAAGGGCAT CTTT AAGAC 
CCATGGACAGGAAAAGCCTCCAGACATTTGCCTACCCATGAACTGGCCATCAGCCGACAGGGTTTCCCACAG 

4 0 AGTGCAG CAGCCACAGCCTTCCCTGTGCCATCCTACCTG CTGCCAAGTCAGTCTGCAGACACACAGCCTGGCAGGATGG ACTGGT C 

AGAGGTTCAAAGTTGCTTGTGCAGAGTX3TTTC(^GCrrGCCATTGATTTGTTAG 
CTTCTAGTAGTAGATTAAATAAATTGTGGTGTTCCCAGACAGGAAAACCTCT 
CTCACAGGGATCTATGTCCACAATGTATTTGCTATTGGTTCCCTAAAGTGCCTTTTAACA 
CAGATATGTCCGGATAGGCACAGAGTGCCTGCCAACTATGCCTGCCAAAAGCCAG^ 

4 5 GTCTTCTTTATAATATACTCTTTTTGTATCITAAAATATTTTTTA 

GGAGCTATACTTTCAGTTTTAAAATTTGAGTTAAGTCAATGAGCT 
CTATAAGACTAAAAGCCACATAGAGGACTTCTTGTGTGCC^ 

T ACAGT AGCTC CAGGAAGCAAGTCTTTT ATGAT C CACATTTGACAGATG AT CAAATGGGGAATT AGAG AAGTCATTTGT CT AACAT 
CTCTGGCTGACAAC^GGTGAAACAGAGCTCTATCTGGTCAAAGGCCT^ 

5 0 ACAGCATCAATTCACCGGTTCTCCATGCATGGGGCACGGAAGGGACCGAGAGCTC 

GGGAGAAGGCAGAAGTGAAGGCTGTGGATTGAAACAGATGACCCTGCGG 
CCCCCCACTTGTGCTTCCTCCGTCGTTAACAGGGCATCGTTTCAGGCACTCCATTTAGAATT 
TAGTAGTGAGTCATGTGGTACCACTTCACrGCCCTACTGTGTCCACTTWl 
TTCCTATGTCAGGGAAGTTCTGAGTGAATGTGTGTTTCATTATGGGCATAG 
5 5 AAACTGGCAAGGCTTTCCTCGGGAGTCTAGGTATGAGTGTTTATGACGAGAGGTCAGT 

TTCCTCCTAGACGTTTACAGCCTGAGGCTAGTCATCTGCAGAGACCTTCTACCAGACTAGCCAGCACCrC 

ATAAAGGTTG(^GGTACTGGCCA.TAGTGGTGGATTAGAGAACCCCGCCTGATCCCAGCTTCACAGTACACATATGTACTTTCTTCA 
TTCCCTGGGTGCCCTTAGTCAGAGTTCTGCGACCCAAGCTGAGCCCATCACCCCCACACTGAGACACACAGACCCCAGGGTC 
GGTAGGGAAGCTGTCCCAGAAGGCTGGAAGGTACACAAGCGTCCTCATCTCT 
60 GTCTATTGGGCATAAGCCCCTGAGGCTGACACAGTTGCCTTGTGGAGGCCTGTATCCCCAGCCTTGGAAGAGAACACTCAGTAAGA 
GAACATCCTCTGTTGATAATTGGAAGTTATTCACAAGAGGCCTTGCAATTAAAGTC 
AAAAATTTTTTTTCTGTTTTCTACTTACCTGGCrCAGAGTTC 

AGGGGCCAGTAG TAGGCCCTATTT TATACTAGATACTCG AGTGAAG AACTT CTAGGTGTT CT ATG T ACATT AT AGGCT CA 
CAAAGCCCTGTGTTGGAGGCCAGGGCTCTGAACAGTCAGGCCACACAGCTAG 
65 CCACACTAATTTCCACCCTGCTCCCTCCTCTTGAGTTGTTATAGCTTTGA^ 
ATGAGAGAGCAATTTAAATTTAAACGGCATTCTCCTCTTCTATTGCrrG 
GCTACTTCAGTCT/ITCAACTAGAGGAGATGGGTGAAGGAAGGGGAGAGAGAGAG 

AGAGAGAGAACTG CAATCAG AGTGTGGCCTGTACTGCAATCAAATTG CAAACATTT TCAT C CAAAAATGGAGC CAGGTG CCTTTTA 
ACTGCCTCCCCC^^TGCTCTCCGACACTAAGCTGTAGTCCGGTATCCT^ 
7 0 AG ATACGGGATTATATAACGCATG G CTT AGTG C CTGG CATTGTG AC CATAGAATGCTTTCAAGGTG CATTGACATCT CACG GTAT C 
AT AT CATCACATCATATGC CG CTGTGCCACTTAT C CTGAGG ACTCAATT ATATTCCAT CAT ACGGAGTACTGTGCTTTATG CAGTG 
CTGTTTGTGGTACTGTTTGTGGGTACTTAGGTTGTTTCCACTTTTGGACT^ 
TGTGTGTGGATATACATTTTCACTTCTCTTGTCATGTGTACATTGCTATGCCTAGAATCTTAGGT 

GGTCTGAGG AC CTGACACAGTCAGTTCCAGCTAAAATTCCAATCCTGTATCTATTCAACTCATG CCTO CTAAG 
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CTCCTGGCTTTCCCGCCTCAGAGCAGAAAAGAGACOT 

CrTCCTATGCCACAGGCGTCCAGCCTTTCTGCATCTCCGTGATCCAGACCTGGGA 

GAACACCACCATCTCCCATTACACG CATGCG CAGGGTGGT AAACTTCCCACCTTTAG AAACTCCATG CAC CAAACCTTTGTAATGC 
ATCCAGTGTCAATGCATCCAGCATCCTGTGTC 
5 CCAGGAAGAGGGGGAAAGAGGATGACACATGTCTTCT 
TTCATTGTTGAAGTCCAGGAGTGAAATGCCTGACTT^ 
CTGAAGAGGCCTTTCTGGGGGAAGGAGATCTGACTTAAATT^ 
GCCTTAGI TA TTCTCTGTTCAGGGTGCA^ 

T CTTGTCTGTGAATGAGCT AAAACCTCT ACTTGG^ CAGT CTACCCTTC 

10 CCCCCACCTCCCTCCTTCAGTGGGGATCCTGATGGAATTCCAAGCTTCTCTGTATCCTG 
TGTAATACAAGGGGCCAGTTTCCAGGCAGATTTCCCTGTCCCCA^ 
AGAGGAGGCTAGCCATTACTCCCTTCTATOVSGTTCCAGATAA^ 
GGACCTGTTCTAAAGCAGCTGCAGGCTTCTCrTAGGTCTCAAGC 

CAGTCAGAGT AGGCACAACACAACAGT CACCACAGTG CATTT AGAAG AAAATGT CT TAGGG AG AGACAG AAATGGGCGTGACATAG 
1 5 CCTCTCTTTACTCTGTCTCrGTTCATAATGGAAGCCCTTGGTAATTTTGTACACTTAATC 
GTATGTTTTCTGACCTAATTTGTTGTTTTCCATTACCAATAT^ 
TTCTGTCTGTCTGTCTGTCTGTCTGTCTCGCTCCCTCCCTCC^^ 

ATAATGTAAG AGATTTCACAGTTATTTGT AGTTT AAT AAAT AAAGACAT AGGAG CC CTTTGCT AAGTT CTTAAAATGTT TT CT CGG 
TTTTGTTTGAT ATCTAAC CAT AATCTC CATTACTC TTCTTTT AAAAGTT ACTT CT ATAGATTT AAAAG AGTAAT ATGGGTTCAT AG 
2 0 CAGGCAGAAAATGCAAT AT AAGGTGGAAAAAAAGAG CAACTGCATACCAAC CACAAAG C CAC CTTCTTGTCC CCCATC CCG AGAAA 
CACTCTGTGGAGAGTTCTGTAGCKKZCCCTCTGGCATTTTTTT 
ACGGCAGCrTGAGTTGCTTTGGCTGTGTTGTAA 

T AAAATCCCAGAGCCACACAGAGACTGCATCTCAG AG CCACG AGTGGGAGG AGGTCTG CAGAGGAAGGCCTGAAGAC CT CTT CCCA 

2 5 GACAGAGTCACCCTTATGAG C CCT CCTT 'i'lTTGT AGT CG AGGCAAACACTCCTTTTTG CAATCATTACTT CTTG CTCAAGTGAGAG 

AGT AAAAT AAT ATAT ATTTTT AAACAT TTT AAAAG CAAGCAAACAAGT ATAG AAGAAGGG CCCTG CAAACCAGGAGTCAGAGAGT C 
AGGGAGGCAAAAGCAGAGAGCAAGAAAAGCCAAGACCTAAATCTCTCACATCCCOT 

CTTACCCTCCG CATCTG CACTTACTGTAGCCTTTATTTTAATCACCTTGTTTGAATTTTGCCCTGCTCAGCT 
GGCC^CATGGGCCIXIAGTTCCAATTTAGCCATGCAGCT 

3 0 TGT AG ACTGAC TTTCTAG ATTAGAAGTG AAAATG AG AT AAG AT ACTG CAGG GAATTAGTGCCAGG T CATGCCCAC CT AGATGCTGT 

AGTAAACTAATGCCCACCCCCACCCTCTCACCCCAAACCCTAGTCAT 

CAATTT AT AGAT ACAATT ATAATTGATCAG GC TG G ACACAAATGCAATG GCTG ATG C CTTTATAGAGATAGGAAAGGGACAGAGAT 

ATATGTACAAGGGATGGGGCCATGTAAAAATGGGGTGGTGCTCTACAAAGCAAGGG 

GGGAAGCCATGGAGAAGTTGGGAAGTGTTCTTGTTGTATTTATTTTCTCTATTTCTCTGTC 

3 5 GTGTGATGCACTTGTGAGTGTGAATATGGAGACCCGAGGGTAATGTCAGA 

CTCAGTCAGACCCGGAGCTTGTGATAATGGCTAGTCTCATTAGCC^ 

ACAG CT AAAG G GCAG C TGACAAGAACAATGGCTCTAGG GTGC CT GG CAGGTAAG CGG CTTG AGAGAG CCT AAAG ATAAAACAG AAC 

TGTTGAAAACAAAACCAAAGTATGCTTCCAGGCAGCATGCT 

CCGCTTTTTCTTTTTAAC^GGAAAACIAGCTTCAAGGAT 

4 0 AATGTGGCATATCTTTGTTGTCTCCTTCrrGTCTCCCAACT AGAGAG AACACACTTACG 

GGTGTGATTGGCTTCCAGGGAACCTGATACAAGGAGCAACTGTGTGCT 

GCTGATGTGAGTATGAAGTACATTGGGAACTTCTCTTACTTTCTGGGTTTGTGTTGTTGAGATGGG 

AACACTTTTTAAAAATCCTACTCTGAGAAATTGTGCTTACAATGAAATCATTCCTACCTTCC^ 

TTATTCITTTTTTTATTAGGTATTrrCCTCGTTTAC^ 

4 5 CTACCCACCCACTCCCCCTTTTTGGCCCTGGGGTTCCCCTGTACTGGGGCATATAAAGTTTG 

AGGATGGCCGACAAGGCCAT CTTTTG AT ACAT ATG CAGCTAGAGACAAG AGCT C CGGG GT ACTGG TTAGTTCATATTGTTGTTC CA 
CCTATGGGGTT<3CAGTTCCCTTTAGCTCCTTGGGTAATTTCTCTAGCrrCCT 
TGTGATCIATCCACTTCTGTGTTTGCTAGGCCCTGGCATAGTCTCAC^ 
TAGTGTATACAATGGTGTCAGCGTTrGGAAGCTGATTATGGGATGGATCCCrG 

5 0 TCACAGCTCCAAATTTTGTCTCTGTAACTCCTTCCATGGGTGTTTTGTTCCC^ 

TCTTCGTTCTTCTTGAATTTCATGCGTTTAGCAAATTGTATCTTATATCTT 
AGCGAGTACATATTGTGTGAGTTCCnTTGTGATTGGGTTACCT^ 

TTTC^TAAATTCAT^C TTTTTAATAGCTGAGTAGTATTCCAT TGTGT AAATGTATCACATTTTCTGTATCCAT TCCTCTGTTGAGG 
GGCATCTGG^TTCTTTCCAGCTTCTGGCTATT^ 

5 5 TCTT CTGG ATAT ATG CCCAGG AG AGGTATTG CGGG ATCC T C CAGT AGT ACT ATGTC CAATTTTCTGAGGAACC GC CAG ACTGATTT 

CCAGAGTGGTTGTACAAGCTTGCAATCCCACCAACAATGGAGGA 

CTGAATTTTTGATCTTAGCCATTCTGACrrGGAGTGAAGTGGAATCTCAGGGTTGTTTTTTAT^ 
AAAGAACTTTCTGCATTAAAAAAAAGTTTGACTATAGA 
• CAGTGGTGGTG^TGOTGTGTGCCATAGCACATGTGTTGACCrrCAAAGGACAAOT 

6 0 TTTGTAGGGATGGAACTTAGACTTGGAAGGCCTTGACCCTTACCACCAAGCCATCTCAACAG 

AACATTG CCTC CCT OTGAGTTTTG\AAGTTAAC!AGTTT^ T TTT CTACAT CT GACAT CTGTGTG TGGTTA 

TCCTTTTGGACAG CAGCCAATATTCCCATCTGGCCTGTGTTTACAAAGCAAGACTC TGAGAGTAC ACTAT TAGAGAAAGAAAGAAA 
CATGTTTAAATGATAATATTCCATGGTTCTAAACAATTTTTTATTTTTCTT^ 
ACATCATTTCCCCCCCCCTTCCCCAAAATCCTGTACATCTTCTGCCT^ 
65 TGGTTATATATATTTCTAAATACAACCTGCTGTCT 

TGGATAACCAGTTGGCACAGTCCCAGCTTTCCGTAGTTGCGTATAGTTCATGG 
CTCCTTTGTCTCTTGTTATTGTCTTTCTTCAGCTC^ 

ACTAGGAAATACAGTCTCACAGCAAACACCCTGGTCATCTGG CTTTTGTGATCTTTCTG CTCCATTTTCTCCCCTTCCAAGTATGA 
GGGCCCATTAACAATGAACATTGCTCTTGXZAATGGAAAACT 

CTGAGCCCTGAGTGCACTGGATAGTGCCTGGCTGC^^ 

CTTAAAATGGAGGGATTTTTGCTTACTTTGAAGTGGGAAGATCTACTTCT 

TCCAGATTTTAATTCTCATCAAACCTGGGCCATGCCTTCTCT 

CTTTGC CTGCTTG CTCT CACC TCTACTGGCAACCT AT ATAAG GACATGG AAGAAGG AAGTTTT C CTAGCAATT CT A'XTTCTT CACT 

7 5 AGCATTAGGGCCTACTTCTTCAGAATTCCAAGGTGTACTGAAAACCATCTAAGTCAACC^ 
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ACATTCCATTTTTATCCAGACATTGTTGGATTATCTG^ C CAT AT AT AT AT ATATGGAGAGAG AGGGA 

GGGAGAGAGAGAAAGAGAGGGAGAGAGAGAAAGAGAGGGAGAGAGAGAAAGAGAGGGAG^ 

GTTCTGTTACTTTAGAGAACATTGACTAATACATTGGCCAT^ 

TTTT3TTCAGTAAAATX3GTAGGCATACCACCAGAAAT 
5 CTGGAAATGTCCCCAGAT ATTACTGAGGTACCACAAACTGAGGTACC^ T AGCCTT ACAAGATAGTTTCAGT 

GTGTTCTTTT ACT AAGACAGT ATCTT CAGTCCC CAATATTAG CTAACACTTCCACGGTGGAACCGACAT CTTTGTCTAGC CCTAAA 

ACCTTCTGACCTTAAAG CCTCAGTCAT CCTTGATGCTTCT CTTTTTCT CACCT AT ATGG CTGATCAGTTGCCAATTGTTT CTTCAA 

GTGTCCTGTGAATGGTCITACTGTCTCTCCTTTCrCTCT 

TAGAAGACATTGATTCACTCAGTGGTAAACCrCCATTAT^ 
1 0 ACACACACACACTGACAC^CATACTGACACACACACAC^^ 

ACTGACACACAC^CACACTGACAC^CATAC^ 

AC^CACACACACACACTGA(^CACACACA^ 

ACACACACTTATACACACrrTACACACACACACTTTTCACA 

CATTATCCCTCTGCCTGG(^CTAAGCACATTGTTGTTTGAAT^ 
15 CTTTCCCATGCCTTTGCATTTCTGAGGCTTTT^ 

GCT ATAAAAG CCAC TTTAAGT CCCC CATCC CT AGGAAGTC TCCAT ATC TGTAGTCAGTT CACATG ATG CTGGCTCTCTTAACACTC 

ATGTTACACCAGCTTCTAC CTGAGTGGGGTTITTG^ C TTTAG CTTG CACGGGT CTTTTCCTAAGT 

GGATCCTCTTACAAGTTCTCCACAGTCTGATAGAAAGACTC^^ 

GGACAGTTGAGATCTAGTTTGAACTTAAGTATTCCATCCCATTAATAGA 
2 0 TATAATATCTGTCACCACAGATGATGGGCAGATATAA 

ATGG CAGTCAATGTGAAGAATAATTGATCAT TTACAATGTATCATCCAAG CAC CCACAAG CACTT CTGT ACACCTCCAGACTCCCC 

CATCACTCCATACACCCCAGACTCCCraTCACT 

TCTTC(HTCTATTCACTGTACTTGACTTTTGCm 

TT CT CACT AAGAACTC CAAGGACAAAGGT CATATTTG CATTG CAAAG AGT TTTGAG ACAAG TTT ATGATGGAGG AGCTTAAGGTAT 

2 5 CTGTGCTGGCAAGCTTGCAAGCATTGAAAAGGAATCACTCTC 

AAGTCTTAGGATAATGATTGTCTACTTTGTGACTAGCTCATCCTG 
TTTATCTAAAAGAAAAGGAATCCAACX5GGGAGCCAGACTCATGGGTAAAA 

T CTG AG ACCTT ACATG CCAGT AATGAG CTGT AAGTGATCATT AT AG GTG ACT AGG ATACACTCAGGAT ACAGGAAAC CACATAAGG 
CAACTAGAAAAGGGTTGTAATGAATGTAAAAGAGTTTAAGGCTTTTAACAAG 

3 0 ACAAAG AAACTG AGGT AAAGTTAAC TTACAAAGAAACTGAG AT AAAGTTAAAAAAATTTATTTTT ACJ^ TAG A 

TTTTCGCAACCACCATGAGTTCCTGTTCATGGAAAGGGGATGGGTCATTATGT 
GCATTAAAGGCTGTCCTAAGAAAAACTTGACITGAAAACAATAGCACTTGTGTTT^ 

AT CAAATGTCAGAT AATG AGATT AGGTAGGGTCTTGGAAT C CTACT AT CAG AAAGGTGCAG AAAT CAC C ACAGG AG ATGAT AATGT 

3 5 AATCAGACGCAGGGTGAGAGAAGAGTGAAGAATGGTTCCAGGATTC^ 

AAAGGGGG AG C CAG CAGTGAGGAAGATGCACT AGGGT AGAAT GAG AAG CAATG AG AAT CAGAATT ATGGTGATG C CAC AG G AAAGC 
AGAAGGGACAGGGACCATCTAAGATGCACAGTGGGCGTAGAAGCAA 

GCATGTGGTGGTGGTGGTGGTAGTAGTAGTAGAGTAGTAGTAGTAGTAGTAGTAGTAGTAGTAGTAGTAGTAGTAGTAGTAGTAGT 
AGTAGTAGTAGAGTAGTCCCCTTGGGTGGCTAAAGAAGAAGGGAAGCTGAGCCAGTGG 

4 0 TAGAACTCTGGCTCTTGTCACCCTGTCACTCTCCAAAAGTGAGAAC 

TTCAAGTTATGGATTTTGTAAGGAAGACTTACATGGTC 

CTTCCCATCTCTCATCCCTGATGCCATGCAGGCTCTGGCCTACGTGACCATTAAG 

GTCTGCCCTTCCACATCTTGAGGAACTTCTCTGGGCACTCTAGCAGGC^ 

CTAATGACTCCATCCCTAAGCCCAGGAGGTCrcTATTTCCTTTTA^ 

4 5 TTTTTTTTTTGTTTTTGTTGTTGTTTTGTTTTC 

TATTGCTCACACACTTCGCTGGGTTCTCTCCTCAACCTTATCTGAAATACCGGATGTGGCTGACATCCTGTAG 

AGTGCCACCAACAGGCAGTGAGCGCTCAGC 

GGGTTGGACAAAGTTTAAAAGACTCGGTTCTCCCITTCACAGCGGTAACAAT 

5 0 TCACATCCACAGGTCTGACTGGCAGGAAAGAGAAAGAACGTCCTGCTGGTGAGCGAAGAAGCT 

GGAGTAAAATCCAAGGTCACCCTCTGTGTCCTGCAGGCTGCAGCGTGTCCTATAATCAGAGTGCCTT 

ATTATTAG AATGTG TAACT ATT AG AATGT AAAGCTTTTAAGTATG CAAAT AAAAAG TTTAAAACCTTGGT ATGGTGAGATAGC CTA 
GTGAGTC T AGATT CTTATTGAGCCAGCCTGG TG AC CTAAG AT CTATT C CT AAACCT CACACAACATG AAAG G AG AGCACCAACTT C 
ATAAAGCTAGCTTCCAGATGCATACCATGGCACTTGCTCCTACACATG^ 

5 5 AGAATCTGAAAAAGAAAAGAAAACTGAAAGACTGCTTTTACAGTTTGAAAG 

CAATGTAATTCCTAGCAAAGCACACCAGTTCCTTTTC 

AG AAATTCAAGGGAAAGG AGG AGGAAGGGCCAG AG GGAGGG G GAGAG AGAGGACTT CAGAAG G AACAAAGCTG ACGGCCTTACTTC 
CT AATTTCAAACTTTT CACAAAG CT ACAGT AAACX3G CACT ATGTG ATGGGGG CAG AG AACAGACA.CAGACCTGAG CAACATGGATG 
CAAAGTC CAGGAAT ACAC C CCACGTTTATCATC CAGGATG CTGGGAT AATTCT ACAGAGAAAGGACAGTTT AGTTTTTT AAAATAA 

6 0 ATTTATTGGAGTATGTGCAAAAG AACCTTG CT CAT ACACAAAAATG AACT CAAAACAG ATTAG AG AC C CAGAT ATG AACAG AACAG 

ACTTAGTTCTTATGAGAAAATGTTAGCAAGTAATAAACACATTTGG 

AAAGTATTGATGTGTGCTGCACAACAGATAAACCITACATCATC 

ATGTCATTTCTATGAAATGTCAACACAAGCAGTT<3ACCAACA 

G G ACTATTTCTG AG TG AATATGG TTCCTTCTTTTAGTGGTAATTAAG ATTC C CAACTCT 

6 5 ATGCATATACTAAGCAGTCCAACTGAATGAATAGTGTGCCAAGAGTTTACATTAAGTTTATACGG 

CAT TTT AAT AAAG CTG CTGAACATT TAAAAATAAATTTAATG AACACATCTT T AAAAC CCT C CACAG ATCACG CCCTCCACT AGC C 
CAGCACCCTAGTGCAAATGTATACAGTGCCCTCCTAAGGCTGCGTTGCA 
CACCGTCCTCTCTAAAAATGCAGACAAAGTCCCCGGGGCTCCAGAGCGAAGCCCT 
CTGAGGCGTGCAGCTTACTCTGGCTAATAAAGCCTCAAGACAGAAGGGT^ 

7 0 AG CACT CTGTGG ATGCAGAAAACCTGAGGTCAGAAGTGAAACAGGAG ATG GAGCCACACTGAG CTAG AGCAG GG CCTG CCAGTCAT 

ACCTCCTGTCCCCTAGAGCAGGGCCTACCACACCTCCTGAC 

TCAAAATG G CT AT CGT CAAGGAAACATCAAAACAACAGATATTGG CAAAG ATGCAGG CAAAATT AAC CCTTG CACACTG CTG ATGG 
GAGTGTAAATTAGTACAGCCACTATGGAAATCAGCATGGATATTCCTTAATAAAACAACT 
CTGTTCCCCTCCTCCAAGTATACCCAGTGAAGTCTACGTCAGCrrCC^^ 
75 AGTAAGCAAGTTATGTTGCTGACCCAGGTGCTCACTGATAGA 
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ATATATACATATATATGCATATACATATATATACATAT^ 
AGTCAGCCATAAGGAAGAATTCAATGCCATTTATA6AA 

CAGTAAATGGACTAAACAT AACC CAAAGTACATTAT AAACAT AC CTGAAGAGTTACAATGAAAT CCATTGTGTAAGACAATT AACA 
GTTAGCTGAT AAAAGT CTTTATAACGTGGAGGTAAACATT^ CTTCTTTGAGGTATATCTTAGAA 
5 CAAGTAATAAAAATTAACCTCAATTTAATATGTCCAAAAGAATAAAAAGTGGGTCA 
ACTTGAAAGTGGAAGCAAGATCAGXaAGTTCAAAGTCAGCCTATGCT 
CTGCTTCAACCCCCCCTCCAAGAAATAAGTAATTTGAAAGCTAATAATTAGAAAGCTC^ 
GTTGGGGGTGGCAGCACTCAACAGCTTCATGCnTTCTC 

CTGTTCTC CGGGATCACG CTCCGCGAGTT CCAGTCCAG CTACAG ATG ATGG CTTATTATTAGGT AAACAAGACTGATTTTGGTACT 
10 GACTGAGCTTTAGAAAGTGACGTGGAGGCCCGCAAGGATACGGATGATTC 

CAGAAAAGGCCATGCGAGGAGCAAGAAGAGCAGCT 

TCTTCTCCAGAAAAGC^CCCGCAGCCTCTCTT 

GTC^TTACCCTGCTCrrTGGGATGAGTTAAACTCTCAGAGACT 

AGATTGACACAGACCATX3CTGCTCCTGCT 
15 GG AGGG TCCTGGGG ACGATGAG CAGG TGGTGGCT CCTAG CCAAAG CCGGAAATGAGTATCTC 

GGTTACTGGATATTT CAAGTAG CX1AAAAGAAT ACTTT CCAGGGT ATTATTGAT CCATCTCT CCCCACTA CTTG CAATGA 

ATTACATGTGTCTGAACGAACTAAGAGGCCGCTTCCTTTAATGTAAGGTGCCA^ 

TGACTGAGGOVAACATAGAGCACTTCGTTACTCATC 

TAACGTAGACTTGAATTGGAAACTCGGAGACAATAGACCCCCAATGAAAACTC^ 
2 0 CATACCGTGGAGGATGTGGTTTAGAAGGGAGAAAGTTGTTTCAGAATTGATTATTT 
TCTGCCATCAGGGCTGCAGTACCAAGAGAACAT 

GGC TGCTCCACAGCAGGAG CGTGAGG CTTTCCTTGAGGGGAATCATGGGAAAGTGGTACTTTGGCGTTGG ATAG CCTI TTGATAAG 

TTAGGAAATG AAGG AG CTAGGGAAACAGGG ATGGCAAGTTTTGGTTTTGTTG TTTTAATATTTTTGTTTTTTGTTG CG CCGCCCCG 

2 5 COSCGCCACCCCCCCCCCCAACACACACACa^ 

TCTGTAGGGGAAGGACTGTTTCTTTTAAGAAAAGCAGCACTAGGCAATC^ 

GG AAGGATGGTGGAAG CAAGGT ATG CTT ACAT T CACACAAG G ACGG ATATGTGGCAATG TG CAAGGCAGCTT ATGT CTCATT CAGT 
CTT CACAACACT CATTTGGCAAGGAGAGAGG CTG AG ACATG G G AAGGTGT TTCAATGTG CCCGAGATGACAT AGTTAGTTACT AGG 
TGGCATATGAAAGATTTTCA.CACGCTGATATCCTTTCTGCCCTTCCCCTCCCCCTATTA 

3 0 GACAGTAGACAATAAATTGTGAATCGTCTAACTGGGGGTGGGA^ 

TCCAATTAGATATTTrCTTTATTTATATTTCAAATTTTGTCTCCTTTCCT 
CCCACCCACTCCCTGTCCTGCCATTCTCCTACACTGGGGCIATT'G 

ACAAGGCTGTCCCTCTGCTACATATGCGGCTGAAGCCTTGAGTCCC!TCCTTGTGCACTCTT 

3 5 CGATTGGGAACTCTGTGCTCAGTCCAATGGTTGGCT 

AC^GCTATATCAGGCTCCrrcTCACCAAGCACTTGTTGGCATCCACAGTAA 
CAGGTGGGACAGTCACTGACTGGCCTTTCCTTCAGTTTCTGCTCCAA^ 
CCCTTCTAAGAAGGACTAAAGTATCCACACrGTGGTCTTCCTTCTTCT 
TCTGAACTTCTGGGCTAATAGCCACTTATCAGTGAGTGC^TATCATC 

4 0 ATTTTCTAGTTCCATCCATTTGCCTAAGAACTTCCTGAATCAATCATTO 

ACATTTTCTTGTTCAAGGAGATCTXSGGTTCTT^ 
CCTTATTACATGTTGGAGCATCTTCTGAGTATATGCCCAGGAGTG^ 

AGGAACTGCCAAACTG ATTTCCATAGGGGTTGTACCAGCTTG CATTCCCACAGCAATAGAGGAGTGTTC CAT ATCCTTGC CAGCAT 
CTGCTGTCACCTGAGTTTTTGATCTTAGCCATTCITGACTGGTGTGAGGTGGAATCTCAGGGTCATTTGATTTGC^ 
45 ACTAAGGATGTTGCACATTTCTTTAGGTGCTTCTCAGCCATTCGGTATTCTGGAGATATGTTCTAATGTGTATTCTGA 
CAAACCAAAACATATGTTATGAGTTGTGTTGATGAATTA^^ 
GAATTCCTGTCTACTCTTTCTTTGCAAAACATTTTTATTAAAATGAGTGTTAA^ 

CGTTATGTGT CITTCT AATT AAGGT ATTTTGATTTCTT AGAAAG CTATTGGTGGGGCAC CTAGAAAACAAAG CTT AGGAATG CTGG 
TGGTTATTCAATGGTGATTGAAAGGGAAAAGGCCTGGGATAGGGAGGA 

5 0 TTTTAAATAGAAACTGCGCATTCAGGTATTATTTACAAAATACAAAG 

CCTGAGAGTAGGGGGGCTATGATACTCATTGCCTGAGAGGACTCCCTGGTTATCACGGGGGCGGGAATGATGACC 

ATTGCTTCGTGTGTAGCCCTTGACCCGTATCTTCCCTGAACAAAACAGTTCCTACCCAGGAGTT 

CCCTCTATGCATAAATTTCTAGGTTTAAATTTTAAAAAA 

ATGCTCTAAAATAGAGAAATCACGGTCCACGGCAAGGATATGTTATTATAAGCATGGAA 

5 5 CGTAAAACACCCAGGAGGAAGGATGACACACAGCACAATGCCTTTTTCCTAAGTCTGOT 

AAGCTGAAACAAGCAAGACATCGAGAAGCACCGGCCTCAGTATCAGTTACCTCT 
AGTTCTTGGATGAGGGGTGGGCTCCTGGATGCTCATTTTACTTACA^ 

AAGATATG C CA.CGGACAC CGAAAACCACAC!AAAACAGAT ACAAACTTGAC CTGAC CAAACACTG CTGG AAATACC CCCAG AGTGTT 
CATCGCCCCCCACCCCCAAGAACTACCCCTC^^ 

6 0 CCTGATCTTCATAAAAACAATTGGCTACCGATC^TTAAAAGA^ 

CTGTATAGCCCTGGCTGTCCTGGAACTC^CrrTTGTAGACCAGGCTGGCCTCAAAC 
GCTGGGATTAAAGGCGTGTGCCACCATGTCCGGCTTTAAAAGAAATCTCITAAAGCIAT^ 

CAAGTCAG CAAATGG AAG CAACC CCAGGT TTT AAGCAACT AT AT ATTTTACTGAACACACT AT AGT CTATTT AC CT AGTTGATGGG 
CATTGCA.GCTATTTCAGTGTCTGGGCTAAGTGAACCCCnX3C^ 
65 ATTATCAACTCTTTCCTTATGTATATTCTGTAGGTATACTGTTGTGTTTTATTAAAAAAA 
TGGAGGTGTGGTATGGTGTGGGAGAAGGGGGTGCCTCTGTGGG<:CCATG^ 

ATGG TATAG CAT AGAATAGAG T TT ATTCAGGG CAT AGGGAGGG G AGTTG AGGGGGT AGTAG AG ACAGAG AAAGG CAG AGAG CGAAA 
G AG AGG AG GTGTGGG AG AGG TAGCAGGAG AGG GAGAGAGGAAGAAAG AGG AGG AGT AGAAGC CAGT CATG AACAT AT GG AGAGAAG 
AGGGGGAGGGGAATGAGGAGAGAAAGCACAAGAGAGGGTAGAGAGGAAGAGAAGAGCAAG 

ATTTAAGT TTAT AT TCAGT AAAATT CACCTTTTCACTAAGGGAAAAATAAC TGTGT ATTTTGAC^ CATG TAATGG C 

CT C TGCAATCAAAGTGG GATTAGATAGTCTTT ATGT AATTACTTATTTTTT ATTATTAATATACATT AG ATT AT ATTGTTTAG 

TGAGGTTCATGCGTGTTGCAGTCTTTATCAATGGCrTATTTCTGTTTGTTACTG 

7 5 TGTAGGGCACTTGGG CTATTT CCCATTTCTGGTGAGTTTAAATAAAGT^ 
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GCATTTTGGACITGTTAGCTATATTGAATTTCTAGACCCTGTGGCCAT^ 
CAGTTGTACCAACTTTACACCGTCATCAGCAGTGTCTGAA^ 

T ACCCCCACAGATGCATGGAAATACCTTGCTGTGGTTGTAGC'ri"!' CAG'i"l u lT CCAGTGG CCAATGATGCTAAGCACAU"rrr AATGT 
GCTTATTGTCTAATCGCGTATCrTCTTTGGTAAACTACGTATGGAAATCATC^ 
5 CAAAGCGTCGAGTATTCCTTATTTATCTGAGATATAAGTCTATTATAAGGTATATGGCT^ 

CCATTCTTTGAACAAGG CTTCATGGACAT ACT CACTGTGGTCTGACT TAAGGGTGTCATGTTT CGTGTCCCACACTTTTGATCTCT 
CAGCAGCTCAAATCGAAAGGATT CTTTC CTAACATCTCAT AGCTGTTCACGCT ACATTTAGGTCTGTAAT CAAGTTTGAGCTAGTG 
TCTTATACAGCATATGGGAGTGCAAGTCAAAGACCTAACAATGTT CACCCAC TGCGGCCCCACTGTTCT CCGTCA rn\- i"i CAGAGG 
GAAGCACCTTTGACTCTTTATCACAAAGCAATM 
1 0 GTCCGTGGCATGTGCCCAGCTTTTTGTGAGTATCCTCTTCT^ 

ATTGCAAAATCTAAAACTATCTTGTAGTCCAGATGTATTGAACAGATAGTA 

CC CGTACTTTGTAT CATGGAGGTTTCCCTCCTTTACTGCCCTTT ATCTTCCAGATGTG CAGG TAGGGGTGGAGAGGGA 

GAATTTGGCCATGATG CTGCTTGAAGGACTCTGGTGCAGAGCACACCC CACTCATT CTGATTGTCAATGACTCTTGATATGTATGA 
GTATGATGGCTAATATTGATAATTGGCTTGCGATTCAGAACCACCAAG 
15 GAAATGTTTAACTGAGCAAGAGAGACTCACCCTGAGCGTGGATGGCACTGTGCTGTGGGCT 
GAGAGTAAGTTGAAATGAGTGTCCTGACTGCAGGCGCAGTGT 
CTCCTTGAACTGTGAGCCAAGGCZAAGCCAAGCCCTTCCTTCCTCATGTTGCrr^ 

ACAGGAAATAAGACAG CAAAGGAATGGACCCTGG C CAGGGTAGG ATTTCTACAGTGAAGG ATGTCCTGTGAG ATGCT CACACACAG 
AGCCACAGAGCATGAGTCTAGTTCACAGAACAGTTTTCT 

2 0 T AAGT ACACTGTAT AGGAAGGCAAAAAAGAAAAGT AAATTT CTG TTGTTCAAT AG AAAAAAGTT CAAAATT AGC CTAACAAG GAAA 

CTATCTTGGGAGAGAATGCAAATGATTTGTTTTGTGTTTCCTG^ 

ATGCTAAATTGACCTCAGGATCAAAAAAAAAGACTCCCAAATTATCTA^ 

AGATTGAAGCTTAATTTAGTTCCTGCAC^^ 

GCCTC!AGT CTT AGGTGCTGTCTACrrCTTGCTGG CAGTAGCTG^ CAGCCGAGT CATCAGAG CAG AG AGAGG AGAG CG AA 

TACGAAC^GTATCACCTACTAAGGAGGGATGACAGAGGACCAATCAGA 

AATAAAGGTTAACTATCACTCTTTCCCCAGTTACACAGTGTCCTAAGTCTGCCT 

CTGAGAGACTOTGTTTOTAATTAGTGCTCAGGTGACGCrGAAAGCAA 

CAG ATGT TTGATCTGCC TAGTTAG ACTT ATTC^TACTCT CTG CAAG TT AGAAG CT C CAG GG AAACGG CAAG ACG AATG TCTG CTT 

3 0 ACCCCAAT AATAACTTCAT CTAAAG AATGCG AAT TAGGGTTTGGG CATT AAT ATTT TTTT CTAAG AAAAAAAAAAGTCTTTG CTAT 

GGTCTATGGTCTAGATATATGTTCCTCAAAGGTTCCTGCT 

TAACAGAAG TT AATTAAAT CCCAAACGTCCACACTGATAAAAAG AGT C CCATCTGTCITTGAAGAGTGGGT AGTTAT AACGGGAAA 

CTACCCCAAGAGTATGGCCATCCCTTTCTGCITCTTCACACA 

CCAGAATCrrTGAACTTCAGTA<^CAGCAC7^CAGGTAT^ 

3 5 TGTGTCAGACTGGCTGAGGCCTTCTGCTTGGTTAGTCTAACCCGCTCACAGCCAGCCCTGAGA 

AGAAGTGGTTTCATTT CCTCTGTGATT CACAAAAAAGCTGrAAAGTT AT AGAATTT CAAACGT ATGTTG CATCTC CT AAACTAC CCA 
TTTTGTGAGAAATCAAGCCCAAAGCTTTGCATGTC!TTGGAAGCTATACCTGAGGTG 
TCTGAAGTATTCCGATGCAC^GTTTGTCAGGTCATTGTTCTTTCCTCTTGTAAATACTACCCCCA^ 
TGAAAGGTAAAATGTATTTCrrAAAGCTCAAGCCATTT'ATTCT 

4 0 ATGTGTTTATAAAAGGTTGGGGCAGGGGATGAGGGAAGTTCTGCGGTAATGACAGGGGAGTTCCA^ 

TCCTCTCTCATGAGAAAAAAAAGATAGAGCGGTAAACAGCTTAGCTGrTAACACATCTCCAAATCT 
TGCACACTCCTCTTTCCCCTTCACCACCTCTCTCCTTGTTTTCCGTTTGGGCATCTG 
GTAGTAAATTCCTAAGCTAAATGCTCAGAAAATAGCTTAGTATATCTTTGATGAGGGATGATTATGCT 
CGCAGAGTGGATTTGAAACAAAGTATCTTTTAGTTACAAGTGAGTGAAGGGTTTAG 

4 5 CTGG CTGGT ACCT AAGT AACAG C C TGGCAGAAGCGACCTT AGGGAG AAGGGTT CATTTT GTTTCAC CATTG G AG AGG ACACAGGAG 

TCCACAGTG<5TTGGGAAGGCGCGGTAGCAGATGTGAGGCAGCTGTC7VCATCAGG 

AACTTT CT CGGTTTT ACTTAAC C CAGGAC CC C AGTCCATGAGATGCTG T CACT CC CAGG CGG ATCT TC C CT GCTCAGTT AAG ACTC 
CT AGGTGTTT CAAAAT C C CAT CG AATT AAG ACTT AAT CATCACAGTTCT C CTAAG AATGG AT CTAAACACT AAG T ACATAAG AACT 
TGTTATTTCAGCATGCAAAGGACTCTACAGAATGGAATCATACAG^ 

5 0 C CGAGGGT CTTT TCAG C CTGATCCAGCAAG^ACACGCATG ATTT ACATTGTTTTGCATAG AAG CATAT ACCTTAGAGCCTTTCTCC 

TGGGCCCCATGTTACACAAACAAATGCTGGGCATGTGTGCATCTCCTCCACACTACAGAT 
AGTTGTAGTCAAAGCAGAGAATC^AGGTAATAAACrrcTCCAGTCATTTTTC^ 
XTGGGTGGTAACCC1ACAGTTGCTGATTCX5TGTGGTCTCTTGTGAAGTATGCCTCAGGGAG 
TTCTCACCACGCTCCCCTTTCCCTCCCTGCTCCTCCAGAAGTGAGA 

5 5 CTTATTCATTTCTAAGGTTTGAAGGATAATGGAAGTTTGAATTAAAAGAGACTGGCAAGCTGAGTAT 

CAG CACGCAGGAGG CTGAAACAG^AA?U^TTTX3 AGTC CIAAGACT AT AC AG AAC T ATGT AACAAGACCCTGT CTT ATAACCAAAGAT 
GAT CCAAAAGTG AATTGCAATGTG AACATCAGCCIAGCAG GAAAACCAAAAG^TTTT TGGAGT CAGGC CAGGG AACAGACAACCTT C 
CTGGTTAATTAAAAT ATTCTGGTTTG CCACAGTTG TGGTGGGCACC TAGAATCTG AGGACTCAGAAGG CTGAGG CAGG AG GAGGAC 
GAGCTAACAGCCTGGGATACATAGTGAGACCCTGTCAGCGAGAGAAAGAAAAGAGAGAA 

6 0 GAGGGAAGAGGGAAGAGAGGGAGGGAGGGACAGACTATTCTGGCCAATAAA 

CCCAAGAATTAACCATCACCTGAAAGAA(^TTTTCTGAGTCCCTGCTGGCTGGCAG 

CAGATTTTCCCACAGAACCAAGAGATGGATATTTACCTCATACAGACAGATTCATAGAAGCTAA^ 

AAAGGTCCCTAACTAATCCTGGTATTGTTGGGATTTGAACTCAAGCTCACTAAATAAAAATATTTTTTT 

TGATGATCAGATTATTTTCTTCTATGAATG&TAAAAGAGATC 
65 AATTTCCATGTCTTACCTTACTACCATCATTGGGATACCCATTCATTACTCA 

TGCAAACACTGCCTAGGCTGTTTCTGCrGTTGCAGAGTTATTGT^ 

GTACCCTCCCACCCACCCCCACTTTTTGGCCCTGGGGTTCCCCTGTACTGGG 

TTCCAGTGATGGCCGACTAGGCCATCTTGTCATACATAT^ 

TTCCACCTATAGTTGCAGATCCCTTTAGCTCCTTGGGTACrTTCTCT 
70 GACTGTGAG CATCCACT TCTXSTGTTTGCTAGXjCCCCGGCATAGTCTCACAAGAGAC^ 

TGCTAGTGTATGCAATGGTGTCAGCGTTTGGAAGCTGATTATGGGATGGjATCCCTGGA 

TCGTCACAGCTCCAAACTTTGTCTCTGTAACrCCTTCCATGGGTC 

TGGTCTTCGTTCTTCTTGAGTTTCATGCGTTTAGCAAATTGTATCTT 

GGACATATTGTGTGAGTTCCTTTGTGATTGTGTTACCTCACTCAGG^ 

7 5 TAAATTCATTCTTTTTAATAGCTGAGTAGTACTCCATTGTG 
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CTGGATTCTTTCCAGCTTCTGGCTATTATAAATAAGGCTGCTATGAATATAGTAGAGCATGT 

TTCTGGATATATGCCCAGGAGAGGTATTGCTGGATCCTCCGGTAGTACTATGTCCAATTTTCT 

AGAGTGGTTGTACAAGCTTGCAATCCCACCAACAATC^ 

GAATTTTTG ATCTTAGCCATTCTGACTGGTGCG AGGTGGAATCT CAG<^TTGTTTTTATTTGCAT TTCC CTTG ATGAT^CGGATGT 
5 AGAACATTTTTTCAGGTGCTTCTCAGCCATTCCG 
GTTATCTGATTTTTTGGAATTCATCrTCTTGA G TT 

TCCTTTCCCAAT CTGTTGGTGG CCTTTTTGT CTTATTGACAGTGT CTTTTGC CTTACAGAAGCTTTGCAATTTT ATGAGGTCC CAT 

TTGTCGATTCTCG ATCTTACAGCACAAG CCATTG CTGTTCTGTTCAGGAATTTTCCCCCT 

CACTGTCTCCTCTATAAGTTTTAGTGTCrrCTGGTTTTATGTGGAGT^ 
1 0 G AAATGGATCACTTTG CATTCTT CTACATGAT AACCX3CCAGTT CTGTCITTTTT CCACTGG 

ATGTGTTTAGCTCCCTTGTCAAAGATCAAGTGACCGTAG 

TTGTCTGTCACTATACCACTACCATGCAGTTlTrATCACAAT^ 

AGGTTCTTTTATCCTTGAGAAGAGTTTTTGCTATCCTAG 

TTGAAGAATTGAGTTGGAATTTTGATGGGGATTGCATT 
1 5 CCTG CCAATC CATGAGCATGGGAAATCTTTCCATCTTCTGAGAT CTT CTTTAATTTCTTTCITCAG AG ACTTGAAGTTCTTAT CAT 

TCTT^ CTCAGCCTGTTT AT CCTTTGTGTACAGAAAGG C CATTGACTT GTTTG AGTTAATTTT AT ATC CAG CTACTTCACTGAAG CT 
G CTTTTCAGGCITAGGGGTTCTCraGTGGAATT^ 
CTTCTTCCTTTCCAATTTGTATCCCCTTG^ 
2 0 TAGGGAGGGAGTGGACAGCCTTGTCTAGTCCCTG^TTTTAGTGGG 

GTTTAAAATACAAGCAAA(^GGTTACTTTGTATAGGCAGAAACTCAGTAACACACTAGTATAAAGG 
CACTGAAAAATCAAGCAGTCCTACTAAGTTCTTGCCAATGCGAGTCA 
GTTCATTCCAAAACAATAGAGTCGAGTTAGGAAGCTT<3TAGGTG 
AACAATTCACTTCCCTTC1AGGCTTTGGTTTTGTATGACATGAAATTATCTC 

2 5 AAAT ACTGTG AATAGAAACTGCAG CITTT (ZAG<3T CAAACAC CAATGGGCCCT AAAT TG CTAC C AT ATG AACTGTATTCCTAAG AAT 

ACAGCCAATGTCTTTGAACAAAATTATAAAATCTTAATGACCATACATGTTTCAGGTCATAGTGT^ 

ACCTTGCATGGAACTTGGCCTGTAGTGGTTAAAAATA^ 

TCACTGAACTTTTATGACTTGGTGATTAACAGCATGTGAAACCTTGAGA 

GT AATTT AACCATTCTATGCCTTACTT CATTTTAAAGGCGGTAACAATG CTG C C TGTCT C CCTGG CTT CTAGGGGGACTAAAAGGG 
30 ATACTAATGTATCTATGAGCTCTGTGAACAATAAGAGGTGTTGTTACTATCGCTGGTTGGAGGTGGTGATGTTTGGCAAAGAGAGG 
T AAAAATTG ACTTCATACAAACT AAAGAGAG TCAG CCCAGOT G CAGAT CG CAGTGGTAG 

GGCAGCCTAAAGAAAGGTTTGGTCTC1AAAAGAGGAAATGCTCTGGGTTGACTGCAGGGCCC 
GAGGGCAAGCAGGCCTGTCCATCAGAAGTTCCTrGCTGGGGTGT^ 
TCAGATGAGAAAACCACCTCCAGGTCCCGAGGCAAGAACTCCACTGTTTCTATATGG^ 

3 5 GAATGGCG AGG ATCTT CT AAAAGATG C CGGAG GGAGGG AGCACAACTCT CAT TCCAGAAG AACTG AAACGTTGACAT AAAAG CAG A 

CAACAGCGAGCTGTGTCTCCCTTGCAGCTTTATTCACC^ 

AG C CTGGAGTGCGATT TTGACCGCAGAGAATG CACTGAACTGGAAGGG AT AAG AG C CAGT TTG CAGAAGGTAGAAAATG ATAC G T C 
TCTGCAAAGTGAAAGAGCCACCCTGCTGGAGGAGCAGCTGCCCCTGGGAAAGGCTTTGTTCCACA 

ATTCCGGGCAGTACCGTTGCCTGGTCATCTGCGGGG CCG CCTGGGACTACAAGTACCTGACGGTGAAAGT CAAAGGTGAGTGTCCC 

4 0 CCGCAAGTCATCTCTGACGGAGGGCTGTGATAGCTTACACAACAG CATGGGT AGCTCAGGCCAAAACCAGAGACCGGGT CTTAAGA 

GGC^AGCCCCTTCAAAGGGGCACCTGAGCGGAAATGGTTTITGGCATCTGGTATAAGATCT 
TACAATGGAGCGAGGGGAAGCCAGGAAAATGAGCAGACACCAGGCCTCTTCCT 
CTACTAATGGAGCCACATCTGTGCCTTCAGCTCCATTCCCAAGGGGAACT^ 
CATACTGTCCTGACTTTCCAAGAGGGGAAAGCTTTTGGAATTAAAGT 

4 5 AGATTGG AAGCAACCCT CAAAG ATTTATTT CAGTGGTTGT CTTAGTCAGGGCTTCAAGTGCTGTGATGAAACACCATGACAAAAAA 

GCAAACTAGGAAGTAAAGGGTTGACTCAGTTTACACTTCCCAGATCACA^ 

AAGGCAGGAACTTGGAGACTX3GAGGCCATAGAGGGTGCTACTTACTGCT 

AACCCAGGACTACCAGCCCAGGGtfVAGGCACCACCCACC^ 

CGCTGGATCTCATGGAAGCATTTCCTCAACTGAGACTTCTTTCTCTGTAGCGACTCTAGCTTG 

5 0 QC CGGT ACAGTGACT ATTTGAAATT CAC CTGG ATCCTGT T AAG ACAAAAGCAT AT AAGA^ C CACTTTATTCTGAGC CACTA 

TGATGATTAAGACAACAATTGCTGTGTGCAAAGTTTGCCCTTGTCTTAGA^ 

TGCTGTGCAGGAGTAAGGATCTGAATTCAAAGTCACATCACCTGCAG 

AGTAACAGGCAGGACCTGGGAGCTTCCCTGGCCAGTCAGCCTAGCCAAAACAGTGAAC 

GAACAGTAAGG AAAGCCAT CTT C TG ATC C TT ACAT GGG CATG C CCAC CT AG ACAC AT ATAT ATG CACAGAG AAATATACACCACAT 

5 5 ACTCCTTATATGGACIACACACTCCTGCACACACATGTGC^ 

CCAGGGAAATAGAAACCATGAAGGATATAGAAATCTAAAGTAGAAATTAATCAATCTTTTAATGTTAGAGACTGAA 
GGGAACAAGGTAGCAAGTCAC^TCAGCAAGAGTACTAAGGAAGACGTGGGTGGTGGGCAGTGGAGCCCC 

AG ACCGTTCAG TG AG AGCCGGTG ACAGG AATG AAATGATT CTGCTG CT AGAGACACTGT CAGAG CAGAG CAG AGAG CTAAGTC CTT 
TAATTAACAAGAAAGCTGAGAGTCTGAACGGTCATGTCGCCAGCCCT^ 
60 ATGTTATTTTACCAAGCCACACAAGTGTCTGAGGGAAGAAGTTGATGGCATGAGAATOT 

GTG CTT ATG GG AAGATGG CAT CTGAAGG AAG AG GAAG CT C CCATTG ATAG CT ACATTTAACTACAAGTTGGT AG CTAGG AG ATTGC 
T AGT ATG AG AAAAACAG ATGTTGCT AAAG ATAAAAGG C CATGTTTT CATTTGG C CT ACTTT AGATTT CACTGTG CT CT AGCCATAG 
TGGAAATCAGACTAGAGGTTCCTTAAAAAACTAAAAGTAGTACrATCGTTTAATTGAGCTATAGTACTCCT 
GGACCCTAAGTCAGCCTACCATAAAAACACACACCACATT^ 

6 5 CAACAAATGAATGCATAAAGAAAATGTAGTGTGTATGTTTAATGGAGAATTCG 

GGAAAATGGATAGAACTGGAAATCCCTGTGTAAAGAGACATAATGAAGACTTG^ 

TTTGT ACT ACCATTAT ATG C CAT AGG AAAAATGG CAGGGATCTTTAAAAAG C CACATTTCATACT AATAAG AATCCACACT CTAGG 
G AGACATGACAATTCT AAATGTAT ACACAT CTG AAT ATAT AGCTTCAAT ATGTT AAACATAAATTGG AAAGATG GAT ATT C CTTCC 
TTG AAG CCCTAC CCT C T CT CAG AAGTT ATT TTTC T AACITTTTTCTTTTAAATT ATT TTTAT ATTTTG AAATT AT A^ 

7 0 TTAATTI^TTTAGCACCCCAAACAjCATATCTATCTT^ 

AT ATAC ATAAACG CACATAAAT ACACACAAACIACATATATG T ATACATAT CATACATACAT AT ATATCAT ACAACCTGCT CAATCT 
GTACAATACCATATATATATACTCAGTATTGAAT^^ 

TTCCTTAGTTGTCTATAATTGCCTTTAGT^TTTGTTTGGAGTTGAGGCCTCCT 
TTCTATTTGTTCAAAAACAATTGGAAGGAGAAGAGTAGGAGGAGGAGGAA 
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TTTGTGCATA.TGTGAGTTTTTGGTTCATCTCTGCTATAGCAGAAAGGACAAGA 
CACCTAGTTAACATGTCTAGAGCACCAACTGCAGAAGACMGGTGCTTTCTAGG 

AG CAAAGAAGAACAAAGCAAAGT T ATTG CAAGAGAAG AACTTCT CATTTT 
TATTCTTTAATCTTTTTTTTTACAGTTCAGACT^ 
5 CCCCACTCCTCTOTCTCCACAAGGATGTCCCCATGCC^ 

TTAGGTACATCTTCTCTCAACIXyU3GCTCCTTGGGTGTAAA 

GCAGTCATGATAGACCCCAATTTGTAAGCACACCATAGCATAAGTAATAGTGTCAGGCC^ 

ACCCGTCACTGGACCTCCTTTTCCTCXaGTCCCTTCTCCAGTTTTGTTCCT^ 

TCAGAGTTTGTGACTGTGGGATGGCAACCCCATCCCT^ 

GGTCCTCCCACCTCCTACCTCCTGAGGTTGCCTGTTTC 

CGATCATATTCCTCTCTTCCCCrrCCCrGTCACCTTTCCCACCCAGATCCCATCCCCACCCCC^ 
TGGG ATTGAGGT AT CCATGGAAACTGAACAG CTCTCT ATTCAAT AACTTGGTCAG^ 
TCGAATTCAATGAAAATGAGGACACAGCATACAA^ 
1 5 CCCTGATAAAGAAATTGGAGAG CTC CT ACACTAGCAACTTAACAGCACAC CTGAGAGCTCTAG AACAAAAAGAGAAAAAACACACT 

GAAGAGGGGTAGAAGGCRGGAAATAGTTAAACrrCAGGACAGAAATCAACCAACT 

AAACT AAAAGCTGGTTCTTT GAGGGGATCAACAAGATAAAC C CCT AGCCAAACTAAAT AAAGGTCC CAGAGACAGTATG CAAATTA 
ACAAAACCAGAAATGAAAAGTGAGACGTAAAAACAGAAACTGAGGACATTAAGAAACATCAGATCCCACTACAAA 
AACAAAACTGGG AAAT CTAGATG AAATG CAT TGTTTT CCAAACAGATAC CACATACCAAAGTT AAATC G AG AG CAGGTGAACTATC 
2 0 TAAACAGGCCCATATCTCCTAAGGAAATAGAAGGA^ 

AAAACAAACAAACAAACAAACAAAAAAACAGCCCAGGGCCA 
CTTGAACTTTrAAAAGTACTTCCACCTCTGAAACrrCATTTGCT 

TAT ATTGTTGGGG AT AAT AT CT CTCCAACT ACT TTC CAATGAAAAATGGACTGTG AAGGCATTT CT 
TGGGGGTTTCCATCTCTTCCTGAAACCATGATGGTTCCTTTAGCT 

2 5 CCCACAAGGCAAAGTCTCAGCCAAGCACCTGGCTCCT 

CTTGTTTGAAGCCCTCCCATGGAAGTTGCCTGGATCGCC^ 

AAGT C7^CAAAAGTAGAC!AT AAGAT ACAGTATGAT ATAGTGAAAT ATTTCTAAG AAG ATGCTG 
TCTAGAGGTGAAATGAGTACTTGACATTTTTACTTCAGAT^ 

GGGAGGACT AG AC CAAATCATTTAGTAG AC^CTTTCAGAGT CCAAG CTGGGGATTG CTAG ATTGAT AAAGAGAA 

3 0 GGCAATCTTTCTGTTTGTGTTATTTCTTGGCCCATTGTGGTAT^ 

CTCCATCTCAAGGATTCTACTGTTTCCTTGTTTACIAAGCTACGAAT 

ATAAGATAATGGTCGGGTTACCA-GGCTAGGGCCACATTTTATGGCCACTTAACCCCrCCTCTTC^ 

CAGCATACAAGAATGCAAAGCCATATTGGAAGTGAACIATCATGTGACAGGTGAATGGTGGCCT 

TCCCTATTTGGAGC!ATCTCTGGGTGACTTACATGGGAGAGC^TGAATAAGAG 

3 5 ATCTAGAAGCACAGATTTTCACTAGACCCnXSTTCTAGTTTTATTTCTC 

CITGAAGGGATTGTTTGGCTTAGAGTTCCACCT 
AGGAGCIAATGAC1AAATGAATGCATGCACATTTGCTT 
CCTAGGAAACTGCACCACCCACAGTGGGCTGGGTC!TTCCCATAT 
AACTCAATGTGGACAGCCCCTCATTAAGGTGCTTGCCTCAGAT^ 

4 0 AGACCCTAATCGATTCATAAAAGATAAGAAGATCTTTCTTA 

CAGTGCCCTCGGATTAATCCCTGGTGATGGAATTAATCTCTGGGGA 

TAATAAGTT ATT ACIATGGAAGCAGAACAGAGAAAGACAGAGAGG C AAAT CACATGGG AAAG G CT CTG AG CATGTCTTCCTCCTCAC 
TGTTCTACATTAGGTAAAAAGGTCMGCCGCAAAGCTTAlZAGTTAAGA 
TAGGTATTrAAGGTTTTTCACATAACTGTTATCCACCTGGGTAAAGTCA 
45 CCAG AGGG CAG C AAATGT AAATT CATC CTGG AAT ACGTAATGAGGT CAAGG CT AG CC TAGG CT ACAT TG AG AG AG C CTGT CT CAAA 
AACAAAATCTAAAAAATATAGTTGCTATTAGTT^ 
TTAAGAAACTACCTTATGACACCTGTTATGATTGCCTTTCACCACT 
ATTTATTTTAGCTCACAGTTTCAGAGGATCCAGTCCATCATGATC^ 

TCTCTTTAGCTTGTTTGGGTTCCCAGCCTGTGGGATGGTGCTGTCTATGTT CAAGG CAGACTTCCCACCCACTAGATA^ 

5 0 TCTTGTCCAATCAAATTGACAAGATTAAC 

CTTATTTAAGGAAATATTTCCCAGTTAATGAAAGGCAGGACTC^ 
CTTCCAGCTATATACATGTGTAGCGGTATTCGTTAAT^ 

CTCCAAATATCTCCTTTTGGGTTTAGATTTCCGTGTTCTAGAAAGTAGAAAATCACACATGAATAGTTCA 
AGAACATATCTGTACTGAGAAATAAGAAACACACCCCAGCTCTTTTTTTTTT'TACAGCT 

5 5 CTAATCACAAAGATAATGGAGGGAAAGGAAGAACTTTTGAGTAAACTTTCTACT 

TCTAGCCCTAACCATTCCCTCAGCTTTCACC^TCCTTCA^ 

AGCAC!AGTGCTAGGATGTAAATTTGTAGCCriTCCT 

CTGTCAACCTGTCTTCAGTCAACCTCAACAAAAATTATG 

CAAACTATACAAG CTAATGAACCTGAGACATAGGAAATATTTTTGAGTTTAGAAAAATGTAT CACCT AAGATCAATAGTAATCACG 

6 0 TGTTTCTTCTTCCATTGJVSTACIAACTGTACAGCAATCCAAAA^ 

AGACACTAGGATCCTGGAGGTT CCAGGTACAGGGGAGGTGCAGCTTACCTGCCAGGCTAGAGGT TATCCCCTAG CAGAAGTGTCCT 
GG C AAAATGTCAGTGTTCCTGCCAACACCAGCCZACIATCIAGGACCC CCGAAGGCCT CTACCAGGTCACCAGTGTTCTG CGCCTCAAG 
CCT CAGC CT AGCAG AAACTTCAG CTGCATGTTCTGGAATGC T CACATGAAGGAGCTGACTT CAG CCATCATTGACCCT CTG AGTAA 
GTGTGGCCCCCATTTCCTAGTCCTATCAATTAGAGTTCAG 

6 5 CATTCAGAAGCTTACTTCTGAATAAGCTTTCT 

TCCTCTCCCAG CTGACTGGTTCLAACCTG AGTTCTCTTGG CTTCTGACTGAATTGCCCACTTGGCCTCAAACTAACCGGCAATATGT 
TTAATCTTCTGTTTCCTTCTCAGTCTCTGGATCGTTCTGTCTTCACCT 
GTCTCTCTGTTGCTGTCATCTCTGTCTCTGTCTGTCTCTCTCrCCCTCCACT 
ATGAGAGTTGGGCATATCCTATCTCTGACTCATTCTGTCAAGTCTTTCTCA 

7 0 TGGCTGCTTCCTTCTACAAACTAACCAAATCTTCAAGGTTTGGAATTAAAGGTGTGTACT 

GAG ATTAAAGGTGTGT AATT CCAGCCAGATTACAT AGATCT AGGT TT TTGGATGTGATCC CTTGC CAAAG CAGT CATGTTGCTGGA 
TTAAAATTCCTCTACAAGAAACAGATGGTCCGCTCATATG^ 

AGAATATCGAAGAACTACAAACATAGAATGGGGCAAAGATCTCTACTGTTGAGAAAGAG 
GAAGACAGCTGTGAGGCAGCTCCTCTGGGAGTGCAGCTTCGAGGAGGGAAGTATGTGACATACCACAACCACT 
75 AGTGTCCTACTGGTGCCTCCCATTGGCTGGTTrGTGTGTGTGTGTGTGTGTGTGTGTGTGT 
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CCCGTCCCTTCTTOICCATGTGTTCCCTG^ 

CAATAGAAAGTCTGTATT AG CCAGCTGTC CAT CATACTX3TGTGTTTG AG ATTATGGTGTAGTTAAC CTTTTTCAGTTAACAAGAAC 
AGTTAC CAGAAGCAGAACTACAGAAG CCAGAAAT AAAG CAAGGTTAGTTASTAC^ 

ATGGAT AAGGTCTI^GTTTTTATTTG TCAGATGGGG CT CATGGAGT 
5 CAGTTGTGCT AAGGTCTGAAGGCATAGTAACCTCTGGAGCTCTATT ACAT^ TGAGTT AAACCAT AGG 
CTCATTCT ATTTTAGTTCAAGGTAT AAGGG CCCGTGGTTGATCAAATCAGGAGAATGACATG TGCAGTGAG CACTAATAGCAGATA 
GTTAGCCAAGGGGACTACGAGAACAGACACTGGCATCAATTATCT^ 
TCCTTCCTTCCTTCCTTCCTTCCTTTCCTTTC 

CT AATT ACTC TTGATCAATTAGCAT AGAAACAACAGGCT CTTCCCAG GGCT AGACATACTTTTCTT CATCTCGTGAGAAGTCTAAA 
1 0 CXTTCTCCAATTTT ATAGGAT ACTTTCTGCAGGGG AATCT GGTT TGGTGAT ATTGCTGTGAT 

CCAGATGAAT CAGC CCAT AAGGCAGAATAG CAGGCATCAG AATAAGGTAAG CT ATT ATGTTAAAGGTTTTGTATATT AGG CAAGTT 
TCAGGTCCTTGTAAGTCCTGTAATATTAATTTGTGTTGATCCTAGTCA 

AGTTCCTATCCCTCTGTAGTATGAGGG CTGGGCTGTCT AAACATAACCAGCCATCCTGG CCAACTTCTGGCTGGGAGTGATGAACT 
CCCGGGGGCCCITCCTGTGGCATAAAGTCCGATTCCCCAGGTTTTCCCTGTTTCCCAAAG 
1 5 TTTAGGAAGCCTCAGTTTCACAACCCTAATTTTTACAATGGA 

GGTCACAGTGCAGGAGACAGTCCATTTGGCAACAGTGTCTGTGGCT^ 
TGCCAGTTGTCTTCACAGCCATAGGAGTAGAAAGAATCACTGTGTA^ 
CACrTCACTTGAACAAATAGAAACTGGTGGTGAAC^ 

ACTGGACAGAGGAATAGTCAGAGAGTTCTGTCAACTGCTCGTCTCTGAGCTCGAG 
2 0 TCATAGGG AGTAAGTCCCTC CC CATATGGGGTGAAACAGGT CTATGCAAAGCAAAGG AAGGAGG CCTATCCACC C CTCTCATTGGA 
CACTAATGAGAGTTTTGTTAGGGTTTCTTTTTATGTCCTGGTTATTCTTT^ 

GAAGTT TTCAGT CAGTG C CTAAAG CTTTAGCTATT ACTTAAT AGGTTTTGGG CTGTAAGCTACGTT AT^ CC CATTGGGA 

GGGGGAGGCTAAATCAGGGGACGTGGAGGCGAAAAGGACCTGGAGGAGTTTTTAAAACT 

GGAAGTACTGGGTGGTGGTGGTGGTTGTTGTTGTTGTTTTGATCCTC^ 

2 5 TTTGTGATGTGTTGGCCTCTTATTTGGACCCCTTCCTGGGAAAGGGCT 

G GTTG AAG CCTC CACACTAGGCTGT C CAGTCTAGGTATAACAT ACCTTGTTTTGAG CAG CTCAGAAGTTTTGTGGACCCCAAATGA 

GTGGCCTGGTAAAGGTCAGTTGCCAGAATCCTGCCTGTTGTTTCTC^ 

GTCTTTAAGCTGTCCATTTCTCCCATGGATTCTC^GCCAGTACCAGGTCA 

TT AGGG ACATTTGTCTGGT AGCCAGGT CAGCAGGCAGG CA CAAAT AGCCCCT AGG AGG AC 

3 0 AAAGATTTTCTAATGTTC1TTTCCCCAACACCAAGAAACCCTCTTTCTCT 

ATAGCAACTGCTGGTGTATATGGTAGTGGACITTCCCCA 
AAATTTGGACCC^AAATCCTTTCCGTTAAAGT 

ATGAGCACTTCTGGGTTGTCAGCAGGAAAGAAGGCAGCAGGATAGGCAGTGGTGGGGTTTTAAAACTGGACTCG 
GAGTTGGGCCTGGAACTGGGTCACAC1AGGCATTAGACAGAATTCTAGTGCTCCTTA 
35 TGTGAGCATAACATCTTGACCCAGAGTTAACATCATTATGATCATTAT^ 

GTGGGCCATCCTGTGGCTACAGGGTCCAGTCOTTTAGACAGGTATGTCACAGGGCATTT 

GTTTTTAAAGCCTCAAATGCCTTTTGTTTAATCrc 

TATTTCCACAAACACTCGTATCCAGAAGCAG CGATGTCCTG CTAGACTGTCTCTCAG ACCTCAG ACTCTCAAACCTGTCCGTTAGT 

4 0 CTTTGGAATTGGGATCTGACAGATGGCTGCACTCCTACTCACACAGGCT 

CCTCCAGTGTGTTTTCATAGTTCGCGTGAATCTTCTTTGTATCCAGATCATA 

CATG ACG AATTAGAAACATAATCAGTCATAAAG AACCT CAG CT C C C T CATCT CAGC TC C T AGT CTACGT AG ACAGAC CAG ATGTGA 
CACCCTG CCTTGTGACCCCAAACAATAG CAGCAGG AT AAACACAGT CACATC CTCCCAAGTGGTC CTAC CTCTTAAAACCATTG TC 
TAGCTGATTCTTCTCATTCTTATCAACCGAGOTACTCACCCACTGCTGGCTGCTGACTGCTGTGAACCACCT 

4 5 T AAAACAAATGCAG AAGTGAGG TG CGAGGATGC CTG CTACT CTAG AACAG AAGG AAGG CAG AG GGTG CCGG ACACTCT AG AACAGA 

AGG AAGG CAGAGGGTG C CGG ACACTCT AG AACAG AAGG AAGG CAGAGGGTGC CGG ACAC T CT AG AACAGAAGGAAGG CAG AGGG TG 
CCGGACACTCTAGAACAGAAGGAAGGCAGAG GGTGC CAG ACACTCT AG AACAGAA CAG AGG GTG C CGG ACACT CT AGAAC 
AGAAGGAAGG CAGAGGATGCCTG CTACT CTAGAACAG AAGGAAGG CAGAGGATG C CGGACA.CT CTAGAACAG AAGG AAGG CAG AGG 
GTGCCGGACACTCTAGAACAGAAGGAAGGCAGAGGGTGCCGGACACTCTAGAACAGAAGG 

5 0 AACAG AAGG AAGG CAG AG GGTG CCGGACACT CTAGAACAGAAG GAAG GCAGAGGGTGC CAG ACACTCT AG AACAGAAGGAAGG CAG 

AG GGTGCCAGACZACTCXAGAACAG AAGGTGGAGGATGT CTGACACT CTAG AACAG AAG ACTGAGTGCACTAAGTTTTGGT CAG CAG 
GTAGGGAGTCATCAGGTCATTTTCCTTTAGAAACAGGGCTGTCCT 

AGGTCATGGG ATT AACCGTAAAAT ACCTG G T AGAG CT GG CAG AAAAGTG AAG CCCTCTTGGGACCG CT CAGGTAAGGAG CTAG AAA 
AGAACGTGGACTCTGAAGCCAGACTTCAAACTTTCAAATGCTATCTCTG 

5 5 CCTTCTGGGAACTCAGTCTCACAGGTATACAGTTACAGTA 

TATGATACTAACAAGGAGTAGGGAGGCCACTTTGTCACCTCCAGCAAGACTCCTGGCTTC 

GAATAGGATGGATATTTTTTTCCATCACATATCTTATCATTGTTAAAGGGAATGCAGTTATCTTAAAATAATAGAA 
AGGTCACCCATGCTGCCATAATGAAGATTCTCAAACATGCAATGACTCAACA 

AGAG AGAAAACGCTT CAGG CAGCTAAACAACTAC CTTCCGT AAGT CATTCAATG AC CCAGG TTATTTCCAT ATTGTTACT CTGT CA 

6 0 TTCTCCGGGATATCGTCCCTGTTAAATTGTCAAATCAGGGTCA^ 

G AAG AGAACI^TGATGTTCnAAGTCAAG AT CTGGAGAATT CACTCTT AG C CG GAATTT AGTTT ATACTACAGGCTATGTTTAAGTG 
CT G ATGAGGT CTAG AAAAG CAGCC CCTCAGAGAG CT CT ATGTC T AACTATT AAC CT AGCACTGGGAAGG AAGGAGCAGGT AAGATT 
AC CAG CAGGTTGCT ACATGGTAGCTACT AGAGGGAGTAGGTAAGATT AAT AAC CAG CAGGTTG CTACAAGG TAACTAACAT AGACT 
AGTCAATTCAGCCTAGGAACTCAGACTTAAGTCTCTCCTGCAGTGGT^ 

6 5 AACTCTATCCTCCAAATGCATTCTAAGAAAACTAAAGTCTTCAAGGGTAAGCAGA 

CCTGGACAGAACAAGGGTTCCCAAAGACGCTGAGGCAGGGGCCTGGCTCACAGCA 
AGCAGTGTG^TCAGTTAGGGTTTGCAACACACCTCTACCTC 

GCGCCATCCCCACTCTCGGAAAAACCCATTCTTCCCAGCTTAGCTCCGCCAATGGCCAGCTGGTGCCTTTC 
ATGCTCCCAGCCCTTCCCAGCTAGCCACCAGCTACTGCCTCTCT 

7 0 CTTAGTACTTGTCAGATAGCATAATTGCTGGGAGCAGAATCrCATTGTTCCCAA 

CCTCCTCCTCCTCCTCCTCCTCCTCCTCCTTGGACCCGGAAGTTCTACCTTTACCCTTC^ 
ATTGACACAATCAAGAACCAATTCAGCTCCTCTCCCTACATCTACCAAACTC 
CTTGGATAAAGATGATAGCTCTGGGGCTTGAGTTTTGGATCAAGAA 
ACAATTACCATAGACAC^TTAGCATCCCTGAGGCTTCCTTG 
7 5 AGATTTTCCCATG GT CCCAT ACTCTGT AG AGGG AAG GTCAAGAGCAG AG CAGCCAG CAG AAAT AGG C ATAGGGTAACTTGTGTG C C 
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TATTCTTTGCCATGTTTTCAACATCTCTGTAGCCCTC^ 
TTCCCCAGCCTTTAAATTAAATCGCCTCrTC^CrAAATGGATGTGTCM 
AACTTGTATCTGAATGGCTGGATTTAACATGTCA^ 
TGAATAACTAACAGTGCAGAGTCTAGGCTCTTGA^ 
5 CTTGAATGAGAAAATCCAGATGGTGAAGAAAAGAGAGATGGAGGTA^ 
AGATGACGAATAGGTGGGTGGGTGGAAC3VACAAAGCGACAAAAGGACT 
CATTTCTCAGCCTTTCTTCTTTCATGGGT^ 
CTCTACTTGAGAGAGATGTGGAATGAAGGCTTAACTTCATTTC 

1 0 TT CATrcCGGCCTGCACCATCGCITTGATCTT C CTGGCCAT AGTGATAAT CCAGAGAAAGAGGATCTAGGGGAAGC TGTATT ACGG 

AAGAAGGTGAGTGACTTACAAGCCATGTAAGCCTGTGGCCT 

TGGTrTTTTTGTTTTTTAAGGGGAAGTTTGTTCCAG 

G AATCAG CCAG ATTC CCAT CAATGATAAACAATTTTTTAAAAAAATGTGATATGTGTGC^^ CACCTTGTAGCCTT AAAG 
1 5 AAAAAATTAAATACTGTCATTTTTCAGGGGAAGAAATGAAGCTAGAGGTCATCGTT^ 
ATT ATATTT CTTTTGATAT ATAG ATTCTAGACTT A 

AATGATGGAAGTGTATATGTGTGTGAAAAGCCATGATGATTAGGCTGGAGAAA 
GACTTGAGTTCAATTCCCAGCATCCATATCAGGAAGTTCCTGATC^CCTGTAACTCCAGCT 
GCCTCCCAGGGAGCTAAGTTAGTCCACCAGAGGGCCCTATCTGAGCCAATGCTGAGGGCCTAGTG^ 
2 0 AACTCTGGCCGACAGGAAGTCCCACCTGCCTCTTCAGC 

CATAACT AATT CACTGC CACTGTGGTGTAGAG CTCTTGAGTC CTTGCTAG AGCAGAGT CC CATGCAG AG CC CATTAAACTT T CATG 




TTT CTTT CT AG AAGC CTC CTCGGCTC CTGTACAGATTTATTCAT CAG AT AAT CACAACTTATG ATGCCTGCTG AGT ACTGGG AGGG 

2 5 CAGCCATTGCCAGTCCATAAAAGTGTCITATTAACTATTCCnrrrCAT^ 

CAATATGTT C CAG AGAAGTTGGCAAATT CTAGTTCTGTTTT ATCAG CTAGGCTT AAGTATATGGGAACTATG TTTCTCTCAT CAGT 
AG AT AT ATGTTCAAT ATCT AAAATGCT ATGTGGTAT ATATAT AT AAGAT AATG CCCAAT^ CTTTT TTCAT 

CCCTGACAATTTGATACATGCATATATATGAACCTATCATG^ 
CCTTTCCTACTGACGTTTTCTTTCCAACAAGTCTCCTCTCT^ 

3 0 GAAATTGTTTGCGGGAGCATGAGGCATGAGCGTAGGTTACTTATTAAAGGTCAACT 

CACCCCTTTCCATAGCAACCATTACCTCCCAATAGTGCCTCAGAAGGGGGCATCGTGGATTCCT^ 

GCAGGACCAATCTTGTGCATATTTGAGCAAATATACAAG CTATG CTGGGTTCCCGAGTGC^CCAGCTATGTCTGACGTTTTGTTGA 

TCCTCCCCCCACCCTCTGTCTCrCATATTCCTTCCTCCCCCTCrrrCCCAT 

TCTCATCTAAGGCAGTGCTTACTCTTGGATTTTGGTCTGTTATGGGTT^ 

3 5 GGTGAGCATGACAGACCTGCTTTGGCCATTTGAGTAGCCATATTTAAAGC^ 

ACTGCAGAC^GCCAACATC^CTG<!ACATCACTGCCCAGCACTGTGATAAA 
CCCAGGACAGAAAGCAGTGCCTTTATTTTGTCCATGAAGGAACTG 

T AGACAGTTG AACTCT AAAATG TGTGACTCTTTTTATGGCAAAAG AG CTT AAT TG AAACCAG CAAT CTGAATAGT ATT ATC CATAC 
AAGGGAAGAGGAAGTCGTAACCCAAAGAATTAAATGAGTTOTCATTTACATGCTGGGCCAATGA 

4 0 AGAAGTTACCTGTACAGAGTCTAGATCTGTAGAGCTTATTAAACTAC^G^ 

CATGCAATGAGCAATAATCAATTCAAAGATACATATATAACAAATGCAAGGT 

CTTCATTAAAGCATGGTCTGGACATACAACACAGTCACTTTACACTGT 

AAACACCTTGGGTCCAAGTTGAGACTATTCTGTGTTATGGCTTGCAGTGTTTT^ 

AAT AGAATAACTT AATTCTAAAAGTATATTTCAAGAGAAGOT C 
45 TCTAGCTGTCTTGGAACTCACTCTGTAAACCTGGCCAGTCTGGATCTCACAATGATCTGCCTAC 

TTAAAGGTGTGAG CCACT ACATGC CTAT AT AAAAACACTTTTAAT ATl'TTi' CTACCACTTTT C CTCAAT ATGTGTCAAATT CAT AT 
AACTTTATATTTAGTTGCATTAGAAATTTTATTTCTAAAGCTGATTTAATTGCTAGCTTGAGTTTCATTAAA 
TTTTGGGTTCTTATTAGGACAGAATTTCCAAAAATTTCTTAAGTC 
TGAGAGGTGTTCTCTGCCATTCACCATAAAACTATTAATTTTAAATACAACTAACT 

5 0 ATAAATGGTAGAATTATATATGCACAAAGAATTGCTTTTAAATAAGTTAA 

ATACACTCTCACACATACATCCTCCCTCACATACAGTTTCACACAC^ 
ACACCTTGTCACACACACCCTCTCTTTCTCTCACACACATACCCTGTCAC^ 
ACACCCTGTCACAACACATACCCTATTACACACATACAT^ 
CACACACACATTCTCCCCCACATACAGTCACATA^ 

5 5 CC C CCTT CCACATACACAT AGAATAAAAT AAAATATG AATAAAACTTGGG ACT ACTATGT^ CC C ATTT CAT ACAG AC 

CTCTTAGTTACTATGTAGCCCCAGCTAATCTC^ykACTA 

TGGCTTC CTATTG CAGTT TTT AAGAACT CCTTTACAGTGTAT ATT ATGAAAGT AT CAACAACACTTTGAAAATAAAAAG CACAGAT 
GTAATACACACACACACACATACACAAACAAATCTGGTTCTTC 

TGTGTGCACAAGTGTGTGTATGTGTGCATGTACATGTGTGTGTACATGTCTGTGTGC^ 
60 GTACATCAGTATGCTCTGTGTAGAATACAGATATCAAC^ 

AGTC!TTCCATT ATGACGTG AGGCTGTTCACTGAG CCCCAGGGATACATTCACCTCCCAGG CATGTGCTACTGCACCCAG CTCTTTA 
CACAGATGCTGGGGATCAACTCGGCTTCTCACCCGTGTGC^ 

TTATTAG AT ACAACAAT ACAGCACACTTCAATAACAG AAAA^ C CAT AATTCTT ATAAAAGT CTCTAAGGTTTT AGATT CG AA 
TATTAATCCTCGTTTGATCATGAGCTGGTAACTAGCA^ 

6 5 TGCCITAGAAGACCGAGTCGGGACTTTGCCAGACTCTAAACCAAAAGTCAAAGGCGTTAT 

AATATTAACCCTGACCTCIGGCAGTAAGGGGGCTGCCACTGAAAACCGAGACATTCCAGAGCA 
TTCCTTT CAAATACCAAT CTTCTTGAAGG AGACT ACCCTGGCTATGGCACTGAATTGATCCTTGACTCTGA 
TCAGGATGGTCATGATGGAACTGGTAAAAAGCCAGATAGGCAGGAGC^ 

TGATAGG AAAT AGCAAGTG CT ATTTAAGGAAG AAATAAAACTAG AAAAACCTGGTAACAG CAACT CAT CGTGCAAACGT ATG CTTT 

7 0 TGTTCTCTTAGCAACTCTGACATC^GTCrTTTACACCTTAGTTA^ 

GCCAGGTATAAAG CGTCAGTCAAACAGGC C^TTCAAAAAAGCCACGTGGG AACCTATTTTACAAGTATATATATTCTTC^CCTAAC 
CTATGTATTCATAGAATCTATATATTCATATTTTAAAAGTTTAATTTAGAGTTACTCT 
CAGAACCCATAACAAAAACAAAAAGCCAAGTAAAGGTGTGGGTTGATGC 
CAAAACAATACAAACTATTGCCATTGCTCTTGATTGCCCCCCAGAACCTGATG^ 
7 5 AAATATTAAGTTGCCCCTACCCAGGAAGCTTCTTCCCTGCTGGCCACCTTTCA 
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TGGGGfTGAGCCATCAACACCrTACCCAGC^ 
GTAGTAGCAATAATATCACAGTTATGAACAATCTGA^ 
GCCTGGTAGTTGGTATTGTATCTGTGACCAAGAATCCATGGTTGGCGAGCT 
TGCTAAAGGGACAACAGTACC^TCATTTCTCTAAATTTCTA^ 
5 AGTTTCTTTGTGCAGTGGACAGTGGTTAGCAAAGAAACTCATAATGGCT 
CAAAATAGGAC^TCAATATCGCACCCTTACCTCTTTAGGTTTACAGACT 
AAAGGGTTTATTTGGTTTATATTTCCACAT^ 
AGGCAGGAGCTGATGCATGCTTGCTCTACATGCTTTCTTATG^ 
CTGGGTTCTTCCCCATGGATCACTAAATAAGAAAATATCCT^ 
1 0 GTTTT CTACTCCACCTTTGACCATTTAGTTCCCAGATAAAAGATACTCATAAC CTTT ATATTT ACAAT AAGC CTTAATCAG CACAA 

GAG CTGGG CAGATATTTATTCTCCATGCT ATTATGT CTGCCATGTTTTGTCTGG 
GCTGCTCTTAACTCCAGTTAGCCAAC 
TCTCTTCCCACCTCXSTGGTTCTCCTCTGACCCAAGCCT^ 

CAG CATCTTTATTCAC CAATTAGGGATAACTTGGG AGTCAAGGT CACAT AG CATTATGTGG GT CTAT ATG TGGTCTCTGGGAGTAA 
1 5 CCAGTATTTCGCATAG CAAAAG ACCAAACCTTAACATTATGG AGGCATTTC CT CATTTGAGGCTC CTTCJTTCTGCAATGTCT CT AG 

CTTGTGTCAAGTTGACACAAAGCCAGCCAGGACAGGGGTC^ 

CAG ATCTAGAAT AGTGTTTTCCGGACATT AATGGGAC CACTACACTCAT AAGTTTACAG CAGCT ATAG AG AC CTGTATAAG ACCTG 
AACTTAGAGGATGCCTTCACAACATTT CAGCACAGAT CAGGGAGGGG CT CGTGAAC C CCAT ACTCAGCTGAG AAGTT ATTG ACAGGT 
GATGTCATCTGAGGGAGGGAGGATC^GTGAAGGGTATGGGCTCTGGTATGTTG 

ATGAAACGCTCAATTAATAATGAGATATATATATATTGGTTTra 
CACTTTGTAGACCAGGCTGGCCTCGAACTCAGAAATCC^ 
GCCCGGCTAATAATGATACTTTTAAAGTGAAAAGAGAGGAGT^ 
CAAATTCCCCCACCACAACAGAGAAGTGAGCAGAGCCAAGGGTGAG 

2 5 TGAGTTGATAAATTCAATTOTCTCCCAATCTCCACGGGAATTGGTTCT 

GAAAATGGCCCAATGTTTG CAGATAACC TAT AT ACATCTT CACATGGG CTTGAAATAATT CTG ATTCCTGAAAAATACCT ATG ACA 

GCGCAGATGCGAGCTTTGTAAATAGTAGGGACATTGCATTGTTTATGGAACAATGGCAGAA 

TCAATGCAOTCCCCAAAGACC!AGACAAGAAGCAGCTTAGGTCA 

CAGTGTGAGCATGTATGACAATTG CTTGCTCACACCGCCCTAAGGC CAGAAG CT CCAGAGG AACAAG AAG T AGAGAC G AGCTATTA 

3 0 ACCTCAAGACCAAACCCCAGTGACCTACTTCCTTCAGCTA^ 

TGTCTGCTCTGGGTTATTTATTGTCAGATAAGCTTACCTCAAGCTGACrrTTTCTACTC^ 

AAGGAAGGGGAGTCTAGAGCAG^GGTACCCAGGCCTCCTAATCCTGACACCCTGGAATACAGCT 

ACCATTAGTTATTTTCATTGCTACTTCATGACTGTAACCT^ 

3 5 GTAGGAAGTATATCAGX3TTCTAGGGATTCCAGGAACCTCCTCAGAACTCTTTAA 

CTGACCGGTACTTC!TGAGGTGAGCTTAAAACATCTAATTCTGAGCCTTCAA 
TTCTCTGAC^GATCTTATTrCCTATAT^ATAAATATATAGGGAATATATAAAATATGAAGTATGT 
CCCCCCCCAGTTTAAGTATATTTTGTCCTACIATCTTTTTAGCAATGGTCTATCACC^^ 
TCCATACTCTGTATG<3ATGTTTTCCCTGAAGGGCATTTTGAGAAGAACACAACATGGC 
40 crCGTTGCACAGCTTAGCGACCCTGGCTCTGGCTCT 

TGGTACTTGTTGCCAGATGAGCZATCCIATGGGAAGGTACX5AGATGGAACAATGT 
GCACTGTTGAGTTTTTATCTTGTCACAGTTAGGATTACTATTGCTGCAAT^ 
GGGTTTATACAGTTATATACTTCCIATATCACT 
GCAGAAGCTGATAGAGAGACC!ATGGGAGGGGTGCTGTTGA 

4 5 TCGAGGGCCACCAGCTTGGGGATAACAACACCCACAATGG^ 

GGCTGGAGAGATGGCTCAGCAGTTAA^ 

ACAACCAACTGTAATGGGATCTGATGCCCTCTTCTCGTCGGTCTGAAGAAAGTCACA^ 

CTGGGTGGCAGTTTTGCAACCAACAGTGATTAAGAAATACATTGTGGGGCT 

TCTTCCAGAGGTCTTGACTTCAATTCTCAGCAACCAC^ 

5 0 TGTGTGAAGACAGCAACIAATGTACTCATATACGTAAAATAAATAAGTAAATAA 

TGCGTCACC^GAGACCATGTGGGTGTACGAGTGCAGCAC^ 

TGTCTAGGACTGTAGTCCTGTCCCAGACAGGGTCTGTGTTGATGTCTGAGGC03ATGTTACCAATAAAATGTTGTGGGATG 
CT ACGG CCTACGTGAAC CGGGT ATTTCTGGAAGG CAGGC TGGGG CTGAAAGAGACTAG ACT GTG AGAAAAGAATG ATG AAG C CAAG 
ACAAATTTTTCTCTGATCAAGGCTCGTGAGTTTACTAAG 
55 CCCCCCCAGTCCATCCTTGCATCCTTGGTGCrTTGTCX^ 

AGGAAG AT AGGT AGCAGAATCTCAGGG CAGTTGACACTT CAAAAG AAGACAGT^ 
CrCAGTGGTCA.CAAGGTCTGTACCAGCCTGCTTCAGGTTCAGGCT 

TGGG CTGC CT GAGG CCACGTTG ATGTT CAAG CACTATGCTG AGCTGG CAAC CAT AC CATCCC TTACGTAGG CAGTGTGCGAG AGAC 
GGCTCTGGTG CCGTGGGTTCAGAAGAGCTGACCATGGTGGCAGGAGGGCAGGAGATGGTCCTGCCCC TCACCAGCTGCAGCCCCCA 
6 0 GGTAGGAGAGTAGGCCATGTGCCTCATCTGGGCAGCAG 

CTTGTCTGACATGGGGCAGCATGGGTAAAAGAGAAATGCCCCCTC 

ATCATGAGAGCAGGAGAGCTGACCCTACTTCTCACTGGCTACAGTACCCAGAJGAGGGTGTCCTGCACCT 

AGAGCTGGGCCCCTGCAAAAGAGCTGGCCCTGCACCTTGCCTGCTGTAGATTGGGTGAGCTAGCCAGGAAGTGTTAGA 

CCTGGTGGTGAGTGTAC^GGAGAGCAGAGAGGCTGACCAACTCATCTACCACCCAGG 

6 5 CCTCAACTCTGAACTGCTGGAACTGTAGGGATGCTCCTACAAATCTAAAGCTGCAGGG 

GTCCAAAGGAGTCCCTGTAAGGGTCCAGCAGGAGCCAGAAGCCATGAACCAGACCAAAGTCTCA 

AATAGAGTGGACCAAAGGGAGTTCTGTGGGACTGTGGGACIAGACACACT 

TGGTTTTTTTTTTTTTTTTTTTTGGTATTTTACT 

ACTAAGGGACAGGAAGATGAATGGGGTTGGAGTACATGATTCACACACACACACACACACAC^ 

7 0 ATAATTTTTTTTTAAAAAAACTATTCATTACAAAGCCAAAACAAAATTCTGATTA 

TAAGTTGGAGGTTTTTAAATATTCCATACTAAAGTCATCAGAGG^ 
AGCCAGACACTAGGGAGCATTCAGGGAATCCTCCAGAAAGCAGGGAGGG^ 

G AAAAC CACAG AATC CACTAAGCACAG AAAG CAG GG AGGGAGGACTGCAGGAGC CAGAGGGGT CAAGGACACCGCAAGAAAACT AC 
AGAATCCACTAACGAGGGCTCATAGAAGCTCACAGAGC^ 
75 CATATATGTTATGGTTGTGTAGCTTGATCTTCTCGTGAGAC 
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TTTGGGACCCTTTTCCTCCTACTGGGTTGCCAGTCC^GCC 
TGTTTGGTTAATAACCCTTAGAGGCCTACCCTTT^ 

ATGAGGGAAAGAGGAAGGGTAAACGGTGGTTGGGTGTAATATATAAGAAAAGAACAAAA^^ 
ACCAGGAATACATTATGGGACTATGAGCTGTCAGCAATGAAGCAGGAGCAA 

5 AGAACCAGCGCAAGAGACAGGATGTGCTGTGGTTGTC^GAAGGGAGGGCT 
GGCTTCAAAT CAGGGCTT CTGGGACCCG<^T AGGCTTGGGTGTO 

AGTTTT ATAACCTGG CGATCC CTTG CTGT CTG ATGAGACAGGCT CTG CTCGTATCT CCCAGACTTCAlTi' CTT CCAGACAATACCC 
TCCCCCTTCAAACCTrcAGTAATGACTTTTAGAC 
10 CAATGCTGACAC CCACAGGCCTT ATGTTTACCTCGG CCATTCTT ACCTA^ CCTAAATCTCGGGCACTGTCTCTGAGTCAACAGT 

AC CTATTCCTGTG AAAGAAG CCAG ATGTGTTTTGTAAAGAATCT CAGTG TAAG CATGGCTTAAATTATTATATACTCGGG ACTGAG 
TTCATTGTTCTCTGAATACTTTTTTTTTT^ 
AATCCTGGTTCATCACCCGCTACAATOIAATCAGCCTG^ 

TATAAAGCTTGCAGGGGAATGGCC^CACGTGAGTTATGTGGGACAAATGGAATGTTTAC 
1 5 TGG TTT AGCACT CATGTG ATGTGG CACT TGAGTTG ACCAAAATGATATAGAG CAATCAGAAGACITTGAAAT ACCT CTGTGTCAGA 

AGCTTGG CATGTTCAAG CTG CTGTGTAT AG CACCAGAATT CCAAACC CACCAGTTATGAAAACTC 

TCrCCACAGTTACCTGGCAACAGCTAGGTAGGCCTGAT^ 

CTTACTCTTACTCTCTTGATCTTGCTCTGTCCTCTTGTCACTTCTCCCCC 

CTCTCTCTOTCTCTCrrCTCTCTCTCTCTCTCTCTCT 
2 0 TTGTTTTGTTTTATTTTTGTTTTGTTTTTTGAGACA 

GTTGGCCTCGAACTCAGAAATCTGCCTGCCTCTGCCTCCCATGTGCTGGGATTAA 

CCCTCTTAACrCCCCTGACCATGCCCCAAATAAACTCTATTTTATACTATACT 

CCTCAGCATGGGCCCTCAGAGGCACTGCTTTCCCCCATACCTGACTACATCT 

AT AAACACATCAAAAACAAC CC CAGGAAGC CTTTGTCT CATCCGT AAGAGATAG CAATC CTCAGT ACAAAAT TCGAT GGAAAAG AA 

2 5 TGAACATGGGCTCTCAGACTTTTTCCCACGCCT^ 

CCATGGGACAAACTCGGGAAGTTCCACACTGAGGTCTGATTACAACTGAGCT 
ATTTTCTCATCCGCCTTGGCGAGCCrrGATTAAAATACCCATTTCrTC^ 
ACCCGGAGGCACCTGCTTGTGTAGGTCCGATAGGCTCAGTGTCTTTC^ 
GTGGGAGGGGCCTGAGAGCAGAGGTGACAAACTCTGGTCTCCAGCAAGCAAGCATGTGA 

3 0 ACCAG ACAGGGCTTGGG CIAACCTGC CAGCTCAACT CT AACTCT C CACTG G CAAGAAGACAGACATG G AGGTGTTTATGGAATGTCT 

GCT AGTT ATTAAATG CT AG AT AAAAAATT ACCAG ATT AGCCAAC TCACACACT^ G CTAGAAGTA 

AACAAACCGCTAACAGCGAGG CC CGAGGCATGG ATCATTT AGGG T AG AG ACATATTTG AGAACAC CAAGAACACATAAT CT CCTGA 
ATGTCTCATCCTCTCGCATGCCCACGTGCTCTGCGGTATTAGAACGTCTTTGTTGAGTCAAC^ 

TGTTTTTT AATT ATTTTT CTTTTATTT ACTTTTTT TTTTTTTTTTTTTTG AT CATGAGTATTTTGT AACAGACATTT AACCATAT A 
35 . AT CAG AAATGT CTCAGG CCT C CTTAT CTGACATTCTGACAGTG AAGT AACTG GCAGAGAAATG AGAT AGC^ C 

TGTTGTCCTGGGGCC!AGGGTCTCTCCAGAGAAACAGAGCCTGGAGTGGGGGAGGGGAAAGCAGC^ 
TCATTTTAAAGGGTCAGTGCCCATGACTGTGGGGAGAGTGAGGCTGACCrrTTGTGCAG 

GTTTCCIATATCACATTCTCCAGGCAGGGTTTCTTCAT CTCTAGG AATTTGTTCTCAAGGTCTCACCTGAT TAGAAGGAAGCTATAT 
TGG (^TAT AAGCTATGT AGAATTT AGCTGATACAACACATTAATT ATAAC TGT AAAGCG CATGGCAACATCAAGACT AG CATTTG A 

4 0 TAAGATIGTGGGCCCTGTAGCCTATCCAAGTAGATATGACAAATGAACCAAGAGATGAGGGAGCA^ 

G CTGAGAGCATAG AGTAGGCCAG CTTGTTGGAACTGCCAAAGCCrTT AAAAACAG AAGTAAAGGG CCTGGAGAGAG CTCTGTGGTT 

AAGAGGGTTTGCAGCTCTTGCAGAGGACCTAAGTTCAGTTCCCATTGTCTC^ 

CTGACACCCTATGAAGTGTTTCATTATAGTATAGCTGTCTG 

GAGAGAAACTTAGAGTGGGGAAATGGGAGGGTGTATC^TGAAAGGCTTTGAACCCTGCTGGTCCCGATAC^ 
45 TTTAAAGGTGGTTTCTCTCTCCAGTCTACTTTC 

TCTCTTCTTCCCAGATCTGGACCTGCGGTCTTGGGAGTTGGAAGGATCTGATGGGAAAC^ 

AGAATCTTGCAGGACCTGCCATTTGCACTTTTGAACCCTTTGGACGGTGACCCAGGGCTC 

TCTTCCCTCTGTCTCAAGACTCTCTGAACAGCAAGACCCCAATGGCACTTTAGACT 

CCTAAGGCTCCTAATGACTTTCAGGGTGAGAACAAAAGGAATTGCTCT^ 
50 TGGAAATTCCCAGTTACTAAAATAGATTGTCAATAGAGTTATTTATAGCCCTCATTTCCTCCGGGGA 

GTTTTTCATAAACAAAGTCATAACTGATGTGTTTTACAGGATCCT 

TTTAAGCAACACGTTAAGTACCCCCACTGTGGTATTTGTTTCT^ 

ACCTGTGATATCACTGTTTCTTGTGTCTCTTCTTTCAACTACATCTTTTAAAACAAAACGGTGTGGGG 
AGTGGTAGTGTTTCTCAGTGGTATCTCCTTAAGAAAAAAAATCATCATGCCA^ 
55 CTGGAATAGCCTGTCCCCCAAG CTAAGCCTTCTTCCCTAGCTTTCTG CGTGATTTTACATTGAGCATTCCTGTTGCTTTGTTTCTA 

TAACTGTAATGTGGTGATGT C ATTGTT AGGG CACTTGAGGGTGGGCGTT CTG GAAGTCCTTTCAGGTT ACT 
TGCTCAGAATACATAAAGGTGCTAACTTAAACTGCAGCCATGGAGCrCAGTGAATTCACTAA 

ATTAG CTACTGTGAC C CCTGT AGGTTAGGGAG C CTGAAG C CAGCT CTTTACCTGG TG TTT AG AC TCAG CAGAATTTGGAGT CAATG 
GGACCAAATGGTTGTGAAATTAAGATTTGAAGTGTGCATCTTATTTTATCACCAT 
6 0 AAGCACAAAAATGTAATCGTTTATGTGAAATCTCTX3 

CACCTTAGCCTAAGATACCAGACAGCAGGACAGAGAGAAAAGTCCTTCCT^ 

AGGTGTGTGATTGGTAGGCCAAAT CCCGATAGCCAATCGGTGTTGGGTG CTTTGTCTGCTCTACTGGGAGTCCAGTGGTACAATGG 

ATTCTGGCAAAATGCTGCCATCTTGGCCCTCGCTGGGCTGCTTTCTAGGATATTCATAGAGAAAGGGCCG 

TAAAATCCTGAGAGGAGAATATAAGTTAGTGTGTCTCACTATAACTATC^ 

6 5 ATACT AT ATG CCTAATAC TGGT AAG CATTTT AT ACACACCATTGG ATTG AATC CTCTCAAAATC CT CAAAAAGGAAG TT ATTAAT A 

CCTCCATAGGCAAGGAGCCCAGAACCCAGAGAGGTCAGG(ZAGTCTAGTTATAGATGCCTGCTTTGTTT 
CAAATTATTAATGTGCCCTGGTTATTAATGCGCCCTGGTTACCTGCTGGATGGAACATCAAGGTG 
CCCAGAGGTATTTTGGCTATTCACGGATTAATTTCACACGAAGTGTTTCAGAGACATGTG 
TAAGATTCAAAATCAAAAGAATGAGCAGTAAAAGAATGCTGTTTACTGCCTGCCTTTATCA 

7 0 AATGCTTGATAAGTGTGTGTCAGTCTGAAGTCCCATGTACCAGCCGCTGTCC^ 

CAGCTGATTTACCAGCAGCZAGATTTCTTCTTCTAGTCCCATCC CTGAAG^ 

TCCAGGAGTTCCTACACAGCCCTCAACTTGAACACAGGCAAAGTGC^ 

CAATACCTGATTGCTGGAACTTCAAATCTTCCTGAATAAAAGCTTC 

75 TCTCTCTCCCTCCCTCCCTCCATCCCTCCCTATCTCT^T^ 
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CTGGAGCACACGTTATCTCCCCGCTTGGACTCCTTGAAAGCAfiAGJVGA^ 
ATAGGTGCTTTGTGTGTGACTGACCCTGAAACTTTAA^ 
TGTGATTG6CTAAAAAAGTAAATAATG6GAACTTTAAAAAATGTAATT 
CTAACCCCTTCCATGGCTCCrrCCrrCCAGTCCTCCTCATGGCTCCCACT 
5 ACTGTCACTG ACATGCACATAAAT ATATGAGCATAACTTGCTG C CTCTGTT AAGTGTCG CTTGTGTT AATATGATTTTAGAGTTC 
CCACTTTGCATTAGGAAGTCAATTAGGGGGCCATCCCCGGGA 
TCACCTCGGACTGGTGACCTTTGTGACTTTCCTCrrTCCATA 

AGT CATGT AGTTGAAGAATCATGAACGTAGCTTCCCAGACATTT CTACAAGACGTGATCCCACAGC^GACTTTC CGGTTCTCTGGT 
TCrCCGCACrCTTCTACTTCCATGATGTCCCTTGAGCGTT 
1 0 GATGGTTAGTTTATCTCTGCATATGTAAAAGGTAATTTTAAAAATTTCTTTA^ 
ATATATATATATATATATATATATATATATAT 
AGATGTAGATGTAGATGTTO^AAGAAAGCCAGAAAGAGGCTGATCT 

TGGGT CCTCTGGAAGGG CAGGAAGTG CTCTT AACC^CTGGG CCAT CT CTCT AGCCACTAT AATGGCATrrT'IlTTT AAAAAGTGGA 

TTAATTCTTTTTAGTGTTTTATATATAAGTGAAGTAAAAATAGGATTTAGAAGCG 
1 5 GCTCTCATTCAATCACAAATGTATTGAGTTCCATTACTTTTATGTTTGTCTATT^ 

TTTTCTATCCTTTGAAAAAATCrGTTATGAATCTTAAGCTAGCACATATGCTG 

TGACAGTGTTTGCTTAAACCTTCCTCAGCTTCTGGAAGA 

CACACAC^CACACACACACACAGAACACAGAG 

GTGAATGTTAACCACATATTTCCTTGGAAATTCTCAAGTGT^ 
2 0 TC TTAACAG TGTT CT CCTGTTGAGTTCAT GGGGTCAGCCATGGGG CAGATCAGAGGGCTAGAG AAGG CTGATCT AGACGT T TTCTC 

CAGGCTTTCCCTCCAAGGGCCATTGCTGACTGTCTTGCCCAGAGC^ 

G T CTCTTACTCTCTCAGAGAGAAAGATATAGACAGGCAACCTG CCTG CC CTGTGGTACAAT ACTAT ATCCAGCCCACACATTT ACA 
AAGACTCCCTTTAATTAAAAGTTCACATTTTC 

2 5 TTCTTAGGTAAAGAGTGGGTACTCTAGTTATGCACTGCTA^ 

CTCGGAAAATACCCCTGAATTTGTGCTGTTGTTATTGATATTGTTATCATC 
AGATTTGAAGTTCTTTGAAATGTACTTCCTGTTTC 
CCAGGTGGTGAGGTGCAGACAGCCAGTTAATTGTATTTGTTTT^ 
CCTCTTAACACATCTGAGAGTGTTTACAGAATGAGTCTGAGGCTTAG 

3 0 TTCIAAACCTCAGAACCCACAGTGGAAGGAGAAGAGAAAACCC^ 

T ACACACAAACACATG GAAATT ACATATAAAAGTTGCT ATCTTGTGTCT CTTT ATTACTTTT AGAGTGAATGTG 

AACCIAAGTATTTCCTTGGAAATTCTAGAGCATTCCCT 

TGTTTTGCCATGTAGTGCGGTTTTGTTATCTTGACTTCTTCGGGGTGGTTAC^ 
AGGCTTCTTAGGTGGACCACTTACTTTCTGGCGTGCITTGTATTCACATTGCTACT 
35 GGCTTCACTTTGCCTATGTAACAGGTTAAAGCAGTGATACTTTTACGTTT^ 
TCACTCTTCCTGTGACCTTGGAATCTGAGTGTAGTCTA 

T AATTT AATCATTTTGAT ACATTATTTTGTTTT AAGCT CTATCT ATG GACTTC CT AACAATAATAC C C CTCTC 
ACTGAACTGTCCT ATGAAGATGATG CATACCTGATTT AAAAAAAGAAAAAC CTATCCAACAAAT ACAGAAG TGG ATG CTCAC AGCC 
ATCCATTGG ACTG AG CACAG AGT CC CCAATG AAGG AG CT AGAAAAAGGACACAAGG AG CTGAAGGGGTT TG CAGCC CC AT AG AAG A 

4 0 AACAATATGAACTAACCAGAACCCCCAGAGCTCCCAGGGACTAA^ 

GGTGCATATGTAGCAGAGGATGGCCTTGTGGGACATCAATGGGAGGAGAGGCCCTTGGCCTT 

GG GGAAT G C C ATGAGAGGGAAGTGGGAGTGGGTGGGTTAGTGAG CAGGAGG AGGGGTAAT AAGG AT AGGGGGAGTTT CAG AGGGGA 
AACCAGGAAAGAGGATAAAATTTGCAATATAAATAAAGAAAACATCTAATAAAAAAACTGTTCAAGA 

AAT AAAT ATG AAT TAAG AATGAATAAATT ATGTAAATTTTGAAC^U^TG AT TGTGT CAATTTTCTTTGAAGTTT ATGTTAAG AG TT 
45 AACTCATAAATTAGTATTTTGATGATCTAGATAGTCCAGTGCTATCTTATTTGTGGCTTTAGATTCTTCA 
TCTTTGAATAATTATACTTTGAAATTATTTTTATTAGTCAAGATATATCTCAGTAAAATAAACTCTGAC 

T AG AGGG AG AAGCTAG ATTCT C7VCTTGT AGTGAGT CATATCTGGTTTCT AGT CTGTAATTCAAGG T AGG CTTTT CAAATG ACACAG 

ACATTAATTCATAGAGCCATTCCCCACCAGGATTCCCGTGGGTTGAC^CATTCTGAATGCrrAGATGATAGCTC 

AAATCAGGAAGTCCAACTCCGTGGTGATGTTTCCCACCACACCATGCTTAGAG 
50 TG AATTCT CAGTACTT CAG AGAAGCCACTACT C CCTGTT C CT CATGGTG AC CCAAGGGG CAAG ACAAGAT ACAT AATT AT AC ATG A 

CCAGGTGGTGAGTTTGCATCACGGAATATAGCCTACTCTTATGCTCATATTCC^ 

ACTGCTAACATATTTGTCTCATAGCTTTTGCTGTCTTACAAGCTACCCCAAACGCAGJIGAT^ 

CACAGACTACTGGCAGXjC^^TTTTTCAAGACTTAAAGTTGCTCT 

TGTCTCTCTGTCTCTTTGTCTCTCTCTCTCTGTCTCTCTGTCTCTATCTCTC 
55 TTTGTCTCTGTGTTTCTCTCTTTGTCTCTGTGTCTrCTCTGTATCTGTCTGT 

TGTCTCTTTGTCTCTTTGTCTCTCTCTATCTCTGTCTCTCCCTATCTCTGTCTCTCTCTGTCTCTGTCTCCTTC^ 

TTTCTCTGTCTCTGTGTCTCTCrrGTATCTCTCTGTCTCTGTCTCTCrGTGTCTCTCTGTGTCrGTCTCTCrG 

TATCTCTCTGTCTCTCTGTGTCTCTCTGTGTCTCTGTCTCTCTGTTTCTCTCTCTGTCTATCTCTC 

TTCTCCTGTGCACTTTTCTTTTCTCGAACTCGCACACTCCTACACACCTCTGTGCGTTCCC 
CACATCTCACACACACCACATGCACTCTCCTTTCACTCACACACACATTCTTCTCT^ 
ACACTCTGTCCACACACACCTCACATTCTCTCTTCACTCTCTTCACATTTGCTCTCTCTCT 
TGCACCTCACTCCCACTTCACTCTCTCTCCACTCTCTCrrCCACTCGCT 

6 5 C CAAAAGCAGGGAGAGAAAAG ACATTGTATTCT CATTGC CAAAT CAAATTCAAAAGAATAACCACATCCCACAAAG AATCAAGAAT 

TACAGTTGTTCCCCAAAGTTT CAAGCTTAC CCCTATTTCCAGGCTTGTGGCCAAAATAATAATTG CAAAGTCATCACTATTTAAAC 

TTTAATTTTTTTTAACTTATACTTC^GAGCTCAGAGGTCCGAGGTCA^ 

CGTGGGCAAAGCTGGCAGGCAGTGGCCAGAAAGAATCAACCCTTAACCCT 

ATTGC^CACAACG ACTCTCCTAAGAG TCTCAGT GTTTTG ACTTAGT^ CT ACAATTGCTT AT 

7 0 CCAGGAACCACTCTCAAATGTTGTGCAAAACTTATTTCATAACT 

TTCATTTTCTAATAATTTCTTCAAACTTTCTTTGCAAGTCA^ 
AAGATGAGLAACACACAGCAGGCACCTGGGTCATTAGGACTGTGCTGTGCTGTGCCC 
ATTGCATCTAGGAAC1ATCGAGTTTCAGAGAATTCACCAGAGCAGGAGAAAGG 
AGACCAAGGCCAACCGAAAAGCAGTCACGAGCAGATGAATTCTATACAACT 
7 5 ACTGTTGT CTACAAAAGACAGCTGTGAGCTACTGACATGTCTCCACCCTGGCCTACTGCAGGC CAGG 
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TCCCCTGGAGGG ATCTAACT ACTCCTTCTCAGGTT CCCAGG CAC CAT CC~1TTITCACAAGGAGCTG C CACAGG CTTCTG AG ATGTC 
TCCATCCCTTTCTGCCCCTTTCX^CTCTGAAAGAGACGGG 

CTCATATGTGGTTGTATCAGAGTTTCAAG CAAAAGAAAGTT AGTGT CTGCAGATGGGAAGGCGGGACTT CCTTCATTATGAAGGGG 
AGATCAGGGACAGACTCAGGACAAGTGTGCTCAAACAT^ 
5 ATCTGCCAGCTGCGGAATGCTTGGTTTGCTTCTC 

TTGAACGCCTGCTTAGAAC CCCCTACTGGTG CTCTGCCTACGGT TT TAGGGGAAATCTCTGAG CTTG CTGCTTT CTTCCTGCACAC 
ACACTAAT ACAGAAGCAGC CACAT CACCTDIATTCCTTT ATGACCTT ATCTTT ACT A^ 
GTTTCCAAAAATTCTACTTCTCCATCAAGCCTCCCTCXKMTCTCAG^ 
TGGGGATGGACGACACCAAGGAAATAAGACCTTCTAGATAAGAGAGC^ 
1 0 CACGCACAGGACCTGCACAGTCCCACTACTGA 

CAGCCATTCACAGAAGAAAAATCAGTTTTCTCCAGTGGAGTCTCACTG 

CAG CAGACTTG CAAACACAAAGCAAACTGCATGGTAGTTTTTGGAG ATGTTAT CTAGCAATACT 
TTTGTTTTCTCTTTCCTCTTTATACCTTGTCTTTTGCTTTOCTATTAT 

CTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATTC^TGCAAATACATACAAGCTT TTTCT LTl 1 CT CTTTTri' 1 U -C CTGTTTCTTTGC 
1 5 TTTTATTCTGGATTGTTTTATTGTTTATTGGCTTATTTTA 

GACGATGGAAGGGAAACTGTGATCAGAATACTGTAGGAAGAAGTATTTTCAATTA 
GTATGTACGTGATCCCCGCACTTAGGAAGCCAA 

GTCTCAAAC CAAAAGAGGAAAATCATATG CAAACTGG CTTATGT ACAAAAG AAGGA CATTTGTGTAAATGAGCAGCCTAG A 
GGTGCAACTGGTCCCATCAGAAAACC 

20 

MOUSE SEQUENCE - . mRNA 

ATGCTGCTCCTGCTGCCGATACTGAACCTGAGCTTACAACTTCATCCT 
GTACACCGTAGACCTCGGCAGCAGTGTGAGCCTGGAGTGCGATTTTGACCGCAGAGAATG 
GTTTGCAGAAGGTAGAAAATGATACGTCTCTGCAAAGTGAAAGAGCCACCCTGCT 
25 TTCCACATCCCTAGTGTCCAAGTGAGAGATTC^ 

GACGGTGAAAGTCAAAGCTTCTTACATGAGGATAGACACTAGGATCCTGGAGGTTCCAGGTACAGGGGA 

AGGCTAGAGGTTATCCC CTAGCAGAAGTCTCCTGGCAAAATGTCAGTCTTCCTG CCAACACCAGCCACATCAGGACCCCCGAAGGC 
CTCTACCAGGTCACCAGTGTTCTG CGCCTCAAGCCTCAGCCTAGCAGAAACTTCAG CTG CATGTTCTGGAATGCTCACATGAAGGA 
GCTGACTTCAGCCATCATTGACCCTCTGAGTCGGATGGAACCCAAAGTCCCCAGAAC^ 
3 0 GCACCATaSCTTTGATCTTCCTGGCCATAGTGATAATCCAGAGAAAGAGGATCTAG 

MOUSE SEQUENCE - CODING 

ATGCTGCTCCTGCTGCCGATACTGAACCTGAGCTTACAACT 
GTACACCGTAGACGTCGGCAGCAGTGTGAGCCTGGAGTGCGA^ 

3 5 GTTTGCAGAAGGTAGAAAATGATACGTCTCTGCAAAGTGAAA 

TTCCACATCCCTACTGTCCAAGTGAGAGATTCCGGGCAGTACCGTTGCCTGGTCATCTC 
GACGGTGAAAGTCAAAGCTTCTTACATGAGGATAGACACTAGGATC 
AGGCTAGAGGTTATCCCCTAGCAGAAGTCTCCTGGCAAAATGTCAGTGTTCCTGCCAAC^ 
CTCTACCAGGTCACCAGTGTTCTGCGCCTCAAGCCTC^GCCTAGCAGAAACriTCAGCTGCATG 
40 GCTGACTTCAGCCATCATTGACCCTCTGAGTCGGATGGAACCCAAAGTCCCCAGAACGTGGCCACTTCATGTTTTCRT 
GC^CCATCGCTTTGATCTTCCTGGCCATAGTGATAATCCJVGAGAAAGAGGATCTAG 

HUMAN SEQUENCE - GENOMIC 
AATGGAGCCTGTGGGCTGCGTTGCGAACAATGCCX^^ 

4 5 CAAGACAAGAATGTTAATCa\AACTGCACCATTTTGTAAGCGCCCT 

GTCTTTGGGCCATGAGAAACACCCTGCCTAACCACTTGACCACAAGGCCCACAAAGGCCCAACTAAAGAA^ 
CTTCTGGGCAAAGGTCCAAGGAACACCACGATGACATCC CGC CTGAAC^ 
ATATCCTACCGGGCAGCAAGCCATACnX3CCC!AGACCCCTCCCGCCCATAC 

CTGGCGTTAGGCTGGTCCCCGACTTCTACAGTGTG CATATTT CTTTAACCCTCG CCTTCCCTTCAAAACCTAACAGGGATCAGCCT 

5 0 GGGCAATATGG CAAAATGCCATCTCTGCAAAAAAATACGAAAATTAG CTGGGTGT G GTGGCGCATGG CTGTG GTTC CAG CTACTTG 

AGAGGCTGAAGTGCTAGGATCGCTTGAGCCCCGCAGGTTGAAGTTGCAGGCAGCCCTGATTATGCC^ 

ACAG AGTCAG ACC CTGT CT CAAAT AAAT AAAT AAAT AAAAAT AAATT AG AAACACATACAGC C CTTTGTTGACTTCTTT AAATGTT 
T T CTT CTTGTTAT TTTT CAGTAATCTAACTAT ACT CT CCATT AT TG CTTTT AAAAAATTATTT CTGT CT TTACAAAAGT AATTTGT 
G CT CATG GCAGAC T ATTT AAAAAACAAAATG AAG C CIAAAG AAAG AAAAT AAAAATTAACTG CAGGAGAAGGAGG CAAGAG AG CT AA 

5 5 CTAGATGCAGCCAGGAGAAAC!ACCTCTC!ACAGAGATCTTCCCAGATTTTCA 

CAGAGGCTGGGCTGAAGCAGGAGGAAGCTGAAAACCCTGCATGAGGCTACC^ 

CTGCAGAAGGGATGAGTTGAACAGGCAAGGAGAAACATGCTCTTGC^ 

CACTGTGGACACTTGAGTTGGCAAGGAGAGCTGCrTAGAGAAGTGGTAGGGGCAGAGCT 

60 GGGAGCTGTCGGTCCTAAACTCTGCAGGGCGGTGTTGCCCATGAGATAGGGCCAGTCTGACCTCA 
TTTCCCAGGGCCCCAGCCTGTCCATGCCTGCTTGCAGTGC^GCCACCAGGTACCTCCT 
GCIAGACCATGCCAGACTGTCAGAGTGCTGCAGCAGACTAGC^ 
GCCTCCCCCTTCTTGC!TTTGCTTGCAC!ACACT 
CTCACCTTCCCTTCCCCACCAGCATACGTGTGAGCGTGTGCCCTGC 

6 5 CTCCCTGCAGCACTGTCCACTGTGATGGCAATGGCATTGTCATCAGAGTGCTGTTGG 

GTCCCCAGCCCTGCCCCTATGCGAACACTGCCTATGGCACAAAACTAGGCAC^ 
CAATGCCTCCC^TGTGAATGTGCACAGGGGGCACACA 

C^CAAACATGTGTACAGTTGCTGGTGAGGGCCCCCCAACCTGCCTGAGCCATGCTGCCACTGCTGCTTCTATGAACACCT 
AGGCTGGCACTCCGGCATCCACTAGCACCrTGCroCAGCCAACAAGTGTGCACCCC^ 

7 0 CATGCGACTGAGGATGGATCATGTCTCCACAGCCCTACAAAGC^ 

CAGGAGCACCTCAGGCCCCTCCAACATAGCAGGCTCCTAAC 

AGAGCTATAACACACAGTTTGGGAGTCCTGAGCTGAGACCTGGGCCCCTAAAATCTTCCAGA 
C CCTATACCACJUVTCAAACCCCGAAGGTCACT AAATATGAT ATAAAAAGGGG G AA 

7 5 AAT<^CACTAGCTCTCCAATAATGGTTCCTAACCAGGCTGAGATAGCTGAAATGACAGAAG 
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AAAGATCATCGCAATTCAGGAGAACACTGAAACCCAATTTAAGG 
AAATAGCCAGTATAGAAAAGJ^TGTAACTAAACTGATAGAGCTGA 

AATAGCAGAATAG AAJCAAGCTG AGGAAAGAAT CTCAGAGCTTGAAGACTGACTTT CTGAATTAAGACAGACAAGGAGACAGTCAGA 
CAAGAACAGAGAAAAAAGAATAAAAAGGAACAAACAAAACCrTTTAAG 
5 TTGGTGTCCTTGAAAGAGATGGGGAGAATGGGAGCAACCTGGAAAAC^ 

CT AGAGAGGCCAACATTCAAATTCAGGAAATG CAGAGAAC CC CCACAAGAT ACTT CACAAGACTGTCCCCAAGACACATAATTATC 
AGATTCTCCAAGATCGAAATGAAAGAAAAAAATGCT^ 

GACTAATAGTAGACCTCTCAGCAGAAATCCTACGAGCCAGAAGAGATTGGAGGCCrATA 
CAACCAAGAATTTCATATC(3UK:CAAACTAAGATTCATAAGTGAAGAAAT 
1 0 CATTAC(^CAGGCCIXX:CTAAaU«5AGTTCCTG AAAGAAGC^CTAAATATAATAAGGAAAGAC^TTACCAGCCTCTACAAAAAT 

GCAGTTAAGTATATAGACCAGTGACAGTATAAAGAAACCACACAAACAATGCAC 
AATGGCATACTGTCTTCAAGGGACTCATATCACATGCAGTAACACACATA^ 

AAAATAGAAAACAGAAAAAAG CAGGGGTTG CAATCCTAATTT CAGAO^AAACAG ACTTTAAAC CAACAAAGAC CAAAAAAGAAGAA 
GAAGGGCATTA£ATAATGATAAAGAGTTCAACAAGAAGATATAACTATCCTAAATATA 
15 TCATAAAGCAAGTTCTTAGAGACCTTCAAAGAGACTCAGACTCCCATAC^ 
TTAGACAGATCATCAAGTCAGAAAATTAAAAAAGATATTCAGGACCTGA^ 

CAGAACTCTCCACCCAAAAACAACAGAAT ATACATT CTTCT CTGC CACATGG CACATACTCCCAAATCAACGACACCATC 

AGACATAAAACAATCATCAGCAAATGCAAAAGACCAAAATCA 

AAGACTAAAAAAAAAATTGCTCAGAAC CATT CAATT ACATGGAAATTAAACAATCTG CT C CTGCATGACATTTGGGT AAAT AATGA 

CGTTACAAGGGAAATTTATAGCACTAGGTGCCCACATCAAAAAGTTA 

GAACTAGAGAAGTAAGAGCAAACCAACCCCAAAGCTAGCAGAAGACGAGAAAT^ 

GACATGAAAAACC^TTCACAAGATCAATGAATC 

TAATAAAGAAGAAAAGAGAGAAAATCCAAATAAACACAATCAGAAACAACAAAGGGA^ 

2 5 ATAACCATCGGAGACTACTATGAACACCrrCTATGCACACAAACTAGAAAATCTAGAAG 

CC C CAAGACT AAACCAGGAAG AAAC TG AAT CCCTG CACAG ATGAAT AATG AGCTC CATTACTGAATCAGT AATAAAT AGCCTACCA 
ACCCCCACCCCCAAAAAAATGCCCAGGACCTGATGGATTCACAGTCAAATTCTAC 

T AG ATAAATT ATTC CAAAAAAATTG AGG AGAAG GGGCTCCTCC CCAGGTCATTCT ATGAGG CCA.GCATAATCCTGAT ACTAAAACC 
TGGAAGAGACACAATAACATCAACAAAAAACTTCAGGCCAATATCC 

3 0 CACACTGAATGCAGCAGCACATCAAAAAGCAA 

CTCAAATCAATAAATGTGATTAATTACATAGACAGGACTAAAGACAAAAA 

T ATAAAATTCAACATTTCTTTATGTT AAAAATG CT CAACAAACTAGGT ATTG AAAG AACATG CCTCAAAATAAG AGCC^CT ATG A 
CG AACTCACAGCCAACAT CAT AC TG AATAGGCAAAGG CTG G AAG CATT CC C CTTG AAAACCAG CACAAGACAAAAATG C C CT CTCT 
CACCACT CCT ATTCAACATAG T ATTGGAAGTCCTGG CCAGAGCAOTCAGGCAAGAG AAAGAAAG AAAAGGCAT 

3 5 AGGGAAGTTAAACAATCCCTGTTTGCAGATGACATGATTCTATATCTAGAAAA 

AT AAACAACT T CAGCAAAG TT TCAG GATACAAAATAAATGT ACAAAAATCACTAG CATTCCTAT ACACCAGCAACAGCGAAG CCG A 
GAGCCAAATCAGGAACACAATTGCACTTGCAATTTCCACAAAAA 
GAGCTCTACAATGAGAATTACAAAATACTGCTAGAAGAAAT^ 
AGGAAGAATCAATATCATTAAAACGACCAAAGCGGTTTATAGATTCAATGCTG 

4 0 AAC T AGAAAAAAATATTTTAAAATTTATATGG AACCAACAAAGAGC CCAAATAGC CAAGG CAAT CAAAG CAAAAAG AACAAAGCCA 

GAGGCATCATCCTACGAGACTTCAAACTATACTACAAGGGTACAG 

GACCAATGGAACAGAATAGAGAACCCAGAAATAAGGCTGCACACCTACAATTATCTGATCT 

ATGG AGAAAGGACTTC CT ACT CAATAAATGGGG CTGGGAT AACTG G CT AGC CAT AT ACAG AAGATG GAAACT AG ACT C CTT CCTT A 
CACCAGATACAGAAATCAACTTAAGATGGATTAACAACTTAAATGTAAAACCTAAAACTATAAA 

4 5 CAATACCATTCTGGACATAGAAGTGAGTGAAGATTTCATGACAAA 

GGG AT CT AATTAAACT AAAG AGCTT CT ACACAGCAACAG AAACT AT CAACAAAGTAAACACACAACCT ACAG AATGGGAG AAAAGT 

TTTGGAAACTATGCATCTGACAAAGGTCTAATATCCAGTATCTATAAGGAACTTAAACAAATTTACAAGAAG^ 

AATT AAAAAGTGGGCAAAGG ACATG AACAGGCGTTTTTCAAAAGAAGACAT ACA C CAACAAGCATAT AAAAAAGCT CAATA 

TTACTTATCATTAAAGAAATGCAAATCCAAATCACAATGAGAGATCCTCTCAO 

5 0 AAGTTGGCCGGGTGCGGTGGCTCACACCTGTAATCCCAGCACTTTGGAAGGCTGAGG 

ACCATCCAGGCTAACATGGTGAAACCCCGTCTCTGCTAAAAATAC1AAAAAAATTAGC 
AGCTACTCAGGAGGCTGAGGCAGGAGAATGGCATGAACCCAGGAGGTGGAGCTTGCAGTGAGCT 
GCCTGGAAGACAGAGTGAGACTCCATCTCAAAAAAAAAAAGTTAATAAACA 
TATACACTATTGGTGGAGTGTAAATTAGTTCAACCATTATGGAAAGCAGTGTGG 

5 5 ATT CAATC GAGCAATCC CATT AC TGGGT AT ATACC CAAAGAAATGTAAATCAT T C TATCACAAAG ACACATGCACACACATG TTCA 

TCATAGCACTATTCACMTAGCAAAGACATGGAATCAACCTAAATGCCCATCAACGACAGACTGGATAAA 

TACACCAAGG AATACTATAG AGCCGTAAAAAAGAGCATGT CCTTTGCGGG AACATG^ CTGGAAGT CATTATG CTT AGCAA 

ATTAATGCAGGAACAGAAAAC CAAATACCACATGTGTT CACAT ATAAGTGG GAG CT AAGTG AT AAGAACACATG G ACACAAAGAGG 
GGAACAACAGAAACTGGAGCCTAGCTGAGGGTGGAGGCCC 

6 0 AAAAAAAAAATGACTGGGTTCTAGGCrTAGTATCTAGGGGACAAAATAATCTGTATGACAA^ 

GTAAAATAAAAGTTTAACAAAAAACTAAAAAAGTTACCTATGTAAAACAAAAGTTTAA 

CAGAAATGCCTTTATTAACACTTTGGTTTGTAACCCTCCATTATTTTCT 

AGACACTAGCTTGAGTTTCATTTGGCCTTGCTGTGAATATACCTGG^ 

agag ctgaactg c cc aaaggaaagtgtgaagaggagaggcagg c caag cctggag aaagg ctacc caagagagag g aaaggaaact 

6 5 gtx3gtgagtgcacctcactcctaaagaaggacagg 

acaggggacatatgtccattcctcagccagtcactgtggccagg^ 
acatgcctggagccgtgggtgtagaagcagatccccagaggaagcacgggtcccacagac^ 

AGGACAAGGAGGAGGAAAGGAGATCCTTGCTTTTGTAGAA 

7 0 TTTTCTTAAAAACATCCTCTGTCTGAGTGTTCAGCCCCACATCTCTGCTGGTCCTACCCCTATAAGCACAA 

GCAGCAGGAGAAGGATTGATC^TTCAGGCCTATATTCAAGTT^ 
TGAAGGTATAGTGGGGCCACTGAGGTCCGAGTTCZAAATTGAGCCATGCAACTA^ 
CATGTGTAAAATGGGTCTAACTCTAAACCACACATTGCACTGAGAAT 
AATGTTCCCCCCAAATTCATGTCTACrGAGGACCTCAGAA 
7 5 AGGTCAT ACTGCAT TAGGGTGGG CACT AAT C AAACGG ATGGT AT C CTCAT AAGAGG AGAGG AATGTAG AGAT ACACACAGAAAAGA 
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ATGTCATTTGAAGATGGAGGCAGAGATTGGAGTGATACAACT^ 

G CCCTAGG AAATGAGTC^UITTT CCTGGCACAGCACTAAGACATGCTGGTATT CTTCATTCCATCCCTTTCC CCT ATTCCTAAATGT 
G CTCCAGGGACAATGGGGAAGCTGGAG CACATAT AAGAATAGG GGAAAGAG AGGGAAT AAAAGGGTATTAAAAACTCTCAAGGTAT 
GTTAAAAAAAAGACTCTCAACAAACCAGGTATTGAAGGAACATACCTCAAA^ 
5 CATCACACTGAATAGGCAAAAGCTGGAAGGATGCCCCTTG 

CCATACTATTGGAAGTCCTGGCCAGAG CAATCAGG CAAGAGAAAGAAAGAAAAGG CATCCAAGAAGGAAGAGAGGAAGTCAAACTA 

CCCATGTTTGCAGATGAC^TAATTTTATATCTAGAAAATCCCACA^ 

GTTTCAGGATCC(^CATGAACATACAAAAATCACT 

AATCCCATTCAC^GACAAACTGCTAAAAGCAAAACC^VAACTTTCCAAATA 
1 0 GGTTTGACTCACTCTCATGTTACGGCAAACCTTAAGCT 

AGTGC T 1 M 1T1 TGTTACCCATCCTCATATGTCCCAGCT AGAAAGAATCCTGGGTTGGAG CT ACTGCATGTTG ATTGTTITGTTTTT C 
CTTTTGG CTGTTCATTTTGGTGGCTACT ATAAGGAAAT CTAACACAAACAGCAACTGTTTT TTGTTGTTTACTTTTGCATCTTTAC 
TTGTGGAGCTGTGGCAAGTCCTCATGTGAGTAACGAGTGGGTTG^ 

1 5 AAATAAGCTAATCTCCTCCTGCCCCAGATTTCTCTCACTrc 

ATTTT CTGCAC GAAATATGTGCACTGTAAAAT AAT AT AACTATTCTTAAG AT^ TTTCGGAGG CTTAATG 

GTGAAGAA(HTAAAAATCTGTTTTCCTATGACTGACATCTG 

TTTGT TT ACAAGCTATG CCT AAG AGG CAGCTGTTG GTGGAGGGGG AACAAAAACAAAAAACCCAAAACTATT ATCT 
TAG AGGTAAT CATTATTG AACAAAAGCATG ATTTGGGGACTATGG AAAGG CTT ATTCTT T C CAT AT AAAATGATCAGCGAAAAGGT 
2 0 AGAAAAATTAAACAGAACTGAAATAATTGCTATTGATTGCTCACCGCCTGGATGC^ 

TGTGG AGGGG ATGGAGGTGGAG AGG CT AAG CCTG AGTG AGATT AC CAACT AATCAGG ACTi"!' TAACC CAAAAATACT ATGG CTCTG 
AGGGGCAGGGAATGAGAGACCTAAATTGACAAAATGCAGAGTAAAAATATTCAGTG 

CAAATACGG AGTTGG CCTGGATGTCCTCCAAGCTCCTTTC CAAG CCTAAGATTCTGTGAGTGG ATGAACAATC CTGGTCCTGCTG C 
TCTTGCCACTGGAACCTGATCTATTTGCCTTCATTATTTGAGGTCTCCCCATCA 

2 5 TCTTCTCCCCAGATTTCTCCTGTTGGTGAGCITTTCCATTG 

GAGTCC CAAGTT CCTAGTGCAGTACCTGGCAATACACATAGGTG CTTAGTGGGTG CTTGC CGAATTGTTGAATGG ATTT ATTTG AT 
GAAATGTCAGG AG AATCACC AGAAATGAAATGTT CGTGT AATGG ACAGG GGG AAGTTTG AGGTTG ATTTGAAAAAGGAGATGGTAG 
AGACAGTATTATGGGTCATGCTTCAGAGTTTATTTGCrTTACAATACfi.TTG 

TTCCACATG CTAATGTCCAAAT ATGTCIATATGT CTTAATTGTTGCAT ACGTGGGTT ACT CTGTAAAGGGG CAACTGTAGT AT CACC 

3 0 TCCTCATGCC^CACCCrrcCTCCACTGTAGCTGGGCT 

GAAAGGTTATCTAGCCCCAGTCACCTCTGATGCCTTCCAGCTGTCTCTAACACACCTGTGGGCTGA 
CX5GCTCCCTATGATGAACTGCCCCTGCTTTGAGGGATTTTCCTCCATTGCCTCAGAAGTACTG 
TTCTGACCACCTTGTCTCTGCCnTTTTCACTAGTCCTCTGTTCCTGCTCrCCAAATT 
TTTATAATCTTTCTTTTATATTCTCCCCATCAGATGGTTCAT^ 

3 5 CCTTATCTTTAGCCCTAAATTTTGTCCCAAACTCTGGACCTACAATC 

TTACTOTCTTCCCTCTAAAATCAGCTCCCCATCTTCTAT^ 

AGTCTGGTCTTAAAACCACAGTCAATCTTGTTGCTTTTCT CTTTGTCTCTTGTGCATGGCTGATCAGCAGTCAATTGTT TTGTCAA 
TTGTCTTATATGTTTCTTCTTCCTCTTCCTTGCT^ 

CAGTGACCCCCTAATTGAGAGGCTGTCTACCAATTCCACCCACCCTCAATTCT 

4 0 ' CAGATACTCCTTTGGTATTGTCACTTAGTTTTTGAGTGACTGCC^ 

ACATTCTCAACCTAAGTGTCCAGCCAAGTTACTCTGTTC^ 

TGAGTCAGCTATGCACTGCCCTTTAGAATAGCTTCAACCCCAAATCCTACCAGTG 
TGGGAAGTCTCCCTTCCTTTAGTCAACTTCTAGGGCTTTTGCAACT^ 
TGAGAGTTTTAACCTTCCAATTATGGTACAATGTCTTTAAGGGCAAGGGCCCT 
4 5 GATAAATTATTCTATGAATTCTCACAACTTTCTAATATAAAGGCT 

AG ATTTAATTTG AACTTTAAATGCTCTATTAAATTG AG T AG AAT AG<1AT C TT AAGG CT ACTGGTAGTC TAT CT ATGCCAC AGATGGT 
TTAGAGATCAAATAATTACTC!ATATACCTATGACAGCAGA^ 
GTGAGGCAAGAAAAATAAGTGAATCCAGCTAGGTCTAATACAATG^^ 
AAAGGCIATTCAAATTCAGATTATGTTCAGACATTGCTAATTA 
50 AGGTATTTCAGATCAATTGGGTAT<3CATTTATCTAGGTCT^ 

AATGGGAT AAG ACTGG CAGTATTT AAC AAAGAATGAAGT ATTT T ACAAACAT G AT CCTTT AGG C TGG TT ATT GTTACAC AAAGG AA 
TGATTTGCTAGGATTCTTGAGATACTTCCATAGAATGTCTATGCCTCACTTATTG 
CTGTGTCTTATTTACCCTTGTATACTAGAGCTTGGCAC!ATAGTAAATACCAAGTAAATGTTTTGAATA 
AGCTTCAGCACCACAATTATTTGTATTTCIATCTATCAGTTTGGAAAT^ 
* 55 TATAGCAACGAAGAAGTGAAACTGCTAGAACCTACCATCATC 

TCTGTTTACXIAACrrTATCCTCAGAATTTACAGATGTTTCCTTGATTTCTC 

TGATACTTCTTATGTAGAAATAAGCAGAATAAGCTGGGCAAGGTAGCCCACAACT^ 

GAGGATCACTTGAAGCCAGAGTTTGCTIXSAGGCCAGGA^ 

CTCCAGGCTAGGCAATGAGTGAGACCTAGTCTCTAAAGAAAAAAAAAAAAAAAAAATUVAAGACAAAATAA^ 
60 GAACTGATAAATACAAGGTAGACAAGAAATATCCTGAGAGGCGCAA 

TTATGAAGGAGGTAGCCCTGAGTGAGTCTTAAAAAATGGGTAGACTTTTACCAGGCAAGATCTTAGTGAAGTTAG 

GTCATATAGGAATTCATGGGATTTTTAGACCAAAGATTACCACCCTCATAAGTGATAATGACGATGAA 

CTTGCAAAGTATTTTTCACTVTACCTTGTCCAAGTTCCCACAAGC^ 

TTAGAGAACGTGTTGATTTCTAGTTTCTAACTCCGCCCAATAACATTAAGATGAAAATGGATAGCCACATAG 

6 5 AAACATATTTAGAAAACACAAGTGAATTTTAAAAGGGGAGACTTAAGAGGTGGCATG 

TGGAGAAAAGTCAGCACAATGCTCATTCAGCTACATAAGTGTTTCTGTTAC^ 
TCTTGTCCCTTtSTCCIAGAAAAAATGAGGTATGTGGACAACTGGAGGGTGAGC^UVGGCGGAGAGGAG 
GCTTTCAGGAGACCCAAAGTGGGTAGCACTTTTCCATAGGCAGGAAATGAGTGTTCAACTCT 
ATAGCTCCTTACCACAGGCAGGTCATCCCAGAGAGGTGAGGAGACACA^ 
70 TTGGGAGGCCGAGGCGGGTGGATCACGAGGTCAGAGTTCGAGACCAGCTTGGCCAAGATGGTGAAACTCCATCT 
TGAAAAATTAGCCAGGTGCAGTTGTGCACGCCTGTTGTC 

G CGGAGGATG CAGTG AGC CAAGATCACAC CACTGCAC TCCAG C CTGGGTG AG AGAG CAAGACT CCAT CTCAAAAAAAAAAAAAAAA 

AAAAAAAAAAAGAAAAGAAAGAAAGAAAGAGGAGATCTGAAGTGAGCMCTCCTTCCCCTAGCTGGTAGTCCCAA 

GTCTGGCTGAGTCCAGGGTTTTGTGTTTGTTTCTTTTGTTTTT 

7 5 GCGCAATCTCGGCTCACTGCAACCTCrrGCCTCCCGGGTTCAAGCGATTCTCCTGC^ 
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ACCCACCACTACATCCAGGTAATTTTTTCTATTTTTAGTA^ 
CCTCGTGATTTTCCTGCCTCGGCCTCCCAAGTGCTGGGATC^ 

G CTCAGAAGGGAGGAAGAGTGTGCTGATTGGCCCATGGGCAGCCATGGA CTTACCCCAGG 
CTGTGGACTCCACCCACAATTGGCAACCTGGCTCCCAGG^ 
5 CCCCTTTCCGCCAAGGAACCTGTCTCCCTCTTGCCATCAACATGCT 
AGGCCTGCCCTGAGCC^CCCTCAGCCCCTGCTCAGCCTCCCT 

GGGGG<nAGTGTGTCAGTACXIATCCTGAGCAC CCACATGCCT CTAGGCTTGGCCACAACTTTGCTTC 
ACCCAGTAGCAGGTGCCAGGAGTGGGAAGAGGCCAGGGAGTGGGAGCAGGCACTT^ 

1 0 AACTCAGTACGGGGCGGGACTCCCACCTGTTCTTGGCCTCCCTGGCC^ 

TGCCATCACTGTGAG CTATGATTGTGC C^CTGACTCCAGCCTGGGCAACAGAAAGAGAC CT ATATGGTATGTTAAATGGTAATGAC 
TG CTTTGGGGGAGAAAAGATGACT AAGG ATTGTTGGGAATG CTGGGGTAGGGTTGGAGTGCTCTTGCAATTTTACATAAG ACAGTC 
TCAG GAGAAAAAACAATGAAAGTGTCCGGGT AGAGGG ACATGTACACTGGC!AGCAAGGTCTTC C CT CGGAA 
GAGGCCACACAGGCCTGGAAGCAGGCTTAAGGCCATTCAGGCTAGGAA 

1 5 TGGTAGGTCAAAAGCTCCAGATAACATCTCTCAGCCCTGCGAACACCT^ 
AAGCATGGGGTCCAGGACAGGGGCCTCTTGTCTTGGCTGTTGAAAAAACT 
GGTTTTAAGCAAATATGGTTTGACrTTTATTTTTTAAAAACCACT 

TAG CATC CAGTTGTCAGACTAATGCAGAAAT CAC CATGAGAAATGACAATGACTCTG ACCGATATGG AAG CAG TGG AGGTGGTGAG 
AAATGGTCAGATTCTGGATATATTTGAAGGAAGGGCCAATAGGG 
2 0 AAGATG ACT C CAAGGTTCITGTCCTTCACCT CAATAAGTGGAGG ATGGAGCAG AGGAAGG CA 

GAGAAGGCTCTGCTATGGTATAAGAAGAAATGATAAGAAGCTGAATTAAGGTGATGGCAGTGGGGTGGAAGAAA^ 

TG C AAAAAGTAT CCAGG AGGG AGAATTAACAGGACT AGGGG ATGGGC C ATATTTGCAAG ATG AGAAATG CAG AGGTC T AAGATT CT 

AGCTTAAGGTGGGTGTTGCTACTATTAGCATAATAGGAAAACAAGGCA 

TACAATTTGTAAAACCAGGATTAAAC CCAG ATGTGTCTGATTTTAG AGC CTG AGCTCT TACTCATTG CATAAACCATATTTTCC CC 

2 5 AGAGGAGGATTAGTAGGAAAGGAAGCTGCTGGTTGGAAAGTATCTTTATAGCAGTGTCT 

GTGCCAGGAAAGTCCCCGTGGAAGGGCAAGGAAGAAGGGGAAGTTAAAGCCAGTGGCAGGTGATCCAAGA 
GAGCTATGTTACATGCTGTCCTTTCATGCTCrAAAAA 

CTTTCTTGGATATATATGCATCTGAATAAGGAAGGCTATCITCTGGTCAAGCTAAGGTATGCCATGA^ 
ACTTAATTCTGTTCCC^GTTGTTACCTGCTGTAAGA^ 

3 0 TTTCCCTCTAGCTACTGACTCATTTCTCTTCTCCTTTTTATAGCACTCTTCTTGAGAG 

ACCCATTCTCTTTTGAACCTATTCAAGCTTTCATCTGTACAAAACTCA 
ACTGCTAAATGCAACTCTCAAGAGTATTTGGCTCTACTGATCA.CTCCTTTGTA 
TTTAGGTATGGTGAGGCCAATATATCAGGAAATGACTGTCGTTGAAAAA^ 
ACACCATGCCACAAAGGGCCACATGGGGAAGCACCAGGGTCAGCCAGGAGGTGG 

3 5 CAACAGGAAGAAATGGGTGAAGCAGGGTGAGTGGATTTAGGATTAGCTGATATAAATAATTTCAGCAGGCTCT 

TGTCCCTAGTCTTCTGGTACTTGGCCCTGGGGTGATTAAGGCAGTC^ 

AGTTGTGGGTATGGGCTCTGAAATGGGTTGGTTTGCATTTGAAAGGTGTGCTCATGGGCAAGTGGTTTAC 
GAATTGGCTAACCCTGGGAGCGGCAGTCCCTTCAGGGTCAGCAA.GGCCCCAGGTGTCAAAG 
ATGGATAATACACACTGCCATTTGCCTTTGTACCCTTCCTTTC!AATCTTCTCT 
40 GTTGGAATACCCCATGTCTCAGTCCTTGGGCACTCTCTTTCCTATCTCTCTGTAGGTGATGTAATGCAGATATCCAT 
TCTTTAACACTTCTGCATTGATGACTCCTAAATTTACATCTCT 
GCATTTCAAACCAAATCTACAACAAACGTAACTCTTTTTCCCCTTCCTTAA 
CAGGATCTCCATTGCCTTAGTGACTCAAGCCCCAAACTTAGGAAT 

CTGT CAGCAGTTCC CTTCAGGT C'l'TTTiT CAAACTAT AGAAGGC CT AAACAAAGGAAAGACATGCTGT ATTCATG AATT GGAAG AC 

4 5 TAAATATTATTCAGCTGGCTGTATTCCCOIAATTG^ 

AGAAATGAGCAAGTTGACCCTAAAATTCATTTGAAAATGTAW 

ACAAAGTTGGAGGACTCACACTTTCCAGTTTCAAAACTTACTACAAAGCTACAGTAATCAACGCT 
GACATATAGTTCAATAAAACAGAATCGAGAGTCCAGAA 
ACAGTTCAATAGGGAAAGAATAATTTCCTCAACAAATTGT 
50 AAAGTTCTTATATTTAGGTTCTTGATCCATTTTGTTTAATTTTTGT^ 

GTGGACATCCAGTTGTCCCTGTACAATTGCTCCATTTATATTTATCTTAGCAGAAAATACA^ 
AGGCAATGGTTTCTTAGGTATTACACCTAAAGCACAAACAATAAAAGAAAAAAGTAGATAAAT^ 

TTTGT GT TTCAAAATTAAATAAAATCAAAATTAAATTAAAACATTTATCCTT CACAGGA CT AT AAAG AAACTGAAAAGACAATG 
TG CAG AATGGG AAAAAAT ATTT CCAAAT CATATATTTGATAAAT AGT TTGG CAATT C CTCAAAAAGTT AAAT AT AG AGTT AC CATT 

5 5 TAATTCAGCAATTCTACTCCTACATATGTACCCAAGATAATTA 

ACTTTATTCATAATAGCCAACAAGCAGAAACAATCCAA 

AAAAAG C GAG CCACAAAAGTCCACAT ATTG CTTGATT CCATTTTT^ CAAT TG ATAG AGAC AGAAACT A 

GATTAGTGGTTGCCAAGGGCCATAGGGAAAGGGAAGGGGAAATAGGAAGTTTCTGCTAATGGGTAAGGGGTTCCT 

6 0 TGAAAATGTT C T AAAATTGG ATAATAGTGTTTGTTG CACAAC T CCG T G AACAT AC TAAAAAC CACAGAGT CGTGCACTT AAAAAAA 

GTTTGTGTTTTAAATATATATACACACTTAGACACATATAACCCTCTTTCGTATA 
TTTAAAATAAAATTTTAAAACAAAGAAAAAATATATAAACTCGC 

CTAT CAG CATTTCTG CAAACACATTCCTAACCG ATCT CACTG CTTG TAATCTTGC CAG CAAC CT CT C C CT CT CAGCAATAGT CT AT 
TGCCTACACCAAAGCTTAGTTGTCTCTTAATGATCTAAATGAGGT^ 

6 5 ACACTCAGAGCAGCTCTGCTGTTGCCTGATTTAGATGTATGGCTCCAACAGATTTCCCCTGAAGAAATGATTCCATGGCTG 

AGTTGGAAAGCCTCCTCAGTTTCAGACCATTATCAGATTAGCTGTGTGCTCTC 

AAGAAAG CTTCAG AGTAAAGGAG AAAG CATGGG AGGTACAGCAGG ACCAAG GTGGGG CATTCG CAG C CC CC AC C CT CATCAG AGCC 

AGTTCCCTACTCTCCCTGTCTAAACCTCTTAGTAAGAGGTAGTTCAAGAGAGGGGCA 

ACTACTTTGCTCAGTCAACTAGCAAGTATTTATTGAGAATGTAGCTCTGT^ 

7 0 ACATCCAGTCCAGGTAAAGAAGATGGTGTCCATTATTCATTTGACAAACAAAGTTGGGG 

ATGGAAAG CTT CATG AGACGTG CAGGTAACTG CCAAT ATGTGTG GTTCACAAGGACTG GTTCATATT CAG AAACGGCCATTAGAAA 
AGGAAGAAGAACTTCTCATTTGGATTTATAAAGAGTGTCTTGTTTACTCTTO 
TAACTTCTCCTCCCAGCTCCACTCTACCATGGTCTGTCACCTTCCCCAAATG 
GAACCAAAGACCCAGC^AAGTTTCACAAGGCCCTGAGACTTT CAATT 
7 5 G CT AATGTTGAG CCTGGAATTGCAGCTT CAC CAGATAG CAGGT AAGAAAGG ACAAAGG G AGAGG CTT AAG AAAGAAG AG CAG GTGG 
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TGGTTCCTAGCCAAAGCCAAAAATGAG^T 
fljVGCACXIACAAAAAAAAAAAGAGTCCCCATGG 
CTCGATCTATTCCCACTATCCAGTGAGGGTAATGACAAGAA^ 
CAGGGGTTGAGTTCCCTTAAAGGTGAGATGCCAAGCAGCTC 
5 AACACCXSTTACTCTAAAGATAATGTCTTGAAGACATTCTGC^ 

TAGCRGCCGTTGGTCATTGATACCTTAATGTGGTTTACTGAGTC^ 
TTGGACTTGAAGAAAGGACTTCTATTGGAGGATGAGGGTGAGCAGAAAGAA 
GTAGTTCAGACCATTCGAGGGTGTAGTTAGAACCATGAGTGCACTATT^ 
TGTGGACAGAGGATTAAGTCCTCAAGACAATAGCCCCAGCCCCAT^ 

CTCACTTGAGCTGCCCAGCCrcGTCTGGC 

CTTCT ACTCTGCTC CTGC CT GCAGGGCCAGCTAAAGGAACTTTT CATGTTTTC TTTG CAAGGAACC CTGCCTGGCTGG CA 1 x i"iAG 

ACTGTTCCCCATCCTGTGGCAGGACTGGGGGCTTCAGGAGAAAACTTG^ 
1 5 ACTGAGAT CCTGGGTTTTCTGTCTITTGlTTlTrG CCTTTT CT CTTAGGAAAGGAGTG AGCTAGGGTG ACAAGGGGCAACATTTTTT 

ATCCCTC^TTGGCTCTTTCrrACAGAGGAAGGATCTTTTCTTCTA 
GTTAGGTATACTAATGGGGCAAAAGGAAGAGCATTTACATTTATTGAAGATTCACTA 

AT ATGGT AT AGTTCATTT AAT CTTCACAATG ATCATTTTG CAGG TGAGGGAACTT AG AACTCAGAAAAGGTACTTAATTTCC CCAAG 
ATTACAT AGTT ATTAGGTGAGACCGGCARGATTTCAACAAAG CT AATGTCCTTTCT ACTIT? ACTX5TG CT ACXIATGATGATG GTAAT 
2 0 CAAAAATGGCAGACAAC CCATAAATCTTT CCAACTTTGG AATA TG AAGT CTGAAT ATGGATACGTATTGAATGTTT 

ATT CTGGATATTCACAGAAT CAAAAAAT ATGTGT AATGAATT ATGTTG CTG AATT AACTG AAAGGAAAGTAAAAATGT AG CG CTTT 
CTCATTTTCT T CACGAATTTGGAATT CTTTTCTGCTTTC CACT ATG CAGAT AACAT CAGTTCAG ACAAATATT AAAT ACCTACCTA 
AAT^AGAATGCCTTCTCCTCATGGGATTTTTTTAAAATCTTGTCATTTCA 
GGTACCTGCAAAAGAAAACAACAAAAAAACTAAAGGATCTGAGAAATAAT^ 

2 5 TGGGGGTAAAAAGGATAGTCTCATCTATTATGTATATTCAG^ 

ATAAAATATAATTTTAAAAATACAAGAAATTTCTCATATAAAAATATGAAAGTAATCAGACre 
GCTACAGCTATCAGGGTGTCCAGACAGACAGAAGATTACATT 

G AAAAGAAAAG AAG ATACCTGAATT AAATCAATGTGATG CTTAGT AC C CTATCAGTGCACATTTCITT TCTATTTT T AAATTTTAA 
AAATAACACTTGGCCAGGCGCAGTGGCTCACGCCTATAATCCCAGCCCT 

3 0 AGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCCTCITGTAOTAAAAATAGAAA 

TAATCCAAGCTACTCAGCTGAGGCAGAAAAATTGCTTGAACC 
CCAGCCTGGGTGACAGAGTGAGATTCCATTTAAAAAAAAAAAAAAAGAAAAAAA^ 

ATAAATAAAAACAAT CGATATGC CCIGG AG AATTTGATTCCAGGAGT AGGT CT AG AAG AACTT CAACTGGAG AATGG AT AG AG AAA 
TCATGGTATATTTGCAGAATATATATTATATATATAATAGATAGCATGTGAATAAATTA^ 

3 5 TATTATATAGCATGTAGATAAATTACAAAAACATGTAACTACATCT 

TAAAAATT AAGGCCAG AAG AT AACACAT AGCACAATGT C CTTTTCATAAAT CATACCTTATAT AT AGAAG 
ATAAAGCTTAAAAAGTAAAGAAGAGGCCGGGCGTGGTGGCTCACGCCTGTAATCCCAG 

ACGAGGTCAGGAGACCGAGACCAGCCTGGCCCGCATGGTGAAACCTCGTCTCrACAAAAAATACAAAAATTAG 
CACAAGCCTGTAATCCGAGCTACTCGGGAGTCTGAGGCAGGAGAATCGCT^ 

4 0 CACGCCACTG CACT CT AG CCTGGG CGACAG AGTGAGACT C CGTCTCAAAAAAAAAAAAAAAAAAAGT AAGG AAGAG ACTGATAAGC 

C CGAT ATTCAGGACG CCAGTTACCTCTGGTGCAGG CAGACATACATCATTAGTAATGTG CTAGTTCTTAGGGTAGGTGGTAGGTTC 
ACAG ATGTT CATT TTACITAATT AAAT AT AT ITTG AAAATTT ATTT AAAG CTTT CTGTTG T AAAATAT A 

TG AAACAACT AT AAAG CTT AACATT ACT ATAAGGTG ATT AC TTTTGT AAC CAC CAC C C AG GTT AAGAAG AACTTTGT CAGCT C CCC 
AGAAATGOTTCACATACCCTAGCCCAATAAAACCTCCTTTGCAAA^ 

4 5 TTTTTTAAATTAAAACAAAACTTTTTAGAGACAGGGTCTTCCTOT 

GCAGCCTCCAACTCCCAGGCTCAAGTGAGTCTCCTGCCTCAATCT 
CTAATTTTTTAAACATTTTTAGTGATGGGGTCTTGCT 

CCCAGCCTCCCAAAGCACTGAGATCACAGGTGTGAGCCATCACTCTCAGCCT 
AGGACTAAAAGAAGTTTTAGTAAGCATTCAATAAATGTATGTTCTTTATAGTTTCCAAATCAG 
50 TAAGGAGATTTATATATTTTATTGGACATGCTGTAATTTATTTAACCACTTCCCTGTTGGTAGACATTATTTCCA'^ 
TAGATTAATGCTTGAAAAAAATGTGTGCCTCCTAAAGACTGTGATGAAAGTTGCCTCTGAATAAAACTCAAACAA^ 
TTAACTCTTTCCTTACTTGTATGCTCTTTGGATGCTCT^ 

CATTTTAT AT CATTTTTAAAG G ATAT ATACATGG ATGT ACTT ACAT ATG CATG TTTAAATTT ATATACCAT AAC ATTT ATTCTT TT 
TTT AAAAAAGTCTTATG AATTTTAACAAATG CAT AGT C CTAT AACT AC CACCAC CACAGTGAACAT CCAGG ACAGTT CCAT CCCTT 

5 5 GCCAAACAAACAAACAAACAAAAAGCTTCACCCAAGTCTTAAGTCC^ 

TTTTCCAGAATGTCATATAAATGGGATCATTCAT 

TCATTCATATTGTTGTGTGAATCAATAATTTATACCT^ 

AATTCCCCAATTGGGGCACATTTGGGTATTTCCCATTTTGGGTG^ 

TTGGGCAAAC CTAGGT TTTCATTT CACTTGGATAAAT ACCT AG AAGTGGGATT GCTAGGTCACATGGTAAGTGT ATGGT TTATTGT 

6 0 GAGAAACTGCCAAACCTTTCCATAGTGGCTGTACCATTTTTTCATTCCC^ 

CITCCCAGCACTGGTATTCTTTTCrAT 
TGATCTTGGCTTACTGC^CCTCCGCCTCCCGGGTT 

CACCACCATGCCCTTCTTTTGTTTTGTATTTTTAATAGAGACGGGGTTTTGCC^ 

C^GGTGATCCACCCACCTTGGCCTCCCAAAATGCTGGGATTACAGG CATGAGCCACCACG CCCTGCTCTTTTCTTAAGTCATTCTA 
65 AT AGGTGTGT AG AAGTGG TTCTAACTTG CATTTT C CT AATG ACT AATGATATTAAGT AT ACAT AT ATT T ATAT ATTATTTT ATAAT 
CTTATATATAATATAC^TATTTTATATATTATATATATATTTTATATATATATACACA 

AG ACAGGGTAT CACT CTGTTG C CCAGATTGGAAT ACAGTGGT ATAATCAT AG CTCACTG CAGC CT CGAT CTCTCAGGCT CAAGCGA 
TTCTC CCAC CT CAG ACTCTGTAGTAACTGG GACT ACAG GC CATGCCAC CATG C CTGG CTT AATTTTTTTTTTTTAATTTTGTAGACA 
TGGGGTCTCCTTGTGTTTCCCAGGCCAGTCTCAAACTCCTGGACTCAAGCCAT 

7 0 ACAGG CAT AAGCCACTG CAC CTG GCCTTAGTTAAGTTTTT ATG TACTTATTGT ATAT OT 

CCTTTGTAATTGTATACTAAGXiiAATTA 

TTTGAGACGCAGTCTCCATCAGTCGCCCAGGCTGGAGTTCACTGTCGTGGCTCACTACAGCOT 
CTCCAGCCTCAGCCTCTTGACTGGCTGGAATTACAGGCATGTACC^ 
TTTC^CTATGTTGGCCAGGCTGTCTCAAGCTTCrGACCrCAA 
7 5 GTG AG CCACTG CCAC G AC CTT AAGAGTT CTT TATT CT AGAT ACAAGTT CTTTAT CAG AT ATTGATTTTCAAAT ATT CTTTTG AAGT 
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CTGTAGCTTGTCTTTTCATTTTCTTAACTGTGTCT 
CTTTTATGGATCATGCTTTTGGTATCATATCCAAAAAOT 
ACACATTTTGTAGTTGTATATTCTATATGTAAGGTTTATAATCC^^ 
TTTTAGTTTTACATGTAGATGTTCAGTIGTTCCAGCCCCATTTGTTAA 
5 CTTCATCAAAATCAATT AGTCATATTTATGTGGGTCT ATTT CTGGACTTTTTGTTC CATTAACTTATGTGTAT ATCCTTTTGC^ 
TATCACACITATATTTGTTTTTTTTCTTACATTTTTA 

GTCATGTCTTAGAGGAAAG AG CX!ACCCCAGTCTTTTTTCATTG ATGTTTTTCTTCTCT C CAGAGGTAGATGAAAACC 

AGAGGGCCACAATGACCATGGTGATGCCTGAGGT CATTCTGGGG CACAGACCTCAG CCTAGGTTACT CCACTTCGC CT ATCTTTAG 
ATC CAAAACT ACCCTG CTGACTGCTGAGATAAACAAAGGAGAAT AA GAAAGGATTTCT ATG CGAAGACATGTCTCC 
1 0 ATGC7U3TCCTCCTAC!ACTGAGCAGAGC^GAGTCAGGTGCOT 

AGAATGTGGCTATGTAAAGCACCCCCCCACCCCACCCTTACTTCTCAA 

CCTG AAT AAGTGACAGGTGCTG ACTACCATTGATGCTTCATTGCAA CATCA 
AGCTAGGTGTTCTAATCTTIGCAAAAGGACCTGCATACA^ 
1 5 GGG CCCCACATG AATGGAAAG CCTGTGTTGTCAG CTTAA'iTT TGT AGTTGTGGAAACCTTCGAGTTTT C TCl; i"ixGT CT AAT ACCT 

T CAGG AGTTCJUVTC CT AG GTTG AAGCTT AATTTAATAACC^ CATGT AAAGTAGAAAACT^AAACATCTTTT CCTT AG CAT AC 
AGCAAAAAAAAAAAAAAAACTCACTCATGGATGTAGTGTACACATGCCAG 

AG AAAT AAATGTGCATTGAAGACACAGAG ATG AATTTGAATT C CGGTT C CIATG AG CTACCATTCTTTGACCTTGGG CAAGTTGCAT 
CACTTCTTTG AG CCTCAGTTT CC CCAT CTGTAAAATGTGGAT AGCAACAT CTACCT CG CAGAGTTGCTGGGAGG ATCAG ATGAAAT 
2 0 GATGGGCTAGCCCAGTGTCTGACITACTGTGTTTAAGAAATATCAACT 
GTGTTATAACTCTACTTCTC!AAACATTAATGTGCATGCAAATCACCGT 
GGGAAGGCITGAAATTCTACATTTCTACCTAGCTTCCAGGTGAAGCT 
GC^CGTGCAAATGTTACTTTGTGTAACTAATGGTTCGCAATACrc 

ATG CTG AC CTCATCT C CAAAATAATGTAATTTTAACAGATC CAAAGT AGTGC CTGGGC^ C CAGGT 

2 5 GATTCCAATGTTAGCCAAGATTGAGGCCCTCTGCTTAGTGAATCT^ 

GTTTGCATCAGTGTCTTGATTGCCTCTGTAACrTACAATAAGCAAGA 
CTCTTATTTTGTATGGAGTAATCAAGCrrCAAAGTTTGCATGTCTTA 

AT CTTCT AGGTATT CTGAGCT ATTCAGAC ATATGATTCATG TTTG CT AATTGCTCTTTTCT CTTGT AAATATT AGCTGAAAAATGT 
CACCTGTCTGACAAGTAGCATATTTTATGCCTATCACTCCrrGGC^CGC^ 

3 0 GACTTTTATCAAAGGCCCAGGCAGTAAAGAGGGGAGTTCTGCTGTAAGCTAAGGGGAGTTCC^^ 

TCTTATGACAAGAAAGCATAGTGCAGTAAATAAATTTGCTAAATAG 

TTGTTATGCAGACTCAGCAACTAACCTrCCTTGTAAGCCCCATTTTCT 

AAGTATT AT AG AAGAGTAAAAGT AGTAGGT AATT CTTGAACTTGG C!AT ATGATTACTAC^ 

CITCAAGGACAGATTGGATTTGGTATCAGGTGGCTCTGCATTAAGTTATAA^ 

3 5 CATCATAAAGGGATCTGC CCTT AAG AGG GT CC C^TTATGGAG ATC 

AAAGAATCATACAGGAAATCTCTTTTTACATC 

TGTTTTACACATGTACACTCACCTTATAGCCCTGCCCCT 

TGTAGAAAGGCACATTAATGTTCTAGCCATGGTTAAAAC^ 

AGTTTGTG CAT AT CACTATCAAGTGTGCTGTGTGG CAATT AAGAATGC CAATTTG T GTGAT CACAG G CAAGTTG CAG TTTGATG AA 

4 0 AGGAAAGCAGAGGTGAATATATAACCAGGGTCATCOTTTCTITTCTCCCT 

TAGAACTTGCTATGTGCTAGGTACTGGATATATCAAAGCAAACTCAGCCTGGTCTTTGCCTT 

G AAAAACTTGAAT CAGAGG ACAT C TGCAGTGGGAATCAT TCAAGCAGCAG AAAACC CAAAAGTTACT T ATACTG TGAAATCTGATC 

AGAGAATGGACTGTCCTGGTTAGTAAAATATCCTGGAGGATAAAGATTGGCCATGCATTCCAC^TATGAATTAC 

AATTAAAACATGGTACGAAAGAAAGGAACGAACATTTGCTGAGTGCCTACTGCCTGCCA^ 

4 5 CGTCCAGCCTCCACCACAATACTGAGAGGTAGAG 

GGACTCACAGTCAAAGGACTCCTAATCCTGAGTGAGCATTTGAACCCAGG 

CATTTTATACTGAAACAACTTCTGGAATAGCTAATGCTTA^ 

ATG AATAAATGAATG AAGCAAGCT CTACTGAACATAATTTGAT CT AAT CTT 

AT ACCT CCATCAT AATG AATACC CATTCATT AATG ATGG AAG CAGCCT AATTTTG T CATTTTT CACAC TTT ATTGATGT AACACTA 

5 0 CCTTTACTAGTTTGG CCACTCCTT ATGCTTTT TIT AT AG AACT ATTT AGAT CAATT CAACTTTTAAAAAAT AAAGC CACATACCCC 

TGTGGTAG ATGAAAAACAAGTAT C!ATTTGCACTGGTAAAT AG AG AAT AGGAAG AAAAAT AAATGCAGTG AAAATAAAGCAGTGT T A 
TCAAATCCTACCCAGATACTGTTATCTACCCGGAAGCTTCCTGT^ 
CTAGTTGAAATTATATGCAATTGAAGGATTAAAATAGAATTGAAAAGGGAATA 
CTTTGATTCTGCCTCATCTAAAATCATCTTACATACTTCTAGTGGCGTGTCCCTCACAT 

5 5 GATTTTGTAGTCAAACATCTGACTGGCTCrrGGATTTTAACTTTACTAACATAT 

CAACT ATATAACCAT AATGACAAAACAGG TTTCACG^TCATGTTTTCAGT AT TAAACAATATAGTT AAAAGT AC CT AG CACAGTG C 
CTGG CATAGAACAT ACTAGAT ATACATT AAGTATCAGTT CCATT TTT CCT^ CTACAG 

CATTTTTGTTTTCTGAAAAAAAGGATAGAATATTATTACCTACCTTGCTGGCT 

6 0 CAGAGTGTCTGACAATTGTAAGAATTCAACAAGTAATAGTTATTATTACCATCACTGGTGA 

TATATGGATTAATCAATATGGATTGAGGGAAACAAACCTGGAGAATAGGATGTGAAGGTATTTAAGTAACATGAG 
ATGGTAGGGAAGTCGAAAGGAAGCATTTTGTTCTTATATGACAGATGACCTGGAATGACTGCAG 
AGGTGGGAGAGGCCTCTGAGAGCAAGCAGTGCTGTCCACCAGAAGCTCTTGCT 
TGCAC!AGGAGGGAATGTTTGGAGGAGAGAGCCACCTCAGATCAGCGGGTCAAGAATCCCACT 

6 5 GAGAAAAAGGATTCGCCACGGGAATGTCCAGATAAGACAGGTGCCTTTT'GGAAAATG 

GT AAGAACTGGAATGTAAAGT AAAGG C^G ACAATGACAAAATAT CTTGTTTTCTTT^ GACAGTCCCT AAG 
G AACTGT ACAT AATAGAG CATGG CAG CAATGTGAC CCTGG AATG CAAC TTT G ACACTGGAAGT CATGTG AACCTTGG AGCAATAAC 
AGCCAGTTTG CAAAAG GT GGAAAATG ATACAT CCCCACAC CGTGAAAG AG C CACTTTGCTGGAGGAGCAGCTG CCCCT AGGGAAGG 
C CT CGTT C CACAT ACC TCAAGT C CAAGTGAGGGACGAAGGACAG TAC CAATGCAT AATCAT CT ATGGGGTCGCCTGG GACTACAAG 

7 0 TACCTGACTCTGAAAGTI^AAAGGTGAGTGGTGTCAAGGACTAGAATCCATGGAAGCATCTOT 

GAAACCGATTAAAGGTAGCTCAAGCAAAAAGAAGCAGGCTr^ 

GGAAATAATTTTTGACATCTGAGTACAAAGCAGCCGAAGTACAAGTGAAAGGGGGTAG 
GAAGCCAGGAAAATTAGTCCCTGAAATGTGGGAGGGTATGAAAAATAAGCTTTC 
GACTTGATTATTAAGATACTCTTTTTCAATAGTTTATACCCTGA^ 
7 5 T TAAAAT ATCAATTTAGTGAGT AG CAACCAGATG CACXSTTTCC CGCCCTTT AAAAAAT AATGT AT AGAAGAG AATAGACAGAGT AG 
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ATCAGACGACAT(^CAGAGTAGGAjCTGAGTAC^ 
ATAAATITITTITTI^ITACTTT^ 

GAAGTAGCTGAGATCCAGAGAAGGGAAGTGGGTAGCCCAAGAGAGTCC 
GGATCTCTCCACTCCAGCACAATTGAAAATCTAAA^^ 
5 AAAGAC^TATAACAACTATAAAAAATAG ATTT^ 

C CAATATrrCTC^GGCCTCTTCCCXTTCCCACG CATTGGACCTCAGATGGTCTGT ACTGT CTTCTAG&GGGTTTGTGGGTTTTGGC C 
CCAAAAAACCATTAACCTTGGCAGAAAGTGTGTGACT^ 

TG AATATTGTGTTCGCCCTT ATCCTTGGG (^GGTTATTTTG CCTTTCATAAC CT 
AGTAGTACCAAACACACAGGATTGTTGTCAACATTGAGAGAGATA^^ 
10 GTGTCCAGAGACTTGGTTAAAAG ACCCTCAGTTGTTACAGGGG CAGTG ACCT C CTCACACCTCAACCATCAATGAGTCACCAGGAA 
AGCCATTAGCCTAGATGTAACTGTTTTCTATCT 

AGTTTGT CTGAGTT CCCTTAATGTAAGGCTGTACATTCTCAGG ATGCCTTGATGTACTCG AAT ATCTG CAACCCT AAATCACCACC 
TCTGTTTTT ATTGAT CI'CT ATCTGAATGCTGTATT AATGGG CCAGGCCTTCTGC CCA i. lt± CAAACTGAGAALTGTCTCTCATT 
CCTGGGGAGGCACCCTGCCTACTCCTTACCTAGATC^GGG 
1 5 AACIt3GGATATTTGG<^TTACAAAGACTTTTCAAGGGATC 

TTCCCCAAATACTCGTTCCCAAAATTGACGAGCCTGACAATGTGC^ 

CTTTTAAG CCTACCACTCACTCAT CATG AAT AT AGTCCATTGTC CCAGGGTG TAAAACCCTCT AT AGTGTTAAAT AAAAG AATGAT 
TGGGAACATTGACACCTGATGGAACTGTTATC 

TTGAAGGAG^CCCTTATAAAGTTAATAACTGATAATCAAAAATGAGTAATTTTTGCCATGTA 
20 ATTG CTGTAACGAAACTG C TT CCATTCCCATTGATT T ACT CATACGAACAAG ATTC CTT AG CCTTT AT AAGCT ACAAAAAAATGAA 

GAAAAG AGAGCCATAT CCAT CTTATT AAG GGACACATTCCGAATAAAT TTTCATCTTT CATGTTT AATA^ CAT AA 
CATTTTACATTTTGATCAAATATGTGTTAATAATAATAGA 

TATACAC CATAG AATACT ATGCAG C CATAAAAAATGATG AGTT CATG T C CTT CGTAGG GACATGGATGAAG CTGGAAACCATCATT 

2 5 CTCAGCAAACT AT C C CAAGGAGAAAAAACCAAAC ACCGCATGTTCTCACTCATAGGTG GGAATTC GG ACA 

CAGGGAGGGGAACATC^CACACTGGGGAC!TGTTt3TGTGGTCGGGG 

ATGACAAC^AATGGGTGCAGCACACCAATATGGCACATGTATACATATGTAACRAACCT 

CTT ACTGTTAAAAAAAAAAATTAGATCTTAATG CAGAACACCCTGAACATTT AAAG CTTCAT AGTCACAAG AG AAAAGTTTTCAT T 
TCAATAGCTATAAATATTrTGTTGTTGTAAAGACATATAACGATAA 

3 0 ATAAAATT CTGTAGGGAAGG TGAAAT TGGAAG TGAGTTT CAATGAATG AAAAGAAACAATTT AGrACAGAGAAG AATTTTTTCA^ 

AAT AATTTATTTATTTTT ACTTAG AAGG AATTGAAT AGATTAG GT TCCTTAC CCAAAAAG C CTCTGT T ATTTGTCTTATTT ATTTA 
TTC TCttti~1'TC CACATT CT CCAGT CT CATTCCCCTT TT^ CATGTT T CATACAT ATTC1TTTGTT 

TGTAAGAGCTTATTTAAAATATGTAATATTGTTTTAGATGCATATATTTTTTTTCTTGTGGAAACT 
ATTTTAGAAATGGACACATTAGGCCGGGCG{XiGTGGCTCACGA 
3 5 AGG TCAGG AG AT CG AGACCATC CTGG CTAACATGGT G AAACCCT GTCT CTACT AAAAAAT ACAACAAAAATTAG CCGGG CG CG GTG 

GCGGGCGCCTGTAGTCCCAGCTACT<3GK3GAGGCTGAGGCAGGAGAATGGCGTTAAGCCGGGAG^ 

TCGCTCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCTGTCTCAAAAAAAGAAAAAAAAAAAAAAGAAATTGACAC^ 
TTTATTGTGAATU^CATGATTACAAGCAATAACAGATATAACAGCAGAAAAAAT^ 
TTTTTTGGGTATATGTGAATGTATTTTATCTITTTTTTAATTGACAATAATTC 
40 ATATACAATGTATAGTGATCAGATCGGGGTAATTAGCATATTTATCATCTCAAATATTTATTA 
ATGTACCCCTTCTGGCTATTTGAAGCTATATACTGTTGTTAAATATTTATATGTATT^ 

AACTT CTTT TAT AT TTAGTCAC CTTTGG ATACAT ATGTAAGAGT AAAAT AAT TGTTTTACT AAAAAAATTCATAAG ATG CTG AACA 
TTATATGTTTTTCCAGTATAGAAACTTATGAAAAATTATAAGAATCAACTTAATAGGTGAGAAGA 

TTAGGAGGAATACAAGCAAAAAAAAAGTTGGACCATCACCTAAG AGTAGGCTACCCTGATCCAGGGTCT CTAAAAG AAG CAAGATA 
45 GATCTGAGGCCCCAATGAGGGTTCCCATGAATGAAGTAGGACTAGAAGAAGGACATGTTTTTATCr^ 

ATAAAG CTTGAAAATGGGTTTGT TTAT AG CAC CTTCGTT G ATG T CTG^G CAG AAATT CTT ATCAAATAT AAT GAAAAG AGT TAATT 
TCAAAAGGACAGAACAAATTATGAGGTGGAATGACCG 

GGTGGCACCTACATTTCTGGGGGCAATCCATCTGAGGAAGGGGCGGGAGCATGGCC^ 
GTCTCTCAGAGGAGCTGGGGTTGAGAACTGGTCTGTGCAAAGGTTTCCAGA 
5 0 CAAGAGCCAAACCATCCTGAAGGCCTGGGCAGGGCAGGGGGATGGGAAGAGCCAAC^ 
CTTGAAACCAGAAACAATGAATTTAAACTATGCCCTACGACAA^ 

CAG AATATACATTCTATTCATCAGCAT ATGG AACATT CT CCAAG AT AG ACTAT ATAGT AGG CCACAAAACAAG TCTCCAT AAATTT 

AAGAAAATCAAAATTATATCIAAGTACrCTCTTAGATCACAGTGGAGTAAAACTGGAAA 

TGCAATTACATGGAAGTTAAATAATGAGTGATCATTGGGTGAACAATGAAATCAAGA 

5 5 GATAAT AGTGACACAAC CTATCAAAACTCTGGGAT AC AG CAAAAGC AG TGCTAAG AGAAACATTCATAG CATTAAATGCCT ACATC 

AAAAAGTCTGAAAGAGC^CAAATAGGCAATCTAAGGTCACACCTCCTGGAGCTG^ 

CAGAAG AAAAG AAATT ACAAAGATC AGGG CAG AACTAAATGAAATTG AAACA^ CT 

GGTTATTTGAAAAGATAAATAAAATTGATAA 

ATTACTACTCATATCACAGAAATACAAAAGATTATTC^ 

6 0 GATGGATACATTCCTGGAAATATACAACCCTTCGATTAAACCAGAAAGATATAGGAACTATAAAC^ 

ATTGAATGGT AACTTTTAAAATTGTCAACAAAAAAAAAT C CAGGAC CAG ACAG ATT CAT AG CTGAATTCT ATCAG ACATT CAAAG A 
AGAATTGGT AC CAATT CTATTGACACTATTC CATGGG AT AGAG AAAG AGGG AATCCT CCCT AAATCATTCTATGAAGCCAGT AG CA- 
C C C TAATACAAAAAC CAGG GAAGGACATAACAAAAAAAGAAAACT ACAGGCCAGTAT CCCTGATG AACATAGATG T AAAAATCC TC 
AACAAAATACT AG CT AACCAAATC CAACAGCATATCAAAAAG AT AATCCAC CATG AT CAAGTGG GT TT TACACCAGGG ATGCAGGG 

6 5 ATGGTT TAACAT C CACAAGTTAAT AAATGTGATACACCACACAAACAG CACATG ATCATCTCAAT AGAT T 

CAGAAAAAGCATTTGACAAAATCTAACATCCTTTATGATTAAAACCCTCAGCAAAATTAGCAC^ 
TAAAAGCTAACTACGACAAACCCACAGCCAACATTATGCTGAATGGGGAAAAGTTGAAAGCA 

CAAGACACTTT CAC CACTTTTATT CAGCATAGTACTGGAAGT CCTG GC CAG AG CACTCAAAGAGAAAGAAATAAAG GG CATC CAAA 
TTGGTAAAGAGGAAGTCAAACTGTTGCTGTTrGCTGATGATATGATCATATACCTAGA 

7 0 CTAGAACTGGT AAACAAATTCAG CAAAGTTT CAGG ATACAAAAT TAATGT ACACAAAT CAGTAG CTCTGCT ATACACCAACATCAA 

CCAATAGCTGCAAAAATAAAATAAAATACTTAGGAATATACCTAACGAAGGACATGAAAGACCT 

CT ACTGAAAG AAAT CACAG ATGACACAAACAAATG GAAACACATCC CAT GTT CATG AAT AGGTACAAT C AATATTG TGAAAATG AC 
CAT ACTTTGTAGATTGCAAAAG CAAT CTACAAAGT CAAT GCAATCC C CATCAAAAT ACT ACCAT CATTCT TCACAAAAGTAG AAAA 
AGAAAT C CT AAAATT CAT ATGG AAC CAAAAAAG AG C C CACATAG CCAAAG CAAGG AAG ACT AAGCAAAAAGAACAAACCTGGAGG C 
7 5 ATCACATTG CCTAACTT CAAACCATACT ATAAGAC C ATAGT CACCAAAACAG CATAAT ACTAGT AT AAAAATAG G CACATAG AC CA 
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ATGGAACAAAACGGAGAACCTAGG AAT AAAGCCAAAT ACTT ATAG C CAACTG ATCTTTG ACAAAGCAAACAAAAACATAAAGTGGG 
GAAAGTACATCCTATTCAACAAACGGTGCTGGGATAATTGGCTAGCCACATGGAGAA 

AT AAAAAATCAACTCAAG ATGGAT CAAAGACTTAAATCT AAG AGCTG AAACC^T AACX5AT TCT AGAAGATAACAT CGAAAAAGCCC 
TTCTAGACATTAGCTTAGGCAAAGACTT 
5 ACTTAATTAAACTAAAAAGCTTCTG CACAGCAAAAGAAATAATCAG CAGAG TTAAOUJ ACAACC CACAG AGTGGGAGAAAATCTTC 

ACAATCTATACATCTGACAAAGGACTAATATCCAGAATCT 
CAAAAAATGGGCTAAGGACATGAATAGACAATTCrCACA 

CACTAATTCTCAGGGAAATG CAAAT CAAAACCACACTGCGATAC CAC CTT ACTCCTGCAAGAATGGCCATAACAAAAAAATAATAG 
ATGTTGG CATGGATGTGGTGAAAAGAAACACTCTTACACTGTTGG CACTATGGAAAACAGTGTG 
1 0 GAG ATT C CTTTT AAAAACT AAAAGT A3ATCT ACCATTTGATCCAGCAAT 

T ATACAAAAAAAAGATACTTGCACACG CATTTTT ATAGCAGCACAATTTGCAATTG CAAATATATGG AACCAG C C CAAATG CCCAT 
CAATCAGTGAGTGGATAAAGAAAATGTGGTATATATATTATACATACATATA 

TACTTAG CTATG AAAAGGAATGAAATCATAGCATTTGCAGCAT CCTGGATGG AATT AG AGAC TATTATTCTAAGTG AAGT AACT CA 
GGAATGTAAAAC CAAACATTGT ATGCT CTCACTCATAAGTGCGAGCT AAG CrATGAGGATGGAAAGGCATAAGAATGAT ACAATGG 
1 5 ACTTTCAGGACTT AG GGG AAAGG ATGGTTGGGGGTGAGAGATAAAAAACTCCACATTTGGGC CAGGCGTGGTTGCTCACAC CTGTA 
ATCCCAGCACTTTGGGAGGCCGAGGTGGGCAGATCACGAGGTCAGGAGATCAAGATCATCCTGGCT 
CTACTAAAAATACAAAAAATTAGCCGGGCGTGGTGGC 
GGAACCTGGGAGGCGGATTTTGCAATGAGCAGAGGTCGCACC^CT 

AAAAAAAAAAAAAAAAAATGG C CTT CAC CCTAAGAAG GAAG AAAC CAAGGGCAGAG GAAGG AAACAATCAAAGAGAAGCAAGGCAG 
2 0 CAGTGAAAAGACAACTGTACGTTTAAGCCTCTAAGGCCCAAGGAA 
CTTGGGGAAACAGAAGGGTGTAAGAAAATCGGAGTCAAGAAAATC 
ATAACACGTTTTTTAAAAATAAAGAAAGAAATTATTTTAAAAAAGAA 
AGCTCCTGGGGATTACAAACTCGTTAACCTGCTATTCCT 
ACTGATTACTAGAAACTTGCATAGATCCTCTAAGACCAAGGCATC 

2 5 TAAAACAGATACTACCATGTATACATTTCATCTTTAT^ 

AAACCAGGGAAACAGAAACCATTTAAACCTGTGGAAATCTAAAGCTGAAACTAGTAAT^ 

AACAGAGAACAGGGAAGCAATT CAGAAATT AGCAATAGCAG AAAGCTG CT CCTACCACTAGGCTATAGTG AAAAGGGGGAAGGCAG 
GGTT AATGGAG CC CAGGG ATT" AAAGTCACT TGACAG ACACTGG AACC CAGTG GGCCTGTC CATTGGTAT CTGGAGC CAGG GAAG AG 
ATACATCTTCTGCTGGAGAGGCCACAAAAGTATAGAGCATAGGGGAGAAAT^ 

3 0 T CCTGTCTTG CTT CCTATTGGC TGTCACGTTC CTAGGCTATACAGGCCACAGGAGT CTG C CTT CCTG CAACACAGAG CAG AG AAGA 

AAAAGTGAGAGATGGATCIGAAGGCAAACAGACCCAGGACACACAAAGCAC^ 

AATAATACAGTTG ACTAG AT CT ATAACAGACTG AAAAG CT AGT AGTTGTTG AGT CAGGG AC AAC GAAGGCTGG CT G ATGGC CT CAG 
T CAACTTGG AGACAATCTCCAAGTG CTAAGGGGTCTGTTATTCAGGTTGTAAGGATATAG CCAACCTAACTGAGCAGCAGATCATA 
TATTTAGCAAAACTGATCITATGACCTTAGGTTACATTAAAACCCTGGATATT^ 

3 5 GTGTTACATATCCTTTTTAAAGAACGTGGCAAAACT^ 

TGTGAG ATAT AGTT AAAGGAGGT AGG CATGTTTT ACTT AGGGTTGAAAAGACT CAAAT AT CA 
GAAGGGTTACCTG CAATAATTGCTAACACT7UVTTGAGTGTTT ACTACGTTCT AGACAT CT AACT AATTATTT AACAC C CTTGTTAA 
GGATAAGTATTGCTGTATCCATTTCCTAAACAAGAAAACTGAGGGTTTA 
AAACTGCTCAGATTCTGAACTCATGTTTTTAACTACGGTGCTTACCATATGTTGTGTTAC 

4 0 AAGCTACAAAGAGGCACIAATTTTGCTTGGCATGAA 

GACTG CTTCTAGAGGAAGTAGGAGTTTCCATT CATAGATGTTTTCAGGCAAAAGTTATTT AG CAGTTT ACTAAT AT CAG AAG AAAA 
AT ATT ACT AAAGAT AACAAGGGACACTT CATAATG ATAAAAG AGTGAGTT CTAAAGG AACACATAACAATCCTAAATGTGTGTCCA 
TTTAACAGTATAGCTTCAAAATATTAAACAACAATTGGC!AGACTTAGGAGAAC 

CTCATCT CTATTACAACTG AT AG AACAAACAATT AAGAAAAT AGAAA^ TAGC CATCT TG ACCTAATAGACA 

4 5 TATGAT AT AT AGAACATACACCCAG CAACT ACAGAACACACA.TTGTTTTCAAG CGTATGTGGAACAAT CATCAAAG CAATTAT CTA 

GGAGAATTGCTGTAGAAGGTGTTTTTCATTGAGCAATAAGTCAGATGACTTGAATTTT 
AATTTACTCATCTTTTCCTAAGAATATCACCCTCAGTCC^^ 

AAGAAAGAGGCTGTGGAGACATT TCTGATACTGCAAAAATGTAGGATGTGGGCTCCTGTCCCTTC CCCAAACCTTGCTGATTCTTG 
TAGTGGAAGAG CTCTCTCTCTGTCTCCCTATCTG^rrCCCTCATGTATCCCAAGTCAAAGCCTCAG CCAAGCTCCCAATTCCTGG CA 

5 0 T CCCTACTTCTGAAATC^CTTCTCAACCAGGACACCAAGATCAGTTT CTTGGTCCAAGCCACCTGCAAT AGTGGGCTGGACCCTCA 

CCCACAGTACTCTGCTCCTGTTCTTGTTCTACTTCATTCCCCAAA^ 

TAAAATGATTACAAAAGACATTGAACCAAGGGAAAATAAGTGTAGATAAATAATACAA 

ACACACTTCGGGTACTGTTTGATTATGAAAGA 

CAGTAGGCTGCTTCAGTGTCTGAATTGCCTTATCTrCrACTACTGCTGGGTG 

5 5 C TT ATG TTG CAC CAT C CCT AGACATCTG CAATT ACACAGAGACACAT C CT CTTATGGTGATTCCAATT AAGAATG CATT ACAAAAT 

AGT AATATGAATTTTACCCAC CT CZATAGGGT CTGCAT ATCATTTAAATG ACATAATGT ATTATATGT ATTATGTAT ATTTATACA 

TAGGGGGAAAGATGTCAGAGCITTTTGGGACATACCAGG 

AGAGCCAAAGTGCACGCIAGGTTCTGGTGGATGTTTCATTGTGCAG 

TTG CTATAATGAAT CAAATTGTCAACTCTT AACATTGAGAC CCACAACACAGGA^ GGGAACAG AG GGG CAAAAATTG TGAAGAACG 

6 0 G CATGAACATT CCTTCTCTT C CAT AGGATGTAAAAGGACGTGG CACAAAG CTCAACACAT AGAAAACAAT AAGT AAAT CATACACA 

TCAG ATGGTTTTCACATGGTTGT TAC CATT ATTTT AATGTGAAATTGTTT IT CACAAAGTTTGTACCT ATTT ATACTTCCAACAAT 
TTAGGAGAAAAAAGTTGTAT CACACCTCTG C CTTTACTTCAAGACTACACTT ACTT CAAAAATTGT ATTG AAAT TT AATTTCTTAA 
AAGAATATTTCCTACCGAATGGAAGCCAGTGCTCATCCCCTCTTCCTCTCACCAAATATCAACAG 
TAACTGCATATGGTATTGATATTTGTTTATTTTTAGCTATTATCTAAGACCACT 

6 5 G GAATATCATG CCT ATTTT ACTT ATTTTTAAAGTT TTTATAAGGTTGTAATG TTTTATTTTAATT TTAAAATTT ATTTTTAACTG G 

CAATAATGT ATAT TT ATGG GGTACAAT ATGATGTTTTGATCTATGT AT AGAT T AT AG AAAGAGTCAATCAAG CCGGG CACAGTGG C 
TCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCA 

TG AAAC C C CAT CT CTACTG AAAAT AGAAAAAATT AGCCAGG CATGGTGGCAGGTG CCTGCAAT CC CAGCT ACTCAGGAAG CTG AGG 
CAGGAGAATTGCTTGAACCTGGGAGGTGGAGGTTGCAGTG^ 

7 0 TCTGTCTCAAAAAAAAAAAAAAAAAAAAAGAATAAATCAAACTA^ 

TTTTTTTCTCTTCTTAATCTCAAAACAGAGAATGTAAAAAATTCATGCTTTTAGCA 
CATCACACATGGCAATAGATCACTAAAACATTTCCTCTA^ 
CCTTTCTACTCTGTTTCTATGGGATCTACTTTTTTAGATT^ 
TTACTTAACTTAGCATAATACCCTTCAGTTCCATCCATGTTGTCATC^ 
7 5 CCATTGTGT AT AT ATACCACATTTT CTTT ATT C ATTCT CTGTTG ATGGACATTT AGT TTGGTT CCACATCTTGGTT ATTGTGAAT C 
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ATGCTGAAATGAATA?U^3GAATACAGATATCT CTTCAACATACTAAATT CAATT C CTTTGGATATATACC CAGAAGT AGGACTG CT 
GGATCATGTGGTCATTCTATTTTTAGTTTTTTGAGGACCTCTATACTA 
AGGCAGGGAGCCATATCTGTTTAATTCACTCAGCATATACTGCAA 
GTATCTGCATGTGTTAGGGAGAAGTGCAGAATAAATATACCCAACTCTTTACTATGT^ 
5 TCTCTTCTTAAT CTCAAAGAAAACAG AGGAAAGGAGGAAGTAAAAAG 

AAAGTGAGATAGTGGGTAATCTAGTGATTTTTATTTTTCCGTCCT 
GAAAGAAGCCACAAATTGTGGTAGCrTGGACCACTGTTGAGGTCTGG 

ATAAGCTCTATGACTATGAACAAAACACTCAACTTCTCrrAT CAGTAACCGTCCACAA.GAATTGGGAATAT CAACAATGCCATC 
TTTATTGCATTGTAAGAATTAAATAATTCAATATATGTGGAAAATAGCAT 
1 0 CCATTATTACTTAATATTGATAGTGA^ 

TCTTATTTCTTTTTCTTCTCTATTCTCCAGCTTCCTACAG 
GAGCTCACCTGCCAGGCTACAGGTTATCCTCTGGCAGAAGTAT^ 

GACCCCrTGAAGGCCTCTACCAGGTCACCAGTGTTCTGCGCCTAAAGCCACCCCCTGGCAGAAACT^ 
CTCACGTGAGGGAACTTACITTGGCCAGCATTGAC 
1 5 ACAAGAAACAGATGGCATACT CGAGTG ATTTGAGGAGAGTGTAATAAAGGGACT GTTTACAAAGGTGTGAT CACCATTTGGAGAAA 

CTACAAAGGATAGTGCAGAACACrGGGGCTTCAATGTTGGGAGGGCAA^ 
CTGTAGGGAAAGCCCCACTTCCCAGGAGCTGTAGCCACAGAATAGGGAAGCTGCCA 
ATGAATGAATACCCCAACTCATTCTCCTCCCACCCTCCAATCTCOTGCTAGCACCTCCCA 

GAAT ACAAGGGT CCACTGTTGTATTCCAT AAAAGT CAACT^ CTGGAG 
2 0 AGGAAACAGAAAATATCTAGTACAATAGCTAATCACTGTGATTCATGCACAGTGTCATGAGCCAGCAG 
GTACTTGCroCCAGTCAGCTGGTTATGGGTTTTTCCAAGAAATTTGGTCTCT 
GG AT ATTTTTGGAAGGGATCCCAT ACTTTTG AAC TTCAT ACAGCAGAATTT C^UU\CAAT CTTGGG 
CCAGTAAGGACAACTAACACCTAGTATCATAATCATTTCGTAAGAGACA 
TCTTATATCAAATTACCACyu^CTTCACAGTTTAAGACAACA 

2 5 GAGTTTCACTGGGCTCAAATCAAGGTGTCAGCAGGATTGCAGAGGCCCTAA 

AGAGGCCACCTGCATTAATTGCCTCATGGCCCCTCCCTCTA 
CCACATCTCTCTCTCTCTCT^TTTTTTTTTTGAGATGAAGT 
CACTGCAACCCCTGCCTCCCGGGTTCAAGCTATTCT 

CTGGCTAATTTTTGTATTTTTAGTAGAGATGGGTTTCACCATGTTGGCCAGGCTGG 

3 0 CGCCTCAGCCTCCCAAATTGCTAGGATTACAGGCGTGAGCC^ 

TGCCACCTCTTAAGGACCCTTGTGATTGCATTAGGTACTCCATCCCCCTGGTTATTTGGGGTGATCTTCCCATCT 
TTCAAAATTCCTTTTGCCAT13TAAGGTGACATATTCACATGTTCCAGGGATTAG 

TATTTATCCTACTGAGAAGGGATATACTCTCAGACTAAAGGACAGTCCCTAGTACTGATTC^TCTGGCTTTATAGA 
ATATTGTCATTGTATTTCACAGTTTGCCCTTTGTCT 

3 5 TTAATTGGGAACCCCAGAGGGCAGAAGCAAGAGATTTA^ 

TGGACAACAG AAACT CAAAT ATTCAAGGAC CAG CATGTAGACAGC CT CCT AAGATGTCT ACT CAGACAAAG AAT TT CAGGTG GAAG 

GACTTGTTCATCTGCTTCACGCCCATTGGTTGACAGGAATATGAACTCCATTCTGCT^ 

GAGCTTTCCCCAGTATCCGTAGCATCAG 

TGAGTGTAAGCCTAACTAGAATTCTACCCCAGCTGGCTGAAGTGAAAGG 

4 0 CCAATTGTACTGTTCAAATCCACTCJATGCCCCT 

TGCAGAAGATTTAATATACACCAGTTTGTGGAACAAGCTGTAGTTCCTGCTGC^ 
TGTCTCACTCATCCACCATACATTCCAGATTTCCCTCACCTAACACCCCAGATGGAATGGTTTCT^ 
ACTTAACGACCCrTAAGAGATCTGAGCTTTTGATCAGCATGCCCrrTAACAGACTGGGGTTG 
GATATGGAAGTAAAAAGAACCAGTGAATCAGCGGACCCCTGAGTTCCAGACATACTCTTCCTTGTCTC 

4 5 AAGATTTTCATGATAATCAAGATTGATTTCTCCACCAGT^ 

AAGGTGGCCAGGCCAGTAGCAGAGCTTTAGATGCAGTAGAACAAGCAGAGTATCCCCTC 
TTTGATTCAG(IAGAGCCTAAATGTGTGGGGTTGAGTGAACTTCT^ 

CAG CAC TGGG ATAG CCCTGGTACAGACGT AGAACA.TGCTTG AGGAGGTTGT CAGG AGG AAATGAGT TAG ACCTTG CACAGAACT AC 
C&CCATCAAGTACAGTTGGGGTGAGGCAGACATGG 

5 0 ATTTTTTCATGGAGTGACACTGATCCACAGACCAGACT 

GTATTTTCTTCTGGAAAAAATTCTTCCCT^ 

CT AC CT ATC C1ACAATTTACTACCCAATGCCAACACTAAAGTTAG C C C AACT TC CTTCTAACTAAATT ATTAGTT TAGAAGGAAAGA 
GAGGAGTCATGCTAAGGATCTTAACTGAAATC^AACATTAGAGG CTGG GCATGGTGGTTCTTGTCTGTAATCCCAGCACTTTGGG 
AGGCCAAGGCAGGAGGATCACTTGAGCCCAGTAGATTGAGGTTACAGTGAGCTACAATTGTGCC^TTG 
5 5 GCAACCGAGTGAGACTCTGTCTCCAAAAAAGAAAAGGAAAGGGGAGAGG^ 
GAAATCC^TCAATTTTAGGACCAACTTCAGGTAAAAAAATGAATTAGGC^ 

CATGGTTGAG AC CAGAGACTTCTGACCTCATGAAAGATGAACT CTAAAG CCTCT AAA.G AG GATTAA 

ATATGAGCAATCCCACGAGAACTTTTTTCCCCTGGAATTGTTTATTCAACTGTCGTTCGTTATATG 

TGTAGGCCAGTACTTTGGATGAATTGTAGTTTTCTAGAAAGACGCTTCTTATATAAGAACCTCTCC^ 
60 G AG ATG AATT GAG AAATAACTTTACACCACTG ATT ATGTCAGTGTT CTATTCTG CATGGTAG AGATGTGAAAG GGCAGACTGACCA 

TTX3CTCTGGAAGCCTTTACGCTGTGAGAAGTTAACAGTGGAGTA 

CTAAATAGTCAACAACGATGGGAGAGACCTGTGGGGTCTTCATC^ 

TCAGGGAGATGGAGAAATCTTTGTGCAGCCAGAAATTCTCAACCTGG 

TTACTGACATGGATCCTCTGCTTCATTAACCATCCCT^ 
65 CCTTCTTTGCTO\C4GTTTTTCC^CATAAATATC^TTATTTCCCT 

TCACTCCAGGGCACAAGGGGGTCTTTCCCTCATCCCACATTT^ 

TGCCTTTTTAATTGCTATTTTATTTTCCATGTATTTTCATTGTTCACACAAGTCT 

CCTGTGTTCATAATAAGTGCTCAACAAATGTTGGGTTGATTGGGTTGGAGATTCCA 

CTACCAGACTGTGTGGACAGCCAGGTCAGAGCAGCCAAATG 
7 0 TCACACA.TCTGCCTCTGAGTTCCTGGCTTCTATTTCTCAAG 

CATAAATTGAGCTGTGGACTGCTGTGACCCACAAGCTTCTCCGGAAGTCAATGTATA^ 

GGTGGAGGGTACTCCACTGACTCTAGAATGGATGACTGAACATTCCAAATTTCAAGCACAAGTTAG^ 

TTTTGAAATAGGGTTTCTTCTGCTCAGCCAGTTGTTGTATTTTCAT^ 

AAAGGACCCTTGTAGGGCTGGCAGAAAAGAGAATCCTTCCTAGGAGACCTGGAGGTGATTCCAGGCAGTGGTTGAGA^ 
75 CTGATGTCAGACAGGCTTAAATTTCAAACTCTCTCTCTATTAGCTTTGAGAACCTAAATATCC 



2226 



WO 03/008583 PCT/US01/51291 



TCTCTCATGTGTAGCATGAGGGTGAGTAGTGGTAGACAGTT^^ 
TTAGTACTCAGAAAGTCTGCAAACAGTAAAAAAAAATGGTATAT 
GCAAGGATAAGACAAATATTTTTTGCCATTGCATCATTATATCATTGCTAA 
GCCraTAATCCTAGCACTTTGGGAGGCTGAGATGGGTGGATO 
5 AACCCCGTCTCTACTAAAAATACAAAAAATTAGCCAGGCATGGTGGCAGGTGCCT 
GAGAATCACTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCCAAG^ 
CCTTCTGAAATAAATAAATTAATTAATTAAATTAAA 

TATGAGGATACAGGCTAAGCTACCATAATGAAGAGACCTCXaAAATACAGTGAGAA^ 
CAjCTTCAGGTGGTTCAGAGCAGCTAAGCAGCTATGTTCCATAGAG^ 
1 0 TCTCCAGGATGTTGTCCCTATAAAATTGTCAAAGCTCAGTCAGTGC 
GAAAACACATTCAATGTTTTAAGGCCAAGACCTTGAAAACTCA 
CTATAGGGGAGGCTTGGAAACATAGTCTCTGAGAA 
TAACCAGCAGTATACCACAAGGTAACAAATGACTAGGAGGATC^ 

ATTCCATATTGACACrrTT CATGTAGCTGTTGCTTGGCT CTAT CTTAG AGAGGACTCAG AGGTAGTTTAGATAAGGCCTTTG CCCT C 
1 5 CAAATACAGTCTAAG CAGACTG ATTTCCTACTGGATGTTCAACITTGGAGTC 

GACTATGAGGGAACCTGCACAGGACAAGGGTTTGCATAAAGACACTGAGGTAG^ 

CAGGTCTCC TTGACTCACAAAGTG CTT ACT AAGCAC^ CAG ATTATAATCAAT ATGGGTTATCCAATGTT 
TGGATGAGCAAGGCTCCTTATCTTTTCTTCGTTAATGTTAATCACA 

TGGGTCAGATAGATCTCGGTTTGCAATCCT CAGTGATGAC CT CACATAGATTATTTAGCAACTCTGA 

2 0 GACTTACTTC CTCAACT CT AAAATGAGG GG CTCAG ACTAAATAATGTCT AAT CT CTTCTCCAGTAAAACAAT CCGTGGTTCTCAGA 

TAGCACTGTGCTTGGAGGTAGTGGGGTTTGAGGGCTGGGAAGTTGGGAGGACT 
TCCTGTACCAGCCTGTCATGTTTATTCCATGTGAATGACTCCAGAGGCAAAAT^ 
GTACCC7ITTCATTTCCCTGTAGGTGAACACTCCTCTAGCCCTC 

GAGGACTTTACAAGGCITTC CCCAGCCTCC^ C CT CTACTAAACTCAC C CTTTCCT CAGTGCTAGGA 

2 5 CGTTGAAAAACCGAAACAAGGCAAAGGGCCAATTGTAtt 

CT AGGATATTCT ATTTCTTCCAAAAG G AT CCTGTT AATCCTTG AAATTTAACAAC T AATGGTAT AG ATT CTAAG CACTGTGAGTAC 
TTGTCAGTGGGGGAAAGACATTTTTGGGCTGAGAGACTTTGCC^ 

GGAAAAAGAGAAAAAAGAAAGAAAGATGATTAGATGATAGATGGATGGATTGATGGATGGATGGAA 
GTAAAAAGCACAGAATCTGGAACCAGACTGTATCTTAGCrrGGGTCACTTACTATGTAACT 

3 0 GCCTCCCTTTCTTCACCAGTGAAATGGGGATCATAGCAG 

CAGAGCACTTAGAATAGGGCCTGGTGTGGJIGTAAATGCTC 
GTTGGTCTACCrCTTAGTTGCCATGTAACAGGATTCTGGTGCTTTTCCTTTGC 
TGGCTG CTTTCAGATITTCATCCCCrTCTG CATCATTGCTT^ 

AAAGCTGTATTCTTCAAAAGGTAAGTGAGTTTTATTCATGGTAACCCAATGCACTGGG 

3 5 CTGCAGGCCTATGGCTTGCTGCTTTC^TGCTAAACCCACTCAGAGCTTATGAAC 

CCAGAAGAAAATGGCTCTCATCGTCAGTGAGCTGAACTTCTTA^ 
GAGTTTGTCTTATTCTTTGAGCCAAGAGGTITCA 
GATTTTGCTGGTGGGAGGTTGCCAGGATCCTITCTGAGGATTCC 
GGTTAAACCCACCCCCTCTCCCAGGAATCTCAACCTCACGGTTGGGTA^ 

4 0 GGAAACAGCTGGTTGGTGGTGCATTAACGTCTTTCAGCAG^ 

TCCTGTTCTGTTGTACCAGCTGGGACAGAGCATGCTGAAGCCTTTCAGCCCT 
TGGTTTCCTGGGTGGAGCCTGCCCACAAAACCCCCAGAATGTATGCAGG^ 

ACTCAATGAACTT CTC CAAAGAT CT ATACATGG C CTC AT CTATAGAAAG AGAAATGAC^TGTGGAAATAATTCAGTAGGAGTTTGC 
AGCAG CACTATCTGAGGACT AGG GG AATTTTAAGTGGTTGTT AT CT T AC^TTTATACTC7\.T AACTT CT AT ATTT TCATCTGCCAT A 
45 AAATATTGTCATGTTCTATTTGTCCATTGCCCrATGTGTGTATGTATTCACTTGGGTGCTG 
GCAAGGAATTGTTGCCAAAAGAAAAAGACAGAGGGAGGGAAGA 

AGAAGAAGAAAGATAGAAAGAAAAATAAATGTGGGAGGGAAGGGAGCTTCATGGTAAA 

CTTTTAAATAGAACCACTTCAGAGAGTTGTGTCAGGTGCACCTCAGTTGTCTTATCTTCTGCCATTCTTCTTTTACCTCTC^ 

50 TGGGAGAATGGAATGGAGTCACACTCAAAGATAGAGTGGAAATAAATCCTCTCCTCATCCTTCACCCCAATCTTA 
GGATATCAGTAGCTrCCGAGCTGGGAGGTAAAGCTCAAGTTCTAACTGTGATTA 
TAAATGTGCAAACAATTTCTTTTATATTTTTAATGAACCAGAGAGAAA^ 
GCATTCAGLATCTGCTTTCTTACATGTGTCTGCCA 

GACAGAGCTTTAAACCAGGGTCAGAGGG CTAGGCTGGGACCTTG TGCTGCCATTTGTTGGCTTTCTGACCTTGTACAAATTGCCTA 

5 5 T CTTCTTTGAGATGGT CTTT CTATCITGTAAAAGGGGAAGCAG CAAGAG AAGGAACATTTTACAG CTT ATTGGCCG AACTTCACTGC 

CGCTAGTGTGGTTCMCTTGGACTACAGAGAAATCTTCCTAACTGGTTTCCCTGTATTCACTCCTGCrrACCTCCAACTTGGTCTGT 
TCTCACTTTTTGCTATAATAGGCTTTTAAAAATCATAAATCTACCATGTGTCCT 

TCACTGGATG AAGTTCCAACGAGCCCAGGATGGTTTGAC TCATGTC TCCAGCTTTAACTGCATCACCATCACCTTCATTGTCTAAA 
GCTCTAACCACACAGGATTTTCTAGTCCTCAGAGGCATGGCAGT 

6 0 AAATATTT/TTTCTTGTTGTCCACCTGAGTTGGAG 

TACCTCCACGCACCACACCCATTACATTAAATCCCCTTATTATATGTTTCC^ 
ACAATC^C^TGTCTAGGATCTGTTTTAATAGCTTGGACTACCAATTAAATT 
ACCCACATGCCCATCTTAGCAAGAAGTTCAGTGTCTCCCTCTTATAGCATG^^ 
ACT^AGCCCACAGGGCCAGTGCTAGGCAGCTAGGTTAGTCCT 

6 5 GCAGAATCTCTGGTCCTTCCAGATGTCTCCATCCACTGTGCAAAGGTAACCTTGCTGGTTCCGA 

ACAAG ATT ACAGTTCT T ATGGTT CCCCAACACATG CTCTGTCATTGGT C CC AAAG CAGG ACCC CT ATGGG TTG ATGAG GT AGG AGG 

AGGTCCCTGCCTTAGCCACAGCTGCACACAGCCMCCTCTTCCCTTCTAGGCCCTCAT 

CCTTCTCTTCAGTTCCTCTTAGGGCC!ATTGGTATCCTGAATTrcrT^ 

CCCTTTCCTTCCAGATGGCTTCITAGCT^ 

7 0 GGGAGGGCAGCCATTATTGGTCCATAAAAGGTCCTGCTTACCATTTTCCCCTAAGAG 

TGACCTGCTGGGCTCTAGAGAAGAAGCTGGCAACTTCTGTTX3CAAAATAATC 
CAAGAATACAGTAACTGTGTTTCTCTGATTCACCAATATATGCCCAGTGTCTAAA 

AAATTTTAAAC AT TG AACCTAC T AT ATGCAGG TGAGTATG CTAGATCTGCTCTGTCCAAT AT AAT AGC C ATCTGTGGCTCTTTTTA 
AGTTAAATAAAAATTCATCAGTCACATC1TAAGTATTCACACA 
7 5 TGGACAGT ACAGATAT ACAACATTTCCATCATTG C AG AAAGTTCT ACTG AAAAC CACTGTG CTAGAC C CTTTACAAACTTCTC ATT 
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TAATTTCACAAAAATTGCTrGTGGCAAGGAGAAGTGTCT 
CCAAAGTCATACAGCTGGGATTTGAATACAGCT 
TTGGTTCAGTCAATATTATTATCTTTGAACAATGGAAGGAGAAA 
TTG<TTGAGCCAATGGCAAAGTCAAGTrAGAATC 
5 TTCAGAACAGTGTTTCTTAAACTCTCTTGTGAAGGAACAGTT 
GAACCAGACTCTAAGTGGCACAGCCCTAGAAGATTGAGC 
CCHTSTGGCCCACAGAGAATGGCAGCACTAACCTCA 

GTTAACCATTGGGCTC CTTTCAAATG CAAACCC TCATGAAAGAGACT ACAGTG CTG AATAG AG ACCTC CAAATTCCAGGCCAAG CT 
CAGGATAGTCATGAGGGAATTACTAAAAACCTGGTATATAGGGCAAAAGCAGAATTAGGAATG 
1 0 ATGGCAGGAGTTGGGCATTAAATCCTAAAAGAGAATCAG 

CTATT CATGATTGCAAAGATATG AAACCAACCT AAGATCC CATTGACGAATAAGTGG ATAAAGAAAATGTGGTATAT ATACACCAT 
GGAATACTACTCAGCCATGAAAAAGAATGAGAT^ 

TAACTCAGGAATGGAAAACCAAATACCATATATTCTCACTTGTAAGTGGGAGCTAAG 
TATAATGGG CATGGGAGACT CAGAAGGAGAGAGTAGGGAGTGG CAGTGAGGAAT AAAAAAGCTG CAT ATTC 
1 5 ATTTGGGTG ACGGGTG CACTAAAATTCAGACTTCACCAGT ATACAATTCATCCATGT AAC CAAAAACCACCTG TAC C CCTAAACCT 
ATTGATATAAAAAAAAATTAAAAGAAAGATGTGAAATCAAGGAAACTTACT 

TGAACCAGT AACTTGAGTTACTTTG AGCCAGTATCACT CAGACATT CTGACCTGGA 

CCAGGGAAGGCAGGCAGAAGTAGCAGTCAAGACTAAAGCAGAAAAGGGAG 
ACCCAAATTAGTCCCTCAGCCITTAAGTGCCTGAGGGCCAG^ 
2 0 CTTTGCCTAGGGGCCAGATTTTGCTTTAT^ 

TATTTAGATTTACrrTTTTGTCCTGCCATATTTTAA 
TCCTTTTCAGACACAACAAAAAGACCTGTC^CCACAACAAAGA 
TTTCTTACTTTCITTTCTCrrCTCAGCTTGAAT^ 
TTATAGAATCTTCCTGTTTGGTAGCATTTC&GCG 

2 5 TTCCTAAGCACTCCTAGATGATACTGCACTGGACCTCrrGGTCrCACATAGTTAGAAA 

ATATTCAACTGCGATGCAATTGACAATGGATGTTTTTGCAACAAACAATGATT 
GTAATATGGGAAAAACACAAGTCTCTTCATGAGGTTGACAGGTTTGG^ 

TCAGAAGAAGTGGGTTACTAAAATCATTAAGCCTGGTTGGATGAAAAGCTTAGACTCAGGGGAA 

GGCAAGTTTG AAATGTGT AGGGCAG GCCAG CAGTCTGGAAAC C CAGG CAGGATTTCTACC TT ACAGT CTTGAGG CAGAATT C CATC 

3 0 TTTTCTGGGAAACCTCAGTTTTTTG CT CTTAAGGC C CT CAACTG ATTG AATGAAGCCAAC CT ACATCATGGGAG AT AATCAG CTTT 

ACTT C?U\GT TAACTG ATAGCAAATG ATAATT ACATCTACAAAAT ACT CAACAT AT AG ACTAG CAT TTG ACT AAG CAACT 

GAGCACCATAGCCAAGATGACACATAAAATTAATCATCACAGG 

AAGTAAGTCAATTTGTCATTCCTGCATGAGCCTTTATAAACAGAAGTAAGTA^^ 

AGGC CAGGGTT CTGTCT AATAC CTGTTCATG CATGAACATG CCAAC CT AG ATTG CATG CAGAC T ACCAGTTTTGGGTTTTTGTTTA 

3 5 GTTCAGCAGGATTTTTCTCAG CTCACTGCCTCTCAAACTTC CTGTG AT AT CAGAATCT AC CACTCT AAG 

T ATTTGGATG CAATAG CAATGAATAT C TGAGTAAATCTAGG TGGG G AGTGGGGG CAC C CTGTAGCCAAAATG AT TTAACAAAATC A 
AACCAAAArrr rGGAAATGATGCCTTG GTACAATGAAGAGAC T ACTTGAGGT AGGTTTGACTT AT CT AAT ATCTT ATTTT CTTTAC 
CAATACCTAATGAGGAATTTAAATATTTCTAGA 
TCCCCCCAAACACTGTTTTCATCATCATCATGTCATCTCTTGTCTCT 

4 0 TTTGACAACTAGCCACATTTCACTAAGCCAACT 

CCACTGCCATGAGCTCATTCCCACTTCTGTGGCTTTGCTCATGTTGG 

T C C CAATC CCAACTTTTGG CATGGT CTACTTTAAGATACAG T AATG AGT AACTTTATTAT TATAACACCAGG CTTCTGTTCTAAG A 
CATTAGAAGTGTGTTTAGACT AG CT CAT TTAAT CCT CACAG TAGCC CT CTG AAGTACTTACTC CCTGCTT CC CATTTTAT AGGTGA 
GGAAGAAAAACATGAAGAGGTTAACTAACTTGCCCAAGGTATATAGCTAATAATAT 

4 5 GTGTCCAAGTCCAGAATCCACACCCATACACTACACTCTGCTTTTTT^ 

TCTATC^CCCAGGTTGGAGTGCAGTGGTATGACCGTGGCTC^CriXSCAGCCTCAACTTTT 

CTCCTGAGTAGCTGGGACTACAGGTG CATG CCACCGTACCTGGATTTTTTATTTTTATTTTTTGTAGAACTGGGGTCTCACTATGT 
TGCCCAGGATGGTCTCAAACTCCTCGGCTCAAGCAATCCT 

GTGCCCAGCCTACATACACAACACTCTCTTGCTTAATCTGTAAGACTCTCTCCCCCAC^ 
50 GTACACACAATCTATACCACTCTTAAGCACATGATTACAGCGTTATTTTCTGGCT 
GGTCAATATAATAAA^TGGGATATTAGTGTTAATGCAACTATATGXCT 

TATAGTTTATTATATAATAAATTTAAAAGGTGTCTGATTTTGACCAAATTTGACTAAATACTAAAA 

TGTTT AATTT TAG TTGAGTCTATCAAC T CAAAAAAGAATAAC CT AC AACAAT AACAAGTTT CAGAACATTT TTT AAATTACTGATT 
TTATGAGTATAAAAATAAAAAATAAAAAAGTATAAAGAAAACAAAAATAATTTAGAATCCATT CCT AG AAATTG C CATGGTTAAG 

5 5 ATTTTAATATTGCTCAGGCCCAGACAGCTCAGGGCTTTGACATTCCCACACC 

AACCATCCATT ATG AGGAGAGTGTACAG CT CT AGG CTGC CCGGG AG CCAT CC CG CACTCT CATTTTGTGACT CGG CATCTTGGG AG 

ATGGAGTCTTGGGACTTAGCCTGGACATGTCCCTTGCATGTACTTCTTA 

TTAAGAAGCACAGGGCTTGCTGGTTTTGCIT 

GTATT AGCGCTGTTT CACCATTTCCTGGATCTCTTGCurl~i"GTG CACATGATAGAATTG CAC TTCTCTGTG ATT AATTTG TAGTTA 

6 0 AGTGTGGTCATGT AACTCGCTTTGGTCAATTAAATGTAAGCAT AAGTGATG CX5TG GGTAG AAGATGT AAGAGT TGGC 

ATATGCTTTGCCATATTTTCTTTATCCATCTGGCATGGTAACCAGTAACATTCTAGGTAGTAATTGCT 
GAGTGACTAAAATTGACAGAGTCCCCTGCTGACCCT 

ATTTTGG AGTTGTTTGTTCCT ACAG CATTAC CTAGTTT AC TC T AATACAACATG GAAAAAACTGG AACCTAT AAT AAATAG ACC CT 
ACGTTGCCATTTAAACTTCTAGTTCTGAGGAATAATAATCTGGGG 

6 5 AAAAAT ATACTTATGT AT CATTCATG T CCTATT AAAAATGTT ATTT AT AGACT CACCAT ATTCCC TTCCTCCAGAAAAATAGAAGT 

AAAAATATGAAAATGCCTGTAATCATGTTTTTGGATTATGGAATCAAGTATTGCTTTTT 
TACTTCACTACATTTTTCAGTGCCCTG 

CACCTCTCTAGATCCCAGTAAC CAAATACATT AT ATAGG ACT CTTCATCAGTC CTTAT CAAGTTT AGGAAG GG CGATGCT ATACCT 
TCTTTAAAGGACACCTACCAATGTCTTAGTTGCCTTTCAAAGACTCCTAGCACAGCTAAATGTG^ 

7 0 AGAGCTGAAGTGACTTGCATAAGGTCATATCATAACTTACTGTTAGAAATC^ 

GACACACTGCCCTTTCCATTTGTGGAAGTTGTTCTTGTATCTAACTTTATCTGTGCTACTATTTGGGCCT 
ATGCAGACAAGCAGATAAACAGTAAAACTTTAGGAGTGGATTATGATACCATAGATATATATCA 

TTACAGTCATCAAGCCTTGGTTAGAGT TTACAGAC CATGT ATCCTAGCT AC CT C ATTTCACAGATTATGACATTGGG GGCTAAGAG 
AT ATTAAGTG ACTTCC TTAAGTG AAACAGTAGCAGAC CAAAAG AAGTCATGATT CCCAGCATAGTGCTACT CACTCATTTATTCAT 
7 5 TCATTCACCCACTCACAAACCTGTATTAAGTTTCTGT^ 
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TTTATCAGTCATCCAAGATTCTTACA^ 
ACAAAAAGAAGCCAGCTTTCTATATACTTTTGGTCCAG^ 

rrcCTTCTAAGAACTTCCACCTGAGAACTGACCACAGCGTCAGCATTCCACATGGG 
GGTTTCCAATTTCTTTTTCTTTTGGCACTATAAACCT^ 
5 ATAGTGGTGAAATTAGAGTGGAGGACATAATCACCCTATCATAT AGG CTATTTGTCCATATCAT ATTTGTCCCTACAAAGGC CTCT 
AAGGCAGGGGTCCCCAACCTCTGGGCCGCAGACCGGTACCAGTGGCCT 
ACAAGCGGGCACTACTACCTGAGCTCTACCTCCTGTGAGATCAGCAGCAGCATTAG^ 

CTATAGCCTACCCCCACTCCCACACCCAATCCATGGAAAAATTGTCTTCCCTGAAACCAGTCCCT 
10 - CACTGCTCTAAGATACCTTGTGCTTCTTGGAACATATTTGGA^ 
TTACCATCAATCCTCAGAAATACCCAACCCATGTC^ 

AGGAGAGTGCTATTAGCAAGAGGAGTCACTAACAGCTT CACTCACCTGTG CC CATGAATTTTTAATGGTGTGAAAAGTCTGTGTAT 
T TTTTATGGTTCTCTATGG CTCATATAAGAGGG ACAAACAAT ACA CAGAGATGGGGACAAATATCACTTTAAGCTGGG 
GATAATCAGGGAAGAAGAAAGTTTTCATGGAGAAGGTGACrTTO 
1 5 G AGACACT AACAGGAAAGGCAGAGAGG CAGGAAG GTGTGGGAAAGTGCACGTGACCCCGTTCAG AGAGAAAGCCAGG TGTGAGATA 

AGGGGGAAAGACTGTTAGGGCATGTATTGTAAACCACTAATTCCAGGCAAAAGTTAGATTT 

TT AGATCCTAGC!AGGAAATGGAGGGTATGCTTAGAAG AGGTAACTG AG G CAAGTTT AATTTAT AAAGG TGTGTGC AGCATTAAGGG 
AAACCAGCAAGGGATACTGAGCATGCCAGGATGCAAGAGCAGGCAGGGAAGGTGACTATTCCT 

G AGCAGTTCCCAGAACCCT AGTAAAAATGGCAATG AGAAAGGT C CAT CTGGCAGGAC CT ATGGT CTTTAACAGAGGGACAAAGTCA 
2 0 ACCCACAACTTGTCTGGGAGGTTGCTGAGGAATAGATACCCC 

GAGCCCAGTCAAAGACIAGAGGGAGGAGCCCAGTGATGCAGTCTGCAATGTCATCATCCTGGAGCATGAATAGAGTG 
AATAATG AGT CTG CAGG AATT AATAGAAATATCTG ACACAATAGG GAACT ATAAGAGGTTTTGAATAGGAG AG G CCCCTGAAATGT 
GCTC C AATATTACTGAACT ATGTGTGGCC CAAAGAATGGAAGAGGAACAGCT CTTGCAAT AGGT CTGAGGAGAGAAG CTGAAGACT 
TGG ACT AGGG CAATGGTAAAAACT GTGGAAAGAAGT TTTAAATGAAAAGTTTTAAACCATG CGG CTTCCAGCTAG ATGAACTTTTT 

2 5 T AAAAAAATT AGTT CCTCACT CAAATTT^GG GG AGGTT AT AT ATTTT CTAAT CAT AAAAAATGATTTT TCTTATTTGTGGGCT TTT 

CTCCCCAGATCTGAACCTGTGGTCTTGGGAGCCAGGGTGACCTrGATA^ 
GGTGGCCTGCAGAGCTTGCCATTTGCACTTTTCAAATGCCTTTC 
CAACACCTGGCCATC^AAACTTCCCCCTTCACTGATCTGGACTCACCTCT 
CAGAATTACCCCACTGGATCCTGGACCCACAGAATTCCnTCAGGAT 

3 0 CCTCAAGTTTT CT AAGTG ATTT C C AAAAG CAGAGGTGTGT GGAAATTT C CAGT AACAG AAACAG AT GGGTTG CCAAT AG AGTT ATT 

TTTTATCTATAGCTT CCT CTGGGTACTAGAAGAGG CT ATTGAG ACTATGAGCTC ACAG AC AGGG CTT CGCACAAACT CAAATCAT A 
ATTGACATGTTTTATGGATTACTGGAATCTTGATAGCATAATGAAG 
GCACAAATTGTGGAGTAAAGTCATCAAGCTCTGTTTTTGAGGTCTAAGTC^ 
TTAATGGTGGTTTTTTTTTTGAACTAC!ATCnTTCC"riTAAAAAT 

3 5 TATACAGTCAAAAATATTT'GATATGCTCATACGTTGTATC 

AAGCCTCCTTTTCTGGCCCTCAATATGACrTTAAATTTGACrTTr 
TAGTCAAGGCCTTTGATAATTGGCACTATGGAAATCCrGCAAG 

GT AACAGCTTAATTTTGTT AAATTTGTTCTTTATACTGGAG C CATG AAG CT CAG AGCATT AGCTG ACCCTTGAACTATT CAAATGG 
GCACATTAGCTAGTATAACAGACTTACATAGGTGGGCCrTAAAGCAAGCTrCCTTAACTGAGCAA 

4 0 AGGGTGTGAAATTGACTAACAGACAAATCATACATCTCAGTTTCT 

CrCAATTGT^TATTCTTCAACGTTCTTTATATATTCTACTTTTGGGTAAGAGGTT^ 
ATCCTATTACGCCATTTGGCTCCCAAAGATTTGAAGGCCT^^ 
ATGCTTGATCTGGGGCTCCTGCTGGAGGAGGAGCACTTCCTAACACCACAGAGAG 
AGAAGCCCACC!TATGCTGCCCCTACGCCCACCCATGCCCCAGATATGTGATTGGCATGTTGAA 

4 5 CTGTTAGGAACCCAGTGGCATAATGGCTTCTGTCAAAATATTCCCATCTTTC 

AAATTTTCT ATACTACAGCATGCT AT CTTAACAC CT AACATTTATCAAGT ACT TG CTATTG CCCAAT ATTGTTAAGCATTTT ACAC 
ATACTATTGGAACAAATCATCCCCAAACTCTCTGAAAAAGACACTATTAATTACrCCA 
GTTAAGTAATTTGGTTAAGGCTGCTTGTTCAGTTTGAAAGGGATGGAATT^ 
50 ATGTGCCCTGATTTCCTACCACATGGGGTTTAAAGTGGTATTTTTATACTGTGT^ 

AACAACAGATGCTGG CGAGGGTG CAG AG AAAT AG G AACACTTTT ACACTGTT GGTG GGAGT GT AAATT AGTT CAAC CCT TGTGGAA 
GACAGTGTGGTGATTTCTCAAGGGTCTAGAACCAGAAATACC^ 

AT AAATCATT CT ACT AT AAAG ACACATGCACACGT ATGTTTATTG CAG CACT ATT CACAAT AG CAAAAACTT GGAACCAACC CAAA 
TGCCCATCAATGATAGACTGGATAAAGAAAAGGTGGCACATA 

5 5 TGTCCTTTTCAGGGACATGGATGAAGCTGGAAAGCATAATCCT 

TCACTCATATGTAGGAGTTGAACAATGAGAAC^CATGGACACAAGGAGGGG 

GGAAAG CAGAGAAG AGCAT T AGGACAAAT AC CT AGATGACATGTTGATAG ATT CAGCAAACCACCATGGCACATGTATACCT ATGT 
AACAAAC CTGCAGGTT CAG C ACAT GT ATCC CAGAATTT AAT AAT AAT AATT AATTT CACT CTTTT CCAGGG AAGT GCT AT AGG AGT 
GTAGGTGGCGTGTGGGTTGTGGCATCAGAGATTCAAACCCCAGCTCA^ 
60 CCTXTTACGCATCAGTTTACTTCAATATAGTGAGACCTGGCCAGGCGTGGTGGCTCATGCCTGTAATC 
AGGTGGGTGAATCACTTAAAGCCAGAAGTTCGAGACC^CCCTGGCCAACATGGTGA 

AGCTGGGGATGGTGGCACGCACCTGTAATCACAGCTACTCAGGAGACTGAGGCAGGAGAATCGCTTGAATCTAGGAGGT 

GCAGTGAGCCGAGATGAGACCACTGCACTCC^GCCTGG^ 

CAACAAAAAATGCATATATATATATATATATAT^ 

6 5 ATAAAAATAAAAAAAATTTAAAGGGGCCGGGTGCAGTGGCTCACACCTGTAATCCTAACACT^ 

ATGAGGTCAGGAGTT CAAGAC CAGTCTGGCGAAGATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCAAGAGTGGCGG 
CGGGTGCCTGTAATTCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCATGTGGACCCGGGAGGCA^ 
TGAGGTCATGCTACTGCACTCTAGCCTGGGC^CAGGGCAAGACTCTGT'CTCAAAAAAA 
ATAAAGAAAGOlGTTATCTTTCTCAGTTTATCATGAGGATrc 
70 AAGGAGTGAGATTTTTGCCATGTACCTGGTACGTGATAGGTACCTCATGCATGGTCACCTOT 
TGGCTACTATTAAGACTTCTACTTTTTTTTTTTTT 

ATCTTGGCCCACAGCAACCTCCGCCCCCCACCAGGTTCAAGTGATTCTCCTGCCTCAGACTCCTGAGT^ 

GCACCACCtACGCCCAGCTAATTTTGTATTTTTAGTAGAGGTGGGGTTT^ 

AGGTGATCCGCCCGCCTCAGCCTCCCAAAGTGCTAGGATTACAGGCGTGAACC^ 

7 5 C CTGGATAT AATGGG C T AAAACATATATAATGACAATG CTT AAAAGACTT AG ATGAATC TAC T TTTT 
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ACAAAAGG AGCAAAAATAGCATTCAGGAGAGTTTGAGG^ 
AAGCACAAACACTTT CATGTTTG CTGTTTGTTTCTGCTCTTTGTCCATTT^ 
TTTCTATACTTTAAAAAATTTGGGGCTGGGCAC^ 
CCTGAGGTGGGGAGTTCAAGACCAGCTTGACCAACATGGAGAAAC 
5 TGCATGCCTAT AATCCCAGCTACTCAGGAGGCTGAAG CAGGAT AATGGCTTGAACC CAGGAGG CGGAGGTTGCAGTGAG C CGAGAC 
AGCACCATTGCACTCCAGCCTGGGTGAO\AGAGC 
ACTTATGTTGCTGACZAAGTGTTCCEAATTATCACTTT^ 
CTGTTCCCCTATTAATGTAAGATTACCGTTGTTTCTGGrrrr 
TGACCCTCCAGATCCACTCTCCAGCCTTGTCCTCCCTGCTTTGGGTCCT 
10 ATTG CTCTCTGGCTTTTGATTGGGTTCAG CCAATGGGGACCCTGG CAG GT^GATCAG AGGATGGGAGAAGAGTATTGCTGGGGTATT 

TATTCCCCAGGCTCTCCGTCTCCAGGGTCACTGCTGGCn^ATAATT 
TCC^CATGGCTCCATCTGCCTGTGGGTTCCAGTAACTGCTCTCTCCAGTCCCCTTTTCA 

C CCCTGCAACACACACACACATGTGCAAACACACA^ CTGTTC CTGCTAGCCTTTGGGTACTG CACTATTC 

CTTAGAGTTCCCTTACACCTGGCCCACATCTCTACAAAGA 
1 5 T ATATCATG CTATTTCAGAT AATGCATGT CTTTTTACAT CT AATT TCATTAGGATGGATACCTGAAAAGGGAATTATGG AGT CAAA 
GAATGTGAACACTTC^GGCTCTTGAGTC^TACrrGTCAATTTGCTTTTGA 
AGCACTGGATATTTTAAAATATAATTTTGCTAAGTCAATAGGTTAAAAAATGGTACTT 

ATTACCCT CAAGTTTGCTTAT AAAACCTGGTT ATTG CTGTGTACGTTGTTATTTTGGAGTTTGTTTAAATGTl'rCTCCCTTCCCCA 
CT CATT CTCT AG TTGTATAATCTTTCCTTAATAATT CAATTTTTTAAAAGATCAACTTTTT AAAGC CT CAATGT CTTCAT CTATTT 
2 0 ACAG ATATC CT AACAAT AACAC CTCTCTCACAGAATT ATTGTGAG AATT AAATGAAAAT AAGCA CATAT AAGAAACATCCA 

AGAAACATGAATGCTTTCCTCCTTAATAATTATGCATGGAGTT^ 
TTTCTTTCAAGTTTATATTGAAACTTACTTCATAAAATAA^ 

ATATATTATGTGACTTTAGAAAATTTGTAACCCrTTTAAGCCAAAATCrTTAAAA^ 
TTTTATTTTATTTTGAGACAGGGTCTCACTCTGTCTCC 

2 5 CCCAGGCTGAAGTGATTCTCCCACCTCAGCTTCCTG^ 

TGGTTTCTATTATACAGATCAAGTAATGCTTTGGCAGGCT^ 
GGAGTTTCACTCTTGTTGCCCAGGCTGGAGTG<MTGGCACA^ 
TCOTGCCrCAGCCTCCTGAGTAGCTAGGATTACAGGCATGTGCCACCATACA^ 
TCTCCATGTTGGTCAGGCTGGTCTTGAACTCCCAACCrrTGGGTGATCCA 

3 0 TGAGCCACCACXSCCCGGCCCACATCTATTTAATTCTTAGA^ 

GCTACTCTGGGATCTCATC^TCTACATTGAGATCAGGCAGATT^ 

TGTT(ZAAGGGTGOTGGGG CT TCTCCCATT AAAGCA'rTl"ri l C AATG CTTT AGAG AAG AAACTATTCCTTAGGCTCTTTCAG GACCAA 
GTGTGGTGAGGGAAG AGGCACAGTT ACAT AT AATAAAT CACTCCTTTATTTCTATCTCCTTAG AGATT CTAATGG CTT AGG CCATT 
AGAATCCTTCCTATACATTATTCCAGAGAAATTCTGGCAACTCAA(ZACCA 

3 5 AATCAACTAAGCAAATAGTGCCATTC^CAGCTGCTCTAGCTAATACATTTGTTTCAGCT 

AAACTTAGTGACTTAAAAC^CTGTTTATGTTTTTATGATTTTATAGGATAA 
TGTGCTCTACGTGGTGTCAGTGGGATCATTCATGCATTTGCAGTCAGTTGGCAGTTTGGCT 
TCATTCA.TATCTTCCTATTGGTGCTGGCTGTTGGTAAAACCACGTATCTTCAGTAGGTT 
AATCCAAAAACAGCAAAAAAGAAAGAAAACCCCAATGTGCAAGTGCTTTCCAAOT 

4 0 CCATTATCCAAAGCAAGCACATAACCATGCCAAAAATCAATGTGTAAAGGAAT^ 

CTACAACATTAAGTCCAAAACrTATCTGGAAAGTGTAGTAT^ 
CTCCTGGGTCTAATACCTTTAGAATTCATCCCCATACATACTAGCCTGGTTTCCT 
TTTTGAAGTATAAGACTTTTCCTGCTGAGCTAGACTCTGCrCTTCCCACCT 
ATGAGAGTTGGGAGCACrrGGGGCTGAGAAAGAATAGGGATCrCCCTTGCAAGGCA^ 
45 AAACAAGGAAAGGCTAATATGATTAGACAAGAAATGAGAGGCTTCT . 

ACATTCTC!AGTTTC!ATCCAAATCCACCAAAATGTCCCC^TT^ 
CACATATGAGACTTAGCCAGTCTGTGAATGATTGATTTGGGTTGAATTTTTC^ 

TAGAAGG AATAAG AGGT ACTTT CAGGGCAGTCTT AGAAG CCCTCTGGTG CT CTGAAT ATGATTTG AGGTGAAAG CC CTAAACTTGT 

5 0 TTAAATGC!AAC!AGCTACTTGGCCTCCAGAA 

ATGATGTCAATGCATGCCATAGATTACTAGTGGCAAAGAGTTCAGCTCTCAA 

CAATCC CAGAATGT CAG AATTTG AG AAT CTGTTT CCT ATGAC CACA^ CAGCCAG AGT CCAGTCATA 

AAAATAGAGTGTATTTAATATTAGGAACTTGTAATAAAGATATAAGAAGACTGAA 

CTACAGACMCaGCTACCTTTCTACTACCTATTGCTACAGGCATGACACATTC^ 

5 5 ACCTTCCAGTCCCCAAGGCCTTCCCATTTACTCAACITAACAGGAAGCTAGCT 

ATT CCAGC CCTGGCACATCAGAGCAGAGGAGAGGAGGATGGACTTAGAGTTGAGAA 

TCAAAATCTATCTGGATGAAGTGGAGACATATCCATTAAGAGATAAAAAATAAAGAGACTATAAAT 

AGGAATTGTCTTAAACTCTGTGGCATCTGACCCTGTAACAAGGGCCCA^ 

ATTGGG CATCTG CTTTGACCATGAG CAT AGG ACAAAATAAACTT ATTTATT T ATTTATTTATTTATTT ATTTATTTATTT ATTT AT 

6 0 TTATTTTTGAGACGGAGTCTCACTCTGTCGCCCAGGCTO 

TT(^TGCCATTCTCCCGCCTCAGCCTCCCAAGTAGCTAGGACTACAGGTGTGCGCCGC 
GTAGAGACGGGGTTTCACCGTGTTAGCCAGGATGGTCTCAATCTCCTGACCTCGTGATCTC 
GGATTACAGGCGTGAGCCACCX5CACCCGGGCCTTTAAGTACATTTTGTAGAAGTTGTATCAAAATTCATTGGCTC 
AAACCTGATGCAAACTGGCTTAAATTTTTAAAAGATAATGTGTTGXSCTTATGTA^ 

6 5 GGGTCTAACAAATGTCTTCAGGATTCAAATTGTCTCTATCATTTCT 

AT ATCTCCTCATAATTACAAG ATGGATG C CAATG ATTATTAAAC CTTTCAGATTC AAATATTTT ACAAG AA 
TTTTTCTCAGCATTCTCAGCAAAATCCrGAGATTTACAGTGGAAAACTTATGT 
AAGAGTATGTAAGTGATATCTTACATCTGGAGCTCCACCCAGACCACATGAACTG 
TGTGCTAAGATGTTGCAATTGCCCAAATGCATCTGTGGACTTCTAC^ 
70 AAAATATC ACTATCATATAACAC CCCTGGGGGACATATTT CTAAATTT 

HUMAN SEQUENCE - mRNA 

GCAAACCTTAAGCTGAATGAACAACTTTTCTTCTCTTGAATATAT 

AT ATGTC C CAGCTGGAAAGAATC CTG GGTTGGAGCTACTG CATGTTGATTG r ri u iGTTTTrCCTTTTGG CTGTTCATTTTGGTGGC 

7 5 TACTATAAGGAAATCTAACACAAACAGCAACTGTTTTTTCT 
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TATCAAATACAGAACATGATCTTCCTCC!TGCTAATGTTGAG^ 
AGTCCCTAAGGAACTGTA<^TAATAGAGCATGGCAGCAATGTGACCCTG 
GAGCAATAACAGCCAGTTTGCAAAAGGTGGAAAATGATACATCCX: 
CTAGGGAAGGCCTCGTTCCACATACCTCAAGTCCAAGTGAGGGACGAAGGACAOT 
5 GGACTACAAGTACCTGACTCTGAAAGTCAAAGCTTCCTACAGGAAAATAAACACT 
TAGAGCTCACCTGCCAGGCTACAGGTTATCCTCTGGCAGAAGTATCCTGGCCA^ 
AGGACCCCTGAAGGCCTCTACOU5GTCACCAGTGTTCTGCGCCT 
TACTCACGTGAGGGAACITACTTTGGCCAGCATTGACCT^ 
TTTTCATCCCCTCCTGCATCATTGCTTTCATTTTCATAGCCACAGTGATA^ 
1 0 T CAAAAGACACAACAAAAAG ACCTGTCAC CACAACAAAGAGGGAAGTG AACAGTGCT ATCTGAAC CTGTGGTCTTGGGAGCCAGGG 

TGACCTGATATGACATCrrAAAGAAGCTTCTGGACTCTGAACAAGAATTCGGTGGCCTC 
GCCTTTGGATGACCCAGCA 

HUMAN SEQUENCE - CODING 

1 5 ATGATCTT CCTCCTGCTAATGTTGAGC CTX3GAATTG CAG CTTCACCAGATAGCAGC TTTATT CACAG TGACAGTCC CT AAGGAACT 

GTACATAATAGAGCATGGCAGCAATGTGACCCTGGAATGCAACTTTC 
1 GTTTG C^VAAAGGTGGAAAATG ATACAT CCCCACACCGTG AAAG AG C CACTTTG CTGG AGG AGCAG CTGC CCCTAGGGAAGG C CTCG 
TTCCACATACCTCAAGTCCAAGTGAGGGACX3AAGGACAGTACCAATC 

GACTCTGAAAGT CAAAG CTT CCT ACAGG AAAAT AAACACTCACAT CCT AWVG G TT C CAG AAACAG ATGAGGT AG AG CT CACCTGCC 
2 0 AGGCTACAGGTTATCCTCTGGCAGAAGTATCCTGGCCAA^ 

CTCTACCAGGTCACCAGTGTTCTGCGCCTAAAGCCACCCCCTGGCAGAAACTTCAGCTGTGTGTTCrrG^ 

ACTTACTT TGG CCAGCATTGAC CTT CAA&GTC^ C C C CT C CT 

GCATCATTGCTTTCATTTTCATAG CCACAGTGATAG CCCTAAG AAAACAACTC TGT CAAAAG CTGT ATTCTTCAAAAGACACAACA 
AAAAGACCTGTCACCACAACAAAGAGGGAAGTGAACAGTGCT 
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Table 110 

MOUSE NOMENCLATURE 
ICSGNM N/A 
Celera mCG2257 

5 

HUMAN NOMENCLATURE 
HGNC PKDM11 
Celera hCG25389 



10 MOUSE SEQUENCE - GENOMIC 

CATCTCACACMTGACTTATAATATGTTCAQATTATTATGGGATTACT 
AG-GGCACTACACTCCAATXSAAGGAATTG^ 

AGTTAAAAGAAAAATAG AAATCTCACGAG GTGAAGAATTAGTCAAAGGAGACT AGGAAC CAACTGATAGAAATATAAAAGTT AAAG 
CTGG AACAGTTCAGTAATAAAATCAACAGGG CG CTATTGGATTAAAGTGAAAG CATAAAACAGAGATT CCCCAATCCACACTGATC 
1 5 AATTAGTGTGCACACTGATCAACTAGTTTGTGATGAGTTAGGAACT 
GCTAACTGATTAGTT«3AGGACTATGCCCCGCT^ 

CTGGTAGATCCTTCCCATCAGGTGTCTAGGTCTGCCCTAAGCTATATTGACAAGAGAGATTCCTG^ 
ACTTGGTCTCTGCCTCCTTCCACTCCACAGTCCATGAC 

CAAGGCT AT CCTCAAATGG GTGATATT TCTCCCTAAGCCTCACAAGTG CTGGGGCT AGGAGG TATCTG CT ACCACACCTGGAG ACT 
2 0 T ACACT ATT CT CTiATTTATGAT AGG ACAGGATT AACCAGTGG CTTCTGTTCATTGTCATACTGAATAACT CAAAAGGGCTAT CAGA 
AGGG CTTGTTGTTCCATG ACACAGATAT AAAACAGGCTAGATTG GCAGCTAGCAT AGACCATATTAGCACACTAG CT AAAGAACCT 
TTCAAAG GACTGGAATT CTGTG G ACTATGCT CCCTCACAGCAGTGG AATG AAATGGGGAAGTTTATGG AAAAAGC^ 
TCTAATGCTTG CAAATT AACCAGCAT ATTG CT ATATAG CG CATG GGTT CAAGAAG AAAC CATG CCAAAGACACTGGG AAATACTTG 
GAATGAGAAGAT AGGG AG AATCATG AGATGTGTAT AAGGG TAAAATAAAATTGG AAAGT TGGAGTCATT AGAAGAACAAACCGTCA 

2 5 G CT ATGAGG ATTGTCTAATATTGTGAGACC CCTGCCACIX5ACATCAGC CAGTCTC GGC CTTT CTGCATG C CGT CT AGTTTAA 

TCTCCAACAGCTTAAGTTAAATGAACACATCCATTAAAAATTTAACT^ 

GACTTGGGAGG CAGAGGCAGGCAGATTTCT AAGTTTGAGGACAG C CTGGTCT ACAGAGTGAG TT CCAG G ACAG CCAGGG AT ACACA 
GAG AAAC C CTG T CTCAAAAAAAAGAAAGAAAG AAAG AAAGAAAG AAAGAAAGAAAG AAAGAAGAAAAATT G ACATTGCCACATGAA 
AG G AAGTT AAAGTG AGACAGTATGG AG CGATGG CTCAGTGGTTAAGAGCACTGGCTGCCCTTCCAGAGGT CCTG AGTTCTATTCCC 

3 0 AG CAACCACATGGTGGCT CACAACCATCTATAATAGGATCT AATGC CCT CTTCTGG CATGTG AATGT AT ATG CAAAAT AAATAA^ 

AAAT AAAT AAATAAATAAATAAATAAAT AAAT AAAG TG ACAGTGTGGT AG AGGTG CAAAATGCAATCTGTATTTT AAATT AAAACA 
GTCCTGCACAGAGATGACAGGGTTAACCAGTTTTATCCAGTTGGCATACATTCTGAAGAATTACAGAA 

T C CAT AG T ACAT ATATAAAACAGACTAG ATTGTTTCAGGGTCAGATAG CTGTG CTGGG AATTTTCAGTGCTAATCTTG CATAAATC 
CTTT CAGAAAAT AG AG AAAGAG GAAAT AACTC CAAAGTCGTTCTG CAAGGC CAG CT ACT CCTTGT G ACAAAAT ATG ACATTTCCTT 

3 5 ATATTTTTATAGGAAAGGAAAATTTTAGACACATCTGTCTC^TAAATATAAATAGAAAAGTTGTATGTAA^ 

GAGAGAGAGAGAGAGAGAGAGAGCTTATCTGCATGTCC^CTATGTACT^ 
AAGT<^CAGAGCTGCCCCATGTGTGTGCTGGGAACCTGACCCA 

CCAT (^AAITTTITCTAAGGATTGTTTTAAACAATCT TTCAATGT AACTT AC CAGATT AAC CT ATCAGGGGAGAAAAAACAAAAACA 

4 0 AAAAATAAAACAAAACAAAAAAACAAAAT CAACTCGAGACAC^ CGTG AG ACAAAAATTCAG CAGTTTTT CAT AG CAT AC 

AGGAACAGAAAGGAG CGTCCCCCATCTAGTCTCAAGTATCTAGACAAACCCTTAGAGAACTTTGT ATTTTAATGTT CCAATAGCAA 
ATGTX5CTCTCCCTCCCCTGTCTCTGGAGTTAAGGAATGCAAAGCTACTGTCA 

G C CAGTGCTCTAGAGAAGAAAAAGCATATGATTTGAG AGAAAAGCATAAACGCACTATCAT TTCG CACATG GT G AATGATG ATG AA 
TGCCGGAAATCCTAAAGGATTCAACGACAAACTGTTAGAATTTTAAAAAAGGGCAAACCTT 

4 5 ATGTGTTGTTT AAACATGGTGACTTCCAAAT CAATTGTATT TCT ATAAACAAGTAAAAT AAATTGG AAGG C CTT TCATTTG CAGTT 

G AACTG TTAAAATCG AATG CTGAATG CTTACGAATAAGTGGTAT ACACC CTGG ATACT ATGAGT ACCAGCTG GGTAGCAGTAAAAG 
T AAAGATT ACATGTGT AT CAACTGGAGGACTT G AT ACTTTGTCACCAAAC CAAAGAT CT AATT CTACATGCACAG AAATT CAG AGA 
. AGCCAGACTGCCAAGCCTCACCAGACAGTAACCCTAAATTTAAAGCTACAG 
AATGGGGTCTTAGTGTTACTTTTGCTGGGATAAAACATGAGGACCAA 

5 0 GTTCCAGGTCATCAACCCAGGGAAGTCAGGGCAGGAACCTGGAGGCAGGAACTGATACAGA 

ATTTACTAACTTACTCTACTATAGGTTTGCTCCGCCCATTTCCTTTTATCAT^ 

AG AGAG CTGATT CTTCTAT ATCAGTTAT CAATCAAAAAAATGT AOTACAGGCTTGC CT ACAGGC CAAT CACATAG AGG CAT TT C CT 
CAATGGAAAGTC CCTCTT CCCAAATAACTCT AGCTTGTGTCAGGTTGACATAAAACTAG C CAG CAC AAACAG AGT AAGATAG ACT C 
CAAATAGACTCACACATGTAGAGTTGGTTGATATGTATGTGCGGGGCACATATATGAAGTGGGCTGATAA 
55 CAGGCATGTGTATGGAGGTCAGAGGACAATGTTCAGGAGTTGGTTCTCCATTCCTACCTTCTTTTAAGATG 
GGCCCTGGCTGGGTTGGAC^CAGCGATCCTCTGTTTCGACTGAAGGTGTGTACra^ 
TGATGCAGGACCTCTGCTTTC!TCTGTTGCTGTX3TTGCTG 
TTCCCTCTAGCCATAGGAGAGCTGGGGTTACATGGGCTCCACGAATCAATA 

ATTTTTGGCAAAATAAAATTAGTGAGGAG AGG ATAGCCTTT AAATG CATGTTC C CAAACT CATTTTTTATG AGAT CCACAAATACC 

6 0 AAAT ATT CT AGAAAG TTGTT TACAACTTTCAAAAATGAAATTAACC C CTGAG CT A 

CAATAGGAGAAAGCTAATATACACACATCCATCAAAACCCATGTGCCrrA^ 
GTACAACCGAAGTGTCTACAAGCAGTAAAAGGAATCACCATGTC 
GAGGAACATGTCTGCTATTCTCAGTATGGTGGCTGAATCCCAC^ 
CTATATGATTTCCCTATATGAAGTTCAAGATTGGCTGTAGCAAAGGCTGCCT 
S 5 GCCTGCAATTGCTTTGTTGCACAATTCAGTGTTGACTGCATAG 
TATAGCCCAAATTCCTGGTAAGCATGAACAGAAGGAGGAT^^ 
GGGTTGGAAGCAGAGGGCTCAGGTCCTGGTTGTGGCTGCGGGAATGTGAAA 

AGGAGCATCCTGCCTAGTTGCCACCTCCCTTGTCCTATTTTTATTCAATCCA.GGATCCCACTGGG^ 

AATCCTCTCTGGAAACCCMAGGTGTGCTCCCTGGATGACCCTAAGTATTTCTTGACCTGATOUU3GTCACAAAGTTTACCATTCC 

7 0 ATCGGTACAAATGGTGTGCTCACTAAGCAAGTTGGTAGCAGTAACAGGA 

AT CAATGGAAAGTT CTGG AGAGAAACTT AAGG AAAATCTTAAAAAG GTGGT ATG GTTTG AAT ATG CTTGGCCTAGGAAGTGG CACT 

ATTTGGAGGTATGGCCCTGTTGGAGTAGGTGTGCCACTGTGGGCATGAGCTTTAAGA 

TTCCACTAGCATCCITCAGATGAAGATGTAGAACTCTCA 

GATGATAATGGACTGAACCTCTGAACCGATAAGCCATCCCCAATTAAATGTTGTCCITTATGAGA 
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GTTCACAGCAGTAAAACCCTAACTAAGACACAAGGCATTGATGATTT 
AAATGAGGATATTACCATGCCATTATTTATAATGGCAGAAATTGGCAAC^ 
CATTTAAAATCATCTTAGAGCTAAAAAAGTGCTTGTAAAA 
GTGAGACAAAAGGGTG CX3TTGAAAATGATCCATACATACT AT ACTGTTTC 
5 GGAGG CAGAGGGAAGTGACTCT CCAATCACGGGGTCGGAAGAGGGG 

ACCGGGGGCATAGCGCAGTTCTTTTGAGCTAT 

TCT ATTAAG CTGCTTTTGCCAGGGTAATT CAGCACAG CAACAGGAAAAGATACCAGGACAG AAATTGGTC CCX3GG AACTGGGG CCG 

TTGCTGGATAAGCCTGCTCGTGCGTTTCTTAGGC^ 

CTTGGGCTAGAAAAGCTCTTGAATACTGGACACAGAGATTACTC 
1 0 TGGGGCCAGCTCAAGGQGTTTCTGACAAGACCTCrCT 

GCTTCATTCTGACCCAGCCCTCCACCTCCTAGAACCTGAATGAAGTTGAATGTAAAGG 

CTTCAAGCCAGGAGAGCTTTCAGACCCGGGCTGGGGAAGCGGCTCTAACT 

AT C CT ACTGG GAGAAGGATCCTGAGGACAAAACCATCAGAAG CTCT 

AATGTTTCCCTGGGATCAGCACACAGGAGTCTATAGAGOTGTTGGCACA^ 
1 5 GGCAGCAGAACTCAGCAGCAGCATCC^TATAATGCrGGCTTTGCGAGTAT 

CTTGCTGAATATTTTCAGAGAGCTGCTAAGGCCAGGTAATG 

ACCACAGAAT ACTGGAGATG CCAG AACCAAGGG ACACACACT CTGTGAACATGGACTGGAG CCAG C CCAGGAG AAAAC C 

TGTGTGTGTGAAGG CAGTAT AG CTG AATGG CTGATG CTATCCT AGTCTTTTGGAG C C CAG CTGATTTTGT CACAAG CTCCGGATGC 
CAGACATGGAGCTCAGGATTTGATGTTTTCC C CTGTTTGATTTTGG T TTTG'r rrGGGAGAAGAGGGT AT CAGAT C CTCTGGAAG 

2 0 TAGAATTATAGGTGGTTACGAGTCATTGGAC^GGGTGCTG<K1A^ 

GCTCCCACCCTCTCACCAGCCCATGGACTCCCAGCTTTTTTT^ 

CCTTATGTACTGTCTGAATGTGGTAACAGGCACATAAGTTACCATTG 

TATTTATTTTTTTGTACAAATGTACACAGATAAGATAAG 

TCAATGTGCACATCTGGAATCCCCATCCTCATGGCAAATTTCTGGATTT 
25 J GTGCACTTCTAGATTTTGATCTTTGG 

AAAACATCATGACCAAGAAGCAAGTTGGGGAG G AAAGGGTTTATTCAG CTTATACTTCCACATTG CTGTTCATCAC CAAAG GAAGT 

CAGGACTGGAACTC^CACAAGGCAGGAGTTGATGCAGAGGCCATGGAGG 

GCTTGCTITTCTTATAGAACCCATGACCACCAGCCCAGGGACGGCACCACCCACAATC 

TGAGAAAATGCCTTACAGCTGGATCTCATGGAGGCACTTCCTCAAGGGAGGCTCCTTTCTCTGTGATG 
30 TGACACAGAACCAGCTGGTACAGCCCTTCTTTTTGCCACCATTC^ 

TGAGACTCTAGACTTTTAAAGTGTTCAAATTGTTAAAGACTATGGAGACTTTTAA 

GGCTTAGTGCTTAAGGGCACrrGATTGCTCTTCTGGAGGACCTGGGTTCAAA 

AACTCCAAGAT CTGACACT CT CACATAG ACAC CCATGCAG ACAT AACACCAATG C CCAT ACAAT AAAGGTCAAT AGATTTTTTT AA 
AAG AACATTTGAAGTTG GACTGG GTGTATTTT AAAGTAT AACAATGTCATG AGACTTTGAGG AAGG GTGTAG AAGTTT ATGG CGTG 

3 5 AAT CCATGT TTG GG CATCAGGTTGACAAGAAAAG G AACT AAAT ATG CCTGTGGGGTATTATCTTGCTT ATATGAATTGATGTAGGA 

AACCCATCTTGATCACTGCTGAGGCCGTTCCTGGACTGTGTAAGACAGGGAAAGCAGGCTGAGTACT 

GCTTCAATAAGCCCTTTCTCCCTCATGTCACTTCTTTACAGACTATTTCCT 
GTGAGGCGAGCTCTTGCCAACACAAATCAGAGCCTTTGTCTCrCAGCTCTC 

4 0 CTGGGGAGGCTGGGCTTGGACCCCACTGCCACCTTCCTCATCTTTGCCACrrTCC 

TCTTCACAGGGGCTGTGTGCCCTGCAGAGACTCTGCCGTTCTTCrCriTGGCAAG 
CCCAAAGTGTCACCrCCTCAGAGAGCrTGTCCCCGGTCT 
AA.CCCTGTTGCCTCTCCTTGCTTGTCACACAGrAGAAA 
CTCAATGGAATGGATCCTTAGGGGACACTTGGGGTGTTGA^ 

4 5 GAGTTGCCTGTCAGTAGCTTACAGAAGAGCAAGAG 

ATCCCTTGGTCTTAAGCCAGTCCCTCTTTCCTTCCAAGAGAGAGATCCCTCAGCrrGTTGGGCAAGTGGGCAAGAGTGGAACC^ 
CTG<X!C!AGGAAGATGGCC CATGGGATCCGGGTCACTTCAGCA^ CAGAGTCATTGGAAAGATTTATGTCTTGCCA 
TTAACAAAGGGAGACATTTTTTTTTTTTTTTTTTTTTTTTTTTTAATGA 
ATGGGGAAACAGAAATGAGAAGGAAACAACTGCCAATGCTGTTGGGAATGAATTTC^ 

5 0 AGTAAAAAGGGAAG CCTGAT CGGTTACAGAGTTC CTCTCATT AGCAAGGAAGACG CCTGCTGGTT CCT CGG GGGAG ATGT AATTTG 

TTCCAGTTTTAGATTTTCCAGGCAATAAATCCTGAGTCAAATTTCTCCCTTGAGGATGTC 

AT AAT ATGCTTGT CTTAT AGTAAGTATCAAAAAG ACTCT CCAGGG AC C TGGG GG CAC C CACCATG CAGCTCACTGGGTG ACAGTTT 
TGTTTGGGTATT CTTATTTTTAAATTT T ACT AGT ATTT ATTT ATTTAT TTATTTATTTAT T TATTT ATTT ATTTATTT AT AAG ACA 
AGGTCTCTCTCTGTAGCACTGGCTTTCCTGGATGGATATCTAACTATTCA 

5 5 GACTATTCATAAGTCACATCTCGATGACTTCATGGGGACAATTTTGCAAGGAfc 

ACACTGGTAAACTCTGTGTGTGCAGTATTTGTGGAAGAAAGCAGCAGAAATCCATAGTTGC 
AAGTGGCAGAGAAGACATTGTCTGATGGAGGGAGGCAGGCTGGCT^ 
ATGTTGAACATCTCTGAGCATGTTGCCTCTTGGTAAAAAAGCCATACTTGCT 
AGCATGGGATGTGTGTGCGTGGATGCCATGCACCTACTGACTIT^ 

6 0 TCTCACCCACATTTCTGGGATTCCGGCCCAACTTAGTGAGAGGCAAGCTTGAGACAC^ 

TAGGGGGGTCCCACGAGCCCTGCCAAATGGTTCAGACCGGAGGAAACTAGTCTCCAAGGTCTC 

CCCTGTGCCAACCCCTGGGCACTTGGGTCT 
TGTTTCATGTATGGTCTCTTCCCCTGATAAGGG^ 

6 5 TAGTTAGAACTTGAGGTAGTCCTACAGACTTCTGACTGGGCT 

CACACACGACCCGGTGCTTCAGAAAGCTACACTGTGGAlCACACAGCA^ 

G CT CTTT ACAT ACACCCTCAGGGCT CAC C CTAGAG CTGGCAGCTGGACCTTTACAGCTTTGCAAG AAGAGGAAACAGGGT CAT AG C 
TACAGACACTCAAGGTCATACTGCAAGTGACA^ 

GTTGCCTCCTGTCCCAGCCCCCATCAGGTCTCCTAAGCAAAGCATCCrCAGGAGTACTTGACCTTGGAAAGCCCT 
70 CTGGTGGAAGCTCCTTTCCCTGGGTAATCGGCTCCCCTGAGGTGACACCTTCTTGGGACTATCTGGACCTTCC 
ACTGGGCAGGCCITCCAAGATCTCTTACTCAGGTAGAATC^ 

ACTTCCTTTAAACACCCCAGCCTTACTACATCCCTTCACTATCTTTCTTGCTGGGTGCCA 
TAGGCAAGTTAAGTTAGGA<^CAGTCCCGTTCCAGCTTTGTA 

7 5 CACCCTTTAGTCCCCACGTAAATAGTTAATTTTTCTCACAGGCCCCAGGGCCCTTT 
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CCAACTTCAGTTGATACTTCCAAACGGC^ 
AACGCATTTAT CGCCCT CTGACTATTTATT^ 
GCTTGCGK1AAACGCCACAACTTTCTAGCACGA 
CGTCGCCCTCCCCX5TGGGGCGCGCGGCGGGGTACX3CGCGAGCT 

GATGGTGGTCAAAGGAGGGGAGGAGAGCAGCAGCGGAGTCGCCGC 

ACGTGAACATGGCTTCGGAAGACGATGGGCGGCTCGCCAACCGGCGCGGGG^ 

GGGGGCX3CAGGCAGGCGACGGGGCACGGACCGGAGCACCGGAAGCGGGGCGGGTCT 

.CGGCTCCGCCGCGCGCTGACGGACGCGGCCGCCGGCCAATGGGAGCGGCTGTTCGGGCCCGGCGTTCG 
AGGGGGCAGGGCGGGGCGAGGGCGGGATGCAGCCGCTGGCCTAGTTGCAGGGACCCG^ 
GGCTTAGGGACCGCATGCCTCTCTGTGACCrGCCGCGGGCCTGGGGCGGCAGAGGGA^ 
GGTTGCAGCACX3GACTCTTCGGGAATTCGCCCCrrGAGTGTCGTCT 
TGATGCAGAAACTGCAGAAACTTGGTGGCCAATGACCTTCCT 
TGAGCAGCCTTGGGCTAATAAGGCAAAAAGTTGCTAGTGTCTTGAGCCCA 

G ACCTAGAGAAGGAAAGTCTGTCTTTTGTTrrrriTG AC CATGG CTT CGTGTGGAGGACAG C CGTGGTTTG TGATT AAG AGT GATC 
AGAAAGAAGCACGCCCAGTCCTTTGTGCGTGGGGACTGCX3^ 
TGGGAGTTTTGTCTATCTGAAGCAGAATAGTTTTCCATG 
ACTTTC 

CTGCTGGGAACCTTCTGGGCATATGGCCC^^ 
GGCTTTGGATTTTGCAGGGCTGTAGTTGATCTCTTCAA^ 

CAG CTGGGAG AGCATTG AGATAGG ATCAATTAAGTACCTGGC CCAGGGTCAG ACTCCTGG CTAGAG CTGTGAT CCTG 

GGCGATGCTCTGTGAACCATTTTGGGAATCACGATTGC 

GAGCAGATGCTCTCAGGGTGTGGTACCTrCTGAGTATCCGTCTCAAAAATGAGGAAClACTT 
CACGGCCATCACCCCCAGCCTGGAGCTTT 

TGGCTTACTCTT(^CTT AGATTCAATGTGGGATT ATCAGA C CTG AAAG AGGCAAAG ACTATTG GAAGAG GAATTGGGGGGTTG 

GAGTTATGCTGCCTTTGTGGTCTCTCACrTGGCTTTCCTCCA 

ATCCAGTCAGTCCACCCATCTGCAAGCCOVGGGAG 

(^CACTGGGGAACAGTACTTCCTGTTCCCAGGAC 

CAATTGGGGTCACCTCAGTCTCCCTAGTTCCTGCTCCCGCTCCCCCTCCAGTCTGCCTGGTGCACTTCTG 

CCCAGCCTGCCCTGGCCAGCCTCTCTTGGATC 
AATGTCTGCTGGCCTTTACTTTTTGTTTTO 

ATCCCATTACAGATGGTTGTGAGCCACCATGTGGTTGCTAGAAATTG 



ttgccttcagtcctcttattctaggtcaaggccatg^ 

AGCTGTCMTGCAAGACTCTCCTAAGATCTCCTTCT^ 

ATGAGC^GTGTACCTTGGTGCTCCACCATAAATGTGCAGTACAACTGAGGGATAGACCTTTGTTGTTTGAAAC 
GTTGGCTTATTCTTACAGGATAGCTTATGTCACCTCCTCCAGGA 

GGTCTCATCCTCGTCTCCTTGGATTCCTGTTGTTACTTAGTGATGATCTTTCATCGTCACC^ 

GAGGCTTGTGTCTGAGAGCTGCTGGGTGAATGCCCTCTGTGAGTACTTGGCCCTGACTGAGGACAGGCAGTGTGCTCCGAGC 

CTGAGCTGCAGTTCCAGCAGGAAGGTTTCAGATGCAGCAGTAAGTCACTGGTCGAGGGC^ 

TGTCTGGACCCAGTGGCACTGAGCTCCAGAGAACTATGATAGGCAATTCCAGGAGA 

CTCTGCTGACATAAGGX3GTGATTCAGTGCAAGGGTGGGAAAGGAATGGTGGTGATATCAGCTAGTGTCCC 

TCAGGCACTAAGC^CTCTGTCCTGTATAAGTCAGTTTCTCAAGACTGTGTATCACCTGAGGCCGTTTA 

ATTCATTTAGCTAACAGTTTTGGAATCTGAAG 

GCTGGATAGCACAGTGGAGGGGCATGTGACAGGGGAGATCACATGGTAAGACAACAAAAGAGAGAGA 

AGAGAGAAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAAGAGAAACCCAGGGACCTC^ 

AATCCTGCCTCTTTAAACATGTGAACCATTATC^ 

TACCCACAAGTGTTTTACAGATCGGGAAACTGAGGCITCTGAGAGGTGGGGTGAGAGGTATAGAGCTTATC 

AAAGTGGGATGTCACCTGAAAGCAGCCACTTGGAGCAAA 

AC^CAC^CACACACAGACACACAC^CACACACA 

ATGTGGAGAAGGGACCTGTGGCCAGCCTTGGAGAGGCAGTGGCTGGTGAGTGTGTGTGGGTGTGAG 
TAGAGCAATCTCTATGAAAGTGTCGTGTGGGGTGAGTGACCGTT^ 
TGGCCCAGGAGTACCTGGGAGTGGGGCTTGAGGGAATGAAGCTGGCT 
AGGCCCCAGAGCCTCATGTTCTCAGAGTGGGGC 

AG CCGTG CAG AAAGAAAAAAAAAG C CAG AAGAAAAAAAATGTGTGTT GTT ATTTTT AAAAAG C CTATTTCCT ATTG CAG ATTTGT A 
TCTGCTGATACCAAGGGGGCTGATGTTCACACG 

TGCAGGAGGGGAAGCCTGGGCCTGGGTGTCATCCTGCCCTTTGTTCCTACCTGAAGCTTATATATTCTTC 



GAAGGGGAGAGCTGTCAATCCCACTCAAGGAGCAGTCTCCTCAGT^ 
GC^GAGGAAGAGAGAGAAAGAAGAGAGGCGTCTGAGTGGGTCAGGGCTGGCGGGCAGrc 
TGTTGGTGCAGGAGGGTGGGATGGATATCAGAGCACAGTGTCCTGGT^ 
GTTGGCTCTGGACTGGCTCTGTCTTCAGCAGCAG 

CGCACGATTCCCTGTCTCAGCTTCCTGCTTTCTCTCTTTATGACAGATCT 

GCTGTTCTTAGCCCTGTATCTGGCACTTGGTTTACC 

GTTGTGAACGAATGGGGAGCGTTCACCCTTCACAACCT 

AAACCGTCAGCTATGCGGATTGTCTAATATTGTGAGAC^ 

TTTCTGCATGCCGTCTAGTTTAATCTGCAACAGCITAAGTTAAATGAAC^ 

GTTGTGCATTCTTTTAATCCCAGCACTTG^ 

TCCAGGACAGCCAGGGCTACACAGAGAAACCCTGTCrcAAAAAAAA 
TTTGAGACACCGAGGCTCAGCAAGGGGAAGAGACTT 

CTGCCTTACCTAACCCCC^GTTGCCAGCTAGGCTGGAGTAGCTGTGCTGGCCAAGAAGGAGT 
GCCCCTGAGGGATTTTATGAGACACCGTGACCAGATCCTTGGTCAAGAGTGCACAG^ 
AGAATG AG GAGG AAG C CAGCAGAAGAGATG ATTACAGGTTGG AGTGGACAGGT AAAGGCTC^^ 
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TGTTCGTTGTTTGGTTCGGGAGGACITGGAT 
ATTGTCTCCTGCAGAJGGACCTGTGCCACCTTCCT 
CCAAGGACJVAC^TACCZTGAAAAGAGATGGACAGAGGCCACACCCT 
CTGGCTCCAGTGTCCTCTGGCCACAGAGATT^AGGGTGTCAGTCCCG 
5 ACCAAATTGAAGTCACTCCTCCCCAGGAGGGGTCGGCTGCTGGAAGGAAC^^ 
TTCACATCCATTTATGGAACCTGGGCTCAGAGAGGTTGATTG 

TGAAAG CTACIATCACAAAGGAGGGCTTTATGTCTTATCTAAT AATTCC CAG CAGGCGACTTTGG CGCTTTAAAAACGATTTGATAG 
GTAGGGTCTTGGCACAGATGTTGTCAGGACCTTTTTCCACCCTGGGAACCTCAGAGTCA 
CAGAGTCTCAGATGAAGTGATGATGGGGGAGGGGAGCAGACATGGGCAGCTG^ 
1 0 GTAGCAGTGGAGGGCTGGGTTCrATCTTTCMTGAGACACCTGAACCTCTG^ 
CTGACAC^GAAGGAAACCCAGCATTCTCTTCCAGGTTCTGCCI^ 
CCCACAGGACACCCCATTGTGCCATTGAAGAAACnTlGTGTCTTAGTCAGGG^ 
CAAGTTGGGGAGGAAAGGGTTTATTCAGCTGACACT^ 

GTCAGGAAGCAGGAGCTGATGC7VGAGGCCGTGGAGGGATGTTACTTACTGGCTTGCTTCCC 
1 5 T AGAACC CAAGACTAC CAG CC CAGGAATGGCACCAC CCACAAG G GATCCT CCCC CC CTTG ATCACTAATTG AG AAAATGCCC CACA 

GCTGGATCTCATGGAAGCATTTCCTCAACrrGAAGCrrCCTTTCTOT^ 

ACCCAGTACACTGAGGCTCAGAGAGCAGAACAGACCTTATAGGAATCA 

TCTGTGCCCAAGGTTCTCCCTCCTGGATGTATTTGTAGTAACCAGTGTCCATACT 

CAGGCTACGGGTGGTATATGGTGTGACTTAACCTTGAGCTGAGCTGGTGTGGTCATATCTC 
2 0 GACCCCCACCCCACATACAATCCCAGAGCTrCCCAGTCT 

AGCGAGGGGGCCTGTTATGAGAGAGCTTGTCATATGGCAGCACATGGG 

AT AGGGTGTGAC CTGGAGGGTGGGGAGCTATTG ATTTGACCTGG CTCAG ACCT CACAAC CC C AGCAGAGTCAGGGTGACTGGTCTT 
OTATGTGTCCCTGTATATCTGTGTGCCTGTAAAGCCCCTGTGCAGACrAGCAGGGTTCCAGGAA 

2 5 GTTGATGTTTGCCAAGGGTTCAGTGTCCCAGCCAGAAGGTGTGATG 

TGTGACITTAGTGCATGAGAGAACAGGACTAATCCCTCCCAAGAAAT^ 
CGTGTGCACAGCCCTGCCAGCCTCAGCTGCCCCACTGTGCCCA 

AAGAAGTGACCCAGAGTCACACAGATAGGAAAGGCTGAGCCGAGATCACCCACTCTGGGTCACTGCTOGTGGTG^ 
ATCTTTGCACGGGGTGACTGGTGGCTTTCCCGTTTCACCCCT 

3 0 TAGCCCAGGCCAGTCTGGGGCATGGAGGACTCT TGGCACAGTGCAG CTGCATTGTGGGGTGGTTGGGCCTATCTACCTAACACTTT 

GCCATCATATAGTTGACCAGTATTCCCTTX3GGGTCTGGG 

AGCAGACTGGCCCATGCCACG CTGGGAATGCTGGGTGGACCTCTGCCCTAGGTGAGCAG CTGCGAGTTACTGTTCTGCTACAGATC 
(^TC!AAGGGAGTGAGTGTC!ACAGGAC!TTGGAG 

TAGTTTGCAAACGCATCCAAGGGCCACTGTTTGGAAGCTGAAACAAGGTGGT^ 

3 5 TTGTTGTTTTTAATCACAGGTAGTTTGAGCCIATTCTGGCTCCCATCCTGA 

GGCAATGGGGAAGAGAGTGCAGGTTCACTTCCCAGAGGTAGGGGAGCAGCrGAGC^ 
GAGGGGAGGGAAGGGCATTTGGGCACTGAGAACAGGTATGTGTAGGTAAGCATTGGGAAGGCC^ 

GTCTTGG CATT TGAGAG AG GGG AG GGGATG ACCCTAG AGACTGG AAG AACTTG CG TGGAG CTACTGAGGAGGTCTTG AT CAT T CCT 
TAGTAGAGGTGAGATGCAGCGAGCTTTTATTTTAGGTTGAGAACTTCTGTCCCTGCAGCAGG 
40 ACAAGACCACCCTGGK3GTGCGGTCCTTGTTTTT 

GCTATCTGCCCATTCTATGGGGCAGTCCTGGGGTGGTGAGCCAGCAAGGCCCCTAGGGTGAGGCCACAGCC 
GGTCCTGAGCTTGTGGTTGCTGGCTGTGGGTGAAGCITCATTCCCCCACCTCT 
AGCCIAGTGGAGTGGCCAGGTTCTTTGCGGCCTGGTGGGGCTCGTGCACGGAGGAG 
TACCCTGGTITGTGGCCTACCATCTGGAGAGCTGGGG 

4 5 GCCAGGTCTGAGGCCAGCATTTCTGCCACACCTGGAAGCT 

GGTTACTCCTTCAGCTTTCTCCTAGGCTCTTTTCTGAGCTTGAGGGA 
GCCTGGGCCTGCATGTGGCCACAAGCCCAAGGAGGGGGGCTGCCCAGGAGG 
TCTCCCTTCTCCITGGGGCCCCGTTAAGGGTGCCCCTGGCTC^CTCTCT 
CGCAGGCTGGGCAGCTGCTGTTCAGGGGAGTCTGCGCTCCTGC^G 
50 TTGGGCTTCAGGCCAACCAGCCTGGAGTCACTGAGGTGCCI^ CG CTCTGT 

GTGTGTGTGTGAGGGGGTGAGGAGCAGATCCTAGGTCGGTGGTCTCAGGTTTCCCCCTGCTGCCTGCCCACCOT 
TTCCCTGGTTAATGTCATGCAGGACTGCCTAAGTCCTGTTTCT 

TCCTCTTAGGTTGCACCCAGGAGTCAGGG CTCAGTCAAGGGCTTGGTTTG CCTCAGGCACCATTTAGTATGCCCCAGTAGCAACTC 
CCCGCTCCTCTCTTGTCTTTGCTTTGATTGTTTTGACACAGGGTCCAACCATGTAGTCCTGG 

5 5 CCATCAGGGTGGTCTCAAGTTTACAAGGATCTGCCTACCTC 

TGTCTTGGAGCCCAGTCATTGC CCATTTATTTTCTTGGTGCCTTC CTAGAGTGTGG CCGTGACAGCGCGTGTTACTGGA 

ACCACTGTTTTTCACAGAGGGGTGAGCTGATCTTAACTTATTCAT^ 
GACTAACrGAGGCTCCCAAGAGTTCAGTGAAGATGGTAGGGTCTTGTGTGGGTACTT 
CTCCTGGCTCTTCTGTCTACACTGGAGTTGGCTTGAAGTAGTGATGTCATCAGTAT 
60 TAGCTGGGTTCGGTCTGTTTGGGGTTAGGATTCACTGG^ 

CTATGAACACTTCAGATGTTTGTCTGCTTTTGCCTCTCTCCTTTTGGAACTTC^ 
ATCCTGGAACACCCCAGAAGCTCGGCAGGAGCTAGGCAGCCTCTG^ 

TGGGTCCCTGCTTCCCTAACCGGCTCGGCAGGGACACCC^TCCAGCTGTCCATGGGGCRTTAGACTTTTCTCTC 
CTTGTTGTAGGTGATTGTATTGCTGTGGGGTGAGGTGAACCACCATTTCCCAA^ 

6 5 CTGAGTTTTAACCCTATAAAGGTCATACAACTATATATTTGCTGTAAAAGTGGAATGTTGACAAATA^ 

GGGCATTCCTGACTCTTCCACAAGGCTAGCATCACAGTAGCCGTGGTC 

GGACACAGACAGCGAGAGTTGCCTGTAGTATTGACTATTGGGTACATTAACTTTCAACATTCT 
GTAGGGTGTGTGTGTGTGTGTGTGTGTGTGOBTGCGTGTGTGTGTGT^ 

ATGTGCTGGTGCATTTATAGACCAGAGGTCAGCACTGGATATTTTCCTCGGTCACTTTCCACGTTGAGATGCGTGGCTGCCAGGGT 
70 CTCTCTCTCTCTCTCTCTCTCTCTCCATGGGGCATTATACT^ 

GGGTGAGGTGAACCACCATTTCCCAAGGCAGATCTTTGTGTCTTTTAAAATT^ 
CAACTATATATTTGCTGTAAAAGTG^AATGTTGACAAATAAGGCGTTCCTTAGCAGACGGGG 
AGCATCACAGTAGCCGTGGTCCTGCAC^TTCTTTCTGGATTCAGTCTTGTAGTGTGT 
GTATTGACTATTGGGTACATTAACTTTCAACATTCTA 
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GTGTGTGCTTGCACTTGCTGGGTGTGTGTATACATGTGCTGGTC 
ACTTTCCACGTTGTGATGCGTGGCTGCOIGGGTCTCTCTCTCT 
TCTCTCTCTCCTAACCrGGAGCTCTCTGATTTGGCTGGGCTGACT 
AGGAACTCACTGGTGCACCTGACTTATTTTTACATCCCTGGGGTTTGA 
5 GGCCGTCTTCCTGTTCTCCCCCGCCAGCCCCCCTCTCTACTATGTAGCCCAGGC^ 
TGTTCTGAATACIXIAGATTATGGGCTTGAGCCATCAG^ 
CACTAATGCATAGTGCTTGGAAGTCATTGTTATTTAGCCATTCC 

ACTTC CTTGGG C CTCTCTGGAGTTG CTCTTG CTGGGTTATGGGAACCTGCTGTGCTTT AAAAATGTATTAT CCAATA 

TGT CCGTT AAG AGTTGGTGAAGGAC CAAACACCT C C CATGG ACG CCAGTTATC CCAGGCT CACTTCTTTTG AG CTGGATGGAAAAT 
1 0 CTGCATCCTTGACCCCAGTGTCTCCTCTCCTTCAGTCTCCTCTGCTOT 

GTGGGTGT C CAGAAG ATCAG CAGCCTrCTGGAATCGGAATGCCCAGGCTGAAAT C CTGACC CAGCTC C T CCAAGAG CCACCATTT CT 

AGAACTGTGGCACCTCTGTGGCTTGGCTGTATCXACTGTGAACCA^ 

AGGGCTGGCCAGGCAAAGGGTAGGCAGCCAGCAGTGTCCCTGTAGTGGGGTCCC 

GCGCAGAGCCCACGTCCCTGGTGAGCAGCCTGGGTTGGCTCTCGCCTTTGTGACGCCT 
1 5 GGACATTGCTGCCTGGGAATGCAACGGGTGATCAGATTTCT 

G AAAGTGG CTGTC CTG GTCATTTGCTCTGAGGAG GTGGCCAGGAGCAT AGCACCT AATGT AAC CCCACACCGTGGACA 

GGTCCTCAATGGGGCCGTTGTGGCTGTGGATTTGATCTCTGTTGGCCTGCATGGTGG 

CTTGCCCCCTGGGAGCTGGGTGTCTTAGCACCATGTTGGCCCAGGCAAGGGT^ 

TTCCX5TGTTGAGAGAATGGCTTCTGACTCTTCCTTACAGCCCTGGCAAGCAGCAGAG 
2 0 AAACAGGTGCCCCGGTTGTGGCCGTTGCCCAGGAGGAGAGAAACCAAGGTTACA 

ATGGTTGACGACTGGCAGGGCTGGGAGTGGTAG CCAGG CCAT CTGGCTGTCTATTCACTCCTTTCTTACAAGGCCACCCAATGTGA 

TCATTTTGGCAGAGGGTGGCCAGCaSGGCGGCCTGCTTAGCCI^GCCTCT 

TGGGATTACATTTTAAAAGCAGAGGAGTGTGTGGCTGGCCCCA^ 

TCAAATGGCCAGCTGAAGAGTCGCTGCTCCAAGGTCCTCrGCCCTC 

2 5 AAGCCATCACTGGGCCAGTTTTCAACGGGGAAAAGTA 

TCTGCCTGCCCCCTCTGTCCAGCTGCTGGCTCGTGCTGGTCTGCTGTCACCTCAT^ 
TGAAGGTTGTAAGCATAGTGTCAGAGGGATTTGTAC^CGGGTTTCACACATCGCTT 
ATGTTCAGTGCCTTGTGATGAGCCAACTACTCCTCAGAATAACCCATGATGGCCGCrrGTTGTTAGTTCC 
GCCTCACCACGTAGCCCAGACTGGCCGCAGATTCACAGGGCTTCTCTTG 
30 GGCTAGCCCCATCCCAGTCTTCTAGCACAGGCTGTATGAGCAAGGGAGCCATCAGACTGTGGAATAAGGG 
TGGTGACTGGGGGGGTCTTGCGGGAGTTACGTGGACTCTTTGCAC 
GCTCTGTACCCAGGGCCCCGGCCACAGCAGCTGTGCTCACTCTGGGAGACCTGC^ 
GAGGTGCTGCTTCTCCCAGCTGGCCATACACACGCTCAGCTGCCAGCCCCTGGA 
TGGGTGTTACTGGTGCTCTGGGACC^GAGGGACAGAGGGCAGATAGTGTGCCCACCTTCATA 

3 5 CTCTTTCAGCCTACAGTGCTCGCTCCCCCCCCCACCCCCCGCAATCCCCTCCCCGCCCCAAGCCTCC^CTGAGCCT 

CAGCTCAGGAAGGTCTGGTAACTTACCTGACTCACTCGCCnTCAGAT^ 

CCACAAATAGAAACAGCTG CCTAGATG AG AG CT ATAC CCAGGG TCT CTACTGGGT C TGG TT AG TT C T ATGAGTG ACACAGC CACAT 
AGGCCCCTGTATGCAGGGAATCCTTCATTCCCTCAGCCACGTTTACTCTGCACAAATAGCCCAATAGCAGGATCTCT 
TTCACACTTGCTCTGTGGCCTTTTGTAAGCCCCACTCCCTGGGTCAGTAAGGGATCTCCCCCCACCTTGCCAGTG^ 
40 GAGGGCTTGCTGTTGGGTCTGAGACTGGAAAACACAT^ 

TGGAGATCAAGGGGTCTGGGTTCAGATCCTGCTTCGTCGCATCCCTGTAGTTGCCCAAGTATCCTCTCGGAGCTGACT 
CCTGTGAACTCGGAGGGCAGCAGAATGGGCAGCTCACTGGAACCCAGGCATGTCTGCTTGCTGGAGGTCTGCTGCCAGGGAGGO 
CAAGGAGGGACTTAGGAAAGCTCTAGAGCCCGTGCCAACAGCCC^GGAAGCAGAGGCCAGAAATAG 
TCCAGATGGCCAGGCTGCCCTCTGCCAACCCCAGAGGCCAA 

4 5 AAATTACTCCTTGGCAGGGCAGACTAATTTTACCTTCGTGGTTTTCATGA 

AAGGACTCTGCTAGTTGGGTGCTAG CATGCTGTGTCCGTGTGGGGTTCCCCTGGGCTGTCATGATTTGGTAT AAATGGTG CAGTGG 
ATCCCAGGGCTGGAATCCATCAGGGTTTTGTTTGTTTGTTTGTTTGTTTGTTTGl^ 

AGTAAGGAGGTTGATGGGAAGGTGTTTGTCACTTGCTGGTCCAGTGCTGTGAGTTTTTGGATGCCGCCCCCCCCACCC 
CCCTACATTGTAGTCACAGAAAGCCAGAGCAAGCAGGTAATGAAGGAAATGAAGCCCAGTCGCCTCATTGTTCAGACAG^ 

5 0 GGGGCTCAGCGCCTAAGTGGAAGAGGTCCCAGGCTCACAGCAGCACAGCCCGACTTGTCT 

TTCTCTAGTTCTGATCACAGCAGGGCGTGTCCCTCTGCC^CTTGCCTTCCCCGATTCT^ 

CTGCCTGTGCTCTCATTTCCGGCCTCCTGTACCTCTGGGAGCACAAGATGTCATCCACTTGAGCTCCGAGTCTGCGGGG 

TAGGGAGCAGTCCAGAGGAAGTCGAGOTGGCAGCTGGATGCATCCCAGGTCCTCAGTCTGCCATACAGGTCACCAGCT 

ACCACAGGAAACTCAGAGGCTGGCTGCCTGGCTGCCTGCCTTCAGAGCTGGGGAGGCTGGCT 

5 5 GTCCAGCACACCGCCACTGTGTGGAGCCACCTGCGCA^ 

G AGGGTTCTTCTGT ITCTrr AAAAC CGG CAG AAAG AAAAGTTG AG AT AG CAACAG CTG CCAGTGTG C CACCT ACATATACTTCCTT 
CCTTCCTGTTGCCTCAGGGTGGAGCAGGTTCGACAGTGATTTTCTTGTCGGATCOT 
TTCCTCTCTTAGTTGCAGTTCATCACCTGTCCTACAGGGGTGCCACCTGAGAATAG 
ATTGCATATGGCTGCTGTGCIAGCTTTGGTCAAGCCACT 
60 GGTCGTGGACTTTATGGAATCACCACCTGCCAGAAA 

GAGGATCCTGAGAGCTITCAGGCAGCATTGGCTCCAGGGGCTCGTACAGTGTCT 

GCATCCTTTTCTCTTCGCAGTGGTCTCCTGGGACACT 
ATAGGAAGGAAGGTGCACTAATCACTGAGGACTGCTGATCCAGGCTCTC 

6 5 CAAGACACCAGACAGTCXMTGCCAGAAAAAATAAGGGTCAGTAAGGCATTCCCATAGGCA 

CCCTGTTCTCTGGATTCCCAGCAGGTAGGACAGTGCTGGCATGCA^ 

CTCCACTGGGTTTAGAAGACACCCCACTCTGCGAGGGCATTGTCACATT 

TGAGGAAGGTCACCTTCCTGGCAGTGTCACTGCTCTGCAGAGCCTCT 

ACCGCAGTCTGTCCC1ACAGGTGTGACAAGCAGACATTACGAGTGGTAGTGCCCGTGTCACCATA 
70 CATCATCCCCCCTTGGTCTCCTTCATCTGTAAGCTTCAGTAGCTCCCCACGCTGCTTGTCCTTTTGA^ 
CACCCATAGGAAGGGAACAGTGAGGAGTGAGATCATGTGTAGAGTAGGTGCTTGCGAAAGGGCAGGG 
AACTCATGGATCTGTACGTGAGCCCTTCCACCTTCGTGGAAAAACTGOTTACTCTGGTGTTTAT^ 
CACACACACACACACACACACACAAGAGTCATA^ 

TTTATTGCCAGCAGCTCCGCGCACAGTAGGCTCCCCCTGCATGGCAGATAACTG 

7 5 AAATCACAGG^AAGAATGTGATGAGCATACAGTGGGTC<3CTGCGTAGGGGAAGAGCCCA 
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CAAACAGAGAAGCCAATGTCTGTCTTCTGGAAGCCATC^ 

GCGGCAGAAATGGTTTGGGGGCCAGAGAACAGGAAGCCCTT^ 

TCAGGCGGGCftGCCCTACCCCATCTTCA^ 

TCGGGGGTGGAGCCTAGGGTTCTGTTTAACCAGCGAGTTCACCT 
5 CGAGT ACATCTGAGTATCAGAGCCATTCTATATGTG CTG CCAATGGCAT CAAGTATTTG AAAATGGATTTCTTTGTCTATGTGTGT 

GTACGTGTGTGTGTGTGTGTGTGTGTGGGTGCACACGTC 

CCTTCCACTATGTTATGCCTGGGGATTGAGCT 

TGAGTACATITCACACAGCAACCTTACC^GTAGATGCTGCCCATC 

CTACAATAACTTGAAGCATCATGGAATGCTCT 
1 0 TCCCAGAGCCTGTTGAGTGTCCACTTTGCACCAAGCCCrrGTTCTGGCrG 

TCACTTCCTGG AGTCT ACATTCT AGTTGGAGAG AG AC CGGG CCGATCTTAT AAATAAATACCATG CACATT GGTTGG AT AACGGTG 

TGGT CGGCAGAAAGTACAGCAGG CAGGAGAG AAAGAGCAAGGGG TACTAGTTAATGTAGAATGCCCAG AG ATTTG ACAT CAGTCGA 

AG C CTGGAGGGAGGGGCAAGCCATG CGCATCCTCTGGAAAG AG CACTCCIT ACAGAGGAAAT AG CCACCACAAAGTCATGTGGG^ 

GGAGGGCAGGCTCCTGGGGCTATGCTAGGGGTAGTGGGAGAAAGAACCATGAGTGACGAGGTCGCAGG 
15 GTATAATCTGCTCTGCATGAGATGGTCACTGTGCAGGGTTTGAAGCAAGGGGCC 

CCCAGTAGGGCKCCACTCTGGGGAGGCTGCTATGTGTGAGTAGCTG^ 

TAGTAAAGCTGCCTGATGTGTTGATGTGACX5GAGCTCCCAAA.CCCTAGGGAAGCTCGGAGA 

GCCCACGTGTAACTTTTACCTGGTTTCGTGGTGTG 

GTGAAGGTGTGAGGTATCACATCCAGGTTCTGTGCATCCTGAAGAGGCCCCCGTACCAGGTCCCTCTCCGTAGA 
2 0 AACTCCCAACCCAGAGAGCATGCATCTGAGTTGGTTTCCAAGGCCCAGA 

AGGGGCCCCGTACCTGCAGTGGGGCAGGGCAAAGCAAGGGTGGGTGACAGCTG^ 

G AGAG AGAG AGAG AG AG AGAGAG AGAGAGAGAG AAGCTTCC^AGG CAGTG AAGAGGTGGGCTC C TGAC CGGCAC 

GCTTTCCAGGCTCAGCACAGACTCCAGAGGACTO 

2 5 TGTCTATGCTTCAGACCACTGTAGATCAAGGCTCCGAG^ 

AGGAAATGGTGTTTCTCACC^TTCCTGGCTTATAGAAAAGGCCGAAGCCAGACACTGCCAG 
GTGTGCATGCATGTGCOTGCGTGCGTGTGTGTGTGTGT 

ATGCACATGTGAAGACTGGGGGTTGACGTCCACCCCCATTTCTGTCC!ACCTTCCTTTT 
CTCACTGTTTCAGCAGACTGGATAGCCTGCAAGTCCCTAAGCTCTGTCTGT 

3 0 GTGGGTGCTGGGAACTGAACTCAGTCTCTGTGCTAGCCCAGCGCTTTAC^ 

TGTTTAGTTTAGTTTTCTGGCTCTGGTCATTGGATCCA 

GCCATGTGAAGTAGGGCTTAATCTCCAGCCCCTGTTCCCCAGTTTCAGAGCTCAGTGGGTGAGGOT 
CACCCTGAAGCTCTCTGGAGTCC^CAGAAGTAAACTCAGGTGCT 

3 5 GTGCTCTGO^ 

TTCACTCACTTTGACTACCCCAAAGTGGGTCAGCTGTCAGCTGCAGAGGAGAGAAATCGTGTGTAATC 
GTTCTTGCTGGTAGGGATGGCGCACGCCTTAAATCCC^GCACTTTGGAAGCAGAGGCTGCCAGATCTCTGTGAGTTCGA 
CTGGTGTACAGAATGAGTTTTAGGACAGCCAGGACrrACACAGAAAAACTCTTTCTTG^ 
GGAGAGATGGCTCAGGGATTAAGAGCACTGGCCGCTCTTCCAGAGGACC 

4 0 CTC CAG CTCCAGGGG AC CCAGT ACCTT CACT CACGAATGCAGGCAAAACR.CGAATTCACATAAAAT AA TAAAAC CAAT CAT A 

AAAAAG AAAAG AAAAG AAAGAAAAAAAGGC CAGG CTT CATGT CCAGT CGC CGG T AAGACCATGGAGGT GTACCAT TG AT TG ACACG 
GTGGCTCCTGCTTCTGGGGCTGAGAGTGGCCCAGCTGGTGCTTGAAGGGAGCTGTGACACCTGAGTCT 
CAAGCCCACAAAAATAGTGCCACT'CTGAAACAGCTTCTTCIAGGGC 
GAACATGTGGTACATTTCAGAAGTCCTTCCC^GAGGGAGCACCTTAGGAGGAGC^CA 
45 AGCAGGAAGCCCTGAGCTGACTAGAGCTC^AGTTCCACAGTTGCCTATGGTGTTTCCTGCCTT^ 
CTTTGCATGCTAGTTCATACAAGTACACACACACACIACACACACAC^ 
TACCC^TGGAACCAGCACCCAGGCTGAGAAATCAAACCTGATTT 
CCAAGAATAACTCTGAACTTGACCCCTACTTTACTTCCTCTCGTAGATGTGACT^ 

GGCTGTATTTTGGCTCATGGTCCG^^GATATAGACCATTGTGGTGAGAAGGGGTGGTGTCTGGAACTCTACCTGTGATAGATGGG 
50 GTGAGGCTTATTCACCTCTCAGTGACIAGGCCATGCTGCAGAGAGACCT 

TACCTGCTGGAACCTAGTGCAAAAGGTTCCCACAC CTT CCCCAAACAGTGTCAGCAG CGGGGACTAACTTCTCAAACCCTGAGCCT 

GGGAGGGAACTGGGCTAGGAAACAGTTTCACACTACAGCTGTCCTGGGTTTGTTCCCT 

GTTACGGTTGAGCC^TAGTGTTCATATAGTTGTGATTAGAATACTCCCCTTC^ 

TTAAAACATG AAGCTGAATACATTGGTAT ATGCTTGTCAT CT CAGCTCAGGAGGCTG AGG CAGG AGG ATCAAGAATT CAAAG CTAG 

5 5 ACTCGGCTTTAGCAGACTGTCTCCAAAACIAACAAAACAATCAAAA 

GGGCAGAGTGAGTGTGCATGGTCCTTTAGGGCCTTTGCTTAGGATGCTGACCGCTCX3CCAGCCTGT^ 

CATGGTGAACATACCAGGCTGCCTTGCTCCATTGGTGGGCCCCTCACCCATCACGTATGTGCCCAG 
TCTAACCTGGATCCCTCCCTTTGCCIACCCCX3GGGTTCAATCTAAGAGTCTGTTGACAGAT 

6 0 TCTCAATCTGTGGGTGTGGATTGCAATCCCTTCGTGGATCAAAGGACCCTTTC^^ 

ACATTTATGTTAGGACTGCAGTTATGAAGCAAC!AGT 

AGTGTGCAGTGTTAGAAAGGCrrGAGAAGCACTGCTTTAAGTGCTTAATTCTCTGAGCCCCTCCCTCTTT^ 

CTCCCCrCCCCATCTTCCCCrCCCTCCn , CTCCCCTCT(ZATGCTGAGGTa\AACCTCATGTGG 

TACTTTGTGTCTCTTTGTGCGACTCCTGGGCTCACTCCACACTGT^ 

6 5 GCTCCTCGCTCACCCACCACAAGAGCATGTGTGTTATTTCACTTGGTGACCTTAC^ 

GGAGTCCCTCTTGGTATTGGCGTTCTCTGATTCCAAACAAATGGGTGACTGGGTGCTGATATGTGTCTGCT 
ACTTCCTGAGCAGTCTAGTCCTCTGCTGCTTCCCCCCCCCCATTAC^ 
TAGCCGTGCAATACACJAAATTTCCTCAGTGACCGTCTCCACTCT 
AGCAGATTGTGAACTGCCATCCAGCCTTCTAGCCCGTGCTCTGTGCCAGGTTCTCT 

7 0 CTTGGTGAGCATAAATCCCAATGTAGGGGACAGCTCTGCAAATCCGGGTTTGC 

CTTCCTTAGAGCGGATTCCTAGGGGGAGATCTCTCATGCAGAGAG^ 
TCCCACGAATX3TCACACTCCAGCCCCTAACATTTGAGTAAAAACTTCT 
TTTTGTGTGTTTACTAATGGAGATGTACAATGGAAAGGGCTGCCTAGGACTAAGAA^ 
CGAAAGGATGAGAACTCTGATGATGAGTGATGTCTGCCATGCGTGTCACATTGGAGACTATAGGAGACTA 
7 5 CTGACA.CCCAGAGCCCAAGTGTTAATAAGATTTTACACTTTTCCACTGAGCATTTTAG 
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AAAAT TTTTCTATTGAGTTG AGCATCCTGAT A CA 
TCCAAGTGCTAAAGGTACGATTTAGGGCTTCACACATGATTG^ 
CCCCCCTTTTTGTTTTGTTTTGT 
TCCTGGAACTCACTCTGTAGACCAGGCTGGCCTT^ 
5 GGGCGTG CGCCAC CACTGCCTGGCTCXrCTTTAATG CGTATACTTTTCTTTCTTT C TTTCTTT CiT'11'GGAGACATGATCTTACT AA 

GTTTCCATGGCreGCCTTGCACTTGCGACTCTCCTGTCAACCT 
AACATTTAACTAACATTAGAAeTACTGTTAATAAAGAAACCCrrGTCT 
AGGGACTAAACATTGTGTGCGCAAACCTGACCreCGAAG 

1 0 TTAACTGGTTTGTAGTGGCTCACACCTTTAAT CCCAGTACTTGGGAGGCAGAGG CAGATGTGAG T TCAAGG CCAGCCTGATTTACA 

TAGTGAATTTCAGGACAGTCAGGACTATGTAGAAGGATCTTGTCTCTAAAAGCAGGCT 

TAATAGATTTATCTCACGATGATGCACTCTAGTAATCCTGGCTTGTC 

TTAGTGAGTCTTGGAACTCA.CTATGTAGACTAGGCTGGTCTAGAACTCAGAGAGTTCT 

AGGTGGGCTCCACTGCACCTGGCTCAGACTCTTAAAAAAGTAGGTAAT 
1 5 TAGGAGCAGCTACCTGCATTTGCAGGCGTATCTTACAGGTAAATTCA^ 

GG CACATAAGCTCAACCATTCAGGAGCTGG AG AGATGGCTCAGGGGTT AAGAG CATTTG CTACTCTTACAGAGTAT CCAAGTGTGG 

TTCTTAGGACCCACACTGGGTGACCCACAGCTGCCT^ 

CACACATTTACACATAC CTTGGTATATGTGTGTACACATCTAT ACAT ACAT G TG CACACATATACAAAT AAAAT ATAATT TTAAAT 
TCAGCTATTCCTTCTAATACTTCCCAAATGTCCCCGCTCCCGCTCCCGTGGTCTATTCATT 
2 0 ATTAGGAGGTTGAATCGGGAGGATTGCCACAAGTTTGAGACTACATAGTGAAACCCTGTCTTAAAAGAA 
GGGAAACTGAGGCTTAGAACATAGGTAGAAAATGTCTCAATTTATGGCAC^ 
GAGTCCTTTAGATATTCAAGGGGACAGGCAGGGTC 

2 5 CTCGGAGGAGCCAGACTGGAGGGATCAGTGCCAGCAGCTTGGAGAGCTTC^ 

GAAGTGGCTTTTAGAGCCTCACCTGGCCTGTGCCrTCCC^ 

TTAGTCACATCATATTCTGACGCATGTTCCTGTAAGCCrTCCATTGAG 

TTCAGGCTTCAAACGCACAGCACTCAGACMGTGAGGCrrTGG 

TCAGTAGATGTCAGGGACAGTGAGAGCAGGGGCACCCTTTTTCTTTTGTT 
30 CAGGTTATTATAATTCATTGGGTGCCACATGGAAGAGCATAGAAGACAGGGCAGGCTCGTCA 

ATCCTACTTTGTACAAGTTGATTAACTATTCCTTGTCTGAAAACCCAAATCCAATGTATGTCAA 

ATCAAGTTTTGGAGCTTTGGGATTTGGTAAAGAGTATGCCAGCATCCT 

GAACATCTCAGATAAAAGTCATG TTACCTG AGTGTG ATTCTGT AAG AAAAACGT GAAGAACCACATG CAAAATG CACATGTGAAAG 
CCACAGGCCTCTCTACTCGGAGGTGGAGGTGCTGTCTGGTGGCT^ 

CCAAGTCCTCCCCTCAGTCTGTATCCCAAT^ 

CTTAAGGCTTAGCACACGGCCTTCTGATGTGGCTGCTGATGCTGTCATCrrACA 



ACCTACAATTCCAGCTGCAAGGGATCTCACAACCTCTC 
4 0 ATCAATAAAAAGCTTTAAACACT ATTGTAAGTTTATTT ATT CTTA CAT GCATGTATGT CTGCAC 

ACCACATGCATGCCTGGTGCCCACAGAGGCCAGAAGAGGGTGTTGGA^ 

GCO\AGTGCCAGGGATCAAAACTGGGTTGTTTGGAAGAGCAACAAAACTGCT(^GCTATCTCTT 

AAAAAGAAAAGAAAATCTCATGATGTCTTAAGTAAGGTTACGGTTITGTGTTAGCCATCCTTGCCTC 

AGGACGTACTGTAAGAATTG43GCTCCCAGCTCTATAATCTGAGTC 
4 5 GTCATCTTTCTTGAC ACGCACCTTCCCCTGCC^ CACAAGG CTTCTCCTTCTGAAGTCTC 

TTGTACAGCTGAGTGTCTCTCGAGCCGGTGGGACACAGGCGACACAGTGCATTT^ 

GCAGATGCAAGTTCTGCCGGGGTGGGATGGGCCCGAGGCCGGGGTGGGATG<3GCC 

GAGTGCAGAACTGAAGGATGTCCACTCCTATGCCTCACAGACTC^ 

TTGTGTGCTTTCCCAGCCTCCAAGCATTTCCTGGCGCCCGTTACAGGTG 
50 CCAT TTGTCCCGC CTTCTGGGAGCTTACAGTTGCAAGCT 

GGGGGGGTCCC^GGCATTGTGGCTGGTGGAGTAGACAGGGTCCAGAGAAGGGGAACACT 

CAGAAAGCGAGCGTTTGTCAGTGATTGACTAAAAAAGAGAAACTGAACAACCTAG^ 

CTGCCTGGCTCTOTTTCTGATTAGCK5TCTTTTTTCTTCGTTTCTTATTTTTATTCA 

TTGAGTCAGGGTCTCACTATGTAATTCTGGCTGTCCTGGAACTTGCCATGTAGACCAGGCTGGCTTCAAACT 
55 TGCCTCTGCCTCCTGAACTCTGGGGTTAAAGGTGTGCACCACTATAAGTGGCTCCTGATCAGCTTTCTATCCTGGGGCrrGTA 
CTTCCACATTTATCTCTGTCCCTGCCACCCACCACATCTGATGGAATCTGTCA 
CTACAAACTGTGCTCTGACCTCCACACATTCATGACCCCCCCCC^ 

AAAAGGTAGGGAAGTCAGACAACCTTTGTAAGACTCTGGGGAGCCTTAGCTTACCAGGGACCCCCTGAGTGAAACTCGTGGAGTCG 
GGATCGATGCAAAAACCAAGAGAAATTTATTCTTC 
60 CCAGTTCTGCTGCGTTTTATATGGTTTCCAGGGGCAGAATAGAGCATCAGCAACTAG^ 
CTTTAAACrrGATTGGTTTTTAGGGAATAGGT<3ACA 

CGACCTTCCCAAAATTCCTGAGTTGACCTTTTCACCTTCATTATCAAGCTTC 
GGCTTTACTTCCCAAGTTCTGAAGGTTAAGAGGTCAA 
65 ACTTTTCTACTGTATCCTC^CAGGCAGAGAGCCCTGTGTCACCTTTCTTTATGAGGTTACCTGACCCTGAA 
TGCCTCCTCTCCAAATAGTGTCATTAGGATTTAGGACTTGAATGTAGGGGAGCGACAAAGGGTTCG 
TGCCTGCGCAGQSTGATTGAGACTTGAACTCTC^G^ 
ATCCCAGTCAGCTTGCGTGACCTTGGGCCAGTCCCT^ 

CCATCGCCAAGGCCGGGAG^TGGAGCAAAACGCGGCGCTCG CACCCGGGGCGGTGCAGGTGGTGCAGGGCGCGGG CTGCGGTGGCC 
70 GAGGTGGGTGTCGGCGGCGGCGCTGCGGAGGCGGCTGCGGGCCGCGCGCGACCATCCGCGCGTGAACCCGGAAGAAGAGTGTCCCG 
GGGTCTGGCATGAGGACCGTGGGGAGGGGCTGCAGCCAGGTTCTGTGGTCA 

GAGAGAGAGGTGGGCTTGGAGTCAGGGCAGGGCTAGCCCTGGCTGGCACACAACCAGCTCCCCAGCCCCGCCTGTACCrrGCTG 
GCTGCCCAAGGGGCCCGGCTTCTTCCTCCTGGCAGC^ 

CACTCCCTCGGGCCCAGTTGGATTTGTGGAGTCCCCTTCCTGCCCACTTTGGCAGTTTGTCAGGTTCrrTGGGG 
75 TTTGGACGTAGTCTCCGGGCTGTGCTTCTCAGGCCTGCCTCCT 
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TCCAGTCGTCCCCAGGACCAGCCCGGGATAGAAATGGCATTTCCTC^ 

GGCTTCCTCTTCGGATGX3CAGGGAGGGTAACCAGACT 

CTCTTTCATCTTCTTCCTGTGCTGGTCCCGCCTCCCTTGCCTC 

CACACCAAGGCGGCTGTAGG^KaACATGGTGACAGTAGTGAAGA 

CTX^TGAGGCCCCTGGGTGGTGAC^ 

GATGCCAGCCTTCCTTXnTGGACAGTTCT 

TCTCAGGAATTC CCGCGCTCCACTGCT CCTACAGTGAACAGCCAG C CTGTTGGAG'lT'rrTITGCCCATCTG CAACT CTCTGCCTCC 
GGAGCAGCTCTCGGAGACTGGAGCCATCTTCC^TGGAAGTTG 



GGGTCATTGTGCCAGAGTCGCAGTGAGAGGCATCCACCroGGGCACACCAGGG 

G ATGACCCC^U^CC CAATTTTTTTCAAATTTGAGACAGAAG CCTTG ACATT ATAGCATGT C CTTCAG TC CTCCAGGGAGTAAGTAT 
CATGGTTTATTG CTrCTGATGGACCTGGGACTGCAGCTAG CTGGGCAGATCTAGGGTTGGTAGGCTTGAGCTACATGCACAGTATT 
TTCTACTGATCCTCAGTGGCCCCCAGTTGTTTC 

TTCTTCACATTCCTCCAGTCTGTGAGTCCTGCCAAGAGTACTTCGTGGATGA^ 

GACACGCCAGTGCCTGTGGGCZATCCCAGATOjGGCAGCCCTCACCATTCCTCA 

GAGCGACGTGCGGTGTATAAACGAGGTCATCCCCAAGGGCCACATCITrGGCCC 

CAGCTGGCrrCTTCTCATGGCTGGTGAGTGTCCCCTGTACCATTCA 

TCAGGTGATAATTTATGCGTGCACAAGTGCTTTTCTGTGCACATCT 

T CCAAAGG CCATGCTGAAGGG AC CCCAC CTCAG CAG CAAG CAGGTGAGTCAT CAC CAGGTAAT AGGT AAGG 



TCTTCTAAAATGGAAGACCTATAGCTCGGGGCTOCAAGGTCATAGAACATGACGTTCTG 

GACGGAAGGCCTTCCAAGACAGGCACGGTTCCCATTGTTGAG^ 

GGGGGAGCACAGCATACTTTGGGACATTATGTTTGTGG 

2 5 CTAATGG C CflTGAGTGTATCACTTT l"iT CTGGGC CTGTTTTGCGACAGAGGCT AAT AAGAG CTG CGGTGAAGATAACTCCAGGT 

GGCCT GG CAC CGTT C"rTGGGAATGTGATi~r Lrr CAGT CACTAAAGCATTT ATCATTG CTTG ATGTTACTGCTG T TGG CCTGG TTGAA 
GGCTCxAGTTCCrTCAGAATAAGAATTGGAGAATGAGGTG 

AGATGGAGAAGAAGTCGGCTGTGGTAGCAATGGAGATC^CAGAGAGGATGGCGGGGACTGTGC^ 
GGACTCAGGGAAGCTGGTGATGTGGCCAAGCAGAAATGTGGCTGAGTCAG^ 

3 0 TTGGGGAGAAGGTTCATGTCAGATGAGCATGGTGATGG 

GAGCCCTGAGGCTGGTGGGACAGCTAGGGTGCCAAAGTGAGGAGTAGGGGCTTTCGAA 
CACGGGGGAGAGATGGACAGTTGGTG^GGCAGTGGAGTGGTAAAGGAAAGC^ 
TGAGCATCTCCTTGGTGCCAGGCCCGGGGAGCAGGCCAGATCAGGTGTAAATCCTACC^ 
AGAATAATGAAAATGCACATAAAACCA'CGGGCAGG^ 

3 5 TGAACAGAGGACAGAAGAGGCOSACAGATGCTGAGCTGGGGTTGAAGGAAGAA 

ATAACTGAATCCATCCATCCATCCATCCATCCATCC^ 

CTATCTATCTATCTATCTTTCTATGATTTATTAGACTGACTGACTATCTATCTATCT 
CCTACCTACCrACATACCTACCrrATCTAC<3ATTTATTAGAACGACCTACAGAA 
GAATGGAAGGAAGGTCCCATG AATCAAGCAGTTGTTCATTC CZATGAGGCTGGATGTC^ 

4 0 TTGAAGTAG CAGG CTC CAATG CCAGTGAAGG AATGAACTTG ATAGCAAGGCIAAGAGCAAGCA^ C CT AC 

TTCCATATCCTTCTATAGGCTTCTGGCCCCAGATTAGAGGTGTGACTTCCC^ 

CCATCCCATAATTAGCCTCCAAGCAATGGCACCATTGCATACACTAG 

GTGAGACTAGGCCAGGX^CTAGCAAACACAGAAGTGGATACT^ 

C T AGAGAAAGTATCCAAGG AG CT AAAGAG AT C TGCAACC CT AT CGG AGGAACAACATT ATGAACTAAC CAGT ACCCCAG AG C T CTT 

4 5 GACTCTAGCTGCAT ATGTAT CAAAAG ATGAC CTAGT CGG C CAT CACTGGAAAGAGAGGCC CATTGG T CAG GCAAACGTT AT ATGC C 

CCAGTAGAGGGGAATGCCAGGGCCAAAAAATCAGAATGGGTTC 

GTAATTGAAGAAAATACGTAATAAAAAAAAAAAAGAAAAAAAAAGCAGAGAGAATATCCAAAGTA^ 
GGTGGTGGGGGGGAGGGGCACAACGACAGATATCAAGGAATTGTGATGACACACTTTATTCAAT 
AAGT GGAT CC C CACGT CAAGTT AAGCAAAAAAAAAAAAAAAAAAAAACAAAA 

5 0 AAAACAAAAAAAAACAAAATCCTTTGCAGGAGTCCCCCTCAG 

AAGAATAGCCATCACATCACCAAAGCACACATTGGTGAGGGGGTATC^ 
AAGGGGCACACAGCATATGTGAAC^GCAAACGTGGGGCTGGAGAAGGCCTGGAGGTGGGGA 

GCTTGGCTGTGTGTATATGTGTTCAGGGTCGTGATGTGTGTGTGTGTGTGTTTATGTGTGTGTCTAAT^ 

5 5 TGAGACAGGGTCT CTCT CTAAACCTG GAGCT CAC CAATT CAG CT AG ACTGG TTGACCAATAAACCT CCAGGGATC C C CGTAC CT C C 

CCAGCACTGGGATAGTATGTGGACGCGAC^ 

CAAACACTGT CCACTGAGT CATGTCCCCAGC C CCACATTAATGT CTTGAAAGG ACAACATTGGCTG CAGC CTTTAGAG CTGACTGG 
GGCAGAG ATGTGG CACTGG TT AGGAAGCCGGTAGT CAATGT AATTC CAAAC CTT CAATGGAGGTGG CGTGGGTGGAGACAGGGACA 
GAGAG ACAGAG AC TG AAGGATCGACTTGAGG ACGAGATGGGGGATGAAGG C CAGTCAG GATGGCATGGCTTT AGT AAGGGAC CCAA 

6 0 GTATCAAGGCCAGCTTAGGTCCTCCCTGCCTCCCACC 

CTGGGATTCTGGTGTTGGCCTTGCTGAATGGCGATGTCAGCAGACAGGTGGATGTACCGTCC 

GATGTGACTTATGTTCGTAGGTGGCTAGTGGAAGTGGATGGGG C CTAGCCTCAATGTCAAGGGACTTCAAAAGTTAACAGTCCAGA 
AGGAAG CATGGGAGAGAGCGGCACTGGGCGAAGGCAGGCCAGGAG AGCGAT CTG C CCACATAG CAAGAAGAGAGG TGTACAAAG GC 
TCTAGCCTCTCATGCTAATGGGGAGCCCAGGGTGACAG 

6 5 CAGAGGAACGGTAGGTCCAACCTGGGATCTTGAGCTCCTATGGTCTTTGGCCATGCTGAC^ 

TACCTAAAGACAGAGGCTGGGGGCCAGGTTTAAGTGGTTTGTACTGCCTAGAACTGCAGCCTACAACAAAGGCTTA 
CTTG CAGGAGAGT ACAG G AGCTGGG CAC CTG ATT AGCATAGAGGTGG C CTG T AGAACAACTTCTCAG AATTTGGCTAGAG CTCAGA 
AG TG AAAAAG AG AG ATAAGAGAGGTGG ATGGTGATT AG AG GGT AAG GG GAG CTGGTACTAGCAAGAAAG CATT CT T AGGG CCAGCA 
GG AAGAGG CTT TG G AAAGG AGCAGGAGG ATG AGGTG AG AACACCATCCATACATG CATTTGTAG CAC CT CTGTCTGGAAGGT TCTG 

7 0 CAGAAGCCGCTGAGTATGGAAAGGGGCGGCTCAGCACIATTT^ 

AATTAATTGCTCTTTGAAGATT AATAGTGTT CATTG C C AAGTTGACAAGATC TGG AGT CAC CTAGG AAACAAGTTT CCAGG AGATT 
GTTGAGCCrTGTGGGGAGAATTGTCTTGATAGGTTGATGGATGTGGGAAGACCC^^ 



75 



CTCCTGCCC^TGTGGCITCCCTGTTATGATAGACACTGGGGCTATGAGCCAAATCACCCTTTOT 
TTTTTAAAAGACCATTTTATTTTATTTATATGTATACACTGTCACTGTCTTCAGAC^ 
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AGATGGTTATG AG CAACCATGTGGTTGCTGAGATTTGAACTCAGGACCTCTAG CGCTGAG CCAT 

CT CTCCAGCCCCCACCCTTTCTTCITAAGTTGGGTTTGTCAGTATrr C CACAGTGTC C 

CC^TTCTTGTCCTCAGAGCTGGTGACTOTCTTTCGGTGGCAGAAG^ 
AGATGAT CTTAGGT1TTCTGGCTGG GTTCAAGATAATTACAAAATTCT^ 
5 AAAATGGTTTCIX3TTGACCTCCCAGAAATGGAAGGGACTGGGAAGTGTT 
CCCCATCCTTTCTCATGGTTGGCTTCAGTCTGCA^ 
CAGCCCTTTAGGAGCACCCTGTCCTCTCTGOTCTTCTCATCCA 
TCTTGATCAGAAGAGGTCTGACCTCCAGAATTGTAAAAGAA^ 
GAGAGTTAGTTCAGC1AATAGGAGACTTAGACT 
1 0 TTGCTCCCACTACATTGATCAAGGCAGTACCTTTTGCTGACTC.^ 
GTACTGCCACACCTGCCCAGCTTTCACAGAGTCT^ 
AATCTCTTTATTTTCCCCGTGTTTGTCCCTGACTTTG^ 

CTT AGACAATG AGT ATCTGT CACTGTCTAGTG ACAAGTAGTGAT AACTGATATTG CATAGT CAACTCTTGTGTGACACT ACCTACC 

TCAATGTCAGGAAACACAGCCAATCGCTATTTTCTGTAATGGGCAAGATG 
15 ' GCAAGGACAGATGTCCTTGAGATGGTGCTAAACCCAGAATCT^ 

CACAGTGCTCTGTGCAGGATCAGTTGTCCCATGTGACTGCAGGGGTGCCAGGA 

CACX3GTATGGTACTTCGCATCCTCATCCATTGTTTGATTTTTTCCCC 

GCAGGTCCTTCTAGGTTTGCTCAAGGTTCTTCCTGCTCCA^ 

TAGGACCTGATGATGCCCTTTGCTTCCCTGGGAGAGCCTACTCCTCACCCT 
2 0 CCGTGGGATCAGGTGCCCCTCCCAGGCAGAACCATAGGACCCTTAAGGGGGAGCACT 

TTC^GATCCTGGTTCTTCAACTGGCTGGAAGTACGGCTTTGAGTTTGTCATCT 

CAAGCACACAGCCTCTGGAAGACAGAAACCCAGGGAGCTAGA 

GGTCTGGGTTTGTGTGAACAGGCCTCATTTAACACATCTCCAGGG^CA 

2 5 AT ATTTT AT ATGTGTGTGTGTGTGTGCGTT TATGG CAATGAATACACAG AAGGAAAG AAAATC CACC C CACAGCAGT C CAACAGTT 

CT AGGAAG CATCAG AAATCACCCAATGCT AAG AGG CGG AG GCAGAGAGT CTGGCAGTTG C CG AG GGCT GAG TT ACAGGG AAGCAAA 
GCACCAAGTGTCTCTCATGTAAGACGACAGTCCTCGAGGTCACTGTAGAGCTCAGGGCCTGGAACAGTGTTGTGTCAGGGCTC 
AG^MCAGTGTGGATAATTCATGGAGTCACAGGAACAATGGTCTA 
GTGTGCACTTCTTCCCAGACTCCTCTAGGGCAGACTGTTACAAGTGAACTCCGAG 

3 0 ACAAACACAAATTTT ATCAACAGCAGTCTT AG AACAG T AAAAGGAAGT CCTCATGG CTT CATTGGAACGATGGTAATAGACT AAAC 

CT CTG AAACTGT AAG CAAGC CCCAATTAAATGGTTTTTTCATAAG ATTTGC C^ 

TGGGCCTTTGGG CTAGGAGAGCGTTTGG ATG CTTTACGTGGGG CTTAC TGGGC CACTGTG AGGGC CT AG ATC AAG AAGTTTTAT AG 
GGAAAGCATATTACCAAGTGTCCTAGAAACCTTTCCTGTAAGATGTTGGCAAAGATTGTAGCTGCTGTT 

3 5 CCGCCTAAGGTTAAGTTGAAGAGTTTTGGATTAATGGCACTGGCAGAGGAGAT^ 

AATCACTC TGGCAG ACCCATAAGGAGAAGG AG CAAACTGTACAACCAGAAATC CAAAAT ATGCAGTTTGA^ C 
AGGAAACGTGATACTGGAGCCAGGATCTGTGCTCAGGGGGAACAATGTGTAAAGAAA 

GGTGACC TG AGG ATAGGACCCCATC CAGCCAAGGTC CTAACGGATGAGAAGGAACCAGTGG CIAAGCGT CAACAGTGJU^GGGAAG CA 
TCCACAACAGAGCTGATACACTGCCATTCTGAGAAGGGCCAAGTGCT^ 

4 0 T AG TTCTG^ CTTTAG AGTTCAGG AT ACAAG AAAGGGGTTGTAG AAT CT CCT TTCATGG CTACG G AAAGTTGC CAAGGCCAGGCATA 

TGTGAGGGGTATCCCTACATGGAGGCCTAGAGAGACCATTGTC 

AAGGTGTTGGTTATGTCAGAGTCCTGGGAGACI^GCAGAGGAGAGCTGCTAACCTCTTA.GGTTGG^ 

GTTGAGG TGG ACAAAGOTG AAGGGATTT AG AG ATCTGAAG ACCATTT TG AC CTCIAG CCAAG^ CAG AATTTGGAGTT CATTC 
TGCTGAG CTGTGGTCCTGTTTTGGC CCAGTATTTCCCCCCCACCCCAACTGCTTC CTTGG CT C C CTTTTG CAATG ATAATGT ATGT 

4 5 TATATGCCATTGTATGCTGGAAGTAGGTGATCTGCTTTTGATTTTGGTTT^ 

AGAAGAGACTTTGTACTTTTAAAAGAGTGTTAAGACTGTGATAGACT 

TAAGATATGGCTATAAACCTATGGGGGCCCAGGGAGTGGAATGTGGTGGTTTGAATAAGAATGGCCCC^^ 
GCATAGTCACCAGGAAGTGGCACCATTTGAAAGGACAGGAGAGATTAGGAAGTGTG 

GTTTCAAAAGCCCTGCCAAGCCCAGCATGGCTCTCTCTGTCTGCCTATGGATTAGGATATAGCTCTCAGCT^ 

5 0 AGCATGCCACCATGCCTCCCACCTTGATGATGATGTGTTAAACCTCT 

AGTTG CCT AGATTGTGGTGTCTCTTCAAAGCAGTAGAACAGGGAC TAAGATTTCTAAGTCTTCACCATTTTGTCC CTAAAGCTTGT 

CACTCTAGCTGCTCCTCTGATGGCATGTACTGAACAAGCTTGAATAAAGAAAATCCT 

AGCACTGACTGCTCTTCCAGAGGACCTGAGTTCAATTCCCAGCAACCACATGGTGGOT 

CCTCTTCTGGTGTGTATGAAGACAGCAACAGTGTAGTC^TATAAATAAAATAAATCTTTAAAAAGAAAGAAAGA 

5 5 GCAGTAAAGCTAAGCCAGGAACGTCCCCCAGGAAGCCTTTTGGCCTGCGCCAGCCACCATAGA 

TCCGTGTCCTTCCCAGCACAGTTAGATCCCTCCGCTCCACTGAAGCTCTAAGGAACCA 

CT CGCATACACCACGCCT CAAC CTAG AATTAACAACAGC^S AATTTTC CTCAATGG CAGTCATAGTCATG ATGCT AATACCCACCAT 
TAGCCTTTTGCTATGCTCAAGGCTGCAAGGCTGCAAGGCTGCAAGGTGGTC 

GTCCTAAACTGTGGAGGTGTGACATAGGTGTGCAGTGTTCGTGTGAGGACGGAAOTCCCGGGTATACTAGG 

6 0 GAACTXK3TGGAGGAGGGGGCCTGAGCTGCAACACATATC 

CATTT ACTTT ATGTATATGAGTAT ACCGT AGCTGTCTT CATACAGAAGAG G GCAGTGG AT CCCTT ACAG ATTTTGTG ATTG CTGGG 

AATTGAACTCAAGACCTTAGGAAGAATAATCAGAGCT 

AGGTTTCrCACTGACCTGGGTGTGCAGATTGGCTAGCCrGGCTG^ 

AGGATTACAAGGTTATATCACGATGTCCAGCTCTTTTATTTGGGTTCTGAATGGGAGAGAA^ 

6 5 G AAGGAAGGAAGG AAGG AAGGAAG GAAGG AAG GAAGGAAGGAAAGAGG AAAGGGTGTG TG C GCCTGGGAATGGT AGAGGAGAAAAT 

GT AG ATAAAAGGG AG AG CAG AGG CAGGGACATCOVGGAGAGTC CAGAGTG GGT ATTGCCAG ACTGAGCTGGAT CATGAGGGGAGAG 
GG AAGAGG AG AGC CAGG T ACAACAG CT AGGAGATCCAAAAGAGGAG AGGTAAATT AAAATGG C TGGATT AT ACAAGGAAGGACAAC 
CCAGCCCACGAGCTGGAGAACTCATGGGAGAGOATGGGGTATGCCAGCCATACCCT 
GAGCCCAACTGCCAGGTTAGCATGGATATGTTAAGCA^ 

7 0 TTCACGCTTGAATGGCAAAC^TGATACCAATGAGGTCATCTTCCCAGCACCCTGT 

AGGCTCATACAGATAAGTGAGGAGCACCCCACTTCTCTCTCCTCCTCT^ 
TGGGTCCCCATAGCTCTGTGCTGGTGACCACCTGCTGGT^ 

ACACCTTGGCTCACAGCTCCTTTGCCCGTAGCTGTGACAGAATACGTGAGCCAGTCAAG 
TTTCAGTCCCnXBACTCTGACTGCTTTGCTTTGTGGTTGCTGTGAGGCAGTACATTATGCCAGGGAG 
7 5 CACCATGTGGCCCCTAGAAAGAGACTGGCCTTCCATAAGCACCTTGA^ 
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TACCTCATAAAGGTTCCATC&CTTTCCAA^ 
ATACACAGATGTAATGCATGCAAGCTGCACAGCCAT^ 

GACCGGAGGATCTG TGTGGCACTC CG CAGTGTGGTAAGGCAG TGAAG AGTCCAAAGTTCTGAGGGGTT IX^nTTOSGTTTTGGTTT 

TATGTTTG CrrG'riTGGTTGGTTTTTGG'riTitJT CTTCTTCTT CTTcrrri'TT ITTT TTCAAGACAGGGTTTCTCTGTATAACAGC 
5 CATGGCTGTCOTAGAACTCCCTCTGTAGACCAGGCTAGCTTTGATCTCACAG 

CTCTGAGTGGTACCTCTTAAAAATGTGAACTGTGTTGCCnTGTAAGGTAT 

TAGCAAATTAATTGTTGGCCCCAGTTGGTACAGGCTCT 

CX3TTGTGGGTGGGTTTTTGCCCTTGCTTTTCA 

TATACACTAAGAAAGGCAGTGGGGGTGCTACCGGGGATGGCA^ 
10 CTCTCGAGACTCGGAGCTAACCAACAC^CCATCAACAGAAATGTTCTTT 

G AGTAAT CCTG AGAGTCCAGGGCTGTGG CTCAAACCAGGAGCTGCAGAGGTGGCCTAGTCT CAGTCC 

GGTCCCCACAGCTCGTCAAGGAAATGCTTACAGCAGGATGTGTGGTCGCTTG 

TGTCTGTAACTAAATGCCCAGTAGAAAAGAGGAAAGGAAGAAATGCTGATTTTAGCTC^ 

TAGCAAGGAAGTGCTGCAGAGTGTATGGTCTTGAAAATC 
1 5 TG GGCC CCT AAAGT CTTCAAATATAGTGGCAG CAGCAGGGGACCAACCACACTGT CAAAG CACAAGCCTC CAGGG AACAGTCAAGA 

CTAAACTGTAATATTTTCTAGAGTTTTGCAAAAGATCTAG 

GGGTGGGGGAATCITCCGGCTCCAGGATGTGCTGAG 

TTATTTGGAATTTGTAATGTTTATTTATTTGGAGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTC 
AGGACAGCTTTCAGGAACTGGTTCrTTCCTATCTGCATCTGTGATCrGGGGA 
2 0 CT AT AC CCACTG AGTCAT CTTC CTGTCT CAACTAGCAATTTTGGGGG AG CAAAGGG AAAG AT CTG AGCTGGAATTG AC TGGTGTGC 
ATTTTGTCGGCGCTTGCGGTCAGCTACGTTATCCTCTTATAATTTCTGCGTATTTGAAATGCT^ 

CATTATACTGACTTTTTTCTTTACTATGACAAATGCCTGACATAA^ 
AGGTTCCGGTCC!ATAGTCTTTGGTTCCATTAATGCT 

2 5 TTCACCTCACAGTGGAC!AATCAGGCAGAGAGC!AAGGAAAG^ 

GTTGTTCTTGGGAGAGGAAAACGCATGTTTCAAAAATGGGGTGTTCTAA 
CAGACAC^GGAGCCTGATGTCGACAGTTCATGTTCAGAC 

CAAACCTGAC CAGAAATGAATCTCATCAG AAGCT CAATGG AT AAAACCTTG AGAT CACATGAGT CAC ATG GT CTGATT ACAATTG A 
C!AACAGTTTGTTCTGCCATTGTTGAGGAGAATGGCGGAATGCTAGCGCC^ 

3 0 TCCTCAGGGAACCT^GCCTCGATOTAAAAAGTTTCCACAATCAAGTGT 

CAAAGAGGAAGGATCATGTCACTGTGAAACCCAAGGAGACAAA 
GTGAAAAATATCAATTGAATGAAAGATACTCATTACAAAGGAAGC^ 

CAGG AGGATGTGTGGGAGG ATTG AAAGTT CAAGG CCAG C CT AGGTACAG AGTG AAGG GCTGC CC C CAAAATTTT ACAAAC CG AAAC 
GAAAACAAAACAT CAGAAAAGCAAG AG AAGTAAGAAAGG AAAAAACCCCGATGG ATG GAAGAG ACTGGAGTO 

3 5 TGGTGGCCTGAAAGAGAG CAGAGAAATGTTCCCTGAC^CGGCTCCGAACGAC^GAGGAAACTATGGAGAAGCAGCTAAAAGG CATG 

GGGGCrGAATGAGCCATTCTGTTTATTTAATAGAAATTCTACCGCJAGGAGAGTGGTGTGAAGGAGAGGT 

AAGCCGGG GATTT C CAGGAAGTAAAATGTGGACGTG CT CATT AAGGAGTGAAGAGAACTTT ATAAT CTTTGCTTCACAAATAATG C 
AAAATGATG CC AAGTAC ACAAATCG AAAGGCT AT ACACTGAT CAC CT ACAAAAG AT C CAT AGT AG ACTAACAGTAGGTTG CTTAT A 
ACTTGGGT C AGGAAAGAACTAGAGACAC CATT AGATAAG CTGACAC C CGTG CT CT CTG ATGAATATTCCTTT CTGAGTGT ATCTCT 

4 0 CTTCCTGCTTTAGACTTCAAGCTTAAAATCTATAT 

CCTAGAATTCCATGC^TCTCGATACTCAAAATGATAGTAGAGTATAGACATTTGGACACAACCCC^ 

GAGCATTTGAGATTTCAGCAGAGGATGAACCCTTTCAGGTCACTCGTGCCTGGGC^ 

AGCAGAGAAAGAGGCTGCCTTCAGCTTTTGTCACAACAGCCTGGCACCT 

CCTGATTCAGTCAGCACGGATGAGAGTGGCGTCCCTTOTCAGGTCGTCTCC^G 
45 ' GAC^GCGAAACTTAGACAGCGGCTGAGGTTGTGGTGGGGGCCATGTAGGAGGGCAGAGGGAAGAGGTGAGATGGGAGGATGGAGCT 

G^CTCCTGGGCCTCAGGTAGGGAGCCCAGGCACGTGCGAGCCCCTTCTCTAATGGCCC^ 

TTGTGGACAAGAACAAC CG CTTAC^AAGTCCATAGATGGGTCGGATGAGACCAAGGCCAACTG^ 

GCTCCAGTGAGCCTCACCA.GCCTCTGGGCTGGGCCCAGGAAGTATCTTGGGCCTCCAGGATCT 

AACTGCAGCTGACAGGGTTTGAAAGAAGTGTTCTGTTCCCCAGGCT^ 
50 GCTGCAGGGATCGGGCTGC^TGCCATGACTCCCAGTAGAATCC 

GTTTCTAACATCrrCCAGCTCCCTTCCCTATGGGGCTCAC!ATTAAACTTGACTGCTG 

TGGCCTAGGACTGTGTtSTCAGAAGCATCCTTAAAGTCTAGCCA 

CAATGAGTTGGCATCCATGCTCAGAGCTGGCCTGTGTG^ 

5 5 TAACTCCAGTACCACTTGGTGGCCTGGATCCATG^GCAACGATGGAGCATTTAGATTTGTTTAGAAGAAAGGAA 

CCAGAGACCCAGCTACATCATGAATCGAGGCCCTATCATGTATAGTTCT^ 

CAATGGG TAGGTG AGAG ATGGGGG CAG CAAGAG ATGCCT CAGG AT GT CTGAG CACATGAAG AGGACCTGGTTGG AAG TG GAAAGT C 
ATGAGGAGGTCCCCTGTGTCCTTCTCCGTGGCTTCACTGACACCT 
CTCATCCTCITTATTGAGAGTGCAGCTGTTTCCACTGCGGC^ 
60 GATGGTCTGCTATCCTAAAAGTGGCACCAGAAGTTGTACTTGACGCTCCTAGCTGCCGTCT^ 
CAGCTTGGCAGTGTCTCATCCCTTATGCTGGGCATCTCGTCA^ 
GCCCTCCTCTGCTCTCTGCACCTCTACAGGCATGCCTGCTATT^ 

CAGATGGAATAGGGAGGAGCCTTTGATATGTGCCT TCTGAGGGTCCAGG ACTTTGTTGAGGACAGCTGG AG CAAGTGGGCATGAAC 
CTTGCAGAAAGACTGCTGACGG ACAGCACAGCCCTGGGCTAGGCCTT CG CATGCGTCACCACAGT CAGGTCCAGGCI^GCTGAAAG 
S 5 AGTGAACCTGATGTATCCTGTGACACTGTTACTTC 

CCATTTCTGAGCTTTTGTTTGGTTGGGCAAAGGGTTTTCCCGCTGCCCCCTAGAAAATAGTGCOT 

TGTTTAGCACAGCAGGGCTGCTGGTGTAACTTGGCTGGGGATGGGTCCGGAGCATCACGATTCTGCAAGTTCT^ 

AATGTGGCCGCTCAGGCTGACAGCCGTTGTGCTGGTGGTCATGCCCCAGTGACAAACTGAGACCC^ 

CACATTCATGTCACCTGTGGCAAAATGAGG CCTGGGGTCCTGTGT CAAGCCACAGTTCTTTCTGATAATAT 
70 ACAAGGGACCAAGCAGAGCATGGCTCCTTATGACTGTGGGTGGGTTCCGTGGGTGGGAGACTC 

ATGACTCTGTCCTGCTACCCCTAGCCTGTCCTGACCCCAGATC 

CCAGGAGCTGGTGATTGTGTTTGTGTGTGGCTTTGCTTTT 

GGGCTTTGGGGGTGATGTTGTAGATCGACATCCTTATGAAATT^ 

TTGAGTGTTAACCTGTGCAAGCTCTGTGTTAGGTGCTCTGC^ 
75 CTTTCAGACGTTCTAGGTAAGCACAGCTTCATGCGTGGGTACT 
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CTACACTGTACAGTAATCTCCTCTCCTGACCCAGACTCTGA 
CTTATAGAGGCCCTGTGGTCCCCAGGGATAGTCTCAAGTCTGTCrGAGCTCT^ 

AGCTTTC CITTGACTGATTACACGCG CTTGG CTGAAAGGACTCAGAAGTAAGGGTGGAGG AAC CG AGAGCTG CTGGGGCTTCTGTC 

ATTCTGGAAGCATCCAGAGGTGTGGGTTGGGATCCGAGTTAGCCTCA 
5 CTATAGCAAAAGGCAGGTAGAACCGGAATGCTCTTTGAATTCT 

TTATCCTGGTGCCTTTGCTAGGGTAGCATGTCTCTTGGTGGATGCT 

ACXSCCATG AGGAAGGAG CCTGTGGTGGTGCCCCTGAGGGAG AAGAGATTGTT 

TGAGAGACACTGGAAGATGGGAGACAAGTCACAGGTGACAGAATAAGCTTTCT 

GCCACAGCCA.CTGCAGGCCACTGTAGGCTGTGTTTTAAACTCA 
1 0 TGTGAGAAGATGCCCAAAGAGG CTAGAGAGGGG CAACAGATCCATTG GAGTTTGAATGAAGGGCATTTGTGAATT AC CTGATGTGA 

TTTCTGAG ATCCAAATT CT AAT ACTCAG CAGAGAGTTG TAGTAAATG TC CTT AAC CCCTG AG CTG CTTCTCCAGT CC CTTAAGTGG 

TTAAAGCTCGTGTACGTAGATGGAACCCTCAGAAACTGGTCTGGACG 

CCTGGTGCTCAAGGCTGATGAGAGGTAGGGCAGCCTACCCTGCAGTCACTGGGCTCCTACT 

GAGGCATCGCTGTAGGAGGAGGGTGTGGTGGGGCCTTGGTGGCCCCAGATGTCC^ 
1 5 AGGAGGACCATCCTCATTAACGGTCATTTACCCAATTA^ 

GTAAGAGGAAGCCTAAATTCTCCAAGGAGGAGCTGGACATTCTGGTC^ 

GAGACCATGCGGCTGTCCCATGCCGACAGGGACAAGATTTGGGAAGGCATAGCTCGGAAAATC^ 

CTCCATCAAGG ACATT AAG CACAGGTGGG ATGAC^TGAAG CGCAGGACCAAGGACAAG CTGG CTTTCATGCGGCAGTCACTG TCGG 
GCCCAGGGATCGGGGGCCACACCTACCCCATCZATGCTCGCrGCCCACGAGAGAGCCATCGAGG 
2 0 GGGCATGGCTTCCCCAGGGCAGAGCTGGACAGAACCGACAGCC 

TCCTCGTGTGCACTTATTCCCATGCTAGCTCTCAGCCCTGCCTGTCTCCCTGACTCT 

CCTCCACTTCTTGCTCATCCACTGTGTGCCAGAAGCCCGCTTGGGAGCAGGAAGGACAGGGC^ 

TTGTCGGACTAAATAAGCCCCAGGTGGATCC^^ 

CATAACCTTGTTTAAAACAGTACAAGTGrriTGTTGGTTTGTTTGGTA 

2 5 TCAAAAGGTTAGACGTGCCITCTTGGATTAGATTATTGTTCACCTTAGAG 

ATTTCCAGCTTTGAGCTTGTGTGATCTCCAGGCCT 

G AAT GG AGGG CTG CAG CTGT AGAGAAGGC CTGAGGCTGGCTGAG GAAGATGTTTG AAGGG CG AGG AAGAG TTAAC CAGG C CTGGTG 
CGAGGCGCAGAGGAGGGGAGGGGAAGGKTGAGCAGGCTGGGAACAGCATGTAC^ 
AGGAACTTGGAGAAGCCAGGGAAGCCCCATCCTAGCCA.GCCGG CAGGGGGAGGGGCA 

3 0 CTGGGCCGGGGAGAGACCTTTCTCCGTX^CTCTGTCT 

GCTCCTGCTTCTTGGTGCIC^CCrGGGACTC^ 
GTGTTCACG&GGAGGGGGAGCCATCGGAGTCCATCCCTGAAGAA 
TCATCTGCTGTCCTO^TGGAGTCCTGTGAGCAGGGCAGGAGGATC 
TGGTTCACCAAGTCCGACrrCCrCCCTGGCCCTTTGCCGCG^ 

3 5 CACTCACAAAGTCAGCOTCGGGAGTAGAAAAGAGTGGTCATGATGAGGGTGAC^ 

CAGAAAGTGGAGGTGGCCCTGAGACATCAGAACCAGACTTGGGCCT^ 

TCTGGAAAAGCCTATTTCCTGAATCCTGCCCTGTTCACCCCTCCGTCTCCTAGAGATCGAAGA 
TGGACHGCAGGGGCTCCCAAATCTACTAACTAATC^^ 

TAATGGCCGTTCTGAGTGCTCCTTTCTACCCTTTCTGGCATCCCAAGAGGTTC 

4 0 AGGCTCCACTGCGCGAGCATCAGGAGACCCACAGATCAGTGTGGTC^ 

CACAGCCCACATCAGGGGGACAACAAGGCTGGTGGCTCGCCCCACATAATCCTGAG^ 
GCCATTTGGCATCTCCCTGTGGTGGTTTGTTAGGATTCCAGCA^ 

TGTGGAGACCAGTGCTCACAAG AGAG CAGCTTAGAACCTCTCACAGGGATGGGTAGCTCTCT CAGCTC TGCAATACACCGTAGGGT 
CTGGGAGCCCCACTAAGAGCCCAGCCGGGGCCCCGGGGGTTGAGCAGAGGAGTCATTATTGTTTCT 

4 5 TGGC^CAATCXSTGGTCZATGAGCCCI^AGGCTGGCATGGCT 

TGTAGCTGCCTCTGTGGGAAGAAGGGAACCAGCTCCCGCTCACCCTTCTACCGATGCTC 
CTGG CATTTCTCTGTGTGTGGTGCCAGTGTAGGCTGACGGGTGGC CAG CAGGGA 
CCCCCTCCACCCTTTTGTAATCACCTGTCCTTCTCTCCTTGTCT 
GCAGCCTCTCCGGATGCCCCTGCTTCGTCCTCCAGAG^ 
50 ACCCATCCAAGATGTTGGACCCTAAGCCAGAGX3TCCTGCCCCATCCCTCACTCCAGGCCCGCA 
TCGCCTACCCTGGGCCTGGACTGGCATCTCCTGCATGCCCATGCCCATCAGACTGAAATGT^ 
GACTGTGCACCGAGACATGGCCGGCAGC!ATGCATGTCATCAGCCAGGCCATG^ 

AGACGCTTATGGAGATCCGAGATGCAGTTCAGG CATCTCAGCGTGGGCCAGAAGGGGCAGCC CCCATGGACCACACTTTCCAGTCT 
CAGGCACCCCTGCTGGGCCCTACACCAGCACCCCCAGAACCAGCCCCAATACGGACT 

5 5 CTAACCTGGCCAGTTTGCC!TAAGGAGAAGCAATGCCGGTGGTGTCTGGGOT 

C^GATGACCGTGGGGCACAGTCGTCTCTTCACCTGGTGCAGTCCAGTCTCCTGT 
CCCCTTATCTCAGGAAGATACTCAAGACACAAGAAAAGCTG 

CCTGTATCCTGTATCCTATGGAGCAGGCCTTGCTGGGGCZAGGGGGCAGCATCCAACAGAA 
AGAGCTGACCrCCCAGGTCCTGATTCTGCCCTTTCTC^ 

GATGTGCTGGTGAGGGGAAAGAGATGATTTTGCATTCT^ 

AGG C CAT CT CAG GAAG CCAAGT CAGTCTTTACAATCACIGATTACTGGTCATTCCAGTCATTGT CATG CATCT CAGGGTT CAG AAA 

TGATACAGCCATGATAGTGGGGACCTGGCCTGCTTTCTTCAAGAGGCCAGATCTGGAGTACT 

TCACCTAAGCTCTGCCTCAGCTTCTACAGTTAGAAGAGGGAAC^ 

6 5 CTCACAGCCAATGAAGGGGAGGTTCTGGGCCCTGGCCTCA 

TAAACCAATGAGACTGTGTCCAGTCCAGCTCCCAGCATGCCTTGCCTCTGCTTGCTT 

T TGT AGCTTTGAGTT TTGTTT AGGGGGCCCAGT CAG AGAG CTGGG ATCTTTGAGTGAGGG AGGAGAAAAGAAG CCAAAGGCTTCAC 

TCTGACCTAGGTGCAGCTTTGACCTCGATCCCTGGCCTG^ 

AACAGAGTGGACCTGGAAGCCTACTGGACCCAGTGCTCACTGAGK3CT 

7 0 CTACATGAGATGATGGACGGATGG CACCTG CTGTGCCCCTGAG CCCTGGGCTCCATTGCCTCAATCTGTGGTTCTTTGGTGGTTCC 

TGCCCCATCCATCAGACTGCATATTGTCTTGTCTAGTAATAGCCTGGTG 

TTGGCCTTCTCAAATATTTTTTGTTAAGATGGCAATAAAATGA^ 

TTTTGGTGTGTATTCTCTTTCTTCCTATCCC^^ 

GGGGAAGAGGTCCCTGTGGACCTTGCATATTCTGGATTCCATAAACTCCCTAGGGGCGTG 
7 5 CGAAAGGCAGTGACTTGTAGGCCAAAGGCAGCTGAACATCCATGGATGTGAACT 
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AACCTTATGCTCCCCCAAAGTATTACAGTTTCCAGGGA 

AGGGGTTATC CTGGGATACTCCTCAGTAGTGG CT ACAT CTCCACTGAC CTCAGTAG CTGC CTGTGAAAGGCAAGGCC CTG C CTACC 

AAGACTCCTGAGAAGAACCAGAGTTCTTCCT 

GGCTAACCTCCTCTCATGGAGAGAGCTTCCGTGA^ 

ACCrTTGGCTTTGGAATCCAGATCATTCTCTTAA^ 

ATACAGAGATGCAGCTGGCCTGGGTCTCCTCTAATACCCAAGACCrGTGAGACTGGTTGGC 

GAGGCCACAXTAAAGGCTGTTGTACAAGAACCAAGCT 

GTGTCTTGGTGGCAGTGGGGACCAGGAAGCCAGAATATATTCAATATATT 

GCTCTTCCCAAAAGGAAGCCTTGAAAGGAAGC CCAT AG CATATGTGGG CAACTG CCTGG ATATCAG AGACCTAACC CGGTGACATC 



GGCTTGTGTCAGTCC CATTAAACAAAGTGGACG ACT CCCTGAG CCA'l u ri"r LTTT CTTGAGTCX3GAACCCAT AGTCATTTCCTGACC 
CAGCTAGAAAATATGGTGCACTTGATAAGACCCAGTGGGTGAGGCCTC 
GCTAGTGAGGAGACAGGACTCAAAACAATGTGTTAT 
CACCrTACTTATTTCTAGGATGCCTGACTCTGAACTGCCAGACTTGAA 
1 5 ACC CTG CTCTGTTTT CCCAACAAGGGAGTGTGAGCT^ CAG AC C C CCTG GTAGACAACTGGT CTTGACCACAAGGAGGAGA 

CAGTTCCTGGAAACCCTGGGGTCCTTGGCCACGTGACACAGGACAG 

CCAAG CCACTGAGTGTCG CCACTGT AAAGGGAGGG G ACCTCAT^ CAGT AAAC TT 
TCTCTTGAAAGTTCTGTCACATAAACATGTTAACT 

TCAAAAGCCAAGTTITAAAACAACCAAATGCAAGAT^ CTG ATG CTGG AAAT ACTTTGTG ATCTGTATCAA 
2 0 GACTGAGTGAGAGTGCTCTCTCTAAATGCTGG CTCTTGTTAGGTAGAACCACATTG CAGGGGGAGGGGGGACGAAGGGAAATGCCT 

GCAAACCATTTCAAATGGTTAGACCCACTCCAGACATTCCTGT^ 
CTTTTCTCTGGCATTTAATATTGAATGGCTGGCCAGAAAOTGAAAGG 

TGG TT C CC CAGAGAAC&CAAGAATGGGATTTCTACCTT C CAG ATGTGT CTG^ AACAGAAAAACTTGGAG CTACATTCATTCTGGGT 
CCT CCTGACCATT ACACTGGATGGTTTGGGACCAAT C CTAGAC TCT'I'GTTAAATTGGAAACTTG GAG CTGAGG CTGG TCCTGTG AG 

2 5 TGGAGCAATAAGGCTGAGCATGGTTGCTGCAGTCTTGGGCTCTGC^ 

AGC CT CATGAGATTTTCACAG CT CATGGGGTT AGGGGGCTCTCTGATG CT CATG T AAGATACAGGTGGGGATG CAAGCGTGCTGAG 

GCTAAGGCCCTGGGCAAGACGGCTGACCCAGGGCTGTGAGCACCATTGATCTTGTCCAGATTGGA 

CACTGCTTGGGTCTTTCATCCCTC^GCTTACAATC 

AGGTGCCAGCCTCCTCTGGGGTAGTCAAGTTGATTCTTCAGGACCTGTCCAG^ 

3 0 AGGTCTAGCX^TCTCCAGTGTAAGTTCTTGGAGCTGTCAGCCCCCCATTOT 

TTCTCTTTGCCT CTTGAGTGAG AGAAG CAG CACAAACATTATTGAGG AAACC TGG AAGG TACAGACAATTC CAG AGACGATC CACC 
CAAAGTGATXX3C CGT C C!TGGTT AAAACACGGG TGTGTTGACTT CAAGCAG TTCTG CTGGG AGACACATTGTTAGAG CACAGT ATAC 
TTGGCTTTC^CTrCCAACCTGTCTCACTCTGCATC^ 

TCATTTGAAGGAAACTGGGTGAGGGGTGTGGATTCATTGGTGTCTTGGGCATTGC^ 

3 5 CTGTGAGTTGTAGCTCAGGGAATGATAGGCC!ACAAATT^ 

TTTCTGrrGTTGCCGATCTTCTGCTGTCTCTCCTTTATCTAT^ 

AAACTG AG CT AT CATCAGTCACT CTTAGAGG ATCGGT T AATT CTGGT AAG CCT AGCAAAAT ATG GAAACAG CCCAAAGC TT CC C C C 
TGCCCTAACTGAGGTTAAAGGTCAGTGAAAGTGG AGCTGG AGCTTAAGAGCCTG CTCTTTCATAGGACC CAGGTTTG AGTCCCAGC 
ACTCACATGGTAGTTCAC^CCATCTGTAACTCCACTTTCAGGAGATCTAATGGCCTCT 

4 0 TGGTACA.CAGACATAAATGCGGGAAAAACACCCCTACACACAAA^ 

AGAAGGGATAGCCACCTGCTATTGAGTCTGGGGTT^GACTGTGACTCTCAAGGT^ 

AAAC^GGTTCCTAGGGTCCAGAGAATGTGTTAACATGATACTGAATAGAGATGTAAGTTAGGAGGACC^ 
CTCTGGATGAGACAG<lA.GCTCTTCriTATGTGCCCACATACTTTAAG 
CCATGGTTCTCACCTGTCCCCTCCCCACCAGGTACGTGGTCATTTCC^ 
45 AGCGAGOTCATCTACTTCCGGGCATGCCGAGACATCCGGCCAGGAGAGC^ 

CCTGC^TAGCATGTCCCAGGAAACCATCCACCGCAACTTAGCCCGGGGTGAGTATGGCCA 
GCCAGGGACATGCCCATAATGCTAAGTCCACAAGTCTAGGTCAGTACTGATTGTGAAGTGACTCGCCC^ 
CTACAGGACCAAGACCACGCCCACGGGGCTAAGGCCATGTCAACCA 
GCACCTCTGGGAACCTTCCAAGCAAACCAATGAGGAATGCAGTATCGAGCAGTGCAA^ 

5 0 GTAAGC CAGT GG CCAC CCCGAGGGAATTCT CTAGT GT G C AGTC CTG AG AG ATCAG CAAAT CCCAAC CAAATAAAT CAGTTGAATAA 

CACCAATCTTGTTACCCCAACACTAACTGATGAAAAGACACTTCTCACTC^ 
GTTGCCTTCACTTTGCCATGGGATCAAGGTAATGAGAACTAACCTGCCAAGATGTTCTC 
CCGTTGATGCAATCTCTACCTCGACTGTCACTAAGTCCTTGGCTC 
ATTGTACAATTGTCCAAACIt3AGCTTTACAGAA^ 

5 5 AACTTGAGGCATCTGAAGTGAGGATCTAAGTTTTATCCAGACAAAGCCAGCCTTC 

TG ATACACAG CATGAAGGAATGGTTTCT AC CGG AG GAAGG CTAGTACAGGAG AAG ATGGG CTCAGG CTGGGAG AACATAG TGGG TT 

GGCTGTCCATCCATGTGCCACTG^AATGGCTCATTTAAAAGGAACTTGGCTTACTCCCTGCCT 

ATCCTCAGCCTGGCCAGGGTACACAGGTGCACCCAGGTCCTTTTTTCCAAGAGAG 

ACCCAGAGTCAATTTGGTCCTCGGAGGCAC^CCATGCCCAACGTCTTCTCCGCCGTCATCTTAACTAACCTCT^ 

6 0 CTTGTTTAGAAGATCAGCGGAGGAGATGGGAGCAGATGACTCACTGAGGTGAAGATC 

GGGTGCCCAAAAGCTGTCTGACTTAGTCAAAAGCATCrGGCAGTTGCCAGCGAG^ 
CTCCTTTGGAGCTGAGCTTGGGCCATGCTGGTTTC^^ 

CTGAAATTCCAGAAGGGTTCCTGTGGGCCTCGGAACCTCCAGAAATGTTTGGTGAACCTCA 
ACGAAAACAGCCAGGGGGTTTCCAGAAGAATCACATTACCCAA^ 
6 5 TCATCTCTTTCCCACCATCTTGGATTCTTCTGGAATATGTCTTCCTGTCCT 

GACCTGAGACCCAGGGACATCTCAAAGAGAGGAGAGAGCrTTGGGGAAATGAAGGCT 

AGC CTTGG AAGATACAG G ATTAAGGTAT CTATC TG CCCGTGACCAGGCT GG ATTGT AGG CTT CAG G AAG AAACCAG CAAC CACAGT 
GGACCCA.GCAGGAGCTGTGATGGAAGTGGTCCTTGAGAGTGA 

CAGAGTTCTAGACCTTTAAATGTGTCTTGGATGACCTATCAGCCC TG CTTAGAAAGGTCCCCATTG CCCAGTCACTTAAGGTATTA 
70 CAGCTATATTAGCTTTGTGTGTGTATGTGTGTGCIGTGTGTGTAC^ 
TGCGTGTGTGTACATGTGTGTGTACATGTGTTTGT^^ 
TCATGTGTCIATTCCTCAGTAATCGCTTTGCTTTTGAGACAACTCTCA.CT 

AGTGC ACCTCAGGGGTCTACCTGAACCCACCGT CCCAGCACTGGGATTGCAAGTG CCTGCCACAACACAGAGCTTTTTTTTTTTTT 
TTTTTT'TTTTTTTTTTGGTTGTTTTTAAATGTGAATTCTGGAGAT 
75 AGCTGAGCTATCTGCCAGTCCCCCCGCGCCCCCTTTGTGAGACACAGTT'TCATCTATCCCGGGTTGGCCTT 
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CTGAGGCTGGCTTTGAACTCCAGAGCCTCCGTCTCTACCrCCCAA 
GAGAGAGGAGCATCTTCATGGTCAGAGGCTGCTTAGCCAG^ 
GAGTGTTGTGTGGAGCCCAGAGCTCCTGGAGAGGATATTCrTTA 
AGGTGATCTCTGTGATTTGTGTGGGGATTATAAATT^ 
5 TCATGGTGACCTTTATTAAAGGTTGTCCGTGGACCAGGGATTT^ 
GAAAACATGGGTGACITTATCnTCCATTACAGAGACAA 
' CTGGGGAGCCCAAGACXJAGGCAATAGGGCACCCT 

ATACCTTGCATTACATTTTTAAAATTGAATTTGATTTATGTAG 
CATGTATGCCAGCTGCCCTTTGAGGTATCTTCTC 
1 0 GCTGGGAACTGAACTCAGGTCTTTTGGAAGAGCATCCAGAGCTTC 

taaagggctccatctok:ccccgagctttctgcagttctgctggc^ 

t ccgggatgaggttgttg cttaagcag cx3cctgtgaggacagaat agggacttg agtggctt aaac cctgct ctgt 
gggagcctagaggcgtgagggaagcagctgggggaggccaagc^tggagcctcctgacc^ 

ACAGAAGCCTGGACACTIX3AACAGAGGCTCAAGGTTGCAGC^ 
1 5 CACAGGTTCTG CTGGCAGAACLT 1' LTGAAGG CTGGGAAAGRACTCTCCTC^GCATGTCACCCCCTGGG CATCC CAGCACTTC TGT A 

GCAGTCTGTG CTGGCCT CGCT AAAGGATCT AAAAT AGCTT ACTATTGGTGAT CT C ACCAACATTTCAAG GT AAT TAAAAATG AAGG 
AAAGAG AGGCATCTTTTGACAGGTGGGGAAATGG CTGTAT CTGAG AATACTTT GGGTTTCGCAACTACCT TG C CCTAATCTTGAGG 
GAGGCTGATACTCTGGTATGTTTAGGAATCTTTCCCAAGGC^ 

G ACTTTGAC CCATCTTTCTTTGTGT ATTTCTTAT CTTT AGACTGCTG GGAGAAT AGAGAGGCAGCAGCTGT AGG GTGGCTGTTTCC 
2 0 CTTGGCTCTCTGGGCCCATGAGCATGGGAGGAAGGCAGGGCCTGTGCTTCCCCCGTG 
TAGTCTCTAGGTCTGAAGGATAGCTTGTCCCATGGCGGGCAGACAGACACTCAGCTC^ 
GAGATGCTGTCTCCTGGGGTCATAGTATGGCAAGCCCCATCCCCAT^ 
CCAACrCACAAGTGTAGCATTTATCCCATGTGAGGCCTTTCTGTAAAGGTCCT 
CATTTGGGTCTCACCTGACCCAGATGGTCAGATCCAGGTGGATGTGTCAGG 

2 5 CAATTTCTCTGTTTGGGGCTTTCCCCCACATCTTCAC^CC 

GAGCAAG CCCTGGAAAACCCAGAAGACCTAAGGGGTCCCACTCAGTTC CCTGTGTTGAAGCAGGGC AGAAGTCCT CAG 

CTTTGATGAGGGAGAGATA(^CCCTCAGGGAAAGAAGAAGAAAAT^ 

CTGGTAACGTGGAAGCCCGO^GCTGGCCCTGAGCACC^ 

AGAGCTGCCCTGACGCAAGGCATCTGCAGGACCCCGGGGGAAGGGGACTGGAAGGTCCCTCAAAGGGTGGCCA^ 

3 0 ACTGGAAGATGAAGAGGAGGAGCCTACATCATTCAAGGCT 

CTACC^CCTCCTTTTGTCCTAACTGCATTCGCCTGAAAAAGAAAGTGCGGGAATTGCAGGCGGAG 

AAGCTGCCTGAGCCCTCCCITCrGCCACCACAGGTGCTTGAGCTCCCA 

GTTGCTGTTAAAGGGAAGAGTGTGCAGTGCCACTCGTGCX»C^ 

CTGCCAGGAGACCAGAGGAATGCAACTCAGCAGAGGCAGCCCCCTCTTGGGGTGGG^ 

3 5 TGGCTTGAAGATCGGGGGAGGGAGGGCATTGCATGTGCCAACCrrGTATATCCTCCTT 

GGCTGTGTGCTGTGTTCTTGACCTGGTGTTTTCTCTTGGGGGCGG 
CAGTCTCTGCTGGGAGCTTTTACAGCTCCCCAAACT 

CATGGTGCTCTGTTCCCAGCCTGGAGTTCTAGAAGGATGAGGAACCAGAGAACCTGGGGCCGACAGOT 
GCTGGCTGCCrrGGATCrCTTCAAGTTGTTGGACCAAGGAAGAACTGTGGTTCGAT^ 

4 0 TTCCCTCAGGTTTCCCTTCTCAGCCACCCTCAGAATGGGTTGGGGTGGACGTAAAA^ 

CGCAGTTAAGAGCCACATGCTCACCCTGACCATAGCGGTGGGGATTCCAGGGC 
CAAACATTCTGCCTCTTCTAAAACATTCCACCCCAGAAGCACCCACAAATCAA^ 

GTGAGTAACGTCCAAGAACACCATGGGCAACTCATGGGGAGGGAACCTCACTGTGGAGAGTAGAGTTACCCTT 
GCTGGGCTTCCATCTCTGTGGGGCCCGCGCTCCCTCAGAGGCCCTTGCTGAGTCTGTGCTGT^ 
45 TCCCCTCTTATTTTCCAGCCTCAGAGAGTATGGTCTCCGGCCCCGCCATCA 

GAATCTGTCTCCAACGACGTGATGACGGCAACCGATGAACCCTCCAAGATGTCCTCTGCCACAGGACGCCGCAT 
GCAGGAGTGGCTGAAAAAGTTCTGGTTCCTACGGTACTCC CCG ACC CTG AATGAGATGTGGTGTCACGTGTGCCGCCAGTACACGG 
TTCAGTCCTCACGCACTTCCGCTTTCATCATCGGCT 
CACAAGAAATGCCTGCAGCTGTACAAACTGCGCATGCAC^ 

5 0 CACCGCCTACCACCTGGCGCTGGAGGGGAGGCCCTACCTCGACTTCCGGC^ 

TGGTGGACCAGTATATGAATGAGGGGGACTGCCAGATCCTTATCCATCACATCGCCCGGGCACTGAGAGAAGAC 
ATCCGTCAGTCGCCCTGCCTCAGCATCATCCTGGATGGGCAGAGTGAC^^ 

TACCAGCAGTGATGGACCCCCAGCCACJIGAATTCCTATCCCTGCAGGAGCTAGGCTTCTCCAGCA 
TCGACCGGGCCTTTGCTGCTCTGGGC^TCCGCTTACAGGACGAGAAGCCAACTGTGGGCCTGGGAGTGGATGG 

5 5 GCCAGCTTGCGCGCCAGCATGTTCATGACCATCCGCAAGACGCTTCCGTGGCTGCTC 

C CTGG AGAT C C TGGATGCCATCAGTGGGAAAGAGCTG CCCTG C CT AG AG GAGC TGGAG AACAACTTG AAACAGCT C CTCAGT TTCT 
ACCGCTACTCACCACGCCTCATGTGTGAGCTGCGCTCCACGGCCTCCACCCTGTGCGAGGAGACCGAATTC 
GCCGTGAGGTGGATCATTGGTGAGCAGAATGTCTTGAACGCCCTTATCAAGGACTACCTGGAAGTGGTGGCTCACCTGA 
CAGCAGCCAGACAGAGCGGGCTGACGCCTCAGCCATAGCCCTGGCCCTGCTGCAGTTCCTC^ 

6 0 TCTACTTCCTCCTGGACX3TGATCGCTCTGCT 

GACAAGATCGAGGAGGGAATCCAGGAGATCAGCCGGCTTGCGGACT 

AGAGAGTTTCAATGGGATCX5CCGTGAAGAACCTCAGGGTGGCAGAAGCCAAGTTC 

CACAGGTCATTCTGGCTCAGAGGTTTGACTCCCGAAGCCG 

AGGAACAGCGAGGAGCTCCTGAGCTrTGGCAAGGAGGAC^TGGTTCAGATCTTCGACCACCTGGAA 

6 5 GGATGTGTG CCGCG AAGGGACGGACCCCCGGGGGAGCCTG CTGATGGAGTGGAGAGACCTCAAGGCTG ATTACTACACCAAAAATG 

GCTTCAAAGACCTCCTCAGCGA(^TCTGCAAGTACAAGCAGAGG 
CCCACCTCCAC^CCTGCTGCGAGAAAGGCCGGAGCGCOT 

GCTCAGTGACCTGTTGACAATTGCTGTGAACGGACCGCCCATCGCCAACTTTGATGCCAAGCGAGCCCTGGACAGCTGGTTT 
AGAAGTCTGGCAATAGCTACACACTGTCAGCCGAGGTCCTCAG 

7 0 GACCATGG CTCTGAGTT CTACCCAGACATGT AGGGG G C CTGAG CT CCAG AG GCCACAAAG C CGGTGAAT AT TT T TTTAATCTATAC 

TCATAAGCTTTGATATATTATATAAATATATAT^ 

ACTTGG<IAGCTGCAGCCTCCACGGCAACTCATCCT 

AGCGGCATAAGCTAAATGATGATTCTCTTGTCACTTTTAGGTGGCTTCTTCTGTTTCTGTTCTCA 
7 5 GTTGGGTGCCTGCCITCTCCTTTTTTGGTATGG 
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TCCTGGT C (^GTT ACATT ACATCTGGGTTTCCATGAGGGTGAATTGGATTTCTGGACTAGACTAT CTGGTCTTTCT CCCCTCAC CT 
GACTTCAGCTTGGGAGATCTTTCTTTGTTCATTTT^ 

TCCCTCCGCTGTGAGCCAGCTGCGTTCCCCrrCCCACCCCCCCCCATGTCGCCGCT 
TCAGTT CCCTTCTTCAGGCCT CTGGAGTTTAAAG AT ATGAG CTGT^ 
5 TTGGGAGTGGAGGGAGGGTAAGCTTTATGCCTCCCCCTTTC 
TGTTGGGGAGAQATGAGCCACGAGAXCTAAACTCCCAA 
TATTCAACTTTCACATTGTTTTCATCTTrA 
TTTTGTAGTTCAAGGGCAGAGCAATCAACGAAAG 

1 0 ATTTTGAGT AGAGTTGAGATGTC CT ATAGAATAGGAG AAAACAG GAAATGAGGTGGGATTCTCAAGGT ACAAG CATGTATACAT AC 
CTACACACACAC^CACACACACACACACACACACAC^^ 
CACATGCACATACATGGACACACATC 
ACACACACACACCACATGCACATACATGCACAC^ 

CC CC CATAGCGGGG AAG AG AGTGGTTTAAACTAGAGGAGTCTGAC CAATGTTCATT T ATCT 
15 CAGAAATGTGCATGAGCTGTTAAAAACTTCTCTCITAATTTCCACA 

AGGGTTGTCrTGGGGAAGGGAGGAAGGTCTGAAGGACCrrGCCClATGGTCA^ 

ATTGGGGGTTCAACCACAAGGAAAGGCCCCCGTACCCTCAACCTCCCT^ 

GTCTGTCTCACCTCTGAGGCCTAACAGAAATGTCATTAACACTGAACACGT^ 

TC CCTCAGAG C CT CC CTTTGCCT CCATGGT CTTGGTG CT AAG ATAAATTTAG AAT CATTG CTGCTAG CCAAT AGCTTTT CATG ATA 
2 0 T AAAT ATTAT ATATATAT ATTATATATT ATTTTTG AAATGTTTTTGT TTTCAACAGTGATG TAG CATGTT AAAACAAGAAACAAAA 
CTGAACAAACGAAAATCTGGGTCTTTCCAACTGTCCC!ACTGCCAGGTCTC!TTAATGOT 
CCTGGTGACTTTCACCCCFCATCTCACTCTGGTCCCCATCTTAAC^ 
GGAAAACTC!AAAGCCCCTTGCAGAGGCCTGGGAATCCCCCAGC^ 
GCCCCTCCTACCCCXX5CCCAAGCCTAGCTGGAGCCACACGGGCAGCTGTTG 

2 5 T CAGGAC CCCATCT CCTCACAG ACACAG ACTTTGGT CTGGG ACAATTTTATTGAAAATTGAAT AGG CCG AGGTGGGTTCAG AGTAG 

GGATGGGTGGCAAGGCTGGTCTCTCATGGGCAGCCTGTAGGCATAAACCGTGGCCTGGGCCOT 
ACATCTGTCCTTCCCAGTGCATGGGGCTCCCTCACCCAGGAAGGGACCTCTCCCAGATG 
ACTGACCGTCCCCTCCCAAGGAGCTATAGCTGAGTGAACTCAG<3TCCTC^ 
TACCTGAGTCTCCTCAAGATACCCAGGATCAGTGGACCTGTCTTGAGGCCT^ 

3 0 TCAGAGTCACCCATTTCCTTGGTCCTCTGTCTCCTCAACCCCA 

GAAGCTGAGCTCTACACGTGTCTTCTGCTCAGTCTCCGAAACTC 

GCACCATGGGCCACTTTGCCrrCXSGGATATTGAATGACTAAGG 

GGAGGGTCAGGGCTTCTGCCCAGTGAGTTCCATTTGTC 

3 5 GGTAGGGCTTAGAGTGTATATGCAGTCATTTCTGATGGGACAGGGACTGTAAITC^ 

ATAGAATGAAATGTATCTGAGGGGCTGAGACTTATTCTCTCTGGGCT^ 
AGTGAGGCAGTGTTGAAGACTGAGGCTTGTGGCCATTGTGCCAAATGGGTACCT 
CCCCIAGCAAAGGAAAACAATGACCACZTGAGACAGGTGGACC^ 
GAGATGAGAGTGCGGGGATCAGAGGTGCCAGAGCTATTGTT^ 
40 CACTGGGGAAGGAGAGTGCACrCAGCTCCGGGACTCTGTGTAGTTAAGTTCT 

TGCCCTCTGCTCTGGCTGTGTGTTCCGAGCCTGCGTTGTTTTGCTGAGGTTTCT 
GAGCAAGGACCTTGGAGTGACTTGAAGCACCrTTTAAATGCCTTTCATTTGTGTGAAG 
GGGTGGGTGTGGCCGAAGAGAGGAAGAAATGGACTTTAAGATGTTACCCAGCCT 
(^GGTGATGGGGGCC^GGAGATGCCGGATGGGTTGAAGTCTTCTAGAACACrrCCTTTGTTTTTTC^ 

4 5 ATCCCAACCTGTTTTCAGTGGATAGTCTGGGTTAAATTCTAACCCGGATTTGTC 

CATTTGGCTTGCAGGCCCC!AGGTTGACTGGAGCTCATGTGT 

ACGGAAGCAGGGAGAACAGGGCCTCTGGAGTGAGGAAAGGAGGGCAGAGCCGACAGTTAG^ 
CCTGGATTCCAGCCrGAOSGTTCCTCrcTTCrTTGA 
TGCTCTTTTCTACTCCTCTGTTAGTCAGGGTCACTTTAAGAGTCITTG 
50 TCTCTGGAGAGAAAAATAGTCTGGCCCTCCTCTCCC^TCTAGCACTC^ 
AAATACCTATGCAATGCAGTCTCCCCAGGAGGCTGTGCATGGAGGC^ 
CAATTGTATATACGGACATATACAGTACGTATAC^TGCAGAGTAAGAGAGTAAATCA 
ATATTTATACATTTTTATGTTTTAAATAGATATAAAAATAGTCT 

GCTTGGGGGGLAGGAGACAGGGCrCTGGAGAGGAGGTGGGGATAATGCGCATC^GAATGGGTTG 

5 5 TTTCTTGGGGTGTTGGAGAGGAGGGTGGCTGAGCCTGGGCTCTCTCTCCATGC^ 

ATTGG AAGGAGATAGCCCTGAGCTGCATGCAACTGCAGT ACCCTACCTC CCAG CACCACCAGCCTGCACTTAGCTGCTCACTG CTG 
CTTAGCAGTAGGCHSGTTGGACTrGGTCCCTGACC^ 

GTGAGCAGCATTGATGGC!AACGGGAAAGGGGCACCCGGGACAAACCGTGGTCCAGAACGATGTC^ 



60 AAC^CTGTCTCTTAGAGCACTGGGAGCTGGGGAGGGCGG CGTG CACTTCAGGTG C1ACTTGATGCCTTTCCCCATCCAAAGGCCCCA 

CCTGGTGGAAGAGATATATCTG CACACTG ACATTTTCAATGGCAACTCAAATTCAGGAGCAGGGC CTTGG CCCACCCGACTCTGCA 
CAGCTGCAGCAGCTGCTGGCTGTACTC1AGGACAATGTTGTCCCCT 
CAGCAGGTTTAGGTTTAGTTCACAAACACTGTTCTCATTCT 

TCAGCTTGTCCCCAGGCAACaCCTCCCCCCTCAAGCrGGGTAGGGATGTCrGTGGTCCCTGG 
65 ATCCTCTATTCCCTTCTCTCrCCTGGAAGTTGACAGGGTCTGATGACAGA 

GCTCCTCAGAC^CACTC^TTTCCCATACCGAGCCTC 

TCCIAGCAGACCAGTCCCATACCACAGTTGAGACCAGGTCTC 

GGCTGAACACCTCCTAGTGCCTGGCTCAGTAGAGTTCAGGACCAATGGCCCAAGC 

CC!AGGGGATAGGCATTCCCAGTAGCAATCCCACAATGCACTGTAC 
7 0 GTCTCCATGCAGAGGAGGCAACGGCTC^CAATACT 

CTTTGGCCTCTCIGGAAGAACCAGTAATCTTTTTCAGCCOTGATCTCTCT 

ATGTCAATCAC AAAGAAAAAAAAAAGTGGGG GTGGGGGGGG ACTCCT AG G AGC CACATACATATTAATATGTT AATACT 

AAATAGAGCTATGCTGGTTTATAATCTGTATGCAGCGTCTTTCCT 
7 5 AAGCATTTAAATTGCCACCCCAAGCTTGGCTCCTTGCTTCTGTGAC^ 
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GTGTGGGTTTGGAAGGATCACAGGTCCTCTTTAGCTGGTCTCT 
TGGCAGGAAGCCCCTGAGCATGTCAG^TGGGCCAAAGAAG 
TGCAAGAATCGCTGTGGGGAAGCATCCAGGGCCTCTGCAAAGCCAATGGCT 
CATGAGCGGGCCrGATCCCC^CTTAGAAAAGTC^ 
5 CACATTTTTTAAGGJVAGGTATTTGACAGGAATATAGGC^ 

GAGATGCTGAGGCCTCTGCTCTCAGTCCCrATGCCCCrAACTTC 
TCTTATCATTAGACATCTCTCGTrC 
TGGAGtfU^GTGTGAACAAAAGTCTTCTCAATCCAGATA^ 
TGAGCCAATGAGCTTATTGGGGCTACTTGGAGAAA^ 
1 0 TTACCAAAAG CC CATT CCAAGGTGAG TAGT CTCATGAAAACCAGAACCCTGGC CATCTCTTGGTAGCTTG GATCIAACTCAACAAGT 

AGGACAGCCTACTCTTATAATATAACATTGGAG 

AG TTGTTTATTTCCAGAGTCrrTAAGAAG CCTCTCITTAGAACCTCTCAAGTCCT AATGAGACT CTATTCAGGACG CATTGTTT CAG 
TTGCTACACAGTACTCTACCC CTTC CTG CCAG CTCTTTCTG CTCTCCTTC CATTGT 

15 MOUSE SEQUENCE - mRNA 

TCCCCAGGACAGAATGACTGAGAACATGAAGGAGTC 

CAGAGGTCTGCTCACCACTCCGGGACCAGGAGTATGGCCAGCCCTGCTCTCGGAGACTGGAGC 
AAGAAACTGAAGGGAAAGCGTGACCTCATTGTGACCAAAAGCTTCCAACAAGTGG^ 

CTT CGTGG ATGAATGCCC GAACCAC GGT CC C C CCGTGTTTG TGT CTGACACGCCAGTGCCTGTGGG<1ATC CCAG ATCGGG CAG C C C 
2 0 TCACCATTCCTCAGGGCATGGAGGTGGTAAAGGATGCTGGCGGGGAGAGCGACG 
C!ACATCTTTGGCCCCTATGAGGGGCAGATCTCTACCCAAGACAAGT 
CCGCTACAAGTCCATAGATGGGTCGGATGAGACCAAGGCCAACTG^ 
ACCTCCTGGCGTTCCAGCACAGCGAGCGCATCTACTTCCGGGCATGCCGAGA 
AGCGAGGACTACATGAAGCGCCTGCATAGCATGTCCCAGGJUACCATCCA^ 

2 5 AGAGAAGGCTGAGCAAGCCCTGGAAAACCCAGAAGACCTAAGGGGTCCCACT 

ACAAGCGCAG CT1TGATGAGGGAG ACAT ACACC CTCAGGCAAAG AAG AA CAAGG ATGTGCT AGAGG C C 

TCACTGGAGTCTGGTAACGTGGAAGCCCGCCAGCTGGCCCTGAGCACCT^ 

TGACTATGGTAGAGCTGCCCTGACGO\AGGCATCTGC^ 

AGCTGGGCCCACTGGAAGATGAAGAGGAGGAGCCTACATCATT 

3 0 CATGAGCrGCCTACCACCTCCTTTTGTCCrAACTGCATTCGCCTGAAA 



MOUSE SEQUENCE - CODING 

ATGACTGAGAAC^TGAAGGAGTGCTTGGCCCACACCAAGGCGGCTGTAGGGGA 

3 5 ACCACTCCGGGACCAGGAGTATGGCCAG CC CTGCTCTCGGAGACTG GAG C CATCTTCCATGGAAGTTG AACCCAAGAAACTG AAGG 

CCTGTGGGCATCCCAGATCGGG CAGCCCTCACCATTCCTCA 
GGGC^TGGAGGTGGTAAAGGATGCTGGCGGGGAGAGCGAOTTGCGGTGTATAAACGAGGTCAT 
CCTATGAGGGGCAGATCTCTACCCAAGACAAGTCAGCTGGCTTCTTCTCA 

4 0 ATAGATGGGTCGGATGAGACCAAGG CCAACTGG ATGAGGTACGTGGTCATTTCCCGGGAG^ 

CCAGCAC^GCGAGCGCATCTACTTCCGGGCATGCCGAGACATCCGGCCAGGAGAGCGGCTGC^ 
TGAAGCGCCTGCATAGCATGTCCCAGGAAACCATCCACCGCAACTTAGCCCGGGGAGAGAAGA 
CAAGCCCTGtoAAAACCCAGAAGACCTAAGGGGTCCCACTCAGTTCCCT 
TGATGAGGGAGACATACACCCTCAGGCAAAGAAGAAGAAAATAGATCTCATCITCAAGG 
45 GTAACGTGGAAGCCCGCCAGCTGGCCCTGAGCACCTCCCTGGTCATCCGGA 
GCTGCCCTGACGCAAGGGATCTGCAGGACCCCGGGGGAAGGGGACT 
GGAAGATGAAGAGGAGGAGCCTACATCATTCAAGGCTGACAGCCCTG 

CCACCTCCTTTTGTCCTAACTGCATTCGCCTGAAAAAGAAAGTGCGGGAATTGCAGGCGGAG 
CTGCCTGAGCCCTCCCTTCTGCCACCACAGGTGCTTGAGCTCCCAGAGTTCTCAGATCCTGCAGGTAAGTTCCT 

5 0 GCTGTTAAAGGGAAGAGTGGATTCTGCAGATGAATCTGTCTCCAAC 

CTGCCACAGK3ACGCCGCATCCGGCGCTTCAAGCAGGAGTGGCTGAAAAAGTTCTGGTTCCT 
ATGTGGTGTCACGTGTGCCGCCAGTACACGGTTCAGTCCTCACGCACTTCCGCTTTCATCATCGGCTCCAAG 
CACCATCAAGCTGCACAGCCAGAGCAACCTGCACAAGAAA 
AGATGTGCCGCAACATGACCCTGCTCTTCAACACCGCCTACC^ 

5 5 GCAGAGCTGCTGCGGAAGTGCGAGCTCAAGGTGGTGGACCAGTATATGAATGAGGGGGACT 

CCGGGCACTGAGAGAAGACCTGGTGGAGCGC7VTCCGTC!AGT(^ 
TGGCTGACACAGTGGCCGTCTATGTGCAGTATACCAGC^ 

TTCTCCAGCACAGAAAGCT ATCTCCAAGCACTCGACCGGGCCTTTG CTGCTCTGGGCATCCGCTTACAGGACGAGAAG CCAACTGT 
GGGCCTCGGAGTGGATGGGGCCAACATCACCGCCAGCTTGCGrcCC^ 

6 0 TGTGCTTGCCCTTCATGGTGGACCGGCCCCACCTGGA 

GAG^^CAACTTGAAACAGCTC CTCAGTTTCTACCG CTACTCACCACGCCTCATGTGTGAGCTGCGCTC CACGGCCTC CACCCTGTG 

ACCTGGAAGTGGTGGCTCACCTGAAGGATGTCAGCAGCCAGACACAG 
TTCCTCATGGACTACCAATCCATGAAGCTCATCTACTTCC^ 

6 5 GGGCGAGTACCTCCTGGTGTCACAGGTGGATGACARG^ 

AGTACCTGCAGGAATTCGAGGAGAATTTCCGAGAGAGTTTCAATGGGATCGCCGTGAAGAACCT 
CAGTCCATCAGAGAGAAGATCTGCCAGAAGACACAGG 
CTGCCAGGTGTTCGACTTAGCGGCCTGGCCCAGGAACA^ 
ACCACCIGGAAGCCATTCCTGCCTTCTCCCGGGATGTGTGC^ 

7 0 GACCTCAAGGOTGATTACIACACCAAAAATGGCTTCAAAGA.CCTCCT 

GAACAAGATCATTCAGGTCCTTAAAGTTCTCCCCACCTCCACAGCCTGCTGCGAGAAAGGC 

AAAACCACCG CTCCCGCCTGACCCTGGAGCAGCTCAGTGACCTGTTGACS^TTGCTGTGAACGGACCG CCCATCGCCAACTTTGAT 
GCOUVGCGAGCCCTGGACAGCTGGTTTGAGGAGAAGTCTC^ 

CTTGGAGCAGAAGCCCATGCTGCATGTGGTGGACCATGGCTCTGAGTTCTACCCAGACATGTAG 

75 
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HUMAN SEQUENCE - GENOMIC ' 

AACCCTGTTGCAGACAGGCCCAGGCA^^ 

ACCTCAACCGTGTCAGTTAaVGTTTCroTCTCTGAGATGG^ 

CCAAC7UVTTCAATTCAATTCAATTCTGTAACTATCT AG AG CTGGTGCAGACCCCACAAGATAAGAG CTCAGTT ACACIAAGACTGCC 
5 CTCAATTCAGACACTGGTCATAAGTCCCAGGGGACTTGTA 
GTGTGATAATTCACTTGAATAACTCACAAAGCTCAGG 
AACGGCC^ATGGAAGAGCTGTGCAGGGCAAGGTGTAGAGGAAGGGACCTC 
CAGCACCITGCTGTGTTC!ACTAATCC^G 

1 0 ATCTGGGGGGTCCCACCCTCAGTCATCTCATTAGCATAA^ 
CCTATAACCCAGGAAATTCCAAGGTTTTAGGAGCTCTGT^ 
CCTATCCATGGGCATCCATGGATTCAACCAGCCA^ 

AGTGATTT ACATAG CATT AACATTGTATAAGGCATTATAAGT AATCTAGAGATGATT TAAAGT ATATGGGAGGATATCCAAAGGTT 

ATGTGCATATACTATGCTATTTTATATCCIAGG^ 
15 CAGATACCAGGGTGCAAATGAATGTGTGTTTCTTGTCCTACC^^ 

CTTCCTGCTGGGGTCCTCTGCAGGACATGGCTCCCATTTC 

CTTCC CTTCCCTTTGATTTGGTCAAAATAAGTCAAATGTCT 

GATCCAGTCATTGGGGAGGGAGGTGGCATTGCTAGA 

TGTCTCTGAATCTCACTTAGTTGGTCCTATAAGGGAGGTGTGGC^ 
2 0 CGGATACCATAGAAGCCACCAGCAGGATCCACTGTGiAGAAGCT^^ 

TAATCCCAGCAOTTTAGGAGGCTGJUjGCAGGTGAATCACCTG^ 

GTCTCTACTAAAAATCCAAAAATTAGCTGGGCGTGGTGGCGCA 

GCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCGAGATC^CGCCAC^ 

AAAAAAAAAAAAAAAAAAAAGTTCTGCGGGAGGGACCGCCTTGGGAGAATGTGT 

2 5 CTTAGTGGCGACTGCCCTGTGGGTAAAAGCTCAAAACCTCATTTTCC^^ 

CCTGCGGCTGCAAAATCAGGGACTGGACAGGGTTAGAGCT 
TTTCTTTATGTTTATTAACATATATATATTCAGAAAAGTGC^ 
ACCCATGGGATGAGCACCCAGATGAAGAAATAGAACATGGCCAGAATCrTAG 
AGAATAACCACAAACTTCACCrCTAACACCATAGTTGTGGCTGGTTGGTT^ 

3 0 ATGAGTCTGTACTCTTTGGGGGTCTCCTTTCTTTTCT 

TCTCATTTCTGCATAGTATTCTATCCACTGAATTTGGTGAA 

AACT AGGAAGATG AATT AATT ATTACCTAATGCATG CAGTCTTTTT AG TAGACATG AT CT TC CAAAATGGG AAT C CTAAACAAAAA 
TAAAAACAGGTATGCTTCGGGTAGAGATATGGGGGTC 

3 5 TATGCATCAAGCGCCACTTGTGTGCCTGATACTCAGTCTCTAATCTGGATTCCT 

ATCT AATGACAAGTGGGTAAAG G TCAT TGGAATTATCTTCTTT AG CAATTTC T AAACTAAATG ATT ATG AAGAGGTC CTCTAAAG A 
ATATTTAAGTGACTATTTTT CAGAAG CAAAAAGAAGAAAACAGAAAT ACACAACCCTC CT CCTGAACTAAC 

AAAATATAT CCT CC CAG ACC CATGTGTACAGAT CTG CATCATTATGT ATAGACTTTAAGTGGAATT ATTTTG T ACACCTGTGT AGT 
GTGGGAAATTAAAAGTGTTTTAATGGAATTATTTTGTATCTATCTTTT 

4 C GTTTTAGGTTCACAGCAAAACTGAGCAGAAACTAGAGT^ 

ATCCCACACCAAAGTGGAAC^TTTGTTACAATTGATCAAC 
GGTTCACTCATGGTGTTGTACCTTCTGTGGGTTTTGACCA 

TTTACTGCCTAAAAATCCTCTATATTCCACCTATTTGTTCCTCTCTCCCCGCAACCCCTGCCAACCTCT 
CGTGATTTTCCCTTTTCCAGAATGTCATATAGTTGGAATCATACAGTATGTACCCTTT^ 

4 5 TGCATT AT CATTTAG CAAT ATGTTG AACGTGTGT TTTCATGT CAATAAATATT C C TCTACAGCATTGTT AATTG CTGTCT ATTATT 

TCATTGTATGCTAATATTTCATTGTGATATAGCATAG CAATTTCCCC CATTTTGGAGGTTT AAATT CTTT CCT ATTTTTTGCT TTT 
AAAAAT AAT ATTG CAGAGG ACAG CTTTGT AGGTAAATCTTTGCACX3CAATTT AAATACAT CCTTAG GATC CATT CCTGGATGTGGG 
CTTGCTCCTACAAAGG^TTTACTTACTTTGAGGATTTTGATACATGTC 
CCAACCAGTGGGATATGAAGTACTAATTTCCCCTACACTCTTGCC 
50 TTAGAGGAGAAATGJVTATTTATTTGTTTTAATTTGTAT^ 
ATCTATTGGTCTTTGCAGATTGCTTGTTCATGTTT^ 
TTTATAAAAAGTATTTATTAGGGGTATTCACTGCTTGTAAAG^ 

CATGCCTGTAATCCCAGCACGGATCCCAGGGCAGATCACCTGAGGTCAGGAGTTCGA^ 

GTCTCT ACTAAAAAAATACAAAAAATTAGGCAGGTGTGGTGG CAGGT ACOT C TGAG ACAG GAGA 

5 5 ATCGCTTGAACCCAGGAGGCAGAAGTTGCAATGAGCCGAGATCGTC^ 

TAAAAAAACAG AAATTT CACCACCATGTGCACATCTG ATGTCTG TTATACTTTTAAATATTAAT AAAATGAATG ATAATATTT AT A 
GGGTACTTACGATAGGTCAGGCATTATGCTGTGTTAAACAGC^ 

AAACCAGGTAGGCGTGGTTAATGCAGTTTCTCAAGGTTACACAGTTTGTGAGTGGCTGTGCTGGTGTTAATTC 
ATGGTGACATAAGGTTTCTATGAATTCAATAACACTTTTAAATACATTT'ATCT 
60 TCTGTCTTATACATGT AAGTTT AAGG CATAT ATTTATGTTCTCT AT AGGCATG CTTG CTG AG AATATAAATTCAATAACT CC CT AT 
AATAATTCCAAGC C CTTG CC CACCTCTCACATGGTATACTG T TAAGGACTATGAGTTGG ACCTTTT ATTTTACAGG AAAGAG AAAC 
TGAGGG CAAGGCAG AAACTTTATCAGGG TCAC CTAAAAAACTGACAGCACAGCCAG AACTGAGACC CAG GG CTTGGG ATT CCCAGC 
CCAAGTCAAGGTCAGCATGTGX5CTTGGGCC^IGCACCCTAACCT 
AGGTACCAGAGTAAACCITCTAGATTCTCATGGTGCCAGCCT^ 

6 5 CTGGCTCCTGC^CATTCCAGCATCCTGGCACCCTGCCAAGGGTGAAAACAGAGCTGCAGACT 

TTTTGGGTTTGCAGCTCACTGCTGGGTTTGCAACAAGCCTAATCTCrG 
TGCC^GC^CTTGGAATCCTCC^TGAAGCCrc^ 
CAGAAAAGGCAACIAGGGGCriTACCTGGGCCAGGCCTTAG^ 
CCCTCTCTCTCTCCCC^GTGGAAGTT<^TTTGGTCCTGCCAC^^ 

7 0 ACAAAAGTCTG TCT ATTTCAAAGTAGAGATGT C T ATGGGCCACTTG AGAGTTGGAATG CACATTAGTCC TT TGAGG CTTG ATCCTG 

GTTCT CCGGGCAGC CAG CTCCAAGAGT AG AAAGAAATT AT ATGTG CAT AT AGT CAAT AAATAGGTGTCAGATAATAAGGACTCGGT 
AACTCATTATGTTTCCCITTATGCTrTGGCT 

TTTGTTTTATAAAACATTGCAAATCTTTTGTGTGGTGAAGGTGGGATGTGAGAGATAGGACATAAATGCAAATC 
7 5 CTCTACTTTGAGATAGAAAGTAATACTTTGCCCACA^ 
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CGGTCCTTCTCTTTCAATCTCCCTTCCCCT 

cctttattctcttactattgcctatcctccctccaaaccccttc^ 
acctcctcctggcaagtatcagtatttccattttgcaaaagaagacactaaggccctaga 

actgattatcctc cccaacittatcgtcagaatccggctctgagtctcatgagtctaattc catcct c c cctagtg ctccatcct 

accccccmtgcacctctgcacccttca^ 

agagctccatagggaatcaaacccccaccctgtttttcctgctc 



TCAGC CAAG CAAGGTTCTCAGGGTGCAAAATTGAAAG AGGTTCTCACTCT CAGG CCTCAGAGT 
G AAAGCCTCCTAGTACCTGG CITGTCTITCCCTAGTCTTGGCCCTGGTGC CAGG CACTGTTACAGGTG CTGGCAGTTCAATGGTGG 
1 0 CCTGCACCCTGTCCTCGGGGAGCTTACAGTCTATTAAGGGAAACAGATATTAATC 

GG ATAGGAGTTCC CCAGGTGTT CTG CAGGCTAGGACTCAGCAAGTTCACAGGCC CCATGGCAGGATGGAGAATGG CATACAGGTGG 
GGCTTGTGGAGGGCAGAGGGGCCAGAGCAAGGGAGCCCCATGCAAAGCAGTCCTCAGCA^ 
CCAAATG ACTGACTT AAAAAAAATAAAT CCCAGCCAATGGAGAG AATT TG AAGC 
15 CCTGATCAGCATTCTTTAGTGGGATGTGCMAGACTGGCCCCTCCACACTCGTCTGTGTC 
GAATCTATTATTAGCCACATTTTCTAGATGAA 

CCATGACAAAATACCGTAGATGGGGGTGCTTAAACAACAGCCATTTATTTCTCACGGTTTTAGA 
ATGCCAGCCTATTGGTTCCTGGTG^TGGCACTCCGCCTTGTGGATGGCTGCCTTCTGGATGTGTC 
GTGTGTCTTCCTCTCCrrTATAAGGGCACCTAACCCTTATGACCTCAATTAATCTTGAT 
2 0 CAGATACTCTGGGGCTGAGGGCTTCAACATATGAAGGGGAGTGGGGTCCATAACACCTGCCCA 
ATTTGAACTCAGGCACrGCCTGTCCAGTGCCCAGGCACCTAACCACT 
ATCCCAGTGAGAGTTCCTCAGGGCAGCAGGTGTTCAGGGATCCITCACAAATCCACAG 
TCTGAGCTGGGACCTGAACCCACTTAAGGGTGATCCCAG 
AACTTGAAACACITGAATCTGGGGGTTCAGCC!AGTTTCCAATT^ 

2 5 TCGCTTGACATCTTCCAGGCCCCAATTC 

CAGCGGGGAGTCCCTGGCATAACGCAGCCAGGAGGCAGTAGAGTGGCAGGCGCCCCAGGCGGTGC^ 
AGCGGGTGTCCGGTGTCCGCACGGGTGGCAGCGCAGGCC 
CCCGGCCTCCGCGGCGCTGTCGCCCAGCCCTGGC^ 
raCCGCCTGACCCGGAGCGACCCCCCGCGGCGCCCTCT^ 

3 0 CCCTCCCGGCCGCCAGCCAACTGCGTGCCCCCAGGGCCGGCAAGAAGCCACATGCCCCCGCAAGGAATGCCGGGCCT 

GGGGCGGGCGCTGGGCAGGGCCGGGCGGGCCGGGCCGGGCAGGGAGCTCCCCGCCGGGAAGGCTGGCGAGAGGGGGAGAGCCCGCC 
CGCGCCCGCCCCGGCCGCAGCCTACACCGGCCCXSA 

CCCTTCCCCACCGCGCGCCCCCGGCCCTGCTGGCTCGGAGGAGGGGGCAGGCCGTC^CGTTTCCCCGCACCCCTCC 

CGCCGGCTTTGGGGCCCCGGTGGGGAGCAGGGGGCTGATTAGCCGAGGAGCAGGGGTGTGGC 
35 ACACACGCCGCTCGCCCAGTCCCAGGGCCCAGCGCCGTCCCM 

CTTGGCATGAGGACCCCGGGGTGCCGAGGGAGGGGCTGAAGCCCGGCGCGGTGGTGGAGGGCGGTAAGCGGCGGGCTGGGGAGTGT 

CCCCTTGAGAGAGTCGGGTGGGGCTGGAGCCCTGGCTGGCCCTCGCCAGCGCCCCC^ 

AAAGGGGCCTGGCTGCGCTCCCGGCAGTCCCAGGGTGAGGGGCTGCC 

CCTTGGCCCAAGTTGGGCTTGAGTTGTGAACCCTC 
40 CCTTTGCCCTTTGGT CTAGGGGT CCAGGCTATGCTTCCCAGCGCGGC CTCCTGGCCTGAGCTGCTCTAGGCTCCTGACCCCATGGG 

GCTGGTGGCTCCA.GTCGTCCCCAGGACCAGCCCGGGATAGAAACGGCATTTCCTCT 

TGTTCTCAAGTGCTT CCTCTGGGGATGGCAGGGAGGGTAACCAGACT CCGCGAGACCCCCACTTGCCCTCTCCGCCX3CTCACTCCT 
CTTCCCCTGTTTCTTTGATCCTCTTCCTGTGCTGGTCCCACCT^ 

TGGCCCAGACCAATGCAGCCGTGGGGGATATGGTGACX3GTGGTGAAGACGGAGGTCTGCTCAC 
45 CAGCCCTGGTGAGGCCCCTGTGTGGGAACTGGAGAGGGAGAAGTGGGAGTGGGAGGTGCCTGGGCT 

GAAACGTTCTCACTCCTTGCCCACCTTCCCTAACCCTGCTCCCGG CAGCTCCTGGGCGCAGGCTCTAGAAAAATC CAAG CAGTGGC 
CTAGGGAGGTCTGTGCTCGCATCTCAATCCTCCAGTTTTTCCACCACCTCCTGCACTGAACAGCA^ 
CCACCCCACATCTTCTCGGTCTCCTGGCCAGGTCCCCAGCTTTTGTCCCCACTGGGCTCT 
TTTGCGGGCCTCTGCCCTGGCCAGCrCTAGGAGACCGGACrCCTCGGCCATC 

5 0 ACCTCATCGTGCCCAAAAGCTTCCAGCAAGTGGACTTCTGGTGTAAGTGGAGCT 

ATCGCCCCATTCCTGGCTAGGGAATTCACAACAATGCTACTAAGATAGGTCCTCACCTG 
TGGTTCGTCATCATCATTAGCCATCCAAATGGCT 

ATCCCACGCCTCGAGTCCTCTGGCCCAAAGGGGTACAGTTGCACTTGGACTCCAAAGCCATC 
TTGAGACAAAGGTAGCTGCTGGCCTCCrTGACGTAGGCAGGCCTGTTCTTGGAGCCCCCAGGGAGAGTATGGGT 
55 TGACCCTGGGGCCrGCAGCTGGCTGTCCATGAGTGGGCCTCCTGTCAGCCCCTCTACCTCCCCCATGGGGGTCTTACCTCCCTGCA 
ACAACTGAGGCCCTCCCTTCTCTTTCCATCCOTCCAGTCTGTGAGTCCTGCC^ 

CCCCCCGGTGTTTGTGTCTGACACACCGGTGCCCGTGGGCATCCCAGACCGGGCGGCGCTCACCATCCCACAGGGCATG^ 

T(^AGGACACTAGTGGAGAGAGTGACGTGCGATGTGTAAACGA 

ATCTCCACCCAGGACAAATCAGCTGGCTTCTTCTC^ 

6 0 ATGGAAGGAAACCACCAGGGGGAGCC^TACTGGCCCAGCTTGGCCCCAGAACTTTTCrrACCATCGCTTCTGGAC 

AGCAGATCAGGAAATCACCAAGCAGAACTCAGGGCAATTTCCCCATGAATC 

TGAACCAACCGATCTTCTAGAGTGGGTGACTCGGAGCTGGCCTTTGGGGATGCAGAGCATGACTTCCCACTTCC^ 
TCAGGCTCTGGGCTAGGTGCTGGGGGTAAACCCTTCAACAAGACAGACATGGT^ 

6 5 AG ACAGG CATTGAACAAGCAG GTACACATGTG ATATCAGTTGCAACACAG GG TACTATCAAT AG ATGGG TTG CAAAGATGAGGAAG 

AGGGGTTTTCAAGGGTATGAGGAGXSAAGGGTTACCAAAGCAACTT^ 

GTGGGGT AATAGAGAAAAGCATT CT AGAGGGT AAG AAGATCCTTATT CACTAC1AAAAACACTTCTTG AGTATCTCCTTTG TG CCAG 
GCACTGGGCTGGGCCTGGAAATACAGTGGTGGGCCAGATAAGGCAATCAGATAAATC 

ATAGAATTAATCAAAGACTAAGACAAATCCATGTCAAATAATAACTCTGAGGGTGATGGAGAAACAGTGAGACT 

7 0 ACAACCAGGGGAGGTAATATGGGTAGAGGTCAGAGAAGCCTTCCTGCT 

TGAAGATTGGAGAGG TGCAT CCAGATG CCTTGGACAG CAAGAGCAAAGGC CCTGTGGCAG G AG AG CATG GTAC ATCTGAG C TACTG 
GATGAAGGTCAGTGTGCCTGGAGTGCAGCACACACAGGGGCCTGGAGATGATGGAGGCTTGATTTTC 

CCTGGAGAACTGATTGAGCTCAGGGGTGCTGGTCAAGAATGGGTACCCACACCCCTTTCCCCAGGTAGAGCGTGTGGGAGTTCTGT 
7 5 AGTCGGCAATGGTGGAAGACTGGGCTAGGGTTGGGGCAAAGACTTGGAGCCAAACGAACTGCTTTGTGAACTGTATCAGAGAGGAG 
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AG CTCTG ACTT AGACAT AGACT AACTTGGCCTAAGG AAGGG AG CAGGAGGTATCAAG GACATCTGCAGT CTT CCCAACAGGTTGTTAA 
TGGTGTCrTTCTCTGAGAAATGGAGGAGGCTGGAAGGGGACT 
TTGGAGTAACAACAAGATATCCAAGTGGTGATGTCAAGTAGGCAGTTGGATA 
TGTTGCTTAGTGAGTCACCTGTGCATGGGTGGTCAG 
5 AGGTGGAAAGGCCTGTGAGCAAGCTTTGAGGGACTTTAGC^ 

AAGGTTTGCAGCCAGAGAGGTAGGAGGGAAACCAGGAGAGTGGGTGACCT 
ATGGCTGTCAGTGCXZAAGTGCTGCTGAGAAACCAGCCAG^ 
TGACTTTAG CAAGAGCTATTGTAGAAAGCAGTGGGTCTGAACCT 
CAATGTCTGGAGACCrTTTGGTTGTCACAGCT 
1 0 ACAT TTTACAATGGACAG AATAGTC CCTCATAACAAAGAGTTAT^ CTGAT 

TAGCATGGAGGTGACCAGGCATAGT^TTTCCTAGAAGTTC 
TGGGAGGGTGTGTGGTCAAGAGAGTTTGTTCATTAGAAAGAC^ 
TTGAGTGTAGAGTAGAGATAAGGCATGATTCCTAACCTTTT^ 

GGTGGAT CAGCATCAITT CTGTCCCT ACAGGGTTAGC CCAGAGG AT T AGAGCAGATAG AGATAGTTAGGTAT AGT AGGC CACATAA 
1 5 TGTCCCCCGAAAAATGTCCCTGTCCTAATCCCT 

TTAAGTTAAGGATCTTGAGATGGGAGCGATGATCTTGGATTACCTGGGTGGG 

GGTGGGAGGCTCAGAGTCAGAGAGAAAGGGATTGGAGGATG CTG CT CTG CTGGCCTT CG AGAT AGAGTAAGGAGCCGGGAGTCAAG 
GGATG CAGG CAGCTT CTGGAAGC CAGAAAAGTCAAAGAAACAC^ C CTGC CAACA 

C C TTG ATTGTAGGACTTCTGAC CTCTAGAACT AT AAGGTAATAC^ CAGTTT ATTG 
2 0 G CAATTTATTACAGG AG CAACAG GGAACTCATATAGGAGG CT TTT ATGTTTGTGGGGGGACAT CTACT AAAAT ATG CAGGGAGGTG 

GATTGAGGTGGCCTGAAATCGAGGAGATCAGAGAAGGTGAGTTCTGCT^ 
GGGGAGCTGTCCAGAGCACTCTGGTGTGTGTGGCTCTGCAGTAGAGAGATG 
TGCAGCTGAAGCCCGGAGAAGGGAGGGCATTTCCTACAGAAAAGGTGCITGG 
GTTTGAGGAATCCTTGGAGGGAGAGGAGTTGAGGAGGGATGGAGCAGTAGGAGGAGAC^ 

2 5 AGGAGCAGTCTTCATGGGCGGAAGCCAGATTGCCATGGATTAAGGTGTC^ 

GCTTGTTTTAAAAGTCTGGTTG CCCACTGAGCAATTATTATATG CCAGGTGCAT ACTGG ACACCTCTG G CAT CTCAT ACTCTTC CT 

CGAAACAGTCCATAGGGTGGGGGTTGCTGCCCCTCTTGTTCTGAC^^ 

TGGGTTCCTTCIAAAGATAGTGTCCCTCCACCATGTGAGGACTTCCCT 

CTACCCTGAGCCAGTTTTGGTTATGTGTCCAGGT^ 

3 0 GGGAGCTCTGGGGATAACAGGGCTGCGTGAGGTTTGCTGTCTGTTCTTATGGAGCCT 

GGTGGATAGGAGGGGAGGAAGCTTATTCCCAACCAAGTAATACAGAGGCAGAGTATTGCT 
GTAGCAGTAGGGAGAGGGGGATTGGTTTCAACAGGGCAGATTGGAGATGGATCTT 
GAAGCCTATGGCTTCACAGCCTATTGGAAAATTGCAGGAGA^ 
CTGCAGCTGCTAGAGTCTGTACCTTGGGACIAGCCCAG 

3 5 TGCCTGGC^CACGTCTACCGTGAGAAGAAAGGAGGAATAGAAGGCCCTGGTGTTTGCAACCTGGTTCTAGACTAG 

ACGAGTAGGCGTGGCCCCAGTTCAGGAGGTGGGACTGGTGGCCTGGAGGTTATTCCCT 
CAGCC!AGCAGCC!AGGAGGCAGCAGCCAGCA 

GTGTGTGTGTGTGTGTGTGTATGTGTGTGTTGGGCATGGGTTT^ 

4 0 CTGCTCrGCTGATGGGTAGGCCAAATGGTGGGAGAGTACGGGGAAG 

GAAGCCCCATATCCTAAATAAACTGCCCTC^TTTGTCy^GTCCTGTCCCAAAGCCATACTGGATAT 

CCTGCCTGC^AAAGCrCTGGCCCCCATTCAGCACAACACA 

AAATGATGGTGGTGGTCATGACAAGCATAGCTGTTAATGTTT^^ 

AGCATAATCTCATCTGATCCTAACAACGCTCCTGTGACATAGGTATTATTCCATCCATTGCACAGCTGAG 
45 GAGGATGAGTGATTCTGCAGAAAGCTAAACAGCTGGTGAGTCACTGAGCT 
CCCTCTCATTCCGCCTCTTGGCCCTCTAGCCTCCCTTACT 
TCATTCTTCCTGCCCAGC^CAAGTTCCCTCTTTGTCTTCGCTO 

GGCATGGGCCTGGGTCATAATGGACCGAATGGCCATTTCCACTCCACAACTCTCCTTCGTGGCCTGGCT 
AC!ACTCACCTCTGCCTGTCTAGGAAAGACATGCCCCT^ 

5 0 CCAAGATCACCCA.GGCTACCCn'CAGACACTGCAGGCCAGGAACCATGC^ 

TGGTTTGGTAAAGAGGCAGAAGACAGAGGAAGCCAACATGGG 
GCTCTGGTTACCCAGGCTGGAGTGCAGTGGTACGATCTCAGCTCACCGCAACCT 
CAGCCrCCCTAGTAGGTGGGATTACAGGCATGCGCCACCACATCCAGCTAAT^ 
TTGGCCIAGGCTGGTCTTGAACTCCCGACCTCAAGTGATCC^ 

5 5 . CACACCCGGCCGCGAGCACAGGGTTTCTATGTAAGTTTTTCGTTCAGACCTTGGC 

AGTAT ACATAAAAT AT AC CACAATGG C!AT ATTAAAACTGGGAG AG AGAAAG AAGAAAGG AAAAC AAGGGTGGAGAC CCT AAGTG^ 
G C CAGGAATGAAG CTAAAAATG CAT ACT ATAATGACC TCTATCTG CACTTTT AG CT ATACAGG CT CCAAAT CTG G CAG C CACTGGG 
AGAGTAGCTGGTACATTCTGTCCATAATGTGAAAAGGACCAGGGAGCTCC^ 
AGGTTTCCCAGGGGCCCTTATAAAGAGGACACTGTGTGGTGGAATTGTCCACT 

6 0 CTACCGTTCTGGGGAGAAGCAGATATGAAATGGTGTTTTAGTGTTTCCCATTTCAGCTTTTATAG 

AACTTGAAAGTGG ACGTGC CAGAG CTCATGGCACATCCTCCAGAAAAGC CTCT C TAG ATGAAATAA 

GCAGACAGTTGCAGTGCCATGACATGTGGTGGGGACAGGCAGGTGTTACCAA 

GTTTGTTCGAGAAGAAAGAAAAGGGGCGTTTGAGACAGGGTAG^ 

TGCTTGTTAATTTCrTACTCTTTATTTTGAAAGTTGCAAACTCGGTTGAAAAGTTGCAAG 
65 GCCTTGTAGATTCATCTATTGCTAACACTTTATACCCATTGCT 

C ACACACACACACTTIT'ITTTTTTTT CTTGAGATG GAG T CT CACTCT ATCAC CCAGG CTGG AGTG CAGTGGCATGAT CT CAG CTCA 
CTGCAACCTCCACCTCACAGGTTCAAGCAATTTTACCACCCCAG 

GGTTAATTTTTTTGTATTTTAGTAGAGATGGGGTTTTACTGTGTTACC CAGGCTGGTCG CAAACTCCTGAGCTCAGGCAATCTGCC 
CGCCTCGGCI^CCCAAAGTGCTGGGATTACAGGCATGAGCCACCGCGCCCGGCCATAC^ 

7 0 T TG AG ATAT AGTT ACAGAGACATCATG ACACTTT ACCT CAAA^ CAGCCAAATTC CTAAG AG CAAGGTGTT CTT CT ATAC CA 

GGGTGAGCAAACTTTTTTTTTTTTAAAGGCTACCTGGTA 

ACTCTGCCTT TG CACTGCAAAAG CAGT CATAGATGATACATG AATGAATGAGTGTTGCTATGTT TCAATAAAACTTTATATGAACA 
CTGAAATTTGAATTGC^TATTATTTCAAATGAAATTTGAAAGTCATAGAAATAAAATTACATAAAAA 
AAAATGCAACATGATTCTTAGCTTGCAGGCTGTATAAAAACAGACAATGGGGGCT 
7 5 CCTGAGCTATTTAACCACCATATAATGATTACACCCAGGAAATTTGACATGGATACAATACT 
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ATTTCCCTAATTGCCATTAGTAACATCCTTTGTGGTTTTTTCC 

CA(^TCTCATTAGTCTCCTTT ATTCTAGAAAAGTCT C C (XTGT ATTGTTTTTTTT CTTCATGAT AT TGATACTTTTGAAAAGTTCA. 
GGCTAGTTGTTTGCATAATGTTCCACAGTTTAGATTTGTCT 

GCAATGTT AGGTCT TT CTCACGGTGTG CCATCAGGAGG GRCCTGGTGGCAGTTTGTCHTGTCiT i"i ACTCAATGGTTTTAACAACT 
5 ATTGATGATl' C TrGTCX'GTATCAATTATTATTCATGGTTAAAATGAT^ 
CTTCTGITTAAAAAAGATAGTGCTCAACCI^CACCCCCACTCAAT 
TCTrrriTTATTCAATGTGTTGGAATCCATGT 

CTTTCAAACTGGCTCCTTGTTCTTTTG AC^ CACAAAAAGATATT C 

CCACreTCrrCTCGTAGGAGCTGTGATTCCTT^ 
1 0 CTACTGGGGTGTC CTTGCTTCT AAG CCAAACTAGGTTTTATT ATTATT ATTATTATT ATTACTATTATT ATT ATT^ 

TGAGTTCTGGGATACATGTGCAGAACCTGCAGGTTTAT^ 
TCATCTAGGTTTTAAGCCTCCCATACATTAGGT 

GTGTGTGACTTTCCCCTCCCTGTGTCCATGTGTTCTCATTAGCCAAACT 
TCCTACCATGTCAGAAACTGAGGGAACGCAGCATAGCAGTGG^ 
1 5 GTGGGGATGGTATTAATATTAGGGTATAACAGAGGTACCAAGTGTGTTTAGAAGGAAAR.GTGGAAGA 
ACTTTT CTTGTGTTCT CAGATGCAATGGATCATAAAACACTGTTGATTCAAT AT CAG 
ATCCAAAGATTCTGAATATCAGAAGCATTAAATGGGCTCAGCTTGTG 

ACAAACCAAACCAAAGATTTCCCCAGG AAAATGTAAT ATG CT CATGAAC CAGTT CTGG CTGGAG CCAAGTCTGTACATCTATTGT A 
TCAGAGCAAAAGTAGAACATGAAAAATTATTCTGAAC 
2 0 TTCTTGGCTTTCCAGTCCATTCTACTGTAGCTGTGTTTCTCACACCTGAACAGT 

GTCTT CATGTT ACATT AATTTTATTTT CATGTT ACTT AAAAAGGT ATAAACATAT AGCTTGGT AAACACAAAGTGT CTTGT ACTTA 
TAACTTTTAAACAACTGTAAAGCCAAAACAAGTTTACACATA 

T CATT CT AATGGG AT AGTTG ATGATGTGTCACTG AAACAAAATTG CTATTTAT AAGG CG AAT ATCGTAGTGACTACTG AGACACAG 
CAAGAGTCTTTGGAGTTTGGAATGCCTGTACTTTTGTGT^ 

2 5 CTGGGAAATCTTTTATTTATGTATGTATTTATTTATTTAGAAATGGGAACrrTACTCT 

CATAGCTCACTATAACCTCAAACTCCTGGACTTAAGCC!ATCCTCCTGCCTCAGCCTCCAGAGTAGCTGCGA 

AT CACACC CAT CTATCTTTGTTATTG CAACTGCCATG ATGTGT CTTGG C CTTTGCTTTGC CGG AGTG CAACATTTACAT ATTAAGT 
TTATATCTGTTCTTTTTTTGTTGGOTCATAGCAAGTAGTCATAAT 

AAAAACCCAGTGG CAT AG AACAAGGGT CAGCAAGTGTT CT C TG T AAAGGGCTAG AT AGTGAAT ATTTTAGGTTTTG CACACT ATAC 

3 0 AGTGTCTGTCTCAATGACTCAATTCTGCCATCATGGTGCAAAAGCA 

ATAGTGAAGTTTGAATTCCATATAATTTTCACATCATGAAATATTGTTTT^ 

TTAGCCTGGTGGGCCATGCAGAAACAGGTGGTGGGCTGGAATTAGCCCACAA 

ATClATTTACTCnTGCTCACGTATCTGCTGGTCAGCTGCCGTTCCTCAT^ 

GACAT AGCTGGGCTTGCTGGGCTGCCCGGACT CCAGCTGCT CTGGAG CCAGGTCTGCTCCAGACA 

3 5 CTTGGAGCATGTCTGTTCTGGGCCCCAGCTGAAGGAGCAAGGGCTCCCTGG 

CATGTTTGCTAATATCTCATTGGCTACZATCAATTTCACAGTTGAACGT 

CAGTCTAGTTCAGAATCCTGTGGCAAAAGGTGGTACGGATGGCTGAAGAGCTGGGGCCAGGGATTCAATC^ 

TCAGCCATAAATTCAGTACTCTTGCGGGTGCAGAAATCTCATGAGGATTCCCGCATATTGATTTTTTTAAATGAAAACATGGAATT 
AAAAATTCTTAGAGAAAACATTCAGATCCTGAGAATATAT C CAAAGACCCT AGTTTGAGAGACACTAATTTTTAGTAAACTTCCTG 

4 0 GCTTTGCCGCAGAAGATTTGGGGTTTCTTGGTGTTTGAAAATTCCCCAAGGAGAGCTCTTGTTGAACTAGGCT 

TCAGCACTCCAAACCCCAGTGGGTCCTCCAAAGATGCTAGTTAGGG 
ATAGCTGCTGTTCACTGAGTACTAAATGATTTGTATATCTCACAACA^ 
TTTTTTTTTTTGAGACGGAGTCTCGCTGTGTCGCCCAGGCTGGAGT^ 
GGGTTCACGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCT 
45 TTAGTAGAGACGGGGTTTCACCGTTTTAGCCGGGATGGTCTCGATCTCCT 
CTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCAGTTATTATCT^ 

TACTTAT CT AAGGTT AT ACAG GTCATTTATGGGG C CAGGAC TTTT AACTAG AT CTC CAGTTCATCATCTCCAGG CCTCAGATGTGA 
ATATCTAGAATGCCTTCCTGGCATCCCATTGCTGGTCTTCCCACT 

TAGGATATGTAGGACGTAAGATCGTTCCCITOTCCTTGTTTGATTGGTAACAATGAACCT^ 

5 0 TTATTG AGATGTAATT CACAT ACTAT ATAATT CACC CATTTAAAATGTATGATTCAATGATTTTG AC T AT ATTCACAG GT ATGTGC 

AACCATCATCACAGTCGGTTTCATCAAATCCTGTATCCTCTGGCTGTCGTT CCCTTAACC CACATCTTCAAC 

CCCTCACCCCCACCCATCCCTAGGCAACCACTAATCTACTTT 

ATTATGTAGTATGTGGTCTTTGTGACAAGCTTCTTTCACTTCGCTTCATGTTTTCAAGG 
CCCTTTTACTTTOTTTCTTTTTTTTTTTCTTTTTTT 
55 ATCTTGGCTC^CTGCAGCCTCTTSTCTCCCGGGTTCAA 

CCACTATGCCCAGCTCTTTTTTTTTTGTACTTTAGTAGAGATGAGGTTTTGCATOT 
AAGTGATCCACCCACCTTGGCCTCCCAAAGTGCTGGGGTTACAGGC^ 
CTAAAAATGTTCCATATATTGTTGATCTCTTCATCCATTGATGGACAGTCT 
ACATTTGTGTACAAGTTTCCATGCGGACACGTGTTTTTATTTCTTTCGGGCAT 

6 0 CATCATTTGAAGATCTGCCAGGCTGTTTTCCAAAGCAGCTGATG^ 

TCCACATTCTTGTCAACCCTTGTTTTTCTCTGACTTTTTTATTCTAG^ 

GTGTACTTAACAGCTCCATGGGGTGCTTTTC 

ATCTGCAGCATGATAATTAAAGTGTATTTCTATCTGATTTCTCTCTC 
6 5 CAGCAAACTCCTGATCCAGATACTCCTGGATCTCrTCTCTCACCCT 

ATTGAGGCCAGTCTGCTTGACAGCTGCTTCTCTAGTTCTTCCCCAAGGG 

GGAGCCTGCCTCCTTACCAAGGAGAGCAAGGCTGTG 

CACCTTTTACAGGCAGAAAAGAAGGACTTGTGAAGGGAAGAGCTGTCTG^ 

GCTCTACCACCTGCGAGAAATGTGTCTTTGGGCTGCTGACCTATTCT 
70 ACAGCCTAATGGAACACACCCCTATGAGTTGTTGTAAAT^ 

ACAG CCAG CTTTAGTATC CACAGTT ATCATCAGATGGGTTT CAGG AATGGTTG AGGGTGGG AGGTG AAATCAAAG C 

TG GATGTGGAATTTAGGATTGTTTGTCAACATGCCTCATTAACT AG GT CT C CAGAG TCT TTTT AAG CAGT AAAAAGAAGGAAAT TC 

TGCCATTTGCAACAACATGGGTAAACCTGGAGTACATTATGCTAAGTGAAGTAAGCCA 

GTT ATAGGAGGAATC TGAAAT AGT CAAATTCAT AG AGACAGAGAGT AG AATGG 
75 G AAAT ATTAGT C CAAGGGT AT AAAGT TT CAGTTATGCAAG ATGAGTAAGT CCCAG AGATCT ACTGT ACAGCATAG AG CCT AT AGTT 
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TACTGTATTGCATGCTTAAAAATTTGCTAAAAGGGTAGATCTT^ 
GAATCTTTTGGAGGTAATGGATATGTTTATATCATAGTTTGTGGCGAC 
TTAGATATGTrAAATACATACAGCTTTTTGTATGCCAGTCATACCT 
CteAGTTCATGTCCTTTGTAGGGACAT^ 
5 GCATGTTCTCACT CATAGGTGGGAATTGAACAATGAGAACACTTGGACAC^ 

G^ntSCGGGGAAGGGGGAGGGATAGCATTAGGAGATATACCTAATGTA 
TGTATACATATGTAACAAACCTCCATGTTGTGCACATGTACCCT 
TTAATAAAAGGAAGTTCTCATTTCCTTATCAGTTAAATTAGGGC^ 
, AGGAAGTT AT ACT CACGTGCCAATGTT C ACAAAATGTTG CACACATTTT AAGGGATGCC CCCTTT CC CAGACACCAGGTT AGAAAC 
1 0 CTAGAGCATGGATAATCTCAGATGCCCTCMCTA^ 

AGTTCAAGTTCAGCTTATCTCTATCTCTCAGGTCTTCAGTTGG CTGGG CTGGGGAGT AG CTAG AGAAAAG AAGTG ATG CCATCTGC 

CTCCTGGGGCTTGGGAGAGGAGCATTTGAGGTATGCCAAAGTGC^ 

CTTTGCTCTATGAAGAAATGCAGGATTGAATTATGA^ 

TTTG AAAATCCAT CTCTG CAGCATCTGGTCATGACACCATTTAGAGGAG C CCAACAGTG AAT CTTCGACTGGGGC1AG ACAGACATT 
1 5 CTGTTGCAAAGCAGAGCATT CTTCC CTATT AAAATGCCAACCTGCAGAT AG CAGATTTGCTCAGAGG CAAAGCTGGAAGTTCAGAG 

AGGGGCCCAGAAGAGTCCTGGTTGATTCTGGGTTTCAGGAATGGTT^ 
ACATCCATTGCTCTACAAAGTGGTTAAAGCATGGACTGTAC^ 
CCTTTGTG^KZCTCAGGCAGGTTAC 

ATTAATAGTGTGT CAAAG CACTTGACAT AGTGAAGGT ACTCCAT AAGTGGTAGCTGTTAATGAG G CTTT ATTTTT ATTGTCTTTAT 
2 0 TAATTATTACCATGGTCTAAAATAGCATGTGGATCCACCTGGACCATGCTTCGCT 
TTTT ATGAGCAT CTT CT CAGTCTTCATTTT A 
TCAGCCTCCCTGCTAGCAAATCATCCCCTCGTTGAACACT 

TAGGACTCTT CTG CATGTT AG ACCATGACTTTTTGTGCTGAGT T CAGGC CTGT ATCCCTCTAGATGCATG CG CGTTCT AACCAT CC 
TGTCCCCACX1ACATGGTCGTCCTTCCCACATCT 

2 5 TOTCZACCATGCAGGTTCCCCAGGTTATCAGCCTGGTTCCGGGTTATCAAAGGCCT^ 

TCCAAAGTGCTGAGTTCAGAGG^IACCAAGGAGTTGACCTTGAGGGAT 
GCTTGTCCTGAGCATCTGGCCCTGCTGTCATTCCTGGACCCATGTCAGTCCCAGAAT^ 
GATGGCCTGGGCTCATCAGTGGCATGTAACGTGGTCCACAGGCCCTGCGAGATCCTGCCrGC 
ATCTCTCTCTTCTCTAAGATCTCAGCTTAGACTCCATGTCCCCAGAAAAGGTCCTCAGTAGATTCC 
30 AGAGATCCCATAATCCCACATAATTCTCCACCATGGGACTG<3CCATGCCATATCAC^ 
GTGAGCTTTCTAAGAGCAAAGACACTCTCTGCCTGGGTCAACATTTTAT^ 
ATTTGTTGGTTCAATGAATGAATGAGTGAATAAATGCTTGATATGAGACAATCCCA 
GTCAGAAAAGCTCTAGACCTGAACCCAGCAGACAGTGGTCT 

TCCAGATGAAACTGCCAAGACTGATTACAACTCAGTGTTGAACCAO\GGCCTCCC^ 

3 5 TCTTAATAATC CCGATT ATGAT CACAGTGCT AACAGTGACTGTTCACTG AGTGT CTT CCACTTC CAT TATGACAG CAGTG CT ACAA 

GGTTGGTTTTTATCCAATTTTATAGATAAGGAAACTGAGGCATGGT^ 
GAAGCTGAAATTCTAGCCAATTCCTCTGGCTCCATGGCTCTTACTACCCTCCATGTGCTCOT 
TGCTTCTCrTTTCCCrCAGGAGACAGGGGAGCAATCAGW 
TGAGATTCCAGGGGAGGGTCCCTGCAGACTGGAGCATCACTTGGGGA 

4 0 CTTTTTAGTTTT CTGAG ATTTTT AAAACACAAAGGC^ 

CCTCCCCTCOVGAGTAATGAGAGTTGAGATTTTGATGTGTATCTCCTAGACT 

TGGTAGATGTTTACAACTGATGGTaATGGTGATGGTGATGGTGAGGCAATGTG^ 

(TTGATTTCAAGCrAOTAGCATGAAATCACTGAATGTGCAATTC 

TGCAGCTACAGTAGAGGCCAGTACCCTAGAGAGCATAGAAAATCATCAGTC1AAATGTAGTA 

4 5 GAGT ATTT AT TG C CTTTT AAAAAT ATATC ATTT ATTG AATTGTAAGTTT AT ATGATTT AG TT TTTCAAT AATGG CTGTGTTTG ACA 

ACTGGTTCACAAAATTCCTGAAAATTTAACAGTTGGC!TCTCAGTCAAC^ 
TAAAATCTTAACATTTTATAATTTTTTCTTCTTTCrTTCCTTC 
TTTCTTTCTT CTTTT CTTTTCTTTTAGAAACAGGATCTTGCTTTGT(IACCCG 
CAGCCTCAAACTCTTGGGCTTAAGCAATCCTCCTGCCTCAGCCTCCTC 

5 0 TAATTTTTT AAAAAATTCTTTGTAGAGAGGCTGGGCG CAGTGGCTCATGCCTGTAATCCCAG CACTTTGGGAGGCCGAGGTGGG CG 

GATCACGAGGTCAGGAGATCAAGACC^TCCn^GCTAACACGGTGAAACGCT 

AGGCATGGTGGCAGGCACCCATAGTCCCAG CTACTTGGGAGGCTGAGTCAGGAGAATGGCGTGAACC TGGGAGGCGG AGCTTGCAG 

TGAGCCGAGGTGCACCACTGCACTC<^GCCTGGGTGATGGAGCGAGACTCTGTCTOVAAAAAAAAAAAT 

TCTCGCTATGTTGCCCAGGCTTGTCTCAAACTCCTGGCCTCAAGCAATTCTCTGGCCTCA 

5 5 TGTGAGCATCACACTTAACCAATTTTTTTCTCTTTAACrTG 

CAGTATTCCACC^TGTGCCATGCCOTCCCTGCGTTAGGAACCAGGGC 
TAGCTTACGGTCTAGCCAGGGAGGCAGTCAGTTACCAAATCGACAAATAGATAT 
AGGGAAGTTACAGGATGATGTAACAGAGATAATGCCATGATAGACCAGGGATGAGGTATC 
AGGCTAGGAGAGGCTTTCTTCTGTCAAACTCACATAATCTGAGACCCAAAAGCTA 

6 0 AAAAATTGAAAAAGGATATGTTCACTTGGGTGCCATTTCTGTAAACTTTACACACATGCAAACAGTCCT 

AAGTCCTTGTGTCATAAAAATGGGAAGGCTTCCCACCATCTCTAGGATGGTGGTTACT 

GTTGGAGG CAGAGAGGGTC TTCCACTG CAT CTCAGCA1TTTCTT GAAG CAAAAG G AG AGAGAGG C CTGAG ACAAATAT AACTCGTG 
TT AATTTTTCTTGGCTTT AG C TG ATG GGC ACAGGGTTGT CAGCTG CATT CTT CTTT AT AATTT CTGAATG CTTG AAAT ATTTCATG 
ATTATTGTTATTTTTAAG<MTAGATAAAAGAGGCTCTAGGCT 
65 GAGGCCCAGGGTAAGTTCGGGGGCAGGCCTGTAGGGTATGAAGGCTGTGG^ 

CCATTTTGAG CTGGTGCGGGACATGATCTGATTTACATTGATAAAGGG CTTCTCTG CTGCTTTGAAGAGAACGATGGAGGCTT ATG 
AATGGCACTAGGTTAGGTAGGAGTCTAGGCATGGTGACTATCCATCT 
CCAAGTCTTCAGG^GACCAACCCTGATCTAGCTCATGTGTC^ 
AGCTGTTAGAAAGGAAAAAGAAGGAACATGTCACTATGAAACAGAAAGAGGCAGGA 

7 0 TAATTAAAAATT CTGGG AATGAAAAAT ATT CATCAAAATAAAGGTACT CATTAAAAAAAAAAACAAAG AAAGTTGGAAAAACCTCA 

GTAGATGGGAGAAACCCCGGATTAGACACaGCCAAAGAGATAATTAGTGGA 

G C C CAG AAT AAT AAAG AGAAAAATATGGAAGAGT AGT TAAAAGACATG AAGGAT AT ATTGAGCTACTC C CATAAT TTTT TTTTTTT 
TTTTTGAGATGGAGTCTTGCTGTGTCACCCAGGCTGGAGTGCIAGT 

ACACCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAG<3TGCCCGCCACCACACCCAGCTA^ 
7 5 TTTAACAGAAATTGTACTGAAAGAGAATGGAGAAAATGGCCGAGAAGCTGTATTC^ 
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TGAAAAGGATATGAGTTCTTGGAOT 
" CATAG<^AAACTGCAGAAAATCAAGAGTAAAAGGAAAGTCTTAAAA^ 
CTGACAGCTGTTGCTTTTGCGCOGATGGCAGTCT^ 
ATACCCAGCTXaAACTCTGATTAAAGAGTGATG^ 
5 GAGGTTGGGGAGAGGAGGAACCCTCTCAAATCACATATAT^ 

G AGAAGGATTGGTTTAGCCTTTGTAGTTGCAGTG CGAGG C CAGTGGAAAAG CCCCTTGTCAG CCATGTTG CTGAGTATTCCAT CAA 
GACAGAGGAGGGAGATGGTCCCATCTGCTTCCCATCTTCCAGGAATTCAAAACA 

GGATCTCCAGTGTCTCTAGAAGAATGGGAATGGGTGGAGGG CAGGTAG CAGAGATTCTG AAC CCAGCCTGCATTT CAGTTCTGT AG 

10 ACCCCTCTAGAATGGCCCATCCTAGACTCTTTCrCTT^ 

AGACGAGACCAAAGCCAACTGGATGAGGTGAGCCCTGCTCTGC^ 

GGAGTGTGTTGGACCTGTGGGAGCTCCAGTGGTGGAGACrrCTGGAGGCTGCC 

GACCCACCTTTTCCCCTAGGCCTCCATCAGGGGGCTCT 

TGCTCACTTGCCAGTGTGTCTGGAGGTCCTGGGCCCCAGGCTCCrG 
1 5 AGGACTGTGACACCTCAGAGCAGGCTCCTGCAGGGGTGGTGTGAGAAT^ 

CATGGAAG AAAGGGCT CTGACAAAGTCTTCAGCTCAGl"i"riTG ACAGTGTTGCTGC CT ATGTTTG AGT Aei'Tl'G AAGTGTCAACTT 

TACCCTGTTCCGGTGTTTGGGACAGAATTTTGCACCAGAAGT^ 

AGGGTCACCCAGGGATGGCTGAGCTGAGCCCAGAACCATGG^TCCAGCGTGATGCT 

CCTGGTGGG<3TTTGTGGACAGGGGTGGGAGATCTGACAT 
2 0 GCTGGAGATTTGGATCTGGGTGGGAGGGGAAGGAAGCCTCAGATCTCT 

TCTGCCGGCATTTGCACAGTTTT CTGACCCATGGGGCCTTGAGGCTGGTGGTGGGATTGTAC TTGGAGAGAGGGCAACTTCCTAAA 

AAGGAGTGTG CTGT CTTAAACT CGG AG ACTTTCT AAGCAAATGAGG AG AAG CAGGTTTGAGG CGT AGAGT ATGTG ATGAGG G CTC C 

TGTGTCCTTCCCAGTGGCTCCACTGCCCACCTATACTCCTATCAGAGAGTGGCTGAG 

GCGTTCGCCCTCTTATGGGTTAGGGAAGGTGTTCTCATTGC^ 

2 5 CTGCCTTGCAGTGTGGAGAATAGCACCGACTGCCCCCAACCTC 

GGAGGTGGCTTGACAC(XrrGACTGCCrTCTCTTCTCACTGCCA 
ACTGAGCACCTAGTTTGAGTCAGACTGTGAAATGGGTACCAGGGTGG 
CCGCTGGCCCTTCCCGGGCTCACACCTCCTACCACAGAGGTGACTTTCCAGTG 
GGCCCAGATGAAAGAGGAGGGGTTCGTTGACA.TGGGCCTTCAAAGGGGCCA 
30 GTCAGGCrGTT^GCGGCTGCTCAGTTTGGACCCCAAGCCTGTGCTGCTGTGCTGCCTGCCC 

AAGCAGGGGAGACCGTGCGTGTGTCCACTTGGGAATCAGGCTTTCTGTCCTATTGCTTTTAAGGCGCC^ 

CATCCAAACTAACTTATCTATCCTCTGGCACC^ 

AAAG CTAGTG AT TCTG ATGTGCC ACTAGC ATTGAGAAC CACTGCTCTGT ACAGAT CTTCATG C CACAGTGGTCAAGCT GAG AG CAG 

3 5 GGCAAGGTCTTGCTTAAGAGCATACAGTCAGGGTAGCAGAATCAGGATTTGAAC^ 

CCTGCTTGTGCACAAGGGACCAGGTGAAGCGTGGCTTCCTG^ 
CTGCCACCTTGCTAGCATGGTTTCTGTCCTGTTCTGCTCCCAGCOTGTC 
GGOTCTCATCTTATGACCACTCAAAGAGCAGGTTTGGGTTAGGGATGCAGTTTGTTGCCACG 
GAGTGACGGGAAAGCTGTTAATCCTGGGAATCCTCCTTTCC^ 
40 GCTCTCCTTTGCACTGTCATGGCA^ 

GGCGAATGGGCACTGTCGTGTTGAGCTTATCTTTATGACAGCCCCATCTAGGTGATCTC 
TTTTGTGCCCCTGGGGGCATTTGGCAAGGCCTGGAGACATGTTTGATGGTTGGATGCTATCA 
AATCCTGCATTACACAGGACACCCCTTCCCCCCAACAAAAAATTTTCTA^ 
CATTAAGTCCTCCTGATAGCCCTGGACAGTGACACTAGAAT^ 

4 5 CCATGGCTTGGCCCAAGGAGACTCAGTCTCTAGTTCCAGGTOT 

CCCAGCCTTCTTCCCATCTTCTGGGCC 

TTTTTAACCTTTTATCACTA^ 

GGATGGAAGGCCCATCCTGTGCCCCTTGCCCAGGGCCTCTGGCTGCTCAGTTGAGAGGGA^ 
50 AAGTGGAACTTGACTCCCTCAGGACAGTCGTGGGCTTCTC^AATTCTACC^ 
CCCATGGGAAGGAAACCTGTTTGCCTGCAC^TATGCAAA^ 
ACTAGGCAGAATAAGCTTTCTGTGGCCACACCCTAATCATGGCATTGTCACTG 

C CT GGTT ACGT ATT CTTCCGAG CCATTCATTCAG CTGTAGGG CTG T AGGTAAATAT CGCTTT ATAG AGGCAATGG GCTAATCCTAA 
TTTCAATAAGATCCTTCACTGCTATCTTTAAGAGATTT^ 

5 5 AGAGTTGTGGTTGTTATTCTGGACAGGGAJGAATAAGGTGTGAAAGGTTGAAC^ 

TAAAGCAGGCCCTGAAACCAAGGCCTTCGACTCTGAGTTCAGTGCT 

CCGAGGAGTAGTGGAAGGGACCTCTCAGTAGGAGGAGGGTGTGGTGGGGGCCAGGTTGCCCCAGGTGCCTCCTGGCTCC 
GGGGCCAGCAATTAGGAAGGAGAACCACCTGCATTGACGGTC^ 

GGCACACCTGCCTGAGCXSTAAGAGGAAGCCCAAGTTCTCCAAGGAGGAGCTGGACATTCrrTG 

6 0 CAGTGCTCTTTGGGAGGGAGACCATGCGGCTGTCCCATGCTGAC^ 

GTCAGCCAGGTGCCCCGCTCCGTCAAGGACATTAAGCACAGATGGGATGACA 
GCAGCAGTCCCTGTCGGGCCCTGGGGCCGGGGGCCGGGCCC^ 

TGCTCACGGCCCGTGCAGGG CGCG^CTTCCCCAGGGCGGAACTGGATGGCACCGACAG CCCTTCGACCAG CTGTGAGTATCACCAG 
TCCCCCCCGCC^CCCAGCCGGCTGCCTGCCCCCCAGC^ 

6 5 CTCTCTGTCC^GCCCACTCTTTCCATCCTCCC^ 

TTACTGAGCATCTCCGTGCCAGAGCCGTGCTAGGAGCTGGGAAGA 

ACCCTGGACTCCTAGGAATAAACCAACGTTAAATAATTGCACGAGTAAACTGGAAACGAAGGGATGGAGG^ 
GGTCCCGAGAGCAGGGCTGACTTCATGAGCATGCAACCTGAAC^ 

GGACATTCTTCATAGTTTTTGAGCCAGGGGCCCCACGTTTTCATTTTGCATTTGACCCT 

7 0 AGTTAAGGAGTGGGTAAAACTGGCTCCACTCGGGACTGGGGAGGGCTCCC^ 

AGTTAACCAGGCCAGGCAGCAGCGTGGGGCGCX3GAGAAGGGGAGGGGAGGCG 
TCAGGAAGGCGCTTGGCACGTTCCAGGAACAGTGAGAAGCCAG 

GGCGGCCAAGCTGTGAAGGCCCGGGCCGGGCCATGGAAAGATCAGTTCCTTTCTGTGTCTGGTCCT 
ATTACTCCCCTGCCTGAGCCACGCTGCCTGCCCCCACTTTTCrrGTGTGOTGCCT 
75 GCCACCCGCTCCCC^CCCCTCrrTCTCCCTCTGTACTTGTCCCTGCCTTTCCT 
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CAGGGCCCCAGCGGCCGAGGCGGGTGTCCCTTGGGGCTGGAGTTCT^ 

OUSGGACTGCAACCCTTCCTTTCCACTTC^^ 

CCGGCTCGTGAGTGGTGGGAGCC3VGGCAGGAGGGGCCCCGGGGLAAG 

TC^CTTGCACCTTCCCTTCCCGGGCCCCCTGCCATTGTGATCCACTTGCT^ 
5 ACAGAGTCAGATGTTT AGGACTGGAAAAGAACATGGAGAC CATTTAGTTCAACTTCCTTG CAATG CAG AAAGTAAGCTCTGAAAAC 

OVAGGTGGCCCTGAGCrrCAATTCAGAAGCAGGACTTGAACCCGGGTCT 

GAGGATTCCACCC03TGATAAGAAAAAATGGGGGGCT 

ATCTCCCTGAGACTGAATGTAATt3AGGCCACCAAAGGTAACCTG 

CACCCCCATGGCAGGCTCACAGCCCTCCITTTCCTG^ 
10 CCTGGGAGGGAAGGAGCAGCATCACCTCX3CCGGGTCTCACrCCT 

G ATGGCTG CTGTCCCAC CAAACACTG CCAAG AGACACAGGACTCTTAC CAAG CTGGCC CAAGTGAGC CCCCAGGACCCCTGGGAGG 

CATGAGGTAACTGTAAGATAAACCTCTCTCTCCTCCCCATCACCTTCC^ 

GGGAGGGCTTGCCTGTCAGTAGGGTCAGCCCATACATCATTTAATCCTGCCACAG 

C^GACCCACATGTGCAGAGGCTGGGCCCCCTCCCTGTGTCCGGCTGCAAGGAGCAGTCro 
1 5 GCTCCTGGATGGGAGCACGATGTGACTGAACTGGAAGCCTGCCTTGGGGGACAGGGGGACAG 

TCCTGCTCTGCACTGTCCTGCCAGTTTCTTCAGCCTGTCAGCATGTGCTGCAG 

TTTTATCCCTCACCCCCTTCCCACTCTTCCCCC 

GTGTATACTGCITGGGTCATGGGTGCACCAAAATCTC^ 

CCCAATAACTT ATGGAAAAAAAAGTGTGCAG ATTC CAGCAAGG CCACAAG ATC G CCC CAAGCAATGCAGACGTGG CCTCAGAAAGT 
2 0 CTGGTGCTCAGAGGGCCAAGAGCGTCTTAGTACCTTTAGGCAGGTGCCGCTCCTTCAG^ 
CTTGTTTATGACTCTTCTTAACAACCCAATTAGGT^^ 

GAGGTG CAGAGAGATGAACTGACTGCTGAGGTCAGGAGCTG CTGGGTTGGCACGCAGCCGGTGGGCCACGGTCTCCAAGTCCCCTG 
GGGATTTGGGGAACGTGGATl^CTCTATGGGCGGGGGCCAAACTAGACCGCA^ 

AG ATGAGGAGGCGCCTGGG C C CT CAAGG CAGC CT CTT CGGGTGCCT CTGCAG CGG T CTC CGG AGGAAGAGGC C CACCTGG CCAGGC 

GCCCAGGCTGCCTGCAGGACCCCTCGGCCXSC^CCCCAGCCCACCCACCACGGGCCTTGACTGGC^ 
GCAGACCGAGGTGTTCCGGCAGTTCTGCCAGGAG 
30 CCATGGCCGAGCTGACCAGCC!GTGTCGGTCAGATGTGCCAGACGCTGACAGA 

CCAGAAGGGGCAGACCCTACGGGCTCCACTCCCCAGGCCACCCAGGCCCAAGCCCCCCTGCCAGAGCCCCCACCAGCTTCCCCAGC 

ATCAGCCCCCACACGGACTACCAGGTCTCGGAAGAGAAAGCACAATTTCTAACCCAGCTAGTCTGCACTAGGAGGA 

AGGAGGGATGTCTGGCCTCACAACAGGGGCCAGTCTTTCGTGTCCAGGAA 

GTTACCTCCTCACCGAGCGCAGGTCGCTGCCCTGTGGTCTAACAGAGCATGTATTCAGACCCGCACCCTC^TCTTTGGCGGATGCT 

TGCAGGGAGGGGGGCGCAACAGTGGGGAGCATCCGAATGGAGTCCCACTTTCCAACCTTGGGGCTGCTGCATCGGCCTCCCACATG 
TGACCTCCCAGGCCCTGATGCrGCTTCTTTTGAAGGGGGTGAGGCCTGATGGGCAGGTCAAG^ 
CCTCCCTTGCCCGGGGGTGGTGCCCGCCTTGGCCCTCCGCTTGCCCTTGTGCG03TGCTCAGC^ 
TGGGGAGAAGAATGGGATGACGTCGGAGGAAGGAAAAGACTGTCTTGCATTCA^ 

4 0 AGAGTCTGGGTGGTCTCAGTGGGGGCTCAGGACAGTCAGCTCAAGACAGCCAAGTCAGGTATTATTATGATCATTTTTATC^ 

CATCTGTATAAGACAAATGACACAGCCAGTTAGTGAGAGGGCCAGGACTGGAAGCCAGGGTATCGGTTTTCC 
GAGGCCAGATCTGGAGTAGTGGGAAGAGGAGGCAGGGGGCAGGTCTTGCCCCTTGCCAGGCCTCTGCCCCTTGCCAGGCCTCTGC 
CCCTGTATCTCTGCTTCTGCCTCAGCTGTCCTGGGTGAGGTGGGAGG CACCTAGCATTTAGCAGGTGATT CTCCCACCCTGC CCTG 
GCTCTATGGOTCATAGCCAGTGGCTTGGAGAGGCAGGGCCAGATCTCAGGAGTGGAGGGCAACCTCTTCCC 
45 TCCAGAGCCCAGTGGGGCTOtfGTCCCATTCCAGGTTCC^ 

TACACCTCTAGCTTCATGTTTC^TGTTGGAGTGTGGGGAGAAAAC^ 

ACAAGCTTAGCTCTGACCTGGGTGCACCTCAGCTCTGTGTCCTTTTTTT^ 

GCCAACCCCTCCACCCAACAACCCCCC^CCCTCCCATCCTGCCC^ 

TTCCAAGGCTGTTCCTGGAGCGCTGCTTGTAAACCCAAGCCAAGCTTTGTCCCACGTAGGGAGAATGGAA^ 

5 0 CTGTGCCTTTGGCTCCATCTCCCCATTCTGTGGTTCTTTGGGGGTTCCTGCTTCCCTCAAACTGAACACT 

TGATGGCTTAATTATTGCCTGGAATTGCCCGGCCTCCGATGCTGGAATCAAAGATTGCCTTCCGAAGTATTTTTGTTA^^ 
ATAAAAAGAAATCAATCTCACTCTTTGAACAC!ACCCAG 

AAAACCTCGAGGACCCCAAAGCAGGGGTTAGATGTTTAAGGAAAAGCAGGTTGGTGAGAAGGGCTCAAGTGCCC 

G ATT CT ATAAACTG C CAG ATTCC CTGT AT CCAGCAC CATGCCAGG CCCTCX5G AGGAAAGGAAGGG CATG CATGC ACGAAAC CACTC 

5 5 AGTTATATTTCATTTCTCTGCTCAAAAGTGAGGTGTTGCTA 

GAAAAACAATCACCGTTTCGGACAGGAGGCTCTAGAAGGGTC^ 

CATGACCCTCCCCACCAATCCAGGTGTCTGTGTGTCTGGCACTGGCAAGGCTCTGCCCACCrAGATCTGAGG 

CAAAGCTGCCTCCTCTGCCAAGAGGAAGCCTCCTGACCTGTGCATTGCCCT^ 

AGGAAATCCCCTAGTCTTGCCCACAGATCCCCTTTCTAGAAGGTTCTAACT 

6 0 CCATAGAAATGGAGGACTACCCAATTTCCGAGAGATGCAAAGATGTCGTGAAACCATTCCCAGG^ 

TATGAAGCAGTGACTGGCTGGCTGCTCTCTCTCTTCCTGGGGAGGGAAGGAGCCAAATTGGAAGCTGTAA^ 

AGAGCAAAAGGGACAGTGAGGACTTGTCAACTTGGGATGGAAAAGATTTTC^ 

GCTAGCACATTAAAACAGGAAGAGGAAACTTGGGTCAGAGACAGAAATGAATTG 

CCATCTGGTGGGTGGATGTGTGACCCAAAGCCCCTCCTAGCCCGGAGACCTATGCATGTGAGAACACAGAGC 
65 GTCGGTCTTTTCCTAGTTCTGCCTCCTTAACTGGGGCAGCTTCCT^ 
GCCTAGTCAATTCCCTCCAGCAAAATGGATGCCCTGTGTTATGT^ 

GAAAATATG ATGCIACTTGAT AAGACAC^U3TGGGTG AGGTCTGAGA CTAT C CAGT C CCCAAGGT CAG 

CTGGGAGGTTGG AAT AAAAACAAT CCAG GAT CAT CATG ACAGGTTT AGT CCAGAGCTCTC CAC CGCCATCAG CAG CACCAC CAC CT 
TCCCC!TTGATTGCTCaVAAGGTCCCAAATTCTGTGGGTTGGAGTTGAAATGCCTCAATAAG 

7 0 GCCAGGCATTCTCTGTGCCTTCTCTCAAGAGGTAGTGACCTGG 

GATAGATTTTGCCCCTCCTGGTAGCCTTCCACTCCTTGGGGAGGGCTTCTGATTTCTGCAGCCATGTAATGAATC 
GTTCTTCATAAAGGGGTTGTTGTGAATGGGGACAGAA^ 

AAGCAGGAGATCTAATCAGAGGTAACAGGTCTGGGGACCTAGCTGGCTTCATTTA^ 
TCTTGTCAGG&GAATGGGTTGTGATTCACCAAGAGTT^ 
7 5 CAAAATCTGAATAAGCATTGATCTGCCTTTGAAAATACTTTGTGATTTATAGAACGCCT 
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TTGATGGATAGAAAAAGACCAGAAGATGGAAAGTTAAATTGACTTGAGTCTTT 
TTTGAATAAATAAATCCCAGATAGGAATGTATTTCTCATCTAGCTTTT^ 
TTAGAGGTTGGTGGTTGTTTAGAAACTGGAAGAAAAAAAG^ 
TTGTG AGAAAAAAG ACGAATGAATGGTTTCC CITTTGTA 
5 AAGGTTGATTATTTTTGGATGTCAAATCCTAGACTCTCA.CTAAA 

CTATGGAGAAAAGACTGACTCACAGTTGTAGAATTGAGATTGGGAATTA 
AAAAGGGGCTCTTTTCCTTTCTCTTAAGTCCTTTC 

TCAGGG AT AGCGATG CTTATATCOT ATGTGATAAGAACACCGATGTGTTGGGG CTAACAGAGAACACA 

AATTCTGGCGGATCTAAGCACCGTAAATCTTGCAAGTTCTACACCr 
1 0 AGCAGCCAAAGCCAGCATTTGCTCTCCCATCrGCCTTGGTCTT^ 

GTTCCCTGGACTCCAGTGATCCTCCTTGGAAGGCAACAGCCTCCTCCCAGGCTCTCCAG^ 

GCAGCTGGTGCC^TCCCTGGGACCTCTAGAATGTAGATTCTCCATGCTC 

ATTCXJTATATTTCTTTTCTTTATATGTGTGTC^ 

TGCAAAATTACAAAGATGATCCACCTGTAAGGGGTTGTCACTG 
1 5 GTTTTTAAAGCATTATTGGAACCATAGTCTACTTCCAGTCCTATTC 

TAAAAGATT CTTTG AAAACATTTTT CTATAGATGAAT AG CATTCCATT AT ATAAATGT AACATCATTT ACGAAATCTTATCT AGT A 

TTAGACTATTTAAATGTTTTGCCITTGTACrGCTATA 

CATAACTT ATGTTTCTTC AGCCAT ATTGCCAAATTGTTTC C ACCTGTTAATT CTG C AT CATCAGTACAGGAGACA 
CCTTTCTTACCACCTTTACCAATAGAGATATTATCATCTTTTAGAAATCTC 
2 0 TATT AGTGATGCTGAAT ACTTTT CATTTGTTTATTATCCATCTGAATTCTT CTGTG AATTTTTGTTT CCC CTGGACTTTTCCAGAT 
ATAATTCATTGGGC TACACCAAT CAGACAGACTT TG ATTTTTGT C CCTTGTTTT TG AG AC AAAC CATAACATAAAGAAGT TTATTT 

CACTCCTCGGCAGGTGTTCCCTCATCCCACCTTTTGGAGGTGGGT^ 
ATGGGCGAGGGCCATAGATTCTGAGGGTTTGACAGAACT 

2 5 TAGGAGC CATGAG TTACAGTGGG AAACGTGCAT AGGACTCAAAG CTCCATCTCCCATTTATTAG CTGTGTGACTTTGGGCAAGTTG 

CTTGGCCTCTCTGAGCCTCAGT CT CCTTACCTT AAAAT CATTTAG TAAG ATTAACCAGTGGTTGTCAACCTTTTGTAGT CTGTT AT 

TTACCTTCCTCCCCTGTCCCCACTGGAGCCCCCATTTTTTAAAA^ 

TTGTCACAGGGGCTATTTCAGC^GGCTTTGTGAAATAOT 

GGTATCrGGGAACCCCAGTGGTTAAAAGTTAGTGAAGAGTAGAAGAGATAACCTTTCAGTTTTTCAATC^ 

3 0 GATATATTTTCCATATGGTGTGAGGCAGGCAGGGCAGGTGTCATATCCAGTCTCAGAAAACAGAGAT^ 

TGCTTGAGGTCCCACAACACGGTGACGTTGGGACAGGGATGGGTTCTGGCTGGTGCTGCT 
GATGTTCACATGGGCCCACGTGCCTGTGGACrrTCTAACCATCCACACTTGCC^ 

AGGGTGGCCCGTGCGTTCTCACCTGTCTCCCCTCCCCACCAGGTACGTGGTCATCrCCCGGGAGGAGAGG^ 
CGTTCCAGCACAGTGAGCGKLATCTACTTCCGGGCGTGCAGGGACATCCGGC^ 

3 5 TACATGAAGCGCCTGCACAGCATGTCCCAGGAAACCATTCACCGCAACCTGGCCAGAGG 

CCCACCTCTGAGCCCCAGGGGAGGCCTGGATAGCTTGTTTTGGAGCTTCCT 

GTGATTCGGGGGAACAGATGGTTGAGGTCTGAG CAG CAGCTCTGGGCCAAGCAGGGGC CACCCCAGCATGTTATCX5ACCCCTAGAA 
ATGGTCTCAGAAATCCTTACCACACACATTTCA^ 

TGTACCCTATTTCCCCAACTCCAGTTGATGAAAAGATGTTACAAACTGCCACTGCCTCAATAATGATCGTAA^ 

4 0 TACTAAATACTGAATATGTGAAATACTGACAACAGTCCAAGGACCCCAGTCTACCACCG 

CCCTGTGGTTAAAAGTTTGTGAAGAGTAGAAGAAATGACCTTTCATTTTTTC^ 
CATAGCCTGTGAGCAGATTTTCTGCATGTTCCCATC 
ATGGAGTCATGGACATGTGGCATTTAGGACAATGCCCACCTCCATCATC^ 
CCACATGCATTCTTTTACCCATCTTACAGATGTGCAAACCGAA 

4 5 AAGGACAGAACTAGGTTTCTAATCCATATTGGTCTGACITCAAGGT 

AACCAGTCCCCTCCTGGGTTCCTCCTGGTGTCTGTACTGCAGTCCTAATGAGATGA 

GCCCCTAGTCCTTTACCCTTTTATTCCATCCTCATCCTGACCAGTGTAGACTC 
TTTGTTTGTTTGTTTGTTTTTTTCCCCACTAAAACCACGCATCCAAACACCTAGAGTCTATCTGGTC 

GGGTGGGAGTGACTTATTCAAATTAAGCTATCTTTA^ 

CCTGACTTGGTAAAAGGTATCTGGAGATGATGGGCAGTAAAAGGAGACCC 

AGGGTTTGGCACATGTTGGTTTCACTTGGCAAGGTGTGCTTCCATGATGAGGCT 

T CATGTAAAAGTGCCTG ATACAT AGTAGGG ACTCAATAAGTCT TTGT TG AATGGAT AAAAG AAG AAATG AATG AAG ACAG CCAAGA 

5 5 AGCTTTCCAAAGAATCATACCACTAGAAGAAAAGAAAACCCAGA 

CCTTCTTGGATTCTTCTGGATTTGTGATGTCCTTTTCTTCCCCTCRGTGGTC 

TGAGCCCCAAGACCACAGGGGATTGCTCAGAGAAGGGTGTGAGCTTTCTGGAGTGGTTCAAAGGAGGCGTGT 
AGGAGATACACCGAGACATATGAAGATACAGGAGGGAAGAGGGTCTATTCCTGGTCAGGCTGAGCAGTAAGAAATAGGAAGGAA^ 
TGAAAGTCACAGTGGGGCAGTTTCACAGCTGTGGGTGATCCTGAAGCCCTTGGGAATGGTCATGGAGC 
60 GTGCCTCTATAGCCTGTAGCTTCAGCGGACTCTGGTTAGTGGTCT^ 
CTATGCCCAATCACTCTTGGTCTTATATAGAAGGAGGTTTCTTTC 

TATT AG TT CTTATTAAGT AGGG ATATTG GTTGTTG CCAG AAGT CTTC CTG AAG AGAGTGT AAACT ATTCCTTTGTG^ 
AAGAATAGATTG CATCT AAAG CT C GCATTCAAACAATTT ATGTTATTTGACCT AC I'TTG ATTCACT AATTTACTTTGAGGT TTTCT 
TTGTTTTTGTTTTCCATGTTGTATGGAGGCTTTGGAACCAATTTCATGA 

6 5 AAG CCTT AAGTC CACAAAGAT CATACT AAGAAATTTGAACAACGTTGTTTTCAAACAATG ACGAG GAAT CTCTGGTGATTTCT AGG 

CTTGATTTCCCGATGTCCTCAGTTGTTTGCTG CCTGATTGTCC CATGAGGAAAGX3AAAATGTGGKrrCTTGATTTTTAGAGATTTTC 
AATGGGC^GTGCTGCATCreATCTGTAAGACATACAGGTGGC^ 

GAT ATAGAAGTGCAG GCTGTGGG GAG CTG CCTGGGTT C CAG AAG C CTTGG AG AT AAAGTTGTTCCTT AGGTAT T CATGTG AGGACA 
GAAAAGGTGCATCCTGAATAGATAACTCCCTTTTGTGCTGTAACCT^ 

7 0 C CATACCAGCTATGGGAAACCAGACGTGGAACAT CTGGACTGCTTATTGG CAAACCCTTGGCCTTCAATCAGAAGTCTTTTGCAAA 

TGGAGCCATCAGAAGCCTAAGTACGTTTTAGTTCAAGTCATTGTTCAGGGGCA 

ACCTTTGAGATTTCTGAGTTCCATAGGCTACTTTCAGGACCCTCTAAGGGCTGAAGAGATTCCTCTGC 

CAG CAAGCAT CAGCACT TCTGTGG CAGTTT ATGAAACTATGTTGGT AAT^riTTAAAG AATCAGG CTAG CTGGGTG CCATGG CTCAT 
GCCTGTAATCTCAGCACTTTGGGAGGCCAAGGTGGGCAGATGGCT^ 
7 5 ACCCTATCTCCACAAAAAGTACAAAAATTAGGGTGTGATGGCT 
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TCACTTGAG CCTGGAAGGTGGAGGCTG CGTG AGC^CCACTGCACTT CAG GACAACAGlAJGTGGGT CTCTGTCTCAAAAAAT AA 
AAATTAAAAACAAGGAATCAGT CTAAAATAATTTATGGTTGAGGAG CTCAC CAAAGT CITTG AAACAAITGAAAGTAATTCAAAGT 
GAATTGAGGTAATTCACATGATAGAAAGAAATAGGCAAGGAAA 
GACTG^TTTGGGTCATTATGGCTACCTIX3TCCrrTAT 
5 TCTTCCCCTTTCTTCK5CCCTTAGTCGTCAGTGAGAACAA 
GCTGGGAGAATTAGAAAGGACAGTTGCCATAGGATGGA^ 
GTCAGGCCTATGGTGTTTCATGTGTACCTTTTACCAAGTC^ 
GTTGGGGGAGTTGATACTCACTTGGCTCTGAGGTATGTTCCCACCA 
GGAAAGAATGTTCAATTACTCTCCCTTATAGATTTCTCTTAACTTATAAC 
1 0 TCTCTTTCTAAAAGGTCITTCCACCGTTCAGGTCCT 

ACACTTGCCCCaAAAGCCCAGATCTAGAGGGATGTCTCAGGTGACCT 
TTGGGGGTTTTCCCCATTTCCTTACACCCTGGTTTTCCTTCCTAGGAGAG 

GATAACCCAGAAGACCTGAGGGGTCCCATTCATCTCTCTGTGCTGAGACAGGGCAAAAGTCCCT^ 
GGATGTACACCCCCAAGCTAAGAAGAAGAAAATTGACCTGATTTTCAAGGATGTTCTGG 
1 5 AAGCCCACCAGTTGGCCCTGAGCACCTCACTGGTCATCAGGAAAGTCCCCAA^ 
ACAATGACCCATGGTGfTGCAGAATATAGGCCAGACCCAGGGGGAGGGG 
CCAATTGGAGGATGAAGAAGAGGAGCCTTCATCATTCAAGGCCGACAGTCCT 
TTCCCACCACCTCTTTTTGCCCTAACTGTATTCGC 

GGGAAACTTCCTGAGCCCCCCGTATTGCCACCACAGGTACTGGAGCTCCCAGAGTTCT 
2 0 GAGATTATTGTCGGAGGGCAGAGTACGCAGTGGGCTGTGTGGAGGGTAGCCTAAAGCTCT 

AGGTGCTCCATGTGTTGCCCTTGTATCCTCCTTGTCAATAAAGGAAGTTCCGCTC 
TTGCCTTTCTCTGCTACTGGTGTCITACCCCAAAGCCCAATT^ 

2 5 TGGTCCCAGTTCCCTAGCCTGGAGTTCTGGAAGGATGGAGAGTGAG^ 

GTTGGCTACCTGGATTTCTAGAGTTGGTCGACAACTAGGCAGGTGTTCTAGTTCATA 

CACTTACTCTTTTAACCACATC CCTTAGCTCAGAATG AGGTGTTTCTTGTATTCAAAG CATGCGTCTGAACTAAACTGCATTTTGA 
T CCT GAAATCACTT GGGG CCATAT C CAAGATG C CTTTGTTCACATT AATG AAGG GCAAATGAATCCCAAGT CCTTG C CAT AT AACT 
TTGGAATGTGTGATGTGCTTTTCCTCCTCCATTAGATCTA 

3 0 TCAAACACCCACAGATG<3AGGGCAAG 

TAAAAGTCACATGGGCAAGCAGGAGTGCTCATAAACTAATTCTGGGCTGGGATTCAATC 
TCCCATGGTGTGGC^CCTACCTTCCAGGAGTGTTTCCTGTO 

TCCCATTTGTTTTTCAGCCTCAGAAAGCATX3GTCTCCGGCCCCGCCATCATGGAGGATGAT 
AATCTGTCTCCAATGATATGATGACAGCGACGGATGAGCCCTCCAAGATGTCATCGGCCAC 
3 5 CAGGAATGGCTGAAGAAGTTCTGGTTCCTGCGGTACTCCCCAACCCTCAATGA 
GGAGTCCTCACGCACCTCGGCCTTGATCATTGGCT 

ACAAGAAGTGCCTGCAACTGTACAAGCTCCGCATGCACCCGGAGAAGACAGAGGAGATGT^ 

ACCGCCTACCACCTGGCCTTGGAGGGCAGGCCCTACCTGGACTTCCGGCCCCTGGCGGAGCT 

GGTGGACCAGTACATGAATGAGGGAGACTGCCAGATCCTCAT^ 
40 TCCGCCAGTCACCTTGCCTCAGCGTCATCCTGGATGGGC^ 

ACCAGCAGTGATGGGCCCCCGGCCACAGAGTTCCTGTCC 

TGACCGGGCCTTCTCGGCCTTGGGCATCCGGTTGCAGGATGAA 

CCAGCCTCCGTGCCAGCATGTTC^TGACCATCCGCAAGACGCTGCCCTGGCrG 

CTGGAGATCCTGGATGCCATCAGCGGGAAGGAGCTCCCATGCCTG 
45 CCGCTACTCACCGCGCCTCATGTGCGAGCTGCGGTCCACGGOT 

C^GTGCGGTGGATCATCGGCGAGCAGAACGTCCTCAACGCTCTCATCAAGGACTACCT 

AGCAGCCAGACCCAGCGGGCAGACGCCTCGGCCATCGC^CZTGGCCCTGCTGCAGTT 

CTAC1TCCTGCTGGACGTGATTGCTGTGCTCTCGCGTCTGGCCTACATCT 

ACAAGATCGAGGAGGCCATCCAGGAGATCAGCCGGCTGGCTGACTCCCCGGGAGAATACCTGCAGG 
50 GAG AG CTTCAACGGGAT CG C C ATG AAGAACCT CAGGGTG GCTG AAG C CAAG TT C CAG TC CAT C AGGG AGAAGAT CTG CCAG AAG AC 

CCAGGTCATCCTGGCTCAGAGGTTCGACTCCCGCAGCCGGATCT 

GGAGCAGTGAGGAGCTGATGAGCTATGGCAAGGAGGATATGGTGCAAATATTTGATCACCTG 

GATGTCTGTAGGGAAGGGCTGGACCCCCX3GGGTAGTCTGTTGATGGAGTGGCGAGAACTCAAGGGCGATTAOT 

CTTCAAAGACCTGATCAGCCACATTTGCAAGTACAAACAGAGGTTTCCACTCT 

5 5 CCACTTCCACCGCTTGCTGCGAGAAAGGCCGCAATGCCCTCCAGCGAGTTC^ 

C!TTAGCGACCTGTTGACAATC!GCTGTAAACGGACCGCCAATCACCA^ 

GAAGTCTGGGAACAGTTACGCGCTGTCTGCAGAAGTCCTCAGTAGGATGTCTGCGCTG 

ACCACGGGACGGAGTTTTACCCCGACATTTAGGGAGCTGGCGCT 

CATAAGCTTTGATATATTATATAAATATATATTATATTATATTATATTATATATATATATATATATAAACT 

6 0 TAAAAACCAAGGTGACGCGTCCACCAGAAGCCACTGG^IAGATTTCAGAA^ 

TTGG CAGCTG CAACATACATGGCAACTCATTTTCACJTCACAGAAG CACGTGCTCGGGCCTCCTGTGTTCCCACCTTACTGTCCACC 

AACAGCATAAGCTAAAATGACAGGTCTCTGTCATCACCTT^ 

CTGGGTTTGGGTTTATGTTCTTGCCTTTTCI^TTTTCATTTGGTTTTATGA 

TGGTCCGGCTGAATCAGATCTCTGACITAAGTCAGGGTGGGTTGTCTGTCTGCATTTGGGAGGCAGG 
65 CCCCACCTGACTTCAGCTTGAGATCTTTTTTTATTCATTTCCTGATGAGGGTTCCTTCA 

AACTGTGACACTGCCACACCrCACCTCTGTTGCCTCGTCCATCCCTGGGTTGTGGATCCCTTCCT 
ACAATATTACCCATTATACTGAAGGCAACATTGC 

ATTCCTGTTTTTACGAGGTGGAGGATAAAACAATATAATT CCATTCCAAT CCAGGGCTTTTGGGGAGATGAAG AGCCAAGAAGTCC 
AGACCCCAACAGGGGAGTGATTTTTGGCTAAAAAACAAGGAAAATGAAAAGTA 

7 0 TCTTTATAATCATATCATGTGTTGAGGGTATTTTTTTTCCTTTAATAATCAAGAAATGCCT 

CAATGCAAATTGTGCCCTAAGTTATGCATAACTCAAATGAGAGTCTGTAGAGATGTGGTC 

TGCTACATGGTCATAGGAAACTGGGGAATGTGTCTCTGCCTGTAAACTCTTCCTTTTTTGAACAGGGTAGAGATGTCCTAAAGA 
TGGAGAAAAGAAGAGAGGACTCTCAAGGCATCAGCCTACACAGAC^ 
CACACACACACACAATCACAATATACAATATAAGCT 
7 5 GAGGAGTCTG ATCAATG CTCTTTCATTCATTTAACTACCt^T^ CTGTTAAA 
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AACTTCTGTTCATGTT CCT ACATCTGATT TATGCATATTTTATATG CAG AG ATCCTATCAC 
GGAGGAAGATCTGAATTATATACATGTGGTCAGTTCTGCTGAGAG 
AGGCAAGGCCCCAATGTCTTCAGCCTCCTTGGTTCAGACATT 
TGAGGCCTCATGCCACCGTTCAGAACC^TAGTGGAAATGTCATCAACACT 
5 CTTCATCCCCTACCAATCCCTTCCCCTTCACCTCCA^^ 
TTTCATTATATAAATATATTATATATATTATATATTA^ 
AACAAAAAACAAAAAAAAATGGGTTCCTCCAACTGTCCCACTGCCAG^ 
TGCCTGGTGACATCCACCCCACCTCCCAACCCAGTTGCACACATGCrTCCCT 
CTAGCAAGGAAAGGCTGAAAGCCTCCTTCra 
1 0 TCCATCTGCTCCTCCCATCAGGGTCAAAACCTAACTGTGGTC^ 

CCCAGCCTTGGGACTTCCTCTCCCCrCACACACGCAAACTGCTGTGTCT^ 

GTTGAGGG CAGAGATGGCT AJ3CAAGACAGGGCTTGGTG AGGG CAGAAACAGTAGGTTG AGATCCTTT CTTCTAGCCAACAGTTG CC 
TTACACCTTACATTGGGTAATGGGTAGGGAGGAGCAGGCTAAGGCT^ 
CCTGAGCACCTGOTCAGTTGGAGCTACTCTrTTTCCTCT 
1 5 AAAGCTTATGGGCCAGTG CCAGTG CCTACC CTGT AGTTCCTG AGAAAG CTGAGAGCAGGTGACCCACTT CTGGCCTAG CAGAATGA 

GCTGCTATGCACAGCATGCAGCn^CAGGGGTCACT 

AAGTTAGGTTAGGGCATCCCTAAAACTCTGGGTGCITGTGGCTTCTGCTGAAT^ 
AGGCCACACTGCCATGGGACAGX3GAATAATTTGGGTGATACACCACTC 
TGGGAGGCCCAGTGCTTCTGTCCTGTGAATTCTGCATGTGATTACCCATGATTTCTGTCA 
2 0 TTGAG AAGGACTTGGACTGATGGG ACACT CAGGGTCTAG C C C AGG GAG CAT ATG CTTGG CT AAC CT AAT CT C C CCTTG ATGTGT AT 
ACAGAACTGTGGAAAAGCAGTTGGTGGATCCCAAATGTTGTTACTTCAGACAAGACAG 
AGTGATGACTACAGGCTTAGAATGAAGTGTGTCTCTGGGGCTGAGACTTGGCATAACT 
CTCGTAGGGTTGGTGGGAAGTGAGATCAACCTTGAGGCCCAGG 

AGC CAGAGC C CTT ATGATTAATATGAT CTG CTTTTCCTT CATGGAGGACAAAGAAAAAATC CACTGCCATCT AGTATCTGTGAAAC 

2 5 ATGAGGACAGCGCAGTCAAGTCTGTAACTCTTGCCATGTCAGAATCCCCA^ 

AT CAGAAACAG CAG CCTTATT T AATTTAATTTTT CTAATG ACTGG CCTATGACACCTTGTGATG CT AG G CACATCCTCATTTCCCC 
ATCCTCACTTGGG ACTGAGAGCAGG CTCAAGTTCCAGGGTCCCTGG ATGGCAAGGTT CAGTGCTGGGCCCTGGAATCTATGGCACT 
TGGGGGTCTCTGACCTCAGCCTCTGCCACATGrrTCCAAG 
AACCCTTTAATTTAGGAAAAGTACCTTGTAATTACTTAAGGTAATGTTTAAA 

3 0 GGGGAAAGCATGCAGAAGGCTGGAAACAATAGCTGAGACCCTACTGTGGGCCCACAGCCCTG 

TGCCATGTTACTTGATCACCTGGAGCCTCATGGGACCCAGGAGGTGGCCTGAGGGGATGAA 

CCCTTTCATTTTCCnTCTACCCCCTCTAGGAATGGGAGTTCTAGACCT 

GTCGAATTCATGCHTTTTCTTGGCCCATAGGTC^ 

ATCTAGCCAGGCCAGGCTGAAACAGCTGAAACAGGCAGGCCAGTCCTCCT 

AGAGACTGGCACGGCCCITCrrCCTGCrTGGAGGGAAACC<^ 
GTGCTTGAGCTGAGTGGATGGTGCTGTGATATTTTTAAGGAGCCIT^ 

CCCTTTTGGGAGG AAAATGGTACGCCCCTCCACTTCCATCTGGCACTCC CTTC CCCCCCACCCCCTCAAGTGGTATCACCCTGGAA 
AT ACCT ATG CAATC CAGTCT C CCT AGGAG AG AGTGCATGG AAG CAGGGGT ATGTG CAGTGT AGAAT ACAGATG CT ACAG CAT AT AT 
40 GTTGTATATATGQACATATACAGTACGTATACACA 

ATTTATA(^TTTCTATGTTTTAAATAGATATAAAAATAGAGTC!TATAGAGCTGGGAGAGCAGTGG 

CAAAGGGGAAGGGAGCACGGCCTTCGGAGAGGGAGC CGGGGAAGGCCAGCAGGCAGGGTTGGCTGG CAAGGGGGCCTCCTACCCGG 
AGTGTTGGAGAGGAGAGTGGCTGGGTCCCGGCTCGCTCCATGCACTTTCTCT^ 

GATACCCCCTGAGCTCCAGCTGAGGTGCCCCCTACCTCTCCCCACCCCCACAGCCCACGCTTAGGCGGTC^ 

4 5 AGGACXTGGTCrCTGACTCCrGGTCGAGGGACCACTGCACAAAC^ 

GCAATAGGAAAG<3TCCAAACTGGATCAAGAGCTGGTCCAGGAAAG^ 
GGCCAAGCCCCTGAAGTGCAGCCGTCCTGGCCTCCCTCACITTG 
CXSCa^C^OSCTGCCC^CCATCCACAGGCTTT^ 
TGCACACTCATAAATTCAATGGCTACTCCAGTCCAGAAGCAGG^ 

5 0 ACPCAGGACAACGCTGTTCCCCCTCCCTCACAGAGGCCCCGTGGCCCCTA^ 

TACACTGTTCGCATTCTGTGACITGAGGCAGAGGCTGAGCTGGGATACCCCAA 

TGGGTTTATCACCAGCTC CTAGCCCGGGCCGGGCGGGG ATGTCTGGGGGC CACGCGGGGGCACTGGTGCAAG CTGGCAGCATG ACA 

GAGGGCTTGTGCAGCCCTGACGGGACCCAGAAGCCCATTTGCTCGCAGTT^ 

GAGGGCCTGACCAGGCACCAGATGATGGAGCAAGGGCAGCTGCATGCTC^ 

5 5 AGGGGCATTTGCTCCCATCCTXSAGCCTCCTCTGCCCCTGTCTTGCTCrTCCCT 

CCACCCCTCCACCTGCAGCCCAGGGCGGGTCTGTTCTGCCAATGCCC^ 

GACACCCTCCAGTACCTGGCTCAAGAGAGACCAGGCCGGGCCGAGCCTTCTTCCCACTGCAGTGGACTAGACCCAC^ 
TGGGCATCCCCAGGTAGCJAATCCCIACAATGCACTGTACCT 
CMAGGGGGACAGCGGTCACAATACTGCTCACCAAAAGA 
60 CTCCGGAAGAACCAGCAGTCTGATTCCGTCTATTTCAGCCCCCGTCTCTCTCT 

TC AAT CACAAAGGAAAAATAAGTGGGG ATGG GGGG AAAT AC CT AGGAGTCT ATTAT CACATACAT ATTAATATGTT AATACTTTCT 
TT AAAARAAACCT CITTGATGTTATT ATT TTG CAG ACTACG CTTT AT AGT ACCTGTGTG ACGGGAC CT AGAACACTGG ATACAAATA 
GAGCTATG TTGGTTT AT CATAAT ATGTACG CAGAAACTTT CTTTTTGTCAT ATTAT CCTTGTAATGTAAGAAGATTGTTAATAAAA 
GCATTTAAATTTACTCACCACTGTTGAATTGCTGCCTCCrTGCCT 

6 5 ACCACAAGGGCTCAGGTGTCAGAATAGGAGCTTGCCTGCTGCCAGGTTC 

CTCTCATTGTTCCCCCATTAGAGTCTCTTTCCAGCAAGTAACCT^ 

AAC CGTTT AAAAT AAATCT CT CTG CCCC CATATGCATTCATCC CACAAG AAT CT CCTGGAGGCAGCACACAAGG GCT GTT 

HUMAN SEQUENCE - mRNA 

7 0 GTCTTGAGGACCATCTCTCCCGGCAGCATACCGTGTGGCTTCACACTGCT 

TGGGAATAAGAGTAAGCCACCTCAATGGACTGTGGGAGGCTTAAGTAAATTGAAGTGCCATGCAAGTAGCTAGCATGC^ 
CTCAATGAAT ATT ATGATGGCCGCAGATACGATGG CT ACAG CTG GGGCAC CCATTTCGGGT CACAAGGTAGGGTTCAATGTTGAAG 
ATGGCAGAGCCAATTGCATCCCTCATGATCGTGGAGTGCCGGGCCT 

AGACAG AATGAC CG AGAACATGAAGGAGTG CTTGG CC CAG ACCAATGCAG CCGTGGGGG AT ATGGTGACGGTGGTGAAT CCGAGC C 
75 AGGAGTATGGCCAGCCCTGCrTCTAC^AGACCGGACTCCTCGGCCATGGAAGTTGAGCCCAAGA 
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ATCGTGCCCAAAAGCTTCCAfiCAAGTGGACTTCTGGTTTTGTGAGTCCTC 

CCCCCCXMTGTTTGTGTCTGACACACCGGTGC^ 

TCAAGGACACTAGTGGAGAGAGTGACGTGCGATCT 

AT CT CCACCCAGGACAAATCAG CTGGCTTCTTCTC CTGAT TGTGGACAAGAACAACCG CT ATAAGT CCATAG ATGG CTCAGA 
5 CGAGACCGAAGCCAACTGGATGAGGTACGTGGTCATCTCCCGGGAGGAGAGGGAGCAGAACCTC 

G CATCTACTTCCGGG CGTGCAGGGACATCCGGCCTGGGGAGTGGCTGCGGGT CTGGTACAG CGAGG ACTACATGAAGCGC CTGCAC 

AGCATGTCCCAGGAAACCATTCACCGCAACCTGGCCAGAGGAGAGA 

CCO^GAAGACCTGAGGGGTCXC^TTCATCTCT 

TACACCCCCAAGCTAAGAAGAAGAAAATTGACCTGATTTTCAAGGATGTTCT 
1 0 CACCAGTTGGCCCTGAGCACCTCAjCrGGTCATCA 

GACCCATGGTGTGCAGAATATAGGCCAGACCCAGGGGG 

TGGAGGATGAAGAAGAGGAGCCTTCATCATTCAAGGCCGACAGTCCTGCCGAGGCCT 

ACC^CCTCTTTTTGCCCTAACTGTATTaXZCTAAAGAAGAAGGTTCGGGAGCT 

ACTTCCTGAGCCCCCCGTATTGCCACCACAGGTACTGGAGCTCCCAGAGTTCT 
1 5 TATTGTCGGAGGGCAGAGTACGCAGTGGGCTGTGTGGAGGGTAGCCTAAAGCTCTCrrGTGGAAA^ 

AGTGTAAOTTGGAGGCGGCTACCTCCGTGGGTGGGAGCCCAGGTCCTCAGTGTCTCT 

CTCCATGTGTTGCCCTTGTATCCTCCTTGTCAATAAAGGAAGTTCCGCTGCAGAAGG 

TTTCTCTGGTACTGGTGTCTTACCCCAAAGCCCAATTTCTAAACCCAGTCTTT^ 

GGAGAGATCTCTTGGCTTCCCTAACTTAGTCCAGGAACACAGCCTTGTTCT 
2 0 CCAGTTCCCTAGCCTGGAGTTCTAGAAGGATGGAGAGTGAGGGGATCCAGGCCATTCA 

HUMAN SEQUENCE - CODING 
ATGTTGAAGATGGCAGAGCCMTTGCATCCCTGAT 

GAGAG AG AAAG ACAGAATGACCGAGAACATG AAGGAGTG CTTGGC CCAGAC CAATGCAG C CGTGGGGG AT ATGGTGAC GGTGGTGA 

2 5 ATCCGAGCCAGGACTATGGCCAGCCCTGCTCTAGGAGACCGGACTCCT 

AAACCATGGCCCCCCGGTGTTTGTGTCTGACACACCGGTGCCCGTGG^ 

TGGAGGTGGTO^AGGACaCTAGTGGAGAGAGTGACGTG^GATGTGTAAACGAGGTCATCCCCA^ 
GAGGGGGAGATCTCCACCCAGGACAAATCAGCTGGCTTCTTCTC 

3 0 TGGCTCAGACGAGACCGAAGCCAACTGGATGAGGTACGTGGTCATCTCCCGGGAGGAGAGGGAGCAG^^ 

ACAGTGAGCX3CATCTACTTCCG<3GCGTGCAGGGACATCCGGCCTGGGGAGTGGCT^ 
CGCCTGCACAGCATGTCCCAGGAAACCATTCACCGCAACCTGGCCAGAG^ 
TCTGGATAACCCAGAAGACCTGAGGGGTCCCATTCATCTCTCTGTGCTGA 
AGGGGGATGTACACCCCCAAGCTAAGAAGAAGAAAATTGACCTGATTTTCAAGGATGTTC 

3 5 GTGGAAGCCCACCAGTTGGCCCTGAGCACCTCACTGGTCATCAGGAAAGTCCCCAAA 

AACAAGAATGACCCATGGTGTGCAGAATATAGGCC^ 
CAGGCCAATTGGAGGATGAAGAAGAGGAGCCTTCATCATTCAAGGCC^ 

GAACT^ CCCACCACOTCTTTTTGCCCTAACTGTATTCGCCTAAAGAAGAAGGTT CGGG AGCTCCAGGCAGAATTAGACATGCTTAA 
GTCTTGGGAAACTITCCTGAGCCCCCCGTATTGCCACCA.CAGGTACTGGAGCTCCCAGA 

4 0 GGATGAGATTATTGTCGGAGGGCAGAGTACGCAGTGGGCTGTGTGGAGGGTAG 
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Table 111 

MOUSE NOMENCLATURE 
ICSGNM N/A 
Celera mCG17918 

5 

HUMAN NOMENCLATURE 
HGNC N/A 
Celera hCG23764 

10 MOUSE SEQUENCE - GENOMIC 

TTGAACTCCTGTCTCTOTGTACCCAGCT^ 

ACAAAATACCIGTGACCAAAAGGCAAGCTAGAGCT^ 

AAAGTCATTGAAGGAAGCCAGGACAGGCACTCIAAGCAGG^ 

CTCCTCrrrCTTTTCri'l"ri v rfC"lTTTTTTGGTGTTTTGA 
1 5 TAGACTAGACTGGCCTCZAAACrrCAGAGAGATCTGCCTGTCTCTGCCT 

GACTTCTCAGCCTTTTTTCTTAa«3AA 

ACCCATOUVrCACTAATTAAGAAAATGTTCTACA^ 

TCCTCTCAGATGACTGCAGCITGTGTCAAGTTGACAGAAAT 

GGTTTTGTGAAACAGGCCTGGTGGGGTTGCAGAAGACAGCTCTCCT 
2 0 CAGACCCTGTGACAGTGCTGGAGAGAGTGAATGTCCX3TTCAGGCTT 

CCTGGCTTCTTCTGGGACATTCCATCTTTCT^ 

CAGGCAT CCT ACTGGGACTGCAC CTGGGG AAGTG AGG GGAGG ACGGAAGGAAGT ACTCCCC CTTCTTTGAACT ATG CTTTTTGA 
TATTGTTGACTGCTATGTCCTACATGGCAAAGATC 

2 5 AACTACTGCCGGGTGTGCTGCCATGACAGGTGTGCTGCCATGTGGGTGTGCTGCCATGCAGGTGTGCT 

CATGTGGTTGTGCTGCCATGTGGGTGTGCTGCCAGGGATGGAGACGGACT 

GAGCT CAGCCAGGTGGAGGTGCGT C CATTG CACGTGAATGCG CACACATGACTAGTGAAAAG AGTACAGGCTCC^ 

CTCTTTCTTTGGGGTCTTAGCACAAGACAGCTTCAGACT 

ACACGAGTGTGGCCCCCTTTTCrGACAATGAGTTTG^ 

3 0 TGAGGACTCTTTTG CTTGAATC CAAAAATGAAATGAAGGGCATC CCG CACAAATAACTTTC CT T AATTGTTGTTTTTTTTTCCT AG 

CACTCAGATTCTTTCCCCCTGCTGAAAATCACT^ 

CCAGGGTGGTTGAGCCCTACTTTGTAAGCCAAGAGCCTGAAACATTTCTTTTCT 
3 5 GACAAACTAAAAGCATGATGTAGG(^TTTCTTTTTTGTTTGTT^ 

TCCTGGAACTCACTTTGTAGACCAGGCrGGCOTCGAACTC^GAAATCCGCCTGCCT 

GTG CCAC CACACCC CGTG ATGT AGGCATTT CT AT AATTATTTGTAACATAAC CACTTAAAAGCAAAAC CATTTTGTTGTTGTCATT 
CTCTCTCTCrCTCrCTCTCrCTCTCTCrCTCTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG 
GCCATGCATGCACATGCCACAGCCTGCTTGTG^ 
40 TCCCGGGCTCGCACGGCAAAGACTTTCACC^ 

GCCAGACTTGGAGCCTATACTTGTCTTTTGTTGTTTCTTGCCATCAACCAA 
TCTTTACCATAAGTGACAGAGGTTTATGGGGTGGGGAACAACTTTGTCTACATTTTTAAAA 

CACAGCCTCTCTATTCATTCAGCACAGGTCrTTTCCCCT 
45 ACTGATGACTGATGATGTACATCATAGAAGATCCAOSTGTC^ 
CAGCCCATCTCAGAATCCTGGATTTCTACACCT 

GGCCTGGGAATGCTCAAGTGTTTGAGTGGGCCCTCCTTACTGGCACCCTTCTTCTTGGGGTAGAATGA 
CATGGGCTTCAGCCTGACTGAGTAATAAGAGGATCTGTTTGTGCAATTGTCTCTGCAATGTC 
AGTCAACTTTGTTTGGAGGAAAGGGACAGCATTGGCTCCTGT 
5 0 GCAGGGACAGCTGCTCCAGCTTCTGTCTCAGTTTAGCTC^G^TGACATGCTG 
CTCCCAGTGCAGGATCTTCATTGTCATTGGGTTATTGGTCCAT^ 

CCCGTCTC7UATACGGTTCCCGAGGAGCIAGAACTGAAGCGTGTTTATTCAGGATCTTGCAGTTG 

AGGAC!ATAGACGGAAATAAAAATGGCATTAATTCTATGCCTTCT 

GCTCCTAGCTTGCTGGGTTAAAGGAAATATGTGTGAGCTCTAGGA^ 

5 5 TGAGTTGGGCCAGCAGGGTTCATCTAACTGTCCTCCTGGGGAAGTGTTCTGAGATCOT 

TCAAGTGGGACTGGGCTAGCTC!ACT^TGGGGATGAGCACTGAAC 
CTGTGTAGGCITGCA.GGTATCTGCAGAGATCCTGAGAAATGTGGGTCTCT 

CAGAGAAATGAGAGACACGTGGATATAGAAAAGCTAGGGTCAGGTGGGTCGTGTGCACCCTGGTGGA 
CAATG GGGAACCTCAGT CATTTATT ATG TATG GCAGGGTTAG CT AGT AT^ 

6 0 GAAGTCATAGTTGCTGTGTTCATGCAGACTGGCCAATGGCT 

CCTGACTCAGGGAAAAGGGTTGGCTACTACTTCCCCCATGACATTGC^ 

CT C CCTCC CT AGAGGTGTCCGTGGGAAG AAACAGAGAGAC CTTAGGGGGARG ACAGACTAGGCAG AGAAG AGAGGAG CCGGCACAGG 
ACGCTGCTGGAGTTCTCTCAGGAGGGCCIGTTGGAGGACTGATCTCACCCCGCT 
AGTTGCTAAATGTGGGCTGCTAGGTGGAGGTGGGAGAACGAAGAGAGO^CCTC 
65 ACTGTACCTGGTGGGATCCGAACAGGAGGATGGGTGGGTCACATTAA^ 

TGGTTGGGAGTACTTGCCGCTCTTGCAGAGGACGTGGGTTCAATTCCTGGCACTCACAAGGTGGCT 

G TT CCAGGG GAT C CAAG G CACTTT TCTGGCTTTCAGGAACG ACAGGT CCACAC ACATGT ATG CAGG CAAACACTTAG ATACAT AAA 
ATCATTTTTAAAACGTCT CACCAGGCCAAGTGTGGAGGCCIAAAGTCCTTAATCCCCATACTCAAGAGG CAGAAG CAGGAGGATCTC 
TGTGAACTTGAGGCCAGCCTGATCTACCTAGAGCGTTCAGGCC^ 
70 TCTAGAACTAGGTGTGTGGAGATGGCCAOSGGAGGAATGGAAGCCGAGCCCCAGTTGCrTTGA 

ATTAGACTTATTCTTCTTGTCTGTGTTGGGGGTGGCGTTCAGCCTTGCTTAGCTTTAGGTGGCTTTGCCCATCTCA 
CTCTATTCCAGAAGAAAGCTGCCTGTCTCAGCAGGCATGAGCT 
GTATCTCTGACAACAGGGAGTGAAGCTAAAAATGGCGAGTCTTGTCT^ 
CTTGAAGGCAACGGTGCGCAGAAGTTTTGCTTTTA 
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ACCTTGATAGCTCTCTGACATCCTTTC 

GGTTATGCCTCTACACATGAATTTCTCTGCCCATTGTAAAGTC^ 

CCAGGAAG CT CTCCT AGCATCTCAGGCTG CTTGAGGAC^ CTTAGCAG C CA 

CC^GGCTCTCGTTTTTAAAATGTGAATCTTTCT 
5 CCTGCCrTCCTTGGAGGTAGTACTTCTCAGTC 

AATC^CTTCCAGGTGTGTCTAATCTTCCCTTTCCACACAGTCAGTATTGG^ 
GGOA.CTGCTAAGGGGGCCATGTTTGCTTTTCTTGCrrCAATGGTCAGGCT 
AATCTGATCTACTTCCCAAAAACTTAAACT 
TATATAAAATGCAGTCACATGGTAAACTCAAGCTTGGCTG^ 
1 0 CAG CGCTG CATAGCCTG CAAGG AGAG CAGTTCTTAAGGCATTT ATGGACTATG^ 

TGACACGGACAGGCTCCrrcTTCCTAGTCTGTCAGAGGCT 
CCCCTTCAGGACTTGGTCCTGTTGAGGGTCCTATATTTAGCCAGTCATC^ 

CGCTCCTCAC CAAGAGGGACAGATGCCCAGAGCT ATTCACAGTCACATGGGTTTCAGG ATCT CAGAAT 

TCTTCCACATCAACACCCAGACCACrCCATTGTTTCTGTGTC 
1 5 TGGAGAGAGGGCTCAGAAGTTAAGAGCACTGTCTGCTCTTCC^ 

AACATCT ATAATG AG AT CTGGTG CCCTCTT CTGGCCTTAAGG CAT ATGTGCAGG CAGAATG CTGTAAACTTAATAAATCTT AAAAC 

AAAACAAAACACCAAAGGCCATTCCCTCAGACCAGTCCTTAACACCC^ 

ATTTAITTAGAAAAAAATTAACATGAGCRAATGAGATACCTCA^ 

TTTGATCTTTAAATACAAAATAACCACATGAACA.CCT 
2 0 TCACATCAT CTAACTGCAT ACAG C TTTGCAAGACTAGAAATCACAATT AGTTTT TTTTTTCTGACC^ CAAAAG TATGAAATGAT 

TGCAGTGTACATACGATGTACAAAGACAAGGGCGGGTTCTGTGGACGTCACTTCAG^ 

TGTGAAT CAC CTGTGAT CATG AT AT CAAAAACTT AT ACAAAGT AT ATGAATTTGGTTACTU^TTTT CTTT CTGAAAT C C CCGTTT CTC 

TTCATTGTTTCCATAGCACCCTAAAAATACACAGGTGGCAGGGCCAGGACAC^ 

ACAAAAGGGAACAAAAACGCCTCTGCACACAGGGTCAGTATATTACAGG 

2 5 CTAGTATTCCACGTAGAATOAAGGAGTTCCAAGCCrTTTGTTC 

AATGAATGT ATCTTGTTGACATGTCTATTTC TCATT T ACATATGTACACACGGCC CTTGAGTCG CTGCTG CTCTCTGC CTCGT CTG 
GATTGGTCAGGCCCGAGGGCCCATGGGGAGCAGACCTGTAGCTCTCTGGGATTTAGGGC^ 

C TTTT AAAAAATAAAATGGCT ACAG G ATACGTGAG ACATGAATAAAGCTT TCAAACCAAGAAGATGAGGGTG ATCX5CCTGTG CGGGG 
GGCGGGGCOTTTCCCATCTCCGCATGCTCTCrrCCCAGCCCAGCCG 

3 0 CTCGTTGTGGTAAAGTGGCrTGCTGAGACCTCATTTCGGAGGTO 

CCAGAGCCTAGGCCTGCCTCTGTTTATGGATGTCACCTTTTACCTGCGTCT 
TTGTAGCTCTCTCAAACCCTGGATCGGCTCCAACATTTCT 
ACTTGCTGGTGGGCACTTGCCTGGGAGGGAGCCITO 
GAGGCTGGGCTACTGGGAAGTGTCTAGGCCTTTT<KlACrGGA 

3 5 ACCCCAGGCTGTTCTGGGGCACAGTATCACGGCAAGAAG CAGGAACTTG TGAGAAAGG C ATTTGTCAGTCTGAAG CTG AACAATCC 

GTACAGTCCtSACAGAGCACGCCCCTGTGCCGGGCCACCGACACTGGGTAGGGATA^ 
AGCTGACTGAGAGTGGTTCTCAACAGTGGGTTGCCTGGTCrrcrrCAACTCGAGCACA 
CAAATAGCCCnXSGTTGGTTTTTGTTTGTTTTTGTTGTTTGTTTTTGTTTT^ 

AT ATTTTAAAAACAC CAG CAAAGGCT AT AACCAGGTCATGAAAATGTTGAAT AT TAT CAAAGACAATACT AAAAT ATT CACAAAG A 

4 0 TATTTACAAAAG CGATT CTT AAAAT AATTGAT TTG CATACTGAACACACTGCTTGAAAAAACCTTAA 

GAGGGGTGGGGAGGTGGCAGCTTTCCTTGTGCCTTTTTTTT^ 

CAATCTATATTAAGAGTGCCAGGGAGCACAGAGGGAGGCTAATGACaSATGACAAATGAGGACGCT 

AGAAGGC CTTGGGTGGCAGCATCGGAGAAGGGAGCAAAGAATTTACAGATGGGACTCAT^ CATCACCACA 
GGCATCACCGTAGGCCATGAGG CT CAG GG GG ACACAGAC CAG CCTGGCACAAAGCAGG CG AGTGACAAG CCAGCAG GGG ACT CT AA 

4 5 GCTGGATGGAGTGAAAAAGCAGAG^AGGGCCAGAGGCTGCACTTGCTGG 

GAAATACAGGGAGTGACAGGCCATAAATGCCACAAAATGCCTGCCTGCITGAGATGOT 
AGACAGGCAGCTTTGAGCAGAGAACCTGCTCGACa^ACTGAGAGACTTGGAGATCCAT 

GT CTCAX1AGAAGGAAGT ATGGGT AGGAGGAAG^AAAG AGAACATGG GGGGAAATG GCTTGCTTG CTGTC CAGATACAATGTTCCTT 

5 0 TCCTTCAGAGCTCCTTCTITCAGAGCTCCCCGCTGCGGGTCATTTAAAAG 

CCCCGCAACGCATGGCCCTGGTCTGAGTTAAGGAGCCCTCCT 
CAGTCCCTACAAGGTGTGGCACTCTCTCTCAGACACGGCA^ 
TAGAAGTCAGAGGCCAAAGCCACGACCCCCGGCCTTGAGGGATGGTCTCCT 
55 GTGTGTGTGCGTGTCCTTCCTCACTGCTGCAAACTGTCACC^CTCCTCCAAGGCTTGTCCA 
CAGGCTTCITTCGGGTTCCAGTCATCTGAAGGAGGCTCTGGGC 
ACACTGGAAGGTCGGTTGTCTGGAGTCAGCACACATGGAGGGACCTACAGTCGCCA 
CTCAGCTTGGGTTTGTCTTCCAGGGGCCGTCTTCAGGCTTGCTCTAGGC^ 
CCCACIATGGC!TTTCCCTGCCTAGTGTGCTCTGCTCAGCTCT 

6 0 AGGAATAGGATGTGGGACAGATGGGGTAGGAGGCAGAGTGGGGAGGGATGGCCC 

AGAGACAGGAGGTCATCGTTGCTATTGCTTGGAAGGTCGGGGGGGTCCAGGTAGGAAAGAA^CTCATCCGGGTTTCTC 
CAGTAGCTAGAAAAGAAGAAGCCTCGTTGAGGTGTCTGAGGCTGCTCCAGGAGGAGTGAGGGCTCTCTOT 
TCCGCTTGCCTCCTGG&GTTCTACCTCAAAGACAGAAAACTTAA 
CATCTCAGCTGGGGGCAGGCATACCCACACTCAACAAATGCC^ 

6 5 GCCCCATGGCAGATCAAGGGCTGGATTCCTTTTGATTGCTCCTCTGTC^ 

CCTGCTGGTATAAGCCCTCGGTTTTCATCCATGACCACAAGGCAGGCCGTGG^ 
CTCTCCTGTAATCTTAAGTGAACTAGTGGCTGGCCATA 

TGAAGTATG TATG TCACACAGGGG CAGAG GAG C TCCTG AATC TAG AAG CATCCC T CAGTCT GG CTAAGG AAGGAG AGTTGG GTG AG 
AAACCAAGACATCTGGAAACTTTAGACAGCACTGCAGTTCTC^ 

7 0 AGGGCCAGGAGTTCAGTACCTAGGGATGGGGGCTGGCTACAGCTTCCAGCCCCTC 




GGAAGCAGGGAATGGAJ3ACACACGGACAGGGAAACCGCAAGCAGACAAG 

CTCTTCCCCTGCCCAGGCCAGAAGGTAGATACAGAAATAGCAAGCTTGACATGGGGGTGTGAGGGAGGGG^ 
7 5 GGACCCTCITGCACTATCTCCAAGGCTCTGAOTGGAGGGATTGGTCAAGG^ 
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CTATCGGG CCAAAGGG CTGATGGGTAGGGGTGGGGCTTG CATGACACACC CCCATATGGGG CTT CCTAGTGCTAGAGT CAGGGGAG 
ACTGGCTGGGGCCCATCCACC^CCCACCCACCriTTCCACCCGGGGCCTT^ 
GGTGCCGAGCCTGGCCCCAGACCAAGGTGTGTGGTG^ 
CrCC^GCCTGACCCTCTAAGGCTGAGGAGGGGTTAAAGGTCAG^ 
5 CGGGGAGG CTGGGAAGGAGG AGGAG C CCCACTGTGATGTAATGGAGGC CCTG CCTG GCTG CTGGGGTGTGGTACGTGCAAACCTTG 

T TGT ATGGAGGGATGGGGCTGGTCAGAACTGCCAG CGTGGGG CATC TGTGGAGGAGGAGAG AGCAGGAGAAAAGAGAGGTGAGGTG 
ACGGAGGCACAACGACGGGATGCCTGGAGAAAATGAAAACCGAA 

ACAG AGAACAATGATC CEX^CGTTGAGGGACGGGGTAGAAAAAAAACAAACA^ CAGAAAC CAACAAC CAGGAAGG C 

TGTGAAGGGACAAGTGGAGAAGGCCTTCACAAAGCCTAGGAAGGGCCT 
10 . TGT CACAACAAACAAC AGCGAG GCCAGAATGGACAAGC^ 

GGCCCTCCCCCGAGGACCAGGGCAGTGCAGAGCAGGCGTGGACAC^GCACGGGC^ 
AGGAGACAGGTGCTCAGCCCGCTTCCCTCTGGGATGAAACT 

AGGGGCATTTCAGGGTAC TG CCTTCT CCTTGATTAGAAATGGG CTT CATGGTG GGGGTGGCAAGGAACATGGAGGGACAACTCTAA 

AGGTTAGCTAGGGAATTCCATCCTTTTAACTTACrCCATGCT 
1 5 CTGGTGAATGCTGTTTGCTGGTCTCAGCAGACCCTTCAA 

CCTGCTCrTATACATTTGCACAGGGCCACCAGCACCCT 

GTCTCGGATGATTTCCATCTCTGCCCCATCTCCATTTCT 

TCTCCCAGACCCTAGTTCCTCTTTCCATTTCTGCrAAAAGCTGTTCCCC^ 

TGTGCCCITTCTTTGCAGGGTGCAGGGCTCCAGTGTGACACC^ 
2 0 AAGTGG CTGG AAGACAAGGC CACATAGTGGT AAGAGG C CAGCTGAGGGT AC CCACTTAG CT CACAACAGCAGATGTCC CATGACAC 

CCACCCAGAGGAAACAGGGAGGGACTCCACACTCTCATGACCAGCTTAGCTCAGGACAGACATCTCGGATCCTATGATATA 

TCCCCTAGACCTCAACAAGAGTGTCTC^GAGCCAGAAGTACrGCTGTTG 

CTGCTTCTAGATAGAACAAGGTAGATACACCAAGAATAGACCT 

2 5 TGCTTTCAGTGAG ATGAAAG CTGAG AAAG TAG G CACG C CTTTGTGGAATT CTGGG GGGG AG ATATG AGCTTGATT CAGGT AG AAGA 

TG AACCAGGAAG CAAAT AGCTTGG CTGGTAAGAG AG CAAATGGCTTT CT CTATGTGG CATCTTTCAGG AT TTGTGAGT ATGAGG AC 
TTGGGGTATACAGTGTGTGAGGGGTGATGTCGGTGACATC^GCTGTGTTAGGCCG^GGTCCTGGTGAGCTTAGAGGAGGAGAGCAG 
TGGCTCCAAGTTGAGAACTTCAAAGTCAGAGTCTATATGTACTTTTCTTTCT 

AGCATCGCAAGGCTGACCTTAAACTTGCTATGTAGCCAGGGAATGACTCTGACTTTGGACACTCCTTC 

3 0 AGATCACACGTGTGTACCCTGTCCTGTTTGCTTAGGCAGCTCTGCAGAAGCCCCCTACCCAAGGGAGCT 

ACTCATTCTTGGTGGGAAATCGGGGATGTAAGAGAAGATGAGAAGCCTCCAACCCCATTACTAGGCACTGAG 
TTTCTGGCTGGTCTTCTTATCTGTGCCCC!TTCCCTCCAAGGTGAGAGAC^ 
CTTCCZATCTCGGOTCGGTGTGCTTTGTGAGCACAAGGCCCGATGTGCCC!ATTCATCCCGACCC^ 
AGAGCATAACAGTTGAGCATTCAAACACAGAAGGGCTCAAGAGG^ 

GACAAAO^CCCTCAGGTAGGCCAGATATCCTGGGACCGCCT 

CCCAGAGACTAAGTGC CTGGGAGTGGG AG CTAATTTC AGGAGGC CAG AAACACAGG CATCAGTG G CAGAGCAT AGTTG G AGCTCAA 

GGTGGGCACATGTTGATGCAGGGGGCAGCCTGGAGCTTTC 

AGGTATAGAGCAACCCTGTCAGCCTCTCCCCATCTGAGCCCCTG^ 

4 0 AGGACTCACACGACAAAGAACCCGAGGGAAAGCCCTCTGCCCCAGTCAGCCA 

GGTTGAGGTACAAGCTGAAGGTACTCACAGTTTCCTGCATGGGGTGACTGAG 
TCCGGTGGGTGCGAGAGCTGGGGGGGACCGTGCATGAAGTCGTT<ZATGGATGTCCCTCC^ 
GCCATGACCCTGGTAGTTGTTTCCTGGGATGGGAATAAGATGGGGATGGGAAGAGGTCAAC 
TGATAGTATAAATGCCCAAAGGCTTTCCGTCCCTTGGGACAGGCAAGGCTCCTGAAGCATTTGCTTTGGGCTAC^ 
45 TCATGCCOTTATCTCATGGACTGGGGCACTCCAGGTGTGTGCT 

CTGTTGCAGAAGGGCAGAAAGGGGTCTAGCCACCAGGCTCTAGATGATAACCCAGCCTCTCCATCTCACACCT 
AGACAGGGAGACTCAG AGTGCTG GGATGCCGGGG CC CTTGG CAAAGGCTGTG GG ATTCCAG C CAG TAAACTAATGACT CTGGT CCA 
CTTAATCAGTCACCTGGGCACCTTGGAGACCTAAGGCCATGGATCTGA 

ATTTTCTAGGAACAGAGTGGAAAGCAGCAGGGATGATGCCTGAAGTCTGGGATAGCGGAGAGTGAGACCAAGG 
50 GTTAGCCAAGGCTGGGCCGCCAGTGCCTAGGTGTTCTCTGTTAGAGGGTGGGAGAGCCTAAAGGG 
GCAGGCTGTGCAACCTCTGAGCTTCCATCATCATTCTCCCAAGTCCAGTGA 
TTCrCTTTTATTTTTAAGATTGACTTTTTCTGTGTATGAGGGGAAT 

TGCTTCAGGGGGAGGTTAGAAGAGAGCACTGGATCACCTTGAACTGGAGCTATGGTTGGTTGTGAGTC^ 
ATTGAACCTACG CCCTT CTGCAAGAGCAACAAGTGCTCTTAATGAGCJG AACCATTTCTCCT 
55 ATCTGTACCCX3TTATCATCAGCACAGAGATCCCA^ 

GGCCAGCATCACX3TTACACACCTTGGCTGCTGTAGTCGTTGGAGCTGGTC 

CCAGGGCCCAGAGCGGCGATCATCTCCATGACATTGGGCATGATC^ 

GCCATCGGGGTCATCCTTGATGTGGAGGTCTGACTTGATGGGTACTGGCCGCC^ 

ACTCGGAGCTGATGGGGGAGAGCGAGGGGGAGGAGCTGAGGTTTAGAATGTCTAGCCAATGTCTTCACTCCT 
60 CTGTGCATGGGACCATCTTGTTTGATGAGGTGAGAATATTGAGGCT^ 

GACACCTTCCCCGCCACCCTAGAGGAGTTCCAGCACGGCTGCGTTCTGTCTCATCTAAGAC^ 

CCCAAGGTCAAGCTCCAGATGGGTGGCTGCCCAGCAAAGTACTCAGGACTGTCTATGTCTCCCTCCAGCCA 

GTCGAGGCCACACTTACTGTTGGATGGCGTTCAGGATCCC 

CTGGAAGAGAAGAACAGAGGCTTGGTCTTTGCG CCTX3G AGGCAGGACACAGCCCG CC CACCTCCACTGTCCTGCCACTGTGGATAC 

6 5 AATGGAGGCACAGAGTGCAATATAGCAGTGAAATCCAAGGCCCATGCCTCTGTGGGGCTGGTTTTG 

GCTAAGAGGCCCAAGAAGAGGCCAGAGGGCACCTCTGCTCTGCAAGGTGGGTATTG 

CT CGCTCACAATTCAGTTGG AGGT ATG ACTCCAGGTCAAAGCACTG CAG AAGT G AGAGACAGAGAGAT AGGTAAGATCAG GACTGT 
CAGAAAATGCCTTG CTGTGGCTCTGGACTCTTGGGAAGACTT CAGTGCTGGGGAAGAACTCrTCTCTTACTCCATAGAACCACAG A 
ACAGCCATGCTGTGTCTGCCIACGGCCCTCACAGATCTCTT^ 

7 0 CTCACTGCTTTTGCCTCTTACTGAGGCCGGGACGAGTGTC 

TGTTGATCAGCCTGTATCAGACAAAGCAATCACATCATGTCCGTACCTACAAACTCAGAGGGAGGCAGTC 
TTTCAGGATGAACrrAACTGTGCTTCACAGTGTGGCGATGATGGTGTAATTGGC 
GTCTTGGCCAACCCCTACACAACCGGTCT/TTGCra 
TGGCCTACGATACCIGATTGTCTATTTCTGCATTACTAC^ 
75 AGAGAGCACGAGATACTGGCTGCCGATGGCCGCAATGTCTCTCAGGCAGCA^ 
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10 



GTGATGGGGCACTTCAGAGACACCTTG 

CACACTGCTGAAATTCCGCTTGACTGTGAGTGA^ 

CTCCTAAACACGGCTTGGATTGCCCACCCCTGC^ 

ATCCCTTTTCTGAAGAACAAACCCTACCCCTGGGAGCAAGAACr^ 

TTTAACTGGGTCAGCAAAAAGCCCCCTTAGGCGCITT^ 

TCAGGGGCTGGGGGGGGGGGGGGGGAGGGGAAATGTCCTAACCCC^ 

CAAGAAT AG CCACCAATCTCAGAAGTCCCAGGT CCGCTG CTGGGC CTCACAC CCACAAGCTAGGTGT CTGGAATTCATG^^TAT AA 
TCCTGTAGCCTTCCTCCTGGTACTGGGTACATCCCAGAGTCAGCTAATGAGTTCTCT 
TACCCTGGGGGTCCTAGTGCCCCAGCAGGCATGGGCATCC 
TGCTG<^GCAGCTGCAGAGCAGCTGGCCTCCCTGAATTGTGATTC^ 

CCTCAGCAGATGGCCACTCCCCCACCCGTGGACCCCTAAAACCT^ 



15 GAGGAGAAGGTGTTGTTCTGTCCATTAAACTGTTCrGGCTC 

CTGACTCCGCTGGAGACACGAGGGTTCTGGGGAAGAAGGCACCAAGACGGGGGTCTCT 

GAGCTTTGCTCCTGTTAACAATTTAGTAGACATTTTCCACAGCAG 

CAAACACGCTGAAGGGTAAACTCAAGACGGGTTTTCCCTTCTGAGAATGC^ 

TCTTCATC AAAG CAG CATTGTAAAATCGAAAAATGAGGAGTGAATATGGGAAG GATG AG CT AGGG TGAG ATCAT CAC CT CCATG GG 
2 0 CTGCTGGGCAG CCATTGAGACAGGGACAGAGACATTCTATAAG CAACAAGCAT ACAA 

CAGTGGTGGTGAGCTTTGTCTTTCAGAATTGTGCCT^ 

TCTGAGCCCAGACACAGTTGAGCACTGAJCAGAAAGTCAGGACTGCACCCCCC^ 

GGGTACCCTGAAGATGGAACTCACCTTGTAATACTGCCCCATGTTGACTGTGGGGGGTGGGTACTGTCCGG 



2 5 CTGTTTGGTCCATATTGCTGGTTTCCATAGTTGGGCTC 

GATTCCAGGTCTGGGGCCCAGCrTTTrTTTTT^ 

TGTG CATG C CT AGTGCCTGTGAAGGTCAGAGGAATGT AC CAGAAT C CTGGAATCAAGAGTTATGGATGATTATGAG CCATCATAT G 

GATCCTGGGAACrrGATCCTGGCTCCTCTGCAAGAGCAATAACCACCGAGCCAC 

ATGGTCTCATGCACACAAGACGCCCTCTCTACCTGT 

3 0 CATCCAGGGACACTACATGCTGCAGGTCCAGGCCATATGCCATCACTCCATAAAACTGTTTT^ 

GCTTCTCAGCTGTGTCTTTGAGGGGTTCAATCTTAAAGTACTCCTTTGTTTCTCCCACTGT^ 

GG AGATG CAGGTGACAGAATTTGTTACCAAG AGCTGACAAGCCATG CAGTG AGG C TGAT CTGGAT AG CACTGTC CCTACAT CC TAT 

ACAGAGCAAACATCTATCTAGTGCTCCATTCTTAGCAGCGTCAACTATA 

ACGAGAGCCCCTCCTAGAGAAGTCACTGAGGAAGGAGACAGTGCCACCT 

3 5 GGGCTTCCTGTCACCTAGGTACCCAGAACATGGTCTCCCAGAGTCAATGGTTCT 

. GAGTCTCATCCTCTAGG CTTAAATCTAGTTGGTGAGAGTGCGC CAAGTG C CTG AG AGTG CT CAT CAGAGTGGACTGTGCTCAGGAA 
CTTTCAATGACTGTACGGTGGGGGGACATCTCCTACT 

TGTTCACCAGGCTACAGGAGTGTGGTAGGGGACCAGACAACATGATTGAAGCCTGAATTAAGG^ 

TTTGTTG ACACAAG ATTT CT CTGTGTAGCT CTGGTT AT CTCAG AACTT G CTCGGT AG AC CAGGCT AG C CTAG AACT CATAG AT CTG 

4 0 TTTGCCTCTGCCT CC CAAGT ACTG AG AT TAAAGG CAT AT AC CACCAAT G C CTGGCT AG G TTGTAC CACTT CT AACAT AGGATG AAC 

ATACTGCC!ATGAGGTAGAAGCCAGAGCTCCCAGCATCTTATAACCTGGACCAG<ZTGGGA 

TTAGTAAAAGCAGAGGG CTTGGCTTCAATGTAGGGCTTT CAATATGCAATTCTTCTGCGGCCTCATCTCAGGTGAGCAGCCTG AAG 
CTGAGACATGCATGCCCAGTGCAGCCAACTATCTCTCAGG^ 

CCATCAGCT TAAGGCC CCAAACCAAGGACIAG GCAGG GCTGGG AGGAGC CT^ CTTTG AGTTT CT CT CC CTAGTGCCCACTAG 

4 5 GTCGAGGCTCAGGTCTTGAGTTGGCCTCCTGGGTGTACAGGAAGCCTAATCACTATAGCCTAT^ 

CCTATACCAATTATGACATCATCCAACCTCTGTGAGAGTGGCGGCTGAAGGGACCTGT 
ATAGAAATCTGCTTCAGCTCAAGTTGAAGGCCACAGTCTTG 

CATGCCCAGGAAGGGGCATCATGGTGTGCCAAAGACAAACTTCA.CTGGAGCGAGGGGGG 
TCAGTTGTATTCTGAGTGTCACTGAGACCTGGTGACTTCCCTTTTCCCCT 

5 0 GTTTCCZTCTGG CACCTCTGCTAGGAGACCCCAGAGAAGG CACCATCCCCAGGaGAGCAATATGATGGAAGTAAGCCCTAGGGATAC 

TGTGTGCCATCCGATGGCCTCCCTGACACACACAGCCTGTAC^ 

CTGAGGTCCTCATCCAGGGTCATCCATTAGTATCTGAGTATGGACAGGGTTAAGTGGTC 

TGCTCTAGGCAGGGTATTTTACAATGTCACGGCCCCCTGTCCTCT^ 

TTAAAGTCATACACACTCAGGTGTAGTGACTTGTTTGGTGTTTTCCA 

5 5 CCCTTTAGTTCAAGTTCTCTAAGTGGCCTAACTGTGCCTTTTCCAAGGTCCCTGGGTCCAT 

CTCCCATCCCCCATCATAGTCACACITACCTCTCCCGGGTATGGCCT 
TGGGTAGGTCTGXTTGGGGCATCCTTTGCCCGTGTGTGCCAAAGGGGCTGATGCC^ 

GG CCGCTCATG CCCGAGGGTGTCATGCCAGCTGC CATG CCGGCAGGGTTCAGGCTGCCCCCCATGGAGGCAGGCCCCCGAGGCCCG 
GGTTGGTTCATGAATTGGCTGTTATACGCCTGGGTGGGACCCATCTGGAAA^ 

6 0 AAAAGAATAAAATGCCACATGCATTGAAAGTGCAGGGAAGGAAGGTAGGGGGCTAAAGGGAGAGCAGCACT 

AGTTAAGAGCGAGGAGCTCTTAAAGGACCCAGTCCTCCAGGGCTCAOTACTGC 
GGGTTTGGGTAGGGCAGGGAGCAGTCACTCCAAGGCACAGGCAGCATTCCCAGGGAC^ 
ATGGGAGGGACACAGGGAGGACACAGGAAGAGGAGGCGGATGAGGATGGAGGCTGAAGCCOT 
TTGGTCAGTAGCCAATGGAGACACACCCTCCATACACrAAGTGTGATTTACCC^ 

6 5 CACCCACAGR.CAACACTCTTTTAAGGGGCTTCCAA 

GCCAGCGGGCAGGAACCCTTACCGGTCCATACTGGTTTATATCCTTG^ 

GTGG CTGTGGCTGTGGCAG CG GCGG CTG CT ACTG CAG CTGCTG CAGCAGCGG CCGCTGG CTGGGTGAAGT CAGCAGGAGG C C GAGT 

ATGTGGW3GGATGCCCATGCCTGCTGGGGCACTAGGACCCCCAGGGTAACTGTGGAGGAGAAGCAGGGAG 

ATTAGCTGACTCTGGGATCTACCCAGTACCMGAGGAAGGCT^ 

7 0 GAGGCCCAGTAGCGGACCTGGGACTTCTGAGGTTGGTGGCTATTCTTTGCTACTA 

GCTGGGGCTAGGACATTTCCATAGCTTCTCTGCTTTTCCTTGGTCA 
TATGGTGCTGTGTGCTGTAGATGTGGATTGAGGTGTGTGAGACTCCTGTC 



75 



GA1TTLTTC1TCATTCAGGTCAGTGAGCTTC 

TCACCCCACTGTCTTAGCATAAAGGTAC^CGGCTGAGAAGGAACTGAGGAAGTATCCCCTCTA 
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TGCTAAGACACATGG CAGAACTGTCCTCATATCCTAGACCAGGCCATGTGGAACTGG CATTCAAGTAAGAG TCCTTCCTGGATGT A 
GGTAGGAAGGCAGGGCTAAC^GAATGCTGTAGAAGGGCACC^TCT 
GTAGCCAGGTCGGCCGTACATGTTGGGCTGGATAT^ 
ACTGTGGGGAGTIX1ATGCCGGGGTTGCTGGTGCTCATGCCTGATC 
5 GCCATAGGGTTCCGGAGGACCTGCAAACAGAGAGAAAGAGAGAGAGAGAGAGGGGGAGAG 
TAAGGTCX3GTACCTGAGGOVGCGTGGAGAAAGAGAAGACAAAGACTGAAACATG 

GGCTTGAAGAGTC C CCAGAAC^CGTAAGGGATCAAAGGGAGTGGGCTGAGAATGTCAG C CAGGAAGGAGTGAGGC CAAGGCT CACA 
AAGCCAGCCCAAGTGGTCCCCATGGCrGCAGAGAT^ 
CTCTACAATAGGGCrTGGTGGACGCCCTGGGCTGCTGCTCT 
1 0 GAAGACCCAGGGAAGGATGACAGCTCCCCCCCAGGATGGC 

AGGAGAGAGAAAAGAAGAGGAAACAAGAATAGCATGTCTGGGAAGACAGCTGGG 

TGACATCTTAGGAAAACCTAATCCAGACCATTCAGGGCCAAA^ 

TCTTGTTTGC^CAACTGGAGGGAACAAGGTCTCAGAACAGTACT 

AGGCCCTACAATGGGGGATGATCCTCAGGGACCCAGCTGTGTCCCTACTGAGACAGCTGTGTAC 
15 GCAGCTGGATATGTCCTGAGACTGTCTTCAGTGAGGAGAGTCT 

GGAATAAGAGACACTTGTGCCTAGGAGGGACCATGGTAGAGGATGGGACAAGT^ 
CCAC^CCGTGGTGACCACGGAGAGGGAGCCGGGAGGCTGATTGGTGTTCTGCIGCCAAGG 
CACTGCAAAGAGAGAAGAGGTCCTTAGAGCTGGGGCAAGGCCTAGGTGGCAGTC 
CCAGTGAGCTCTGGGGCTCCAGGCTCATGACAGCACTT 

2 0 GGGGCTACCCCTGTGTTAGCTGTGTAAGCriGGGTGGGTCTACAGTCCAGCAG 

AGCTCACTCTAAGTCCCriTCTTCAAGTATACCGATTTCCCATGACCCC^ 
GGGACAGCCTGGTTCCAACTTTAATGAGGCTGGAAGGGCCTTGGTGGAA 
ACTAGCTGCTGAACTGAGTCCCACTGACCrGGACTTCGGAATTCAGGCTTGGGTCACCC 
CCAGGTGCCCCGCCCCACCTCCCAGTCCTGTGCAraTCCCT 
25 TCCAGGGTTTACTGGGTAAAGTCCAGATGCACTCCAGACAGAACGATCTATCCCACTGGCTCGTC 
CTCACTCCAGCCCTGCAGTGGGCTGCTGATACAGGGAGCTCTTCCCAA 
AGTCrrCAGGCl^ACAGAGAACACTCAGAAAGGCCrrCTAGCCAGCCA^ 
AC^GGATTGAGACTTGCGAGGGTCCCTAGGCCCIACT 

CCCCAGAGCAGCAGAAAATCTCGTGTGGCACCCTCTTCCCTGGGTCTGCCTCCACT 

3 0 GGAAAGTCGCCTCTTTGGAAAGCCTTCCCACCTCrCGGAATATA 

TT ATCACACATTTGATTATT ATCTTC CTAG C CT CTACTCCAGAACCTGGAGTGAGAGGAATCCAG CAGTTTT ATGG G ACAGAGTGG 
ATTAAAG GCCAGGG G ACAAACAAGCT AGC CAGG AGC CAGGCCCTG AACTGGC CCCG AGAGTGG AGAAG CACT CCTG AACAAG CCCT 
GTATTTTCTAAGCTGGGGTGACTGAACACrGTGGCCTGCGCTCTC^ 
GTTCTGGCCCCTGCTGCCTGGTCAGGTATCCGCCGACCTGGAAATTAGTCOT 

3 5 TCCTTGGGGCACTCTCAT1TTTGAGCACATGC 

ACTGCAGAGACACG CCCGG AG AG AAAT ATTT ACAGTTACC CT ATT ATCT CAGAGG AG AAGACG GG AG AATCGG AGGTT C AAGGTT A 
TCCTTGGCTACATGGAAAGTTTGGGTCTAGCCTGGGCrrGTATGAAGCCCCGTCTCAAAAA 
TAAACCCAAACTAAACAACACAAAACACAGGCGGGAGATGGCTCAGTAGTTAAGAGCACTGACTGCT 
40 CAAATCCCAGTAACCACATGGTGGCTCACAACC!ATCTGTAATGAG 

GTATTTACATATAATAAATAAATCTTAAAAAAAAAAACAACCACAGTGCAGATG AATAAATA 
TGTGGACTAAGAATCIAGTACCAAGAGGCTCAGTCAAGAACACAAT^ 

AGG ATAAGAG TGAAG C AGGCT CAGTATT C TTGGTGT CCAAGG AGGGGG ATATCAGGGCT AGACTGG AGAG AGAAACGGCATGG CT C 
AGGAGCAGGGCCCAGGAGCACAGGTGGCTGACTGAGGACTAAATGCAGCTTGTGAAAOT 

4 5 GGGCTGAGCCACAGATGGCATAAAGTACCTGGCrTGGGACACCCTGGAGAGCGTGTTTTC^ 

ATTATCCTTCTGGATTCCCAGCTCTGGTATGCTGGACAGAGG 
CTGTCTTTCCCCCACCAGACCCGACCCTTCAGAACTGCT 
TTTTAAOVAGCAGCTGGACAGGTCCAGGGGGGAGGGGGGCCCC^^ 
CCCTACTACCCTC^GCACCATGGCTGGGCCCTGTGGGAGGAAACC^ 

5 0 AGCCAAGATTTCCCAAGTGGGCTCTCCACIGCCTGGCCAGTAGGTTCCCTCT 

CAGGGCCCGGGGGCTGCTGGACGCATGCTGGAGAAGCAAGTGACAGGCAC 
ATCCACCAGCACACTGTGACTCATCGCCTTCTAAGGGCTCrGACT 
TGCCTGGACTCTCACCCCTGTCTCTGCAAACCAAGGACAAACCAAGCTCCTGTC 
ACTGAGTGGCAAAAGGTGTCAGGGAGAAGGGCTCTGGGAGATGCATGCCAGGCCTTTGGGAG^ 
55 GTGATCAGCACTGTTAGTGACCTGAGGCTCCTCCTCTGAGCTTTTCTTATTAA 
GCTTAAGTTTAATCCCAGCACTCAGGAGGCAGAGGCAGGTGAATTTC 
CCAGAGCTTACACAGTAAGACCCTGTCTTGGAAAAGGGAATTGTGTGTGCX5TGTGT 
AGGTCAGAGGACAACCTGTGGGAGTTGGT^CTCTCCCTCCATGATGTTGGT^ 
TTGTCCACTGGGCTACCTCAGTGGCCGTTTTTGCTCTCTTGTTAA 

6 0 ACAGAGCCTGGTACAGCTGTGCAAAAAATAAO\ACTCATACAACTTGTGTTTTTC 

GCCCAGGTTGGTCTTGGATTTTTGACCCTCCTGCACACACCTCTCC 
CGOTTTOTGTAGTGTTGGCAATCGAACCCAGGGCCTTGGGTTTGCAAAACAATACT 

GCTCTGAAGGGG CTGGGAAGGG AGGAAGAGCCGGTCAAGAGG AAATCAGG CTAGTGTTGGGAGGCCATACAAGTCCCTGGGGATCC 
CCGTGCACCAGGCTTGGTTCTACTCTTCTTCCCACAGATGAGGAAACTG 

6 5 CCATGCCATGCCTCCACCTAGCATTCCGGGGCCTCGCTTTAGATTGTAGCA 

TGATCT CAGAGG CT CCAG CGG CTTTGAT TG TAATCAAATTAATAGATAAACAAGATTT ATT AGACT CTTAATTAGCTTTG CG ACAC 
CGATTTGCAT AGAGACAATGT G CTAATCACAAATGGCTT TTTAAAC CACTCATTAAAAAC CTGGG ATG AGGAAACAG ACGTTGTTG 
GGATAAAGTCCAGATCACCTCAGGGACACTGAAGGTGTCCTTCTGCCCAGGCAATGG 
GTCCTGGAGTTACGGTGGGGGTGGGGGGATATAGGATGTTTCTGTGGCTTCTGCT 

7 0 TGAAG CTAAGACAGAAG CTG AGGACTTTTACAAGCT ACAGTTAG CTCAT ACTCGTCACAGGGTCACAG CT ACT CTT ATCAAGT GCT 

TTGGGAGGAGCCTGGATCTTCTCTGCATTTCTCTCATGGTGGTGGGCAGCACCATGTCTT^ 




CAGGTAGGGTTTCCAGCTCCAGTTGAGGACCC^GAGTCTCTAAATGCCTGGCTCCTCTAGGGACCAAACT 
7 5 ACAGAATACACCCCATCCTTAGTAAAAAGACAACAGCCC^ 
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TTCTCCCACAGGGACATGCACGAAGGT^ 

CATCCTGGGTTTGGAGAAGGGTCAACAATTGCTCC!ATCATAATGTGGC^ 
AGCAGGGCATAGACTGTTTCAGAACCCTCACAATCAGCAAA 
5 TTCAACCCTATAAGCTGTGTGTTCTTGGACMCTTACTGTCCCACTC 
TATGCTCACAAGAGGGACCATTGCTCAATTCAACTGATCAGC 

CCTGCTGGATCTTCAAGGCCCTGAACCTGCCATGGTAGCAGA 

AAAACT AGATTGAGGTTATAG GATGCAGGTGGCTTCTCTTCAG C CTTGGATCCTCCATGGTCAAG ACACTTT CAGAGCC CTC CACA 
1 0 ATCATCTCTGTCTGCTCCTAGACACAGGACCCGGGCTTCCCTCTTTACCCTGC 
GCCTTCCrCCTCTGTTGGCAGAGTCAGCTTCTAAGTAAAGAGACTG 
CTTGATGAAGACCTAGCAAAGGACTGGCTGTGGTCAAAACCT^ 

ACAAGTCT CTAGTGGGAAGTCACACTATTCTGGTCTATCAC CATTAC CAACAGTGAAATGACTCAGG CAAGAAAGTTGGGAGGGGC 
AATGTT AATGGGGTGATGAG AGT CCGTCCTGATGGT TCTG AAG CTCTGAGGGG CTGGG CAGAATT AG CAAGACACAGCTGAAAATG 
15 ATGG CAGGAATATGATGG^GG CTCGCACGCACACACG CGCACACATG AGTACATG CT CT CACACATGT ACACACATACTCACACA 

CCTGTAATCCTAGCACTAGAAGGCTGAGGCAGGAGGATC^ 

C CAAACAC CAAGGAAATGGTCAACCAC CATCAGAC CAATTG CCAAAGTCCTATTCACT GACAG CTTGGCCAGGGAT AAGGAACAGA 
AGGCAGAAGGAAGCAGGTTTACCTCTCTAGAGCAATGAATGGTTTTC^^ 
GTTCITACTGCTGCAGTGCTTAGGGAGTGGGGAGGCAGTCTTCAGAAAGGTAAG 
2 0 GCTACCCAGGTCAAGTGCCACCCCCTCAGGAAACGCTGGCACAGGAGATAGAGGCTGGGTTAG CAGAGCC 

GCGTGCTGTATAAGGCCTTGCTATGCTCTGAAATGCTTTTGACGATGCC^ 
ACTTGGCTTTGATGTGTGCTTGAGGTCCTCTCTGGTT^ 
CCAGAACTGACTACAGXSATGTTCTGGGCTTCAGGTCTCTC^ 
CATTAGCGAGGACCCACAAGGGGAGTGGAGACAGCATGAC^ 

2 5 GCTCAGCAAGGCGCCTGCGAGATAATTAACAGGGAATTXjCCAGTCTCACCAC 

CAGTTCCTAGGTGCTTGCCAGTCTTGCAAGCCTCnTCCTGCCTAGGATOT 

CAGTATAATGGGTGTGGCCTGGTTCTGCTCTACAGTGAGGTGAG<3CCATAGCCCTGGAGTGTCCTGAGAC 

TTCCCTGAGAGTGCTTAAATT CCTG CTTTGGTT AGAGGG AAGGCTC CAAACT CATGATTCTTAAT GTTCT AT AGAGG CAT CTG AAG 
GACAAGGTGGGGAAAAACGTCTTCrrC^TCCCTGACTGACGGGAAGACGCA^ 
30 AGAGGCCCCGCCCTAGGTATTACACCACAGGAGGTCCCCA.CCCCCAGACCCCAGG 
CAGAGTGGGTTTC&TGGAGCTCATGGAGCTGAGGGG^ 
GTCGCAGCAGCAGCAGGCGGCCAAGTTCTTCACACCA.C4G 
GAGCCCCCTCTGCATCCTTCCCAGACGTGGCGTTTATGAGGCCGGTT^ 
AGTGTTTTAAACTATAAAATGGACCAAAGGCAAAATCAGAAT(^ 

3 5 CCAGGTTGAGAGGCCCC!AAAGAACAGTCCCGACAATGAAGAGAAGACG^AGCCAAGGG 

CCITGGATGAGGGTATGAGCAGGC!ACTTCrAGGG 

AACTGT AAAGTGTAGACAT AACT TAG AGCTTTCITGG AGTTG CTG AAG AC CCAGAAG CATTCG CT AG ACT CTGAAACC CCCCAG CC 

AGAACTCTCCTGTTGAGTACACATCAACCTTAGACAGACCGACCGACAGACAG 

ACTC1AAATGCTTGGAAGGATTGAAGGTTTCCATAGCAGAGGGTGGTAGTGTG 

4 0 TAGAGACAGCCAAGGCAGCATGTGCAAGAGTCTCTGCCACCAGCCCAA 

TCTTCATCCCAGTGTCACCACTAGGGACTAAAAGTTTCCTTAAGGTCCTTTCAGGA 
AAC^CTGGGCrTGTCACCACATCCCCTGTAAGTCTCAGAGTAC^ 

TTGTAGGACAATG AG AGGAG AGG C CCTAGGT CCTG TGAAGG CTCT ATG CC CCAGTGTAGGGG AATGC CAGG ACAGGG AAG CAGGAG 
TGGGGGTGAGGGGTGGGATGGGGGGTAAAATGTCTATTAAAAAATATTTTTC 

4 5 AAAAAAAAGAGGGGTATGGATCTTTCTCACTCAGTGCTOVAAG 

AAAATCTGCCTTCAGCAGATAAGCCGCAGTTGACCTTCCTCTCT 

TAACACTGGGGGACATGCCCGATGCAATCTGACCAGGAGTGCTGGGTGC^ 

CCACAGCTCCGACGGCTATCACTGGGGCCGTGCCACCAGGTGCTTGCTCATCTT^ 

50 TGTTCTCAGTGTCACCAACACAGGGAGGA 

TGTTCAAGCGTGCCAGGAGGGAAAGCCGCCTATCCAGAAAGGCTGGATC^ 
CTC^GCCTCGGTTCGGCTTCTTGCACAGAGGAGCAGGGACCT 
CTCTAGGAGGCTTGCTGCCAGCACAAAGGATGGATGCCGACCGGCT 
TACCAGCCAGGCTGCTGTGGGATTTAGGGTTTTCCTTGGTTACCGGAGGCT 

5 5 CAAGTGGACTGGTTTCTGGAAAGATAATGAAGGCATAGGACTGGCAATGGAAATGGGAAGGAGGTGTTTGTTAACTATGTCTTCTC 

ATGCCTGGAAGGCCAGAATCCAGTGACGAGGAGGGAAAGAGGGGTGACACAGAGAACCCTCC^ 
GGGGAGCACAGAGCTTGAGAGCAG CC CCCTCACAATGGGCAGGAGCAATGTGCATGCCTACGAAGG 

GTGGAGACAAACACCACATGGAAATTTGQ^ CCAAA CCATGGGGC 
TGCAGTGCTCCAGCCTGGTGGGACCJAAGCACACTTTGATCTGGGAAGGCCTAGCTGCTTCC 

AGGTCTCAAGTGCTCTATCAACTGTCTAGACAGCCAGCACTGTTGCCACrCGTC 
CAGCCAGGAGGAX3TGTCCACAGAAGCCCX3TGAT 

CTCCCG^TGGACCCATGGCAGGGTTCGACTGTCCCGAGAGCAGAACACAT^ 

65 CTGGGAGTCTGGCCTCTCAGCCTGGAGGGAGCCAGTTTCCTCACTAGCTC^ 
CCACGAAACAGTGGGACCCAC^TTGGAGCCCAACTGTCTACT 

ATGGTAT ACAAAGATGCATAACC CAATACATCT AAAATG AT CAT CAAAT GAGGG CTGGCAAG ATGG CT CAAGGGTTGACGGCTTG A 
GTT CCAC C CTTG AGAC CCACATGTTGGAAGG AG AACAGACT CCTT C GTATTG T CT ATTGT C CT CCT CATGT CACACACACACACAC 
ACACACACACACACACACATGGATATATATACAAA 
70 CACTGTGTAGCTTGGATGACCCAGCACTCACTATGTTATGTAGAT 

TTGT TGAGATTAAAGG CATATGCCATCAAGTGTGTGGCTCTCCACAAAAATGAACTAG CATTTTGTTACTAGGTTCCCTTCCTATT 
CATTCTCACCTTAGGTGCCAAGCGTCTGATCGCCCACCTATATTAATTTGACAAAGAGCT 

TCCTGCACTAGATAGGATGC C CTGTATAAGC CAG CAAG CAAGG AGG CAACT CTGTG CACTGTG CACAC TCC TCTG G CAGT AT TCAG 
G GGTCAG C TGTATT CAGGGT AAGGGACAATCACAGATC CAAACAGTTCC AC CTG ACTT C CACG CAAC TGTT AG CT AAG CACTCACG 
75 TGCTCTCTCTGAACCT TTATCTGTTGATGAGTATGACCCTGATGCCTGCTCTT CAGGATATAAGGTGCTTAAAACAGTAGCTTGTA 
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GGTAGAGGGGTCTATGTAGCATTGTGGCCCCAGAAACCTATCACTCA 
CCACCACTAGGTGACACCTAAACAAGGTCCCIAGGGCAGCTGTGAGGGTGGTCT 
ACCCATGGTACGCTCATCAGATGATGAGCATCAGCCCCCAG 
CTGTCTACTTCTGATCTCTGAGCAGAGTGC^ 
5 AGACGGAGGCCTCGGGGCAGATGATGACGCCACCTGCTCACTCAGTACAGGCTCTO 
AGCTGCCAAGGGGCAGAGGAAGGACAGCCAGTGTCCrACTGTGAGA 

GGAGAGCAGTAAGCACGGCATGCCCATGACTACTGAA 
AGGAAGTGGXSTCTCAGGGGATATAGAAGAGCCA^ 
10 GGTACTATCACAATGTCCTTTAGGGCAAGGAGGGGAACAACGGGAGAGGCCCCGAAGACTGG^ 
TCGTGACCGACCCTAGGTGTGCCTCCCCTAGGGGCAGGGACAGGCCCIGAATCT 
GAGGAGGCAGAAGCAGCAAGGCTCTAATTTGGGGAACCACCTGGACACCT 
GCCAATGAGGATAGGATCCAGTTGGGGTACAGGGGAACAGAGGCTGGA 

AAAGGTTGAACCTCACTAGTCTGGGTTGGG CTTGAGCTTGAA^ CT GGGCAGCTTG 

1 5 GCAAGGTTTAGATACAGGCCCAGGCrrGCTCAGTGCTTTAAGATCCAGGAACT 

CAGTCTGGAGAGGAGGGAGGTAAGAGACAAACAGAGGCTTTGTCCAC 

TCTGAGGAGGACAAAAGATTTCCATGGCTAGAAGGCAC 

TGGCTTCTGAGTCCCTCAGGCCACCAAGGATGGACTCTGAAGTAGCCAG 

ACATCCTAGGAAGGAAAGAATGCAAGGAAGGGCCATTCCTCCAATAAATAGCTGA 
2 0 ACACAGCAGCaGGATGAGTTAGGCACTTCCTGGTG AGTCAGAAAG CT ACCAACATGGC CACTCT ACT CC CCCGAGCACACCT AC CT 

ATCrrCAAGAATATGTACCAACATGGTGTGTTGATCGTCTCAGCAGCCT^ 

CATCAGAGCAGGCCCACCCATGATGTGCCT 

AATGGAAGCAAATGGGGCCAGGAGACAGGCTACCCAGGGGTTAGACATGAGACT 
CATAGCATTTAGTAAGAACTACGCAGCCAACACATC 
2 5 TATGGGAGACGGTCAGTCATCAAGCACTCCATCTGAAAGCCAGGCTGTAGAGCTGGTGC^ 
CTCCATGGCACTGTC^CCAGATAAAACCTATCACAGAGTCC^^ 
CTTTATAAATATAAGCCTCTTAGAGAAACAACAGCGTAGGTGA 
GTGCTGATGGTGTCTGTACATCCACTATGTGGTACCCACAGAGGAAAG^ 
CTCACCITCTCCTCrTTCATCTCC 
' 30 CTAGAACGTATCAGGGATTCACAGGCCTCGGCTCACCCCTrCATTCTGCGTGTGAAATGGTGAGACAG 
CAGACAAGGATCTGAGGATTCTCGAGGCAATAGACGAGGCTTAACT 

GAGGTCAACIATTAAACAAATT C^GATTCTGGCAAGTGGAGTAT CCTGCAGGATAAATT CACTGGTTTTCCCTGTTGCCTTCCTCTG 

ACATAAACATACACAGATGCTCAAAGCTGTCGACTTC 
35 CCCAACAGTGGCCACATCCAGTGCTTCTCTACAAGGCTGACCT 
CTCTTGTGCCAGCCTATTGAGGCTCACCCTCTGTAG^ 

GGCATTTACCCTATTGTGCCCTGAAGGGCGTGGGAGGGCACCAGGAGCCAGGCCTCCTGCrCT 
GTC^GTAGGTTGGACTTGTGTCCTqAGGAACTGAAATGTCTGCAGAGCTO 

GCCCCAGCAATGGAGGAATGGTCCCACTCTCTGCCTTGGGAGGAACCCACAAAGGAACCCCGTGACCA 
4 0 TGGGCTGGGGC^GGTAGCTCTGAGTCACTGATCCGGTAAAGCAGCCTTAGGCTCTC 
ATTCTGACCCTACTCACCCTGCTGCAGTGGATGGGGAAGATCTGCCAGCC^^ 

TCACACT CTCAGTTTGACTCCCATGCCTTGAGGTCTG CCCAAG AGCTATACTGTATTGGAGGAAGGGGGCAAGGGGTGTGGGTGG A 

TTCAGAAGATCAGGACAATGACCCTGCCACTGGTACCrrAGGACTTCCCT 

GGAGACACTGAAGTACCATCCTAGTTTCTCATCTGAGCTGAGCTCAATACCTGTGTGGCC 

4 5 TTCAGGAGGGTGTGTGGGTAGGTGGGAAGAACTTGGCAGCCTTAGCAACAGGGGGTCAGGACTTCTTGGTGGTCACAGGGAGCTGG 

ACTTTGGTGGTGCTGACAAAAGCCCCTGGTTTTCTGTGGAGGAAGTACTCCTC 
GTAAGGTGI^GGCCTAAGCTTCTGGTTTTCCCAGATCACTCTCACCTTCTGTTCCCTGCTTTGCCA 
GAGAGGCAAGATGGCAGAAGCTGCCTGCTGACACACTGGCrrGGGAACCCTTCCAT^ 
CACAGGCCIAGGACTGGTCAGGTCrGGTCCCCTAACTGCTCCT 
50 AGCCTGGAGTAGCTTC^GGCTACrCCTCCC^ 

AAGCTGGAGTCTCrrAGGTGTGTGAACACACCCTGGGAACTGAACTCTGAGGAACCTTA 
TTTGGCTGTTCCTTGCCTTGTGGAGGCTGTCC^GGGCTGAGCAAGAGGAGACTC^ 
GCCTCAGAAGAACCCAGTCTGAAAAGTTCCCTGCACIATTTCCCATTACCATCAGGTCAGCCTATATAG 
GACTGCAGTGCCAAGTCAGGGGAGAATGTTCATCAGGAAGAGAATGTCCTACCCTTAGGATGGTTTCATCrCT 

5 5 TCAAAGTACTGTGGACCCGAGGACGGGCTTGGGAAGTGAGCTGGCAATCTGG 

TGTCCCTCTTCAAGTAGTOGCTCCCTGGTCCTCCCACAGATCCrrCCTGCGGGTGG 
GAGCCTGACTCAGGCCACTGCTCAGTGTGTGGCAGGTCGTGACCTTTGTCTTCT^ 

G ATTG CAG ACCATT ACAG TAGGAGACG AAAG AG CTTTATG AACAAAG CATGGGGGAC CAGTG AG AAGG CTCAGTGGGTGATCGTG C 
TTGGTGCCAAGCTAAAGTCTACTCAACCTTGAGTTCAAATCCCAGGACCCAAATGGTGGGAG^ 

6 0 TCCTTTGATCTTCCAGACACAAACATAGAAACGTAAAAAAT 

GTGGCAGAAGCAGGCCAGTTTCTCTTGAGTTTGAGGCCAGCTTGGTCCACACAGACCTTATCTCAAAAACAA 
AATACCAATAATCCAAGTGCTGGGGATGGTAGCTCAGTGGCAGAACTTTTGCCTAGCATGCATAG 

TACT ACAACGT ACAG C CACAG AAACAAAGTATG CAG CAC TGAGGG GAAG GGG AGGGG AG G AGGGAGG AGGG AGGTAACT ATAGTCT 
TCCJATTTAGAGCCTTGGACCTACGCCCATAACAAACTT 
65 GGTGCTCAAGGACACACACTGGTGTTCCTAAGGGGCTATTTGAGGACT 
C^CCTGAAAAGGCCAAGGATAGCCCAGAGGTGGCAACGGTAGTA 

GCATTGGCCTGAGTCCCTAAGGTGAGAAGACACCCCCCTCACAGAC^CACACACACACACACAC^ 
TCGTGGAGCTGCAGCAGGGACIAGCCTCAAGATCCATTTTCCT 

TGTTCZAAACTTAGGGAATTGGGGGGCCAAGGAAGTCCTATGGAAATCCTTTGTCCCT 
70 TCACGGTCTGATTCCTCCAGGTAAGAGGCTGGAGTGAGCCCGACTTCT 
GGCAGCCAGATGACAGCAGATCACACCTTTATGTGCTGAGCAACCCA 
GCTGGTGACCZACTCTCAGGCCCCTCAAATGGCAGAGCrCATGA 

TGATTGCC CCACAGCAAACTTGT AT CAACAAAGGAACAATCCAAG CAG TTCTC AAAACT CAAGGGCGTGGGG CATGGTGACAGGTG 
TGACAAGGATGGAGAGGCAG CATTCAAAGAGTGACCCACACCCITGACTACG^ 
75 AGACITCTGTCCTATCTGCCCTCTGTCTTTAG 
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AACTGGATACATTTTCAGACTTTCAG^ATCCTC 

TG CAGTCACTCTGGGACACACAGCTCTTCTT CTTT A(^^ 

AGGTTCCTTCCTGCCCAGTCTTCAAGTGGGTTCTCCACGTGGTTGTGGCTGTCTTC 
TCTACACCTTCCCTGACTGGCTATTCTGTC 
5 GTGACCACCTCCTCTCCAGGTGCCCATCGGGGCrrcGTAGGTTA^ 

CTGGATCGTTTTTTTTCCTAAACCACAGGGGGTCCCTTTCATACCCTGGAGAAA 

ACT CTTGGTTTTAGACTGATAC CACCTG AAG CAACT CTCAGAGACTCTGGATGGCACACCT AAAGGGTGGTGAAGGCACAGAGATT 
TCTGrrGTlXSTTATTGTTTTALVlTl'lTAACT 
AAGTCATTCTTTTGACACCAGGGCTGTGGCTTTCrrGGATA^ 
1 0 CAGAGCAAGACTCAACCATCTGGATTTC 

CCTAGACTCCCGTAACTGTGGGATCTCTGATGCTGCAGGACAGAGCTGACGAATCT^ 
GCTGTCCAGTCTCACTCTGAGAAGCACTGTC^ 

TCTCTCCTTTCTCCTATTCACTGACTTAATATTCAGGGGCTCTCAGGACCAGCTCTC 

1 5 ATGTCAGTTTTTCGAGCAAGGCCCAGGTGGGCTATGCACCTGCCTCACCT 

G GCATGACGGAGGAGCTACAGGTGGAGAACTTGATT CGGGT CAGAC CCTATT CCAGTGACT CTG CCTCTGG AGT CCT CTGATTG AA 

GGTCTTGT ATTTGGGAACCTGGGTCTGTCCCCAGCAGCTCTCGGGG CCAG CAGCTCTCGGGGCCACCATCAACAGCTGGTGGGCTG 

GCTGAGTGGAGCTGAGGACTCTGCATCTTGTGGCCATAGGGCTGAGCCAGCrrGTGGGATCT 

GGATGTGATGGGGGAGGGAGGGCAAGGACAAGGC!AGAGAGCAGC^ 
2 0 TTCAGAGCCTCTGGAGGAGGCGACCGTCATTTGTTTAGGC 

CCATCCCTC CAGCTCCTGTTCTCTCTCTCTGCAGGAAACAG CTTTTGCCTrCTGCCTCTGATGG AGGCCATGAAAATTAACGGAGC 

TTGAGGCGAAAGCACGGTGGCAGGAGGGGCTGGGGTTGGAC 

GCCTGAGGGGCTGGCOTGGCTAGACAATGGGCAACCCAGCAAGGCGACAGGACC^ 

2 5 TCGTACAGGCATTGTGTCCTCAGAAGGTCAGCTGCATATGCTAGAC^ 

TGGGAAT CGG TGATCACTATGTC TAGT CT CCTGGCCCCCAG AGACACTGAGAGACTT^ CAAATGG 
CAGGGGGTTCTATACTCTTTGAAGGAGAAAAATTACTTATGTACAAAGTACAT 
GGGCAGGAAGAGGGAGCTGGCTCCAGATCCACAGTTCTCCCAATACCTGA^ 
TGCATACAATTAGAGTAGTGGAAAO\AGAGTCTCTCTCTCTCTCTCTCTCTCTCTCTCT 

3 0 ACACT CCACATTGTC C AG ATG ATGAG ATAGGAAGACT C CATCATCT AAG G GC TACAG AAGACTCT CAGGG AAGG TT CCTG CACTAA 

TGGGAAGCTG ACCAG CTGTCAT ACAACATGGAGAAAAACTAGC ACTG AAATG TC C C ATTAGGAAAGT AAAT AGG AG CT CAGTGTGC 
TGGCACATACCTTTAGCCTCACTAAGCACTGGGGAGGCAGAGGCAGGGGGA^ 

CAAGTTCCAGGACAGCTAGCA.GTATAAAAAAAAGAGAGGAAAGAGGGAGGGAGGAAGGGAGGGAGGA 

TGAAAT AAAG GAGAGAGCAG AAG AGAG AAG CCCAAG ATGAACAGCTTGACT C CC CAAGACG GTCCAATGACT CAGACTT CT CATG A 
35 GGAGACTCCATGTCAGCTCCITTCTAGCCTCCTCTTCAGCCTCCATGAGAA 
TCCACACAGATATAACCAAGCAAGAAGAAACAA^ 

AACGG CATTT AACGAAT CTCATG TCAT C CATCATCTGGGT CCAGC CACTTAGATAC CGGGCATTTG ATT AAACTAATATGGG CGGC 

AGGGCGCTTCCTACCOTCGCCTACAAATTAGACCGTCACACACGATCCXSAT^ 

CATCCATCIACTCCTCAGTGCJCCGACTCTACCCACCCCATGCTAAT^ 

4 0 GGTGGCGTCTACTCATTC^TTCATCCGACATTGGTTGAACACCTA^ 

TCAGTGAGCCCACGGAAGTGGCATGGCAGGAGAAAGGTCT^ 

GCCTGCCTCCATCTGAGACCACCCTGTCCTCAAGTGTCACACTTGGGAAATGGTTCAAGAGCATAG 
TCCCC^AACTGGAGAAATGTTCTAGATAGATCTATGCTGCCCiAC^^ 
ATGGCCCTGCAAGGGGAGCTGAGCTTAAGTCTCATTTAAGCTTA^ 
4 5 CTCCTGTTGCCTAGCATAGATCCAGCAGCTGCrrGTGGA.GAGCAAGCT 

AAATGCTCACAAAGCCTGGATCCCTGCACTTTAGGATCTCAGGCCAGAGATGGCTATTCATCA 

AGCCACACCATGGTAAAGGTGGAACCCTGGTTTGGGTCTCT 

T AGG AGAAAG T CAGCAGG ACTAG AGGCT ACCTGTT CTGTG CAGAG AGG C CACACAG C CAG C CTTC CCATGC CACAGCAAAGCTG GT 
50 GGTGGCTACCTCCCTCTACCCTAGGCCTGCTGTAGCCCTGGCAGTTTAGGACAGAGCTGGAGAA 
CCAGGTACAGATGGGACTCCATCCTGAGGCITAGCAAGAA 

CAGCTCTGCTATCTGGTGGGTAGGTGGGGCTCAACGTGGCTTTGGTTAACGGGGAAGTCrTGGGCCTGACTCTGACCCTCAGGCT 
C CCT GGAGAAAGC CACACTATT CCTTCACAAT ACTGG AGTCAAACGG AT ATAAAAGAG CAACTGTTTATTCACCATTG AGAACTGT 
CTG AGGT CATTCACAGGTCAAG T ACCAGG CAAAGATCACTGT CTGCC CT CAGG CCTGGAG AG AT G GCAACTAGGGACAAG CTATAG 
55 AATGTCTGTCTG ATG AC CTCAATCCC CACAGATGGACGGTATG G C CTTGG C CAAC CCACCTCAT CTGGCAG CTTTT AG AAT CATAC 
ACO^GGGCACAAGTAACAAGTATTCTTGAACCCTCCTGTGAGCATCGGACAAGATGTGCCTGCC^ 

GGG AGAG AAAG AGAGAG AGACAGAG ACAG AG AG AG ACAG AGACACACG G GCACACAT ACAGAACTG G AG ATT AAT AGTGCAC ACAC 
ACACACACA(IACACAC^CACAGACAT^^ 
60 TAGAAAGGAGCAGGTTTGGACTACTGAGGCTGCCTGCAAGCCAGAAT^ 

GTCCACTACTGGG CTAGGGAAAGAGAAGGAAGGATGTGGTGAAGCGGCAGGTCAGAAGAGCCTACACAGAGAAG CTGAC CTGGTGA 
GTTACAGTTGAAAAGGATGCTGGGAGG AGGGGTTACAGGTGAGTAGGTTC CCAGATTGCTCCCTCG CACAAGGTCCACCTCTTTAC 
ACTGTCCCAGT TT CAAGG ACT ACTT AGG CACT CACACATTGGT AT AAACAAGTTTACAACAAGTTGTG C CGACACAGTCAAGGAGC 
AGCAGTTG AAG AGGAGAAAACAGTGT AGAAAAAATGACGGGT AG CTCTTG C!AACG CACC CTT AATTTCCAAGAAGGCAC CG G GG AG 

6 5 CACCCCATCAGAGACCCTCTGCTACCAAGACCACCAGGATTCT 

AACC!AATTTGGGGGTAATTCTGCTGGCAATTGTCrTC 

ATAATTGTATGGTAATCACTTCTGTTTATGATCGATGCGGAAATCACTATTGTTTTCCAGCA 

GGGGTGGGGGAGGGAGGGTTTATGTCTGAGCCCGAGGAAGATGAAATTA^ 

ACACCCTTGTCTCAGAAGCACAGGCACAGTGGGTAGCAGGAAGGCAGCAG^ 

7 0 GAACAGTCTAAGAACTGTCTGGCACTCAGATCC^TAGAGATGTCCAGAGGGArc 

TTT AGTACAT C CAT CCT CATG CCTG CATTT ATG AGCT ACAG AG AGGC AG AGGAAATG CAG ATGC CAGG AAG CAGGTGGG CTACTGG 

TCTGAWICAAGGTGACAGATGGCACC!TGG 

TGAGGGGAACTGCCCACCTGGTGGGTAGAAACCTGAACCTCCCAGAGGCCACAGGGAATGGCCCTC 
7 5 AT ACCTTCCAAGGCTGTGGCAGACACCTTAAGGCACAGAAGGATAGAACCAG CCAGGCTGGAGG CCAACAGC CATT CGG AG CACGA 
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GCAGGTACCAGCTGTCATGTACAGCTCAGGCAAACCTGGATGTTGCAGGCTGAGC^ 
ACATTCTTCCTCCAAAGCAAGAGTGGAAGAGACCATGGAACTCT 
CCCCAGGGAAACACTCACTCCTGGACTCTTCTCTGAG^ 
GACTGCCCCCAGACTTGGTTCTATATATGGCCCTAGCTCAGGCTAGCTT^ 
5 AAGTTTTTAGTTGGAAAGAAGTGTGCTTACGGAGCTGGAGAGATGGCT 

GAGlTCAAATCCCAGCAACCACATGATGGCrCACAACCATCCGTAGTTGAGATCTGA 

T ACTGTGT ACTT ATAT AAAACAAACAAACAAATACAT CITTTAAAAACAACACACACAGACGTATG CTT ACACT AGCTGCCTTACC 

AATGTCAGCCTGGGCTCAAGGCCACCCACCTCATTCTATGGTT^ 

ACAACAAAAGTAGCCAGGGTCCTGAGATCATAGAATAATTTGAGTTTGTG 
1 0 GGTATGAATGAGCACTCrACCAGTGAGCCACAT C CCAGCTC CT ATTT ACTTCTCTTAAATTTATTGCAAGCAACT ATTT CAGCC CA 

TTCTTGAATGGAGAGGACCACCCCAGAGCCCACCAGACTCTTGGGGTTCAGACCCTAAATTTGCCOT 

ACCCTGTCTATCCTGTACCAGAAGGGGTTTGGGGTGTGTGTGTGTTTCAGAGCAC^^ 

CCAGAAGTAGAATGAACATTCTTCATTGAACTAAGATTCCATGGAACTTCT 

GGG ATTTGGTTTTCTGAAGTATGTCAAGCAAGC CATGGTTACTT AAGG TT CAAC^ 
1 5 CTACAATGAAGTTTTCrrAAATGCAGAGACAAAGATCAAGGTGGG 

TTGGGGGGCATTTGTAAAAGTAGTGTGTGCTTTGCTCrcrrCTCTCTCTCT 

GTCTGTCIGTTGTCrGT CCGTGTGTG AACAGGCCAGGGCATCCACATGG^ CTTTTG 
GGGTCATCGTGTCATTCTCTCCTTAATACAT^ 

TTCTTTGGG ACAGTTTCAGGG AATAGGAGATATGG CTTGCAAGG GT CT C CTG ATGGTGTG AGAG ACT CAGTATG AGAAGTG AGAAA 
2 0 GCTAGAGAGCTTTCACTCTTGCTGACAGCCCAAACT 
GAGAGAGTGCGGCATGGGCAGGAAAGTCCCTTCT 

TTGGAGGTAGCCGGTAACCTCTGACCTCAACCTGCAATCAGGAGCTTGAGTC^^ 

CCTTGTGCCTGGTGGTAAGAGGACTCAAGGTGAGAAGCTGAATTACCATAG 

TGTGACTGGAGGGACTGCACTTTGAGGGCTCAGAGACCAGTCT 

2 5 GACAAGCCCTCTCTGT^GGATGCCCAGATCCTCT 

AGTATCTTCCTATTCCTGGAGATGGC&GAGGTAACrr^ 

CATATACAATCAGATTCCCAAGGATGGAGCAGGCTAGGAAGTGACCCTGGCTTT 

AGTACCCTTAGAAATACrrCTACTGGAATGGCAGGCCTGCCTTCCrrGTGGGCATGGC 

CTAGCCCTTACCCGACGGTATCCACTGACACACCTACATGCTG 

3 0 CACCCACAAGAACACCACCCCAGAGCCTCATCrATGAC^ 

CT AT ACAC^CACAC!AAACACAAGTAATGCTTGTG TTATG CATG CAGT CT CGGCAACAGGTCTAACCACTTCGCCTCCAGATACGAG 
TGTTACCCTGCACATCATGTGCT'TTGCAGTCCCACTGGGGGGACTTTAAACT CCCCACCCAGAACCTGCT 
CTTTGTTCATTCACGGATTCAAAAACATACTTGC 
TGCCGAGAGTACTGAGCAGTAGCAGCAGGCAGGGTGGGG 
35 CTGGG CCCCCAGCAGAATGTCCGAGACACACCCGGGGCAGG C3ACAGCTG AGACCGTGG^CACTACTGATGTTirrACAAGCTTAAGG 

TAGTCTGAGTGGGGGACGGCACAGTT^CAGTTAACAGGA 

TTTCCTTTAGGGAAGAGTCCATGGGGCACGCCTAAAGGTGGGAGATCCCACAACCT 
AGAAAACGGTCCCGGGGCGCCCACTGGACAAGAGGGATTTGCCT^ 
CTGAGAGGATTACCAGGCCCTGGTCAGCCTGGCCCTGGCGAGACTTGATTCTAG^ 
40 GGCATTTCTTTTGTATCCCARACAGTCTG CTTACTG CTCGCTGGGCCTTTGGGTGG CAGCAGGACCAGGAAAGAGACAGAC CCTGT 

GCTGTGCCCACAAAGGAGGAAGCGGGAGGGTCCTTCTCACGTC 
AGTCTCCACTGTGACCTAGTGGAGACCCACTGGTCACAGCCATGTCT 
TGACTTCTGCCAGGGCATCCCTGGGACTGTCTGGGAACACTGGGGGATGGTGGTTGGA 
TATCCTTAAGTCCCTGTAGAGAAACCACAAACCTTTCTCCTGAGGTTTCTCIA 

4 5 GGGACCCAGGTTTCTCAGGTTCAATGGAGGCTGGGAATTAGGGTAG^ 

GAGGGTTTGTGGCTAGGCTGTGAGTATAGTGCCTGGGTAAGAGGCCT^ 
GCAGGTATATGCAAGTACAGAGACTTGTTACAGAGCTAGAAGACTTAAT 
CACCCTCTTTAGGTGTACTGAATAGCAAGAATCACCrCAGTGTAGCTCTVGC 
AGATGTTGATGGGCCACACATAGATCCCAGAAGGGAATGCTTGTGTGA 

5 0 CCCIAGCTCAAGCCTCAGGAAGGAAATGAAAGCGGCTGCCAGCACTTGGGTACTTCCT 

CAGCCCTGGTCAGTCACCAGTGTCAGTCCTGT^^ 

TTTCAGACCACCCCCTTACCTCCCCCGGGTGTGGTGGCCATTCACAAGGAAGGGATGGGCATTTCTGTAG 
CTGCTAGTAGTACACAACAGGAAACTGAGGCCCAGAGAAGGCTATGGGCAAGT 

GGAACTCCAGGAGAGTCTGAGTTAAGCTGTCATTTACAGGGTAACTACTATATCCCAGACACAGAGOT 
55 TG ATCTACTG CAG CCTGCCAGG CAAGCAG CACAG ATGTTTT ATC G AT CAAG AAACAAGCATG AATT AAGTGACT T AC C CAAAGTCA 

CCCAGGAAACTACAGATGTCGGTTTCATAATTACGTTC 

ATGCGTCGGGGTG ATGGTGGTGATGGCAACGGGGG CCAGG CAGG CAGGTGAGTCTG AGG CCCTG CTGCAGATCCTACCCCCACTAC 
AGACCTCATAAGAC^TCTTCAAGAGACCTCAGTGACTTAGACACCACACTGCCC^ 

CT ACACCATTGAATATGATATCCACCAAGAACTT CAAATGGTGGG ATT^ C C CAAATCATCT CATTTTAACT AA 

6 0 TACAT CTAAACAG CTACACACG CTGGG C TGCT CACACTTG AAG AT AG CC TG AG CTACAG AGTAAG ACTGTCT CAAAACATAAT CCC 

CAAATTAAAAAACAAAAACACAAT CAAATAGG CACAGGTAG CT AGCT ACTG AT AACTAG G TGGAGTTGTGT AAT AGACCTGTGAAA 
GGTGGGACCCCAAGAAAGAGAAGGACTCCTTAAGCTGTATACCACAAAGGCATCTTGACACCCC^^ 
TAAATATGCTCTGTACAGAGTGATTCTAACTGTTTGTTTGTTTTGGACATGGTCTCTGTAACTGAGGTTAGCAT^ 
AACCCCTCTGCTTCTGCAC^ATTTCAAGCACTTCCTAGATGACCACTCCTGTCAT 

6 5 CACAAAAATGAGGCACCCACTATTGACATCCCT 

TG ATCACAGG CTAGCATGTCTAAC CTTTTGACATCGTG ACAT AACACTGTCATT ACAAAGC T TACACTTG AAAACTAT AT AG CTGA 
ATCACACA(^CACACACAC^CACACACACACACACACAC^ 
TGTTGGGCTGTATTTACTCTGCTATCTTCAGCAGCC^^ 
TGGCATTGGGCTGGGTCTAGATCCAGCAGAACTGAGAGATGAGAGAG^ 
70 AAGGTOTGAAGAGTTACTCTGTATCrGGTGGACCCAGTCCTTTATTTACTG 

CTGCTT AGCTTGAGAGCAGAGG CCATAGACT CAAC CT C CAGCATTGCAAAG AATAG AAAAAG AAAGGAAG GGAAGG AGGT AG AATG 
GAG ATG GGGAGGG AGAAAGAGAAG GAATAAGGAAGGG ATT AAAGG AGGAAGGG AAGG AGGGAGG AG AAAAG AAGAAAAGGG GGAGG 
GAAGGGGGAAGAACAGGAGGAAGGGAAGGAAGGAATAGCAGGCAG 

CCACC CAC CACGACCATTTTT CWU^GTCCT CAG ATGTTGTAGAATGTT CTCTGT C T ATGG CT 

7 5 ATTCCACTAATCAGAGACAAACTCTTGGGAGACAGACAAGGTCCTG 
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GATTAAATCCTGGGGCAACAAATAGGATGTGAGAAGCCCTGGGGGGGGGG 

AGGAAAATGTCCCCATTG CAGTAAAACTTCCCAGACCTGCTTGGGACCCAAGGAG CTACAGGGCCACAATCTCTATTTTCCCTCTC 
CTCCCTCTCTCCJVGGAGAGCTGCCTGGGGAGA^ 
TTGACCCCTT^CCATGACAACTGCAAAACAGGAGCT 
5 CTGCAGGATGCTAACCTCCCCTCTTCCACGGCAG 
CCAAAGTCTTCAACAGCTCTCCCACTGCCCTAGG 

ACTGGAGGTTCTGGTCCGCTGACCTCCCATCCAGTACCACACCTCAAGCCT 

CATTTCTGGGAAACTTCAGGCTGATCCAGGCCCAAAGTCaCCATT^ 

GCAGCCCTATCAGAGCCCTCAGCCCCACCCTCACCCCCACCCCAAGGTGACT 
10 CAGATACTATTACTGGAGGGCTGGGCCAGAGGAATTGT 

AATaUVTCAGAGGGGCCTCGCTGCAGTAAAATGTCATCAGAGXS 

CACCTGGAGGCAGGGCTGCAGAAGAAGCCACACGATTATGCAGGTCTAAGAG 

AGGGACACAGCTAGGCTGGCTACGGTGAGAGGAAAATAACCTGGACC^ 

TGGCTCCCGGCATCCTCTGGGAGGGGTCCATTGGCAGAGAGACTCTGA 
15 CAGGGCTGTAAACTGAGATTGCAAACAAAGCCCCTCrrCGGCCTTCAATATCCCCCCrC^ 

C CCTTTCGGGTATG CAGAAAGC CACCC CTTTAGAT AGCC CAGGGT CTAGCCAGGAAGAGACT ACACAGGGACAG AAATGCAGGTCA 

CTAGTGCTGCCTGGCTTGAGTAGAGGATCAGGGAAGAAGGCGGCA 

CCTGTAGAAGACCAGGTGAGGGAAGAGACTTGCTCCATCCCACACCTGC 

CTGTTGTCTGTCTGTCCTTCCAGGCCTAGACGTACTGCTCAGGAGCTCCCTAGGAAG 

2 0 TTTTTCCCCGAGCGAGCTCCCTTGGGAGGTGGC^ 

GAGTGCCAGAGCAAGCATAGCCCAGTGGGTATGCTTCGATACCAGGGATAGTGGGCCCTG 

TGAAGCAAGCAACCAGGCTACCTGCCCAGGGATGAGTCT 

CCAGCAAGGTAAACACCTTCCAGCTCCAGAGTACCTTTCCCTCCCCrTCAGCCCT 
25 TCTGATCGATACATGACTTCACAGAGCCTGACGCCTTGCCGGCACCATGGCGGTCCT 
TACATGCCCCCTGGCrGACCTCTGTGGCACAGCAGAGTGCTGGCAACTAG^ 
ACACTGGCTAGTCrATGCCTGCCCC^CACAGAGGCTGCCATGACT 

ATG CTGACATGAGGTGGGCTCACCTCGAGCTTTCACCCAGCACTGTC TACTGGAACTTT CTG C CACT AG AAG ACAGCTGTTTT AT A 
GTC CAAGCTGTC CAGT AGGGTAGCCGCT AG CCTGTGCAGCTATTG AGCATAAGAAAC CAATTCCAGCTAGGGATGTAGCTTGGTTG 

3 0 ATACACTGCTTATCTTATGTGCAAACAGCACTGG 

C CAACACCTGGGAGGAAAACT AAAAGAACT AAGCG AGTATTACTTTTATTGTTAAA CAGTTTAAACTT AG ACG CAG CC CT ATGTG 
GTTAGAAGCCACCATATTGGATGGTAGTTTTCCACTGAGATCACCAGCCAT 

AAAGACCAACGCAGGACGCTGGTTAACTACACCAGGAACTCAGTTTAGACCTCCTAGGATCTGCATATGG 
AGCTCAATCCCTGTGGACAGAGATCTAGAAGTGCTTT 

3 5 T CACTGAG CATCCACT CATTG ACAAGCTTTGTGGTGGG CAGTGGGGTGCATGGT AGTAAGGAACACATTGCTTT C CT CCGTT CC CT 

GG AAGTGTTCACGGTGGAGGG AAGCTGTG C AGGGGAG AG T AAG AG CAAGCAGC CTG AGGAAGCAAGT C T CAAACT CTGTG CAGG CG 

GTTGTTTCCX5GTACAGACTCAGATTCTCACCAAAGCTTC^ 

TGAAGGGACTTGATTCCAGCTACTTACAGGX3TAGT(IACT 

4 0 ATATATCCTTGGTCTCATTTCTAGCCCACACGTAAGGATCGACTGTTGAGAAACAOT 

AAACCTCCCTCTTAGAAAATCACCTGCTATG CCCAGGCTCACTGGTAGTTAAG CCTAATGGTCAAGTGCACGCTATGCCTCGGCCA 

CACTGTAAGACCTGATGCTCTCCCAAAGCCCGTCTAAGGAACCCTATCTGCCCrrCCr 

CCAAGAACTGGGACTTATCTATG<^CTGGT^ACCAACACTAAAGCCAAATGCTGGGGACCTCT 

TTGCAGTGTT CTGAGAACCTCT CTACAAGGTCTTCCTCACAG CCGGATACAGCCTGATGCCAGTCACCCCC CACCATCACCAGCAG 

4 5 CAGCAGGAGCAACAGCAGCAGGAGCGGCAGCAGCCGCCTTC 

GGAAGGGAGCCCTAGAGCCATACAAGCCTTCC1AGGTCTCTCCAGCATCAAGGATTCATTCT 
CCCTTAGCCACTGCCTCTrAACCAAAAGGGTCTGGGCTAGCrCT 

AATGTCTGOTGGGGAACCAGCACTTCACCTTTAGGAAAGAGGCGTGGGGATCAATCTGTOT 
GAGGAGGAACCAAGGTCTCCTCTGTGGGTGAACAAAATGGGCCTGGTGTACACGCT^ 

5 0 GACACCATGXZAGCACACGGTGCTACCCACAATGCACTGGCTCTCCT 

CCACGAGCCTGGCGCCCGC^CCCCCGCCCCGCAGCCCGCACTCACTCGCGTGC^ 
GACTGGTGACTAGGGATCTGAGGCAGCTCACATTTCAAACTTGCTCCC^ 

CACGACCCGACGAGTXSGAGCTCCAGCCTCCGGGCGGGGGCGGCGACCAGGCCCCTCTCCCrCCCCCCG 
AGGAACACTTGTGAGAGTGGGCTCCTGGAGCGCTGTCGCCCGAGGGGGCCTCAGG 

5 5 CTCCCGCCCCCGCCAGCGAATCTTCCCAAGAGGTCATGCTAATGAGGGTCCGCAGCGGGGCC 

GCC^GCAGGCTGGGGACATCCAGTGGCAACTTGGGTTTAAAGGGCTTTCTGAAAG^ 
CATCTTACTCCCCCAGCTTAGGGGGTCCCTGTAGCCCCTtSGGATTCT 
CACTCTGCCTCCCTGACTTCGGGGAGTTCCTGTGCCCCGCAGCCGGCGGCGGGAGGGAAG 
CGCCCCTGCTCCCX3GTGCTGCCGTCTTCCTGACACTGCATCCTGTG<3TC 

6 0 CGCGACCCGTGCTTGCCCCTCCCACTCCTCTGCCCC^T^^ 

CCCTCCCITCCITTGAGGGTTCTATCCACCTAGATCACTCACT 

ACCAGTGAAAGTTCTTTGAATGAGCTCCTAACTGCCTCAATTAGTCGAGGAGGAAAGGCTC^TCAC 

CAAGAGCGGCTTTAAATTCTCTGCATCTGCACACTGATCTCTCCAGCTTGG 

GTGAGCCTCCTTCACACGCCCCTCCAGTATGAGCTCCTTCCT 

6 5 AAGTGGCAAGACAAATCACTTATCTAGACACTGGGCTCTTACTCCTTGTTTGACAGAAG 

TGCCACTTGAAATATCACITTTGGGATGGCATTTGGAAATGCAGTTC 
AATTTCTACTTCATAACTGGATTTTTCTTTTTGCC 

ACCTCCAGGCAGTGCTGAGGGACAGACTGACTGCATTCTGCTGTCCACAC 
AATGGCTTCTTTGAGGACATTCATTTACCCACTGG 

7 0 CTCCAAGAAACTAGATGAAGGCTCTCCCATCCTTCCCACTGCT 

CAG AAG CACACACGTC CCTAAGACCTACTTTCTG GTGT CAGG C TAAC TTAACAATATCAGGG CTCAGTTTCGATTAATCCAAGCAC 
GAAAGCCTCTAAATAACTAGAACCCACACCCC^TCCTCCCAAAGTTCTGTACAATGACAGTTTA 
GCCTGTTQ^GTGGCCAAGGGCTTACACACACAG^ 
AGTGGTTAAAAGAGGTT^CTTGCTCTTGCAGAGG 
7 5 C(^GGTTTAGGGAAATTCAACACCCTCTTCTAGTCCGAGGGCACCAGACACACAAGTGGA^ 
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ATTCATAC^TGATATATACATATATAAAT 

GTGAACATTCAGGCTGGAACTGAGCAAAGTTTACACCTTAGGGG 
GCTTCCCGTTGTTTTGCCTCCTCGTCCT 

ATGGAAATAATATTCAAGATACAJ3GGTTTTCCAAGGTAACCCAAAAA 
5 ACAAATTGTTTTCAAGCACCCATTCATCAGTCCAAACTGGCCC 
CGCrCAAGGCCCTCTTGGTTAGC&GCCT 
GGGCTTTCCCTGACTCGCTCCAAGTTCCACATCTCTAACTTC 
CCCACCCCACCCCACCCCGCAGTGCITGCCTTGATAAGGATCTA 

1 0 GGGGGGTAGAAGAAAGGAAGAAGAAAGAAAGGGCGAGCTGTAGCCCCATGCTGTGGCTCCTT 
GACCTGACTAAGGATCAATAGCX3GGTGGCTO 
AAATACAGTGCTGTGGCATGGCTGGCTGGCTCCGGAGTCG^ 

CCTCTAGAGGAGGGGTGAGAAGGGAGAT AATGTAGCCAGG ACAC CACAG T CC CAAGOTTGGGTCAAAGGACTGACITC CTTCAGCT 
CATGGATGAG CT AGAATCAGTTC CAATAAGACAAGGGACCAAGGCAT CT ACCTGGGATG CATGT AC CAG GGGTCAAAAG TGGGTCA 
1 5 CTTCCCACTTACAGCTTTTTCTTTTTCT 

GGAAGAAAGGAAGGAAGGCTAACTAAGTGGGCAGGCAGGCCTGTCATGCACAA 
CCAGCCTTGATGTGGCTCAGATTCTGGGCATTGGGTTCTG 

CAGAGGGATTGACAAGAATTCTGATGACCTGGAGAGATGTCACTTCTGTGTACTCA^ 

GAGATGG CAAGTGGGTGTGTACCTGGTCGT TGGG AAG CGTGGAGAAGAACAAGGGGGGACAG AAAGG C 
2 0 ATAAAGAGACTGGGTCACAGGGCACCAGAC^TACAGACAGCTAAGGATCAAATGAC 
GGCTCTGCCTCTGGGAACCACCAATCCTGGGGCrr^ 

GTACX3GAGCCTAAAAGGTCGCCTATACCAGGGACACAGTGCAGGGCCAC 
CCATTGAATGCTTGCTGGCTGCTGTATGGGTTTCATCATGTCCTGAGCACTAAC^ 

2 5 AGTTATTATGTGCCACATGGAAGGTAAAATGGGGATAGACTGACC^GGTGAAGTTTGCTAAGTATCTAAGCTACTTCTCAGCCTTG 

ATACATCCTCAGCATGCTGCACATT ATACTGTCTGGCACGGGATG CC TAG AC!TCAGCACTGATGGTGCCCACCTGGGCCACACAGC 
CAGCACAGTGTGCAGAACTCTTATATAGCTCCTAGAGTATCTCTTTTTAAA 

GAGGCAGCTCGACCTGAGATGAC^CAGGAAGGAGACCrrCATTACCTCCCTCCTCCCATTATAGTATCCCC 
CACCCCCCCCCGCCCC^CCCCCAGCACGGCTCCTCATCCCAGACCCAGGCAGCCTTCTCAGAACCC 

3 0 ATC CTAT AAT CATTT GATTTTGAAAACAACACCAG CCCCCTGT C C CT CACCTC CACT CAGCATG T CCGAGAAAC C CT ACAATTTCC 

AGGTTATTGG AG ATCCAGGAAAGCTTT CTG CAGGG CAGGCAGT GG GGGJU^G^^GGAGTGAGGTGGGCATTTAGGAGCTGGCTCAGA 
ATGGCCCTGTTAGTCTTAGGGAATTGCTTGCCCCCTACCTGCTC^ 
CCCCCTCAGTAGCACTGATGCTGATGGAGAGCCTGGCCTGCAAG 
GTTGCIGTGTTCCTGGACTTCCCTGAGGCACTCAGGAGGCAGAGGTTTAAG 

3 5 TCT ACTGG ACTTCAAATG AGTGAGAACAC C CAACAG CAGG TCCTAT AGTGAAAAACC CAACT AAGACAGTTGAG ATAGGTTG AGAA 

GTGATGGTATTGGGGTAAATTGTATGCTTCCCCAA 

GCTGAAGATCAAGTTAAGATGAGGTCATGAGTGTGGGCCCTGGTCCAGCATGGCTGGTGTCCOT 

TACAACAAGCAGACGGTACCCTCTTCAAGGAGGAGTGAGAAGGACAGTGGAGACT^ 

TAAGACACrTCTAGCAGCTTGGAGGAGCCCTCCTTCCCTGGTACCTACATAGGAGCT 

4 0 TTTTGGGCCCCCAAACTATGAGATAGTTAATGCACACCATTATGGTACIATGTGT^ 

ACCATGAGGTGATCCCTTTGAAATCTAAGTCACAGCCCATCATGATTCCAGAGGTTCTTGCTGACTCGGTCCTGATTATGGC 
GATGTGGGTCTCAGCrrTCn'GTGGCCAGTCACTGCTCTAAATCCATTTCCT^ 
GAGTTAGCCATCCTGACTTGGCTCCCTCTTGCAGGGTCTCCCTTCGCTGAAAGTCCT 
CCTTCCCACCTCACCCCGTGCTTCTTCTCTATCTTAGCTTTGCC^ 
45 CTGGCTAAAAGGCGGGATCTTTGTCAACTGGAATCCTTGTCrGATACACGCT^ 

T AAATACTCTT TGTAT AAAGGAGACAAATAG CT AC CTGAATT AATGAAGGAGCGAAG AAATGAGTGAGTGTACAC TGGCT AATCAC 
AGGCTCOVGAGATCTGC^TGTCCrGGCTACCTGTGCCTGCCTATCCAGGGACTAGGTTGC 
CT ATT TCCCAGGAAGTCTAT1TGAAC!AGGAAGCTCZAG ACTGG CCAACAAAGGGGCCGGG 
AGGCCTCTAATGAATCTTTCCCCATTGCTAGAGTTGCTTTTGAAGGGAGGGGTACAGA 

5 0 TCCTGGGAGAGGGGCTGGGGGGAGGGTATTGCTTCTCCCAGGACACAGGCCTGATGAAGGTTTCCAGA 

CACATAAGTTCCTTGAATTTGGAAATGCAATTTCCTTTCTTGCC^ 
ACTGTACACTTCCCAGTCTGTTCGAGGCAGCCTGGGGAATAGCACCTACT^ 

GACTGGATGTGACAATGCACACCAGCCAGGTGGGGGACCAGGCAGGGCTGCAACCCAGGCCCCCTAGC^ 

TCTTTCTTTCTTTTTTTTTTGGTTTTTCAAGACAGGGTTTOTCT 
5 5 TGGCCTTGAACTCAGAAATCCGCCTGCCTCTGTCTCCTGAGAGCTGGGATTAAAG^CGT^ 

CCCGTGAGCTTTGCCIAGGAAACTGCrACCTGGTGCCATCTTCCCCTCTTCTCCCTGTCT 

CTGCCTGCTTCTTATCGGTGCCATTTCCC!ACAGCGGCTTCT^ 

CTGGACGTGAGCAGACAGGCCATGKIAGGAATGTCTGTCTGGGCAAAATCT 

CTCGATGTTGAAATCAGGCTCCTTGGAGCTGAGCTCAGCTGAGCAGC^ 
60 GGACCAAGGGCAGGT(^CCGTCCTTTTCTGGGCCCTGGTTTCCTTCCATCACCAGGGAAGTCAG 

CATGGGGCCAGGCAAG AG ACTCC CAATACACAT CG TG CTGT CAAG ACCAAGAAGCTG C CTCTAAAGGAG CCAACATG CCACACTTC 
TTTGCCTTTCCCTTTCCCGGAGCTTATTAATGAGGTTCTAATTCTCTTAGACCAGGTACTCCATAAAGGCAGCAAAGCAGCTOT 
CTATTTTGTTCACTTCTGCATCCTTOATGATTTTGTGCCTGGACCAAGACAGGTACCGGACAAATG 
65 GAGCTACCAACTTATTAGTTTACTGGTAGGATGTCAGCTCTACCCT 

CGTATTATTTTTAGGATTAAAAATGCATAACCAACTTTCATTTTAAACTT^ 

TTTTCTGAAAAGTGCTATATACCGTATTCATAACCAGGGTATCAAGCATGAAACAATTTCTTAGCACCAACCGACT 
TGGCGTACTGGOSGCTCAGGGTGAGGTACGGCCAGTTGAAATGCT 

CCCCAGTACAGTCTTGTTCCCGTCTCTAAACTGTATTCTCTGCCTCTGGCCCTAGCTTGCTOT 
7 0 CAAAGCTGACCATGTCACTTGTGTGAACATCAACCTTGCTTT 

GCCATTCATGGCCTCAAGTGATGCTGTCCTTGGCTTGGATGGCT^^ 

AGAAACCTGTTGGCCATGCAGGAGTGGCAGGGCCGTGGAGGCTCTCAGTCTC 
AGGAAGAATAAGTCTCTGCCAGTATCTCAGGCTGTCAAAGAGGCCTCATTTCTGGTCC^ 
75 GTGCTGTGTCCCTCTAGAAATCfGCTACCTATTAAGAGACT 
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ACCCTCC^GAAGGAGATGATTTCCTTGATGAAGT 

GGTGCCAGCAJGATATCAAGGACCCCCCCACCAGGCCCATATCTGCTAGC^ 

CTGTAGATTTCTGTTG TCTATAATGCGCT CAGATCGATGTGTTTTGTGGTAGCATGTTG CGCAT CACCTGCTCTCTGCTTCX3GTTT 
ACTCAGTGCACGCGC^GTGTGGCAGCCCTGGGTTCCAAGCCACTGCTGCTCAATGCTCT 
5 GGGTTCCAGACAGACCTGGGAGCATGGTATGTAATTGAATTTGGCCTAAT^ 
AGCCCTGGGTGTCAGCGGAGGCTGGAAATCAGGAGCCAAGAGAGCT 
CCCCAGAGGTAGGTAGCACTGATCCrreTC!ACACAAAAGAAGCAGCTCGAGGCTCTGAGT 
CTCTGTGGCCAGGGCCTCTCGACCATGCACrcrCCTCTGCCTGGGATGTCATTCTCACAC^ 

GGGCATCCAAACAAGGGTT ACTTT CTCAG CCG AGCCTTCCTAGGTrCT CAG CCATGAG CT AG CTGCCTT CACTGTCCr ITGCAGAG 
10 GCCTCATGCTGTAGGGGGAGGAGTGCTCAGCTCACA^ 

GAGGCCCTCCTTCAAGGCCTAGAGCTGGGGGCTTCAGATGAGTCCTCAACrrTGAGGTGAGG 



AGATCAGGCTGACTTAAGCAAAGACAAGCCTCTGCGTCCCAAGGTATATGGTATATGGTATATG^ 
TATATGGGAGTTCAGTTCAGTCAATCTGAGTTGGTGGCCCACTTC 
1 5 GGCTGTCCCCAGTCTTCTAGGCATCCCTCrrCCTCTTTCAGGGAC^ 

TAGTCCCACCATTTGCTACTTGAGGGATGTCCAGCATGTCACTCACCTCCCTGAG 

GGGAGAACGAGTGTGTCATGCTTGCCCAGAGTCCACAGCATGCTGAAGACTCAAAGCATCTGCTCCCCAGCGCT^ 
GGGC CGAGCAGTGCCTGTCTAAAGTG CTGGGT AC CAGAAGTAT CTCAAATTTCTAGGTTCAGG AG AT AGTAAAATATGTGCAT ATA 
CATAT CCTGGGAAGGAAACCCGAGTCT AACAGTAAATT AAT TTCCCAAT CACAGG CAC CTT ATGTGCAAGGAGTCTGAAGGAAAAA 
20 TTTAACAACAGAGTGTACTT'ACTGTTAAAACT CTCAGGG TGGCTGAGGGTGA 

CACTGTTTTCCCACCTGTACrrGTCTCAGTGCrAGGATGGCAG 
GCCAGGCAAGCGTTCTACCAACCAAACC^CACCCTGGTCCTC 
ATTTTTCATCCTTGAGTACTCAACTGCCAGGCACGGTGGGGAAGACC^ 
AGGAGGGGTTTGAGGGCAGCCTGGATTGTATGAAATCTTGGCITAAGAAACTTTGAAAG 

2 5 TCTGATTGTTACCAGGGGCAGGAAGAGCAATGGTCACA 

TAGACCATGTTCCAGTCT^TGCTGGTCTCTTCTCAGATCTGACAGGAGATGGGCCAGTGACCCT 
TCACATCAGAGAGCAGGAACAAAAGCAGGAAGCCCAAGGCCATCACATACACTCTGTGTCATCTCCAGC 
CTCAACCTACCTGTCACCCAGGGCCTFCCrCTTT'CATAAGG 
TTGGCAAAGGAAGC^CAGAGTACTCATATAACCCCCGGGAG^ 

3 0 CATTATATT AG AGGAAATGCAG AGTG GTAACGGT AGT CTTC CT A^ CAG CAAAGGCTGGGG CTGTGTGC C CTGGT ACA 

CAGCAGCCACTCCTATGAGGTCTCCCCCGAAATGGATCATCCT^ 

CTGCTCACAGAGAATCAACCTCCAACTACACTGGAAGGGTCAGGACACG^ 

ACTTTCTTGTAGAATCTGAAGTCCITGGTCCCCAATCCC 

3 5 CTTAGCTTATGAGACCCCATCTTTAGGTAATCTGCTTAGCn'TAGTAACAGGAGTCCCACATAG^ 

AGGGACGAGTCrrGGCACCiUUUVGCTTAAAACTAACTGTC 
ATTGAAAGGGACAGACTCTCATCTCrrTCTTT^ 

CTGGAATCTCTCTCCAGGAATCACAAAGTTGCCTCTGTTCACATAAGGGGACTCAGGG 
CTGGGAGGGAGTGATTCCAAAC^TCTGGGGCACGTCTGAGAGGCGATCAGGACCATAAA 

4 0 GTAGCTATGCTATGTCTATGGTTGAGGGCATTCAGGAATGGGCAGAGCTAAAC^G 

AGAACAAGATGTGATGGGGACTTCAAGTTGGGGTACAGC^^ 

CAGAGAAAGACAAAGAAGATGGGCAGATTAGGATGGGGGAGGGGCAGAGGTGGAAAAGGAAATAGAGACTA 

TGGAGGGAGGGAGAGAGAGGAGGGCTCTCAGGAAGGAAGCCTCAGGCCCCAGGAAATGGAGCCTC 

AGTAGCATACCCTGGAGGATGTGGTCACCTAGTTTTGTCTTGACCT^ 

4 5 CTCCTGTGAGCITCAGTTTCCTCAACCAGAGAAGGAAAG 

TTCCCCACCTCCACAAGCGTCTCATGCCCTGCCCTCT 

AGCTTCGCCATGCCCCCTAACTCTATCCTCCCAGAATGTGGCTCTCXrrG^ 

CAGTGTGTGTGCTCTTTGAAGGTACAGACAATCTGGGGGCAC^ 

G OU^TTTCTGCTGACTATAATTATGAGGCAGTTTGAAAGGCAGG CTCCTAGTAATTCCTGTTTCCTGCCAGAGACAATAGTGTCT 

5 0 CTTCATTCCAAACAGGTGAGACTGGGTCCTCTG^ 

TCAACAAGCCACCATGTCCCTGGGAAGGTAATGAGGCATCTGTGAATGAC^ 
GGCJ^CTGCCATAGCCAAGATGGCCCCGCTCCCTTCCAGCC^CCT 
TGCAGAGCITCCA(^AGGAAAACA(ZAAACACCrG^ 
GATAGTAGGGTTGCCACGGAAA<ZAGJ^TCGCTGCCGCTCTAGCCGA^ 

5 5 AATTGAGGT<BATTTTCATAAAGCTACATC!AGTAGGTTT^ 

CTTATAAGTTTATAAGTGATCGGTACTCTGGTCTCCAGTGTT^ 
GGGGCTTCAGGATAAGGCCGGGGTCAGGAAGATGGTGGGGCTAGGCTGCT^ 
GCCCTTGGTTTTCTTCACTTTGTTTTCrrCATTCTGAGCTTTAAGTGGCCTATGAAC^ 
TGAAGATCACCGGAAAGTGGCAAAAGAGGGCCTATGGTCCTGGGCCAGC^^ 

6 0 GCACATACAGAGCTGGGCAGCTCAGAGAAGGC^^ 

AGAAAG AAGAGGTGGAG CCAGGGGAGAGGAG AGAGAGTG CTGTG C CTG GAGT CAG CTGGCT AAAGC T AT AC C CT CTT CTGCACTGG 

GTTCGTCC^CTTGCCGAGTCCCTGCAATGGGTTGGGTGGGATTGTGATC 
CCAGACAAGAGCTCCATGAATGTCACTGCCTCCTTGCTTCCAATCCT 
65 CTGCCTCCCACACCAGGCTCAG^CTCCTGGTCATGAAGAGAGGAGA 

CTCGACTTGGTGTCATAGGGAGTGCCCACTCTTATGCCTGGTATTTAGTCCCCCTGACCTCCA 
TGCAACCGATGCAAACMGTCCTTTGCTCTT'CAGCCCCCAGCTTTGTGTTCC 
TTCAGCATGCTGGGTAAATTTGCTAACACATAGCCATCACCAAACAATAAGGCT 
TCAAAACCTATGACCTCAAATACACACAAAGGTTGGTGACT 

7 0 AGAGCCAGGCGGACTGCTTCTGAGCCTGCACGTTTAACTGTTTCTCTTCT 

GTCAATTAGGGCCAGCCCAGACTCTCTGTGCCTCACCCTT^ 

AGAGACAAAGTGCAGG C C ACATGGAT AG CTTG AGTCATTAG CATTTTTTATTGTTTATTATTTGTTTGTG TAGC CCAGGTTGGCCT 
TGAACTTATTATCCTCCMCrGGCCCATATTCCTTGTTCTTTACAm 

CATACATGTGGGGGTCAGAGGACGACATGCAGCAGTTGGTTTTCTCCATGTACGATGTCGGCrrCC^ 
75 AGGCTTAGTGACAGGTACCTTTATCCACTCAGTCA 
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TGTGTCTGTCTGTCTCTCTGTCTCTCTOT 
GGTCTCAAACTCAGAGATTTGCTTCT 

CTATATAGATCTGGCTGTCCTGGAACTCACTGTGTAGACCAGGCTGGT 

ATTAACCATGTCTGCCACCATTTTTTTTTTAAATCTCT 
5 TGTAACCATAACATGCATGTTATAGCACACACATGGAGGTCAGAGAAC^ 

GAAACTGCAATATCACAAG CTTCCAGGTTACT CTTGTCTTGATCTC CCATCTTGT ACCAGGAACACTTGGATTAC^ T 

GCTCTG CT CATCTGTATGTGG CTGCTGGGAATT AGAATTCAGGTTTTCATTT^ 

TAAACCTCCCTCCCr X M r i T ri" lTlTT TTAAATAAAGATTTATTTATTArrCT 

GCGCCACCTTCTCCTGATCTTGTTACGGATGG^ 
1 0 GTCAGTGTTCTTAACCGCTGAGCCAT CTCT 

CGG CTGGCCTCAAT CAAGTATTTACTTTGCTGAGACTGGCCTTGAACT CCTGCATTAGCATCCCAAAGGCTGGGATT 

ACAAGTATTC!ACTACCACACCCAACAGATATTGCrTGAGTCACAGATCTGTCT 

TCACCTACACCAGGCAATGGGTG03AACCGCTGGGGTCAGGACAAGGGATGAATGGCGACT 

TTGTGGCTGAAAGGGGATATAAACTAGATTTTCTCCTAATCCAGATGCCCTTTCCTGACCTA^ 
1 5 AAGAT AGAT AATAACTGCTGATT CTTGTCTGTCT CTTC CAGAAGGT ATTT ACGAG ACTGTCGG CTTG CAACACT CGG AGAAAAAGT 

TCCCATACTAACTTGCTCACTGCCTGGCAAGCCCTGGATCCATTTCTCCCTCCT 

TGAGAATGCAAAGCCTGCCTTAAGATCAAAGCCCTGGCTTACTTGATCCTCTCA 

GTTGGCTTTATTAAGCTCTACCTCCTTGAGGTACCAC^ 

TTTGTGTTGGGAAGCTGGGGGCTGGTATAGAAAGGAGTCACCCATAGGGTG^ 

GAGAACTTGTTTTTGCCTTCAAGCCTGCCTTC 

TCTCTCTCTGCATCTTCAGCCTGTGCTTTCAGTGGGACC^GCACTGGAATCCCATCA 
AGCAGCACCGAGCATCCCCTGACTCTTCCTCCCTCTTGCACCCGTGAAATGGTGCAGATGCCTTAGA 
25 - TG G ATGG AAT AAGTGTC CATCATCCACAGGTCATGAC CCACAGG ATGGCT CACAGG G CT CCAGGC C CCCACAAG AAAACTATTG TT 
CAACGGAGACTCAGGAAGTGGCITTCATCGTCTTC 

TCT AGG CAGTGGCATCTAGTT CTTTTTTTACAACCCCCTGTGT AGAAGC CAAAT G AATAAAG CAAAACTCT CAGTATG CAGTCATA 
ATTTGTOTAGCCTGGAGCCTGTCCAGGAGATGAGAAGGCTGCAGGGGACTGTGGCA 
CCTTATCTGCTCTCTGCCACGCAGCACACACTGACCTAGCACGTGGCTTT 
3 0 AGCCCATCCrCTCCIATCCCrCACCCCAGACAGCCCTCAAGTAAGGATGGAGAA^ 

GTGGCTCC!TCACTGGTGAGAGTGAGGGGGTGAGTAACAGAGCCGCA.GGCAGGCTGAGCCCC^ 

TAACAGCCACACTTCGCTGCGGTCTCTCCCTCT 

AC^CACATCCTGGATCCCTTCCCCrCTACTCGCAGGCGCCA^ 

TAGGGTCACAGCTGCAGATGTAGACACCrrCCTACCGTGGGTGGATGCCCrCATCCCCTCCCTATCCCCT 

3 5 ATGGGCTTAGAAAGTAGACACCTGGCTCAGCCCCAGGCAGCCCTC 

TAATTCTGTGAGAGGCMGAAGAATGGCCTTCTGAC^GAAGAGGCCTGTCCCCTCTGGCTCrrGGGGCA 
GTATAGAGAAACAGCTGAGCCGCCTAGGAGTTGCTTCTGTTCTGAAC!ACCCCTAACAACrCCAATGAG 
AAGGGTGGGAACACACCCACCTCCTTTGTAGTTGATAGATCTTGTAAGTCAATATT^ 
GTACTCAAGGGCAGCTTCTGTACCTGACAACTCTAAAACTAACTAACTAACTAACT 
40 TTGACAC^CACGCATACACACACACTTA 

ACACACATTCCTAAATAAATAAAAACAATCTGCTCAGTCT^ 

TTGGACGGCCATTCGGTATGCTCTTCCTTGGTAAGCTGGTTGGTGTTTTTAAAAGGTTAACCTGACG 

ATGGCCCTTGTTTTTAGTGGCCACTGTGGCCATGGAGAAACTCAGCCCTTTCCCTGAGTGCAATGCTCTT 

TTCTAACACAGCTCTGCCTCTTCTCCAAAGTGA 

4 5 TGGGAATGCTGGGCCCACAAGTTTCCTCCCACTGGTCCTCTTTGTTGAAGAGCA^ 

TCAGCGAGATGGGAGTTGGGGGAACCAGGGTCATTGCCCCAGTCTTCCTGG 

TCCTCCTCTGTAAATGGGTGTGCTAACAACAGAG 
TATCTGATGGTTAGTCAGTTCCCACAGGCTGGCAGTAAAATCCAG 

5 0 CTG CAGAAGGAAGGAAGCCAAGC CCTCCCC CACAGCCCAG CTCTGCAGAGTTCA 

GGAAGTCTGTCCCCTGGCCTTGAGAAGGTGCCCCAGCCTGCCA 

CTCTGGGGGAACTGGCAGCCACTAACAAATCCCCTGCAATCTGCACCCACCAATCCTGCCTGCT 
TTTCACGGGGAGCTGCACACAGCTTTCTCTT^ 

AGGGAAGGCAGCTGAATGCTCTGTTGGGGTGGGC^GAGGCAGAGAGGGTGGAGAAAGGCTATGGGGGAGGGTGAAGGTAGAGATA^ 

5 5 CAGCCACGACACTATtlAGAGACCAGAAGTGCTGTCCACCAC^ 

AACCTTGTTTCCACAGCAATGGGACACTGGTATATGACCGGCCTTC^ 
CTGACTXK3TTTTGGGTGCCTTTGTCCCATCCC^ 
ATTCCAGCCCCTATCCCTGAACTGAGAGCTCTAGA 
AGGCAAGGCATGTCTCAGTGTCTCCACTAAGACCAGCTGACA^ 

6 0 TCTGGGGACTTAAAGCCAGCAAGCGACATTATTTCCACTCCCGTTCTGGGACCTGGGOT 

G AGGT CAG AGGTCAG AGAAC CCAGT AG ACT TT ACCCATCTTGGG GATG CAG ACCCAAGGTTTC CATTCAAAGG ATG CAGAACAT CT 

GCGTGATGGTCCTACTCCACGAAGAGCAAGTAAGTGCACATAACTA^ 

AGCCAGCTCTGGGCAGCCTGGCTGTGTTAAGATTGGCAAAGACCCT 

TCCCTTCTCCAAAAGCT^GGCTGGTGCCTGTCAGGATGGCCTGGGTATTCTTGATGCCTCCCACAGTAGCTC 

6 5 CAATAGTCCCTTTAAACCACCCAAGCATCCAAGCATTGGCT 

AGCAGCTGGCACCCCCACCTGTGCCCCAGCCCCATTATGGATAGCTTGGGGCGCATC^ 
CCTTCTGGGCTCTGACATTCTCTATTCTGCTATTTCA^ 
AAGCATAGAJCACCAGCGGAGGCATGCAGGCAACTCT^ 
TGAACAGAAGGCTGAACCACTGAGTGAGCGCCCAAGAGGCTGACATT^ 
70 TGGTGGAGG<MGCTGATGGGCATTGCTACAGCCTTGG 

TATCTAGGAAGGGGTGGGGGGTGTGGGAGGGTGCAGCCGGCCCCTCAGGAGCCTCAAGATG^ 

GTGCTCCTGCTCCCGTGCGCGGA.GAGGCTAAAGGCTGCAGCAGAACAAAGAGTTGGGGGCT 

GCGGCTGCCTGGGCGGCTTGCTGTAGTGTCTGGGCTGTGCTCAGCTCTGCATTCCGTCTCTGTCT 

GTGCTGCCCCCAGGCCTGAGCCTG CTCACCGTCACCACCCTGCTCACTGCCACAAC CACGCACACAGCCCAACTCTTTGGTAGCAT 

7 5 TCTTGACAAAGCATCTCCCCTCAC^GCATGATCTCATTGCAG^ 
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AGG CAGAAGGGTGGGAGAGGCGC C CAT AGATGCAG CTAGTTG CGCGTG CACACACTGCT CAG CCATT CCGTAACAGTCATACACCA 

CCCCCAGAACCCACACACATCTGTGAACACCTATAGGTCTCTGTGCACTCAAACCCCTCCACCATCACAG 

GTGTTCATACTCAACAGTCAGACATACCAGGCATGAGCATACACGAGTATGC^ 

TACAG CCACACACACAGAGATGCAGACCTTGTACACACATACC CATACAGTC^ CA 

5 AACACACATATATACATACATATACACATTACATACATATGTACATAAATACACACATAATACA^ 
ACAGACC^TGTACATACCCACAC^GCCAGAAATACCAGGCATGAATACACm 
ACTCACAATGCACACATGCACGATACTTATAGC^ 
CTTCACATAAAGACACACATGCATGCACAQCCATACACATGTA 

AAACTCGGAAAGTCACAGAAAGTCAAGGAAG AT AGATATGGCC^ CAGAGGCCCT CAGGGAG 

1 0 GAAGGACAGCAACACTGCACAGGCAACTAGACAAAGTGGAGTCTAG^ 

CCATAGTCTCTGAGCACAGAGTGCCAGGACCAAGGCCTCTTGGGAG ACTT A^ CATGTGCACCAG AAGGG CAGGTT ACATC 

TGGTCATGGACCCACTT ATCCACAGAG CTAAGGGGGAGTGATGTGCATTTTGGG CGGAAAGGAGGGGT CAGGAGATGTGTTG CAGA 

CAGACACACGGAGAACGACGGCTGGAAAGTC^CATGTGCTGGTGT^ 

ATTGAGAGAGCGAGCCCTCTGAGTTCCCAGAGGCATCT 
15 . TGGCTGAGAGATCAACCTTGGGCCCAGCAAGCCCCTCATCCCT 

GCCCTTACACACCTAGGGAAGCrrcGCGAGTCAGGACTGTTGGGTCTGAGGGGTCTGACAGGTCTGCCT 

CACTT CTG CCAAGGCACAGGCATGCTGGCT TAC CAG CTG ACAAAGGAG TT ACATGGACAAAGGAACAAACGGAC GACACTTTGCCT 
TAGGCCCAGCAOSGGAAGGGGTGCnXSTGAACTGTGACCTT'GCCCAG^ 

CAG AATGCTGAAAATGGCTCTTGG AAAGTG CAGCAT CCTGC^ CG AGAGG ATGCTTGG ATC CT AGACAGGAGTCAG 

2 0 CTGAGATGGAGCAGGCAAAGCCCTCAGAGGAGGCTGCAGTCCCGGTTCC^ 

CCCCAAGACTTTTCTGAACTTCAGTGGCCTCATCCGTAAATGTAGATAATAT^ 

TAGAG ACT ATTGT ATGAGCCATTGGG AGAGGGGTTTAAAAG AC C CAG AGGT C AGGTTT CACAGG G CATG AGGCTT CCC CCTGTACC 
AAGGACTACITCTGGCTTCCAGAATGCCCATTCrrCCCATTGGCC^ 
TATCTACCTATCTATCTACCCATCTATCCATCATCCATCCATCCATCCAT 
25 CTACCCATCTATCTATATCCGTATGGAACCCAGGGTCTCATAGATGCTAGGTGTCCTGGTTM 
GGGGTCATCTGAGAAGACAGAACTGCACrrrG 

TAATGATTGATGTGGGAGGGCCCAGGTCACTGTGGGTGGTGCCAACCCCAGGCAGGTGGTCCTGGATTG 

AGCAAGCCACGAGAAGCAAGCCAATAAGCAGCATTACTCCATGGTCTTTGCTTC^GTT 

GCCCTGACTTTCCTTCCTGATGGAATAGGGTTATCTGTAGATGGAAATAAATCCTTTACTCT 

3 0 TTTATCACAGCAATAAAAAACAG CAAAG ACACT AGACAAG CAT CTT AC CACTATG CCATG CCCC AGTG GC CTC CTTAG AGACATGA 

CACTGTCCCCTATGTCCACACTGATCTCTGGCCTATAAAGTTATC^ 

AGC CATGTTCCCATTACCTTT AGGACAGAGAACACAAC CTTTAAGACAGGGTCT ACATGATT CAGCTGCCG TTGGCTT CAG CAGCC 
TCCCTCTGGCCCTGTGCTCITTGCCrrCTAGGTTTCAGAACTACCGTCCTTGTTTTAAT 
TCAAGGACCTTTGCACATCGTGCTCCTTTCACCTGTGAAGGAATTGTCACACA 
35 TGACCCTGCTCCGAGCCTTCCTTC^CAGCACTGAGTACGGCCTGAACCTG 
GCATGTGCACACCCCCCCGACACACACACATTACA 
CACTCTCCATGGTTTACGTGTTTCTAGAACTTACCACGACCCCT 

ACTGAGGCCTAGAGATCACAGCTAGCAAGGGCACTCCCAACAGTCAGTTGTCTGTGTTT^ 
ACIACATATTTATGAGGCAGGTGTAAAAGTGGATCATTAAGAAAGCAAACA 

4 0 CCCAGGTTGCCCCCAAATCCCAGTAACTCTAATGTGTTCCCrrTCAAAACAGAATGC^ 

ACGGACTCAGGACCCCTTGCTGTGGCCACGTCTCAATAGACAATGTTTTGCAGAAGACATTTTTT^ 
GACAGGTTCTGTTTTGTTTTTGAGTGGGTTTTGAATGAATGAGTAAATGAATGAATGAAT 
CAAGGATCACTGTGCATACTAACTGCAATACTC 
CTCACTGTCCTCACTGGTGGAATTGGATAGATAACACAAGGCA 
45 ACCTGCCTTTAGGTCAGTGCCTGTGCTGGGGTGTAACAAATCTGAGTGTTGTCCCTCTGCTGGGAGAC^ 
TCTGTAAGCCATAGAGCTGATCACCCTTGGAGGCTTGCACCGCCCCTAGACCAAACIAAGGCTCCTCCC^ 
TCAAGCCCCAGACAGGTCAGTACTTACCAGO^GACTTGGGGGTGAACTTC 
CCGkGGTCCAGGTCAAACCCTTGTTGGGCAGCCACACGG 

C CCTCAGAGC TGAGC CTGAGT AGGTACACAT AC CCACAACAAT AATG AAG CAG G CAAAGGCG C CAGG CTGTGG CAGGGGT CAAAGG 
50 C CAGATGGAGTT AGGATT CT AGCATGTCT CTTTG AATGGCTT GAATG ATG AC CT AG CT AGG CATT AG CTTGACATGTAGTACCGCC 

TTCGCTAGAGTGGCTCCAGCTAGGTGCTTGCATCACCCCATAGCTGCTCACCTTACCTGAG 
CCTCGGCCTCCTGCCIACTATGTCTGGGCTTCTGTTCCTACCTCAGCAGGTACTC^ 
TTTGGGGTATCTCCTGGAGATGTCCTGCATCCCCTACTCTCCTGCATC^ 

TGCTCTTCCAGTAGTTTACAGGCAGAGTTGCCTATAGAATGGCAAAGGCCCCTGGACCAGGTGGACA 
55 AGCTCAGCCGGGGGTCTGACTTGGCACTAAGTCCTTAGAG 

GGTACTGTGTGGAACTGGGX3TGCCCC!AAGGACCyVAAGAGAGGCCCGGGAGGCTC 
CCTGGGTGACCAGGTCTGTACTCCCATCTrCATTAACA 

ACTTCACCCAGGAGAGCCACTGGGGAAGCCAAGGCTGAGTGGGAGGCTGCTCCCTTCCAGCT 
AGGCCTCTGGGCTCTGCCCTACTTGGATGCTCTTGAGGGGCCTCAGGCTGCAGAGGGTTTA 

6 0 CATCCC^CTTAGGGTGGGGGAGGGCTCACCACAAGGGTTCTATGACTGAAAGGGAAAATCCTTT 

TCTCCTTTTCCTCTCCTTGCATCCCATTCTCTCTCTCTCTCT^ 
TGATCCTTTCGAGTGCTGAGACTGCAGATGTAC 

GAGTGTAACAGAATGAGGGAGGAGGGAGCAGAGTGTCCTTTCCCAGTACCTGCCIAGACCAGAGCTCT^ 
CCCCTGCAGGCCCAAAGCCAGCTGGGATGGGAGGAAGCAGCTAATCTCACTT 
65 TCCTTCCATGCTCCCCAGACTCTTTCCCATGC^CrrCCCTGGCTCTGTTTGCTCTCAAGTCT 
TACCGGCTGCCTCAGCCTTGATCCOTAATAGGATCTGGCCTGTGK3AGGTTTCCAAGTTTCT 

AAGGTTCTGGACTTTCTGAGCTGCAGCTTAGATGCTGCCTTCGGTGTTTTCCGTTTCGAAGCACCCGGGAGCTCAG 
CACTGTGTCTGACTGGTGK^ATGCTAAGGGTTATGCCGG CCCAAGAAACAGTAGGTCTG CTGTGGCCAAGTCTGGCATGCTTTGGG 
CACAAGTTGTCTT TTGTTGCTT AAAG AAAG G GG CTTTG CT CAAGATCAT ATAGCAAAGATATG ACAGAGCAAGCAAGC CAG AGCAT 

7 0 GTGACACTACACAGCCTCACGTGCTGCCCAGCCCTTGGCTGAGTCTGAGC^ 

CATTGGTAAGGTATATTGGTAAAATGGGCJAGATTATATCTC^GACAGGCCTGTTGTATCC 

GATACATGCTAGAAACCAGCTCATCTAAGGCCGTCCrTTCTAGGCATGGCTGGCAGGAACCTAGACGTTAAGAGGGGCGG 
GGTGTTTAGTTTCAATACTTCAGAAGGTGAAACTATTGTCCCA.GAGACAAGACT 
GCTGCGTCTCTGAAGTTAGGGATGAATCTGGGGTGTGCATGTGCCCCTCACAGTCATCTCAAGTC^CT 
7 5 AGTTCCTTCACCTGTGAAACGGGGACAAATGTTCTTATGJVCTAGTAGATTGGAGTGAG 
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GGTGGCCAGTGGCCTAGAGACTCATATTTGAGTTGACT 

AAAC CAAACAAAAAAAACCCT AAAAATCAAAACCAAAC CAACAACAACAAAACCCT CCACAC CCATTTGTTCTTTGCAG^rTGAGTG 
GTTTTT AATTTAT AT CCTCAAAATAAGCCGATCTGGAGGAATTT AGAACT AGCT AC CAGAGGTCACCGAGGGAACTGTGT CAACCT 

TGaSTGTGTGTGCTGGGGAGGGCTTAGCCTCT 
TCCTACTCTGCCTATGCTGTTCACT^^ 
TGCTACCAGAGCCTGTCCCTTCCAGCTGACACCTTGGCT^ 
ACTGTGGTGGCCCACAGCAGGGAGTAGGTAAATGTCTGCTGAACGACTAAGT^ 



1 0 AGTTTGTTTAG<^C^TCCTGTGCATGGCC 

CAACATCTGGCCCTCrrCCATGGGGCTTAGGCTGGGCCCAGATCCrGCAAAG 
GACTGAAGGATAAGGAAGACAGGGCCACGCAGGTGGCTTTGAGTG^ 

TCTGTGC CATGTTTTAGATGGACTGGAACACXZ CCCTTTAGGT TCACATGGGCCTCAG CTAGAGAC CAAG ATGCTTTTAAG ACTGGG 

TGGCATGGCAGTCAGTTCTGAGTCTTTGTGTCTTCCrC^ 
1 5 TGCCACAAGCATTT CAGCCTTT CT CAGAACAAGAT AG CCTAATG TGGTCATCATG TCT CCTCTTCTAAACCAG AG CT AAG CTCGGT 

TCTTTGTCXICTCTCCTGTCTGGAGCACCTTAGGAG 

CACGCAGGCTCCACACAGTGTGATTCACACT 

GCTGGGGGACATTTAAAAGCTACAGAAGACTCCCAGGTTCTTT 

ACCCAGTTCTCTGGACACCAGGGGCCTTTGCACACGCTGCT 
2 0 TGTTGTCATATGTAGGCGTGGTTTTACATGACTA^ 

TTATTCTCAGACATTCCTCCA.CATTGCTGCTCCTTACTAAGAAAT 

TTTATCTTGGGGTTATCCAGGACTTGGCAGAGGGGACCACAAGTGATAA^ 

GAATGTCACAGGGG CT CTATACCTG CTCTTGAGTGG ACAGACAG ACAAC CAGAGCGAAGCTCAAT AAATGCTCAGCCGATAAAGG A 
ATAAATGGCCTTGGGAAAAGAAGGCTGAGTCCTCCGAGCTCC^ 

2 5 CAAGTGTGAGGAAGCCACACATCCCATAAAGACGGTCTCCATATCA^ 

GCAGCATGGGC^GCTGTTCCCCCAGCATTTGGGGCCC^aVGAAA^ 
GTGTGCTATGAAGGGGAAGTTGAAGACTCACCGTCAGGCAGCCCATGA^ 
CCACACCAATCCAGCAGOTCTGTAGCAGCATTGTGGAAGTTGGCGGGGTT 
AGGCTGCAAGAAGAGGTCAGGGAACAGAGGAGCACAGGACTAGCCAGG 

3 0 GATT CT CCAG CT TTCTCAAACAC CCTCT ATGCAAAG C CTGG C CC ATGGCAGCAGATG AG ACCAG CGGAC CTATTTC CGGGATG AGA 

ATCAGGGCFAGTAAAGGCCAGAGCCATCAGACATCCAGGTGCTCT 

GAAGGTGGTACCCGTCTCTATACTTGCGGCCAGCAGGCTGTGGCGGTGTCTTCCTAACAACCCA 

GGCTTAGTGTTOTTAGCTTCAAAGGCCAGCTCCTCAGGTCTTGATCTGAGCC 

3 5 GAAAGT CAG AATCCTGGGG CTTTCAGAG ATACTTGAGG CCCCAGG CTGCCAG CATGGAGGAG TTG ACGAGATCT CACTT CATCCTT 

TATATGGACCCTCAGCTTCCTGGGTCCTTGTCATTTGGCACTGGGACCAGGAGAGAGA^ 
CACAGCTGCGATTTGACCTAGTGGTAGGTATGAGCCCAGAGGTTATCATGTAGAAACCAGATTCA^ 
TCCAGGCCCCTTGGTATTGGTAGGATAGC^GGATCTCACTGGCCTAGATGGGCCCTGTAACrCCT 
AGCCAGGGO^GTGGGTCTTGGGGTTGGCGCCCCAGAATGA 

4 0 TCTGCCTGTGCCCCTTATTAATCTAGGATGGTCCTTTCACCTCT 

CCCATGTGGTCATGGGAATTTTGGTTCCTTGTGCITATCAGAGTATGGTTCGTGCATGGAAACGCT 
CTCAAAAAAAAACCCCCTCTCCACGAAGAAGAGACAGAGGGAGAAAA 
AGATGACAATTAAACAAGACCAGCTGGGCTTGTGTGGTCCAGGCACAGCACCT 
GGGCTCAGGGTTAGGGTTAGGTCTCAGCCTTTCAGAGGCTCCCaiCCTCCT 
4 5 TAAAGCTGC!AGCTCCTTGATCACCAAGCCAGGGAGGGGCCGCT<3CATACATAGCCA 
TGCTTCTGGTGTTCACTTCTAAATCTGCCrrTTCAGAGGCCCCAGGGAGCACT 
AGCTCCTTCn'GGCrTGCGCTCCCCTGACAACCAGGTGTTT 
GCACATGCACCTACTCTCIAGGTGACCATAGCAATGCTCTTTGGAGA 
C 

50 

MOUSE SEQUENCE - mRNA 

CGAGCGGCGGCGGCGGCGGGCX3CCCGGGAGAGCAAGCTGCCGGCCGGGCGAGCAGCX3ATCCGG 
OTCGCAGCGCXjCTGACCCrAGCCCCGGCCGGTGAGGAGCCGGAGCCGGAGTCGGAGCCG 
CCTCCCGGCTCCAGCCTGCCCTACACGGaSGTGGC^^ 
55 ACAG(ZATAGAATCTACTC^GCATCTCCTTCGCCA(^GCCTCCCTTC^ 

GAGCCCTAGCAGX3AGGATGGACCTTGAGTGAAGTGGAAGAGCTGCTGGACCACGTT 
GCCAGAATGAATTCTATGGACAGGCACATCCAGCAGACCAATG 

CTT C CACAATG CTGCTACAG AG C TGCHX3GATTGGTGTGGAGAC CCT CG AG CCTT C CAGAGGCC CT T T GAG CAGAGCCTCATGGG CT 
GCCTGACGGTTGTCAGCCGTGTGGCTGCCCAACAAGGGTTTGACCrrGGACCTGGGCTACA 

GAGCCGCCAGAACGACCCCCCTXiGGAAACTGCCCATGCAGCCC^ 

GTGATGGGTCATTTCCTTATGACTCTCTCCCTTGGCAGCAGAACACCAATC^ 

TGGGGAGTX3ACCAACACATCCCAGAGTC!AGX3TCCTCGGGAA 

CATGGCATCAGG CATGAGCAC CAG<ZAACCCCGGC71TCAACTCCCCACAGTTCGCAGGG CAACAGCAACAGTTCTCCACCAAAGCTG 
65 GCCCTGCACAGCCCTATATCCAGCCCAAC^TGTACGGCCGACCTGGCTAC^ 
GCCGGCATGGCAGCTGGCATGACZACCCTCGGGCATGAGCGGCCCT 
CTTTGGCACACACGGGCAAAGGATCCCCCAGCAGACCTACCC^^ 

CATACCCG GGAGAG CC CAACT ATGG AAAC CAGCAATATGG ACCAAACAG C CAGTTC CCCACCCAG CCAGGC CAGT AC C CT AC C C CT 
AATCCCCCAAGGCCACTCACATCTCCCAACTACCCTGGACAAAGGATGCC^ 
7 0 AGTCAACATGGGGCAGTATTACAAGCCAGAACAGTTTAATGGACAGAACAACACCTTCTCCTCCGGAAGC^ 
GCC!AAGGGAGCGTCAACAGGTGCAGCrGCTGCAGCAGCGGCCGCTG^ 
GCCCATGCCTGCTGGGGCACTAGGACCCCCAGGGTAACTGTCAAGCX3GAATTTCA 



CTCGAGGCCACGATTGCAAGCATGTGCAGTGCTTTGACCTGGA 
7 5 CCCGTGTGCAATAAAACTGCTCTGCTCGAGGGTCTGGAAGTGGATCAGTACATGTGGGGGATCCTGAACG^ 
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GTTTGAAGAGCTCACCATTGACCCCACGTGC^ 

G CATCCCCTCAAAGCX3GTTCAAAACCATGAG CC CCAG CCAGATGATCATGC CCAATGT CATGGAG ATG ATCGCCG CTCTGGGC CCT 
GGCCCGTCTCCCTACCCCCTCCCACCrCCTCCTGGGGGCACCAGCTCCAACGACTACAGCAG^ 
TGGCAAC1TTGACTTCCCCCATGGGAATCCCGGAGGGACATCCATGAACGACTTCATC 
5 CGGAC^TGCCCAACAACATGGCCGCCCTCGAGAAACCCCTCAGTCACCCCATGCAGGAAACT 
GCTCCCGGCAACCATCCACACAGCrcACCTGACOT 

CATG C CGGAAC (HTCACTGG ATCTACTGCCGGAA.CT LTI'TC CT ACCTGGACC CCC CCGACCTTC CAA 

GCAATAGCAACGATGACCTCCTCTCTCTCTTTGAGAACAACTGAAGGCCACTOT 

T ACCCCATCTGT CCCACATCCTATTCCTCCCAGG AGC CCATG CTCAGG CCT CCCACACCAGGACCT CAGGCTG AGAGCTGAGC 
1 0 GCAC&CTAGGCAGGGAAAGO^TGTGGGTCTC 

GGCCCXTTGGAAGACAAACCCAAGCI^GGGAATCCAGGCCCAGCA 
TGACTCCAGACAACCGACCTTCCAGTGTCCCAAAGTTCTTCCATGTTTTT 

CAGATGACTGGAACCCGAAAGAAGCCTGTGOTCCGGCCCCCAGGCCAGGGTAGTGGACAAGCCTTGGAGGA 
AGOCTGAGGAAGGACAOSCACACACACACACACAC^CCACAGA 
1 5 GGGTCXn'GGCTTTGGCCTCrGACTTCTAAGCCAA 

GAGAGAG TG C CACACCTTGTAGGG ACTGGCAGGAAACAGGT CAGGGCCTT C CC CAGAGC CAAG ACAATGCAGAGGAGGGCTCCTT A 
ACTCAGACCAGGGCCATGCCTTGCX3GGGAGCAGAGAGTGTCT 

MOUSE SEQUENCE - CODING 
2 0 ATGGACAGGCACATCCAGCAGACCAATGACCCTCT 

TACAGAGCTGCTGGATTGGTGTGGAGACCCTCGAGCCTTCCAGAGGCCCTTTGAGCAGAGCCT 
GCCGTGTGGCTGCCCAACAAGGGTTTGACCTGGACCTGGGCTACAGACTOT 
CCCAAGTCTGCTGCCCTG<ZTGTCCTCCTGGTGTGAAGAACT^ 
CCCCCCTGGGAAACTGCCCATGCAGCCCCCCCTCAGCTCC^TGAGCT 

2 5 CTTATGACTCTGTCCCTTGGCAGCAGAACACCAATCAGCCrCCCGGCTCCCT 

ACATCCCAGAGTCAGGTCCrCGGGAACCCTATGGCCZAATGCCAACAACCCTATG^ 
GAGCACCAGCAACCCCG<3CATCAACTCCCCA(^GTTCGCAGG^ 
ATATCCAGCCCAACATGTACGGCCGACCTGGCTACCCTGGTAGCXiXSG 
GCAGGCATGGGCATCCTGAACCCTGCCGGCATG<>CAGCTGGC^ 

3 0 GCCCCGACCACCCGGCATCAGCCCCTTTGGCACACAra 

TTCCTATTCAGAGCATAAAGAGGCCATACCCGGGAGAGCCCAACT 

CAGCC^GGCCAGTACCCTACCCCTAACCCCCCAAGGCCACTCACATCTCCCAACTACCCTGG^ 

CACCGGACAGTACCCACCCCCCACAGTCAAGATGGGGCAGTATTA 

CCGGAAGCAGCTACAGCAGCTACAGCCAAGGGAGCGTCAACAGGTGCAGCT^ 

3 5 AGGAGGCCGAGTATGTGGAGGGATGCCCATGCCTGCTGGGGCACT 

GGCTGCCTCCTCGGGCAACIACAACTCTCAATGGG 

CATTCCGGCGCATACAGCTGCCTGCTCGAGGCCACGATTGCAAGCATGTGCA^ 
TGTGAGCGAGGGACXrTGGAGGTGTCCCGTGTGCAATAAAACrG 

CC TGAACGC C ATC CAACACT CCG AGTTTG AAGAGGTCACCATTGACCCCACG TGCAGCTGGCGG C CAGTAC CCAT C AAGT C AGACC 

4 0 TCCAC^TCAAGGATGACCCCGATGGCATCCCCT 

GAGATGATCGCCGCTCTGGGCCCIGGCCCGTCTCCCTACCCCCT 
CCAAGGAAACAACTACCAGGGTCATGGCAACTTTGACTTCCCC 

GTCCCCCCCAGCrCTCGCACCCACOSGACATGCCC^CAACATGGCCGCCCrCGAGAAACCCCTCA 
ATGCCCCACGCTGGCAGTTCTGACCAGCCCCATCCCTCCATACAACAAGGTTTGCACX5TAC 
45 TCCATTACATCACAGTGGGGCTCCTCCTCCTTCCCAGCCTCCCCGGCAGCCAC 
ACCTGACCTTTAACCCCTCCTCAGCCTTAGAGGGTCAGGC^ 

CTGCCGGAACTCACAAACCCGGATGAGCTTCTTTCCrACCTGGACCCCCCCGACCrTCCA^ 
TCTCTTTGAGAACAACTGA 

50 HUMAN SEQUENCE - GENOMIC 

TTGCACACAATCTCTGTCCCCAAGAGGATGGGAGTCCAG^ 

AAAACCACATTGGAAGTAGTTATCACCATCCCCACrTTACAGATAAGGAAACTGAG 
GCTCACAGCTCTGAGTCGCAGGGTTGGGATTGAGGGCCAGGTCATGCGGCTCCAAAG 

CT CTCAG TCTGG CAGGG AGAT AAGC CGTGTACACAAAT AG CTGCAAT CCCAGGTAGAAAGTG AT AAGTG CCAAACAGGG GCG AT AA 

5 5 AGGCTCTGGGAGGGGAGGAGACTGCCCACTGCGGGAGGGGCTAGGGTGGGGGTCAGGACAGG 

GCGCAAGCTAAGACATGGGTCGCTGAATCCACAGTATCTAGCAGTGTCTGACATTGTCACAGGG 

AAAG AG AGG AAGG AAGG GAGGTG AAGAGGG AGAG AGAAAGGCAGAATGGG AG GGTG G CTATG CT CTGACC C CATTTGGCC CAGGAA 

C^GAGAAGTAGTTCAGGACTTTCAGGCCAGAGTTCCTGACAATCAGATGTTTTGC 

6 0 AGCTAGACAAGACTTCTCTGCGGGGCAGGGGAAATGGTGTGTTGGAGCCTGTTC^ 

CAGGAACTGCGCGAGCTG^nTGATGGAACCAACCAGCTGTGCCAG CTGGCTCTGGTCGG CAGCTTGAAACTGGCCATGGTGGGGGT 
" ATTTACACCGTAGACATCGGCAAATACTACAAACCAGGTTCTCCCCACTTTG^ 
CTGGCAGAGGTCTGCTATGTGAGGGAGCAAGCS^CCCGTCCCTGGAAGAGT^ 
GCTAAGGGGTTCTTTCTTAGTGAAGGGCTCAGAGCAAATGATCTC^ 

6 5 TTTAAGAAAAAAAACTAAATCAGGATAATTCATGCCGGGAAATCATGAACCACTAGACCCAGAGACTGGGAGTAGGGAAGAGACCT 

GAGAAAATGTGTGAGGGGCCACATGCTTGGATTTCT 

GGACTAGAACTTATGTCTCTGGCTGCCCATCTC^CGGTTCCTTCTAGGACA^ 

TGCACGAAGTCTGGTGTCTGTCTCAGAATATAAGCTCTGCAGCAGTGTGGACCAAATCTGTTTT 

TACAGCAAGTATTTAATAAATATTCCTTAAAGGAATCGATGTGTCT 

7 0 GTGGGGGCTGGGAATGTCATCCTCTGACACCGTC^CTTCCTGAGTCCCCACTGCAGCTTAGTGAC^ 

TCAAATCCCAACTCTGTC^CCTCCTTTGACCrrTGGCCAAACCATTATGCCTC 
TAGCAATCCCTGCCTGCCTCTTGAGGCTGCTGTGGC^CTTG^ 

AGGAGGAAGGGGGTGCCAGGAGAGCTGGCTGTGGTGGCAGTGCCAACCCAAGGGTCCCTGG 
CTGCTCTCTTAGAACTGCTGCTTGGGGTAATCTGGGTCAAACTAGCCAATAGCCTGTGGA 
7 5 ACG AAAGGGGCAGG AAGCAGAG GATGAAATGAAGATGGTAAGGGAGAGTTGG AG AAG TC CTTTTT C CT C CT AAG AC CAAAGGCAG A 
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AGGCAAACTAGCAACTTAAAAAACTGTTrTTGTCTC 

GCAAGAGGCCTTAATCTGTt^CTGAGGACT^ 

CAGACCAGTGTTTCAAACGATTTTGGAAGA 

AGGACAGGCTGGCGGGGAGGG^GGCCCAGCGCACCCCAGGCTGTGAGTGGATG^ 
5 AACCCTCCCCCAAGTGCCATCTGATTTTTGCAAAGAGGGTGAAG 

AGGGTGTCOSTGGAGGGGCrGCACOTCAGCCTGGGCCCCCAGTGGAAAG CAAAATAGAAAGGCCCTGTGTCAG CAGCCAGGCCCC 
TGGGCAGCCGGGCTCrCGGCTCCATTCTGTTCAGTGGGATTCTG^ 
ClTTCm'CrrrL'iTr rT TCCATTTrAATAGAGGGGGG^ 
TGGGACCXJTGTGTCCTTGAAAACAGCACAGTCGC^GG 
1 0 TCCAGGTCCCAC^CCTGCTGCTCTCTCATACTTGCTTG 

CCC^CCAGAGCTTGCCTTCCAGGACCCACTCAATGCCAACrCT^ 

CGGTGGCT CATGCCTATAATCC CAGCACTTTGGGGGG C CAAGGCGGGTGGATAATGAGGT CAGGAGATGCAGACCAT CCTGGTTAA 

CATGGTGAAACCCCAT CTCTACTAAAAACACAAAAAATTAG C CGGGT ATGGTGG CATAGT CCCAG CTACTCGGGAGG CTGAGGCAG 

GAGAATGGTGTGAACCTGGGAGGCGGAGCTTGCAGTGAGCCGAGATCTCACC^ 
15 TCTGTCTCAAGAAAAAAAAAAAAAAAAAAGAATGGCATTTGGTC 

GTTCTGGTCATCCTCCATCCTGTGGGCCTCTATACCTCCCCTGCTCCGGATGGCCTGTC^ 

GGGATGTAGCAGTGAGCCATCCTATGACCTGCAGCCTTGACAGAAGAGGGCCATGTAAC^ 

CATACTAGGAAATGAAACTCCCTGTGCCTCAGTTTCCTCATCTGTAAAATGAGAATACT 

TGTGTATGAGCTACTCAATGCCTGGAACT^TAGCAGGAACTTCACAAGTAATTC 
2 0 CAG CATATGGGGGTTAAGAC^U^GGATUIACGGCACATCTGGTGGAGGT AT CAGGGT AGG CTTCCAAGCAAGTGG T TG CAGATGTGGC 

CTTGGTGACTGGGAGGGGTGTGT CCCAGGAGGT AGG AAG AG CATG AG CAATGG CACAGAGGT AAG AGT ATGT ACAT AATGCTGGCA 

ATTCTGGAAGTCACCCTGCATGGCCACAGGATGAGGTGAGGGCAAGGAAGCATGAGTCTTGAGGCTGGAG 

GGAGGAGAGATGCIXBAGGATCTGAGACCTTCCCCCAC^ 

2 5 TGCIATGAGGAGATGTGGCTCTTGCACCCCCTCrTGTCCCCCCACCGAG 

AGACCTGGCTCTTGTTCCAGCTCTGGTGCTTCCCAGTGGTGCAGTCAGGGAAAGC 
CCCACAAAAGGGCTGTTGTGAGGCTCAGAGGGTGAGAGGTGCCTCATGAAGGGAGGCGTTA 
GATAGTCAC^GTTCTGAGCAGCTCCCTCCAGGTATTTTCTTGTCCCTACTCCACTAAATGG 
CAGGGCCGCCCTCAAACrrCTAAAGTGAACTGTAGCTCAGTCCAAGGGTTAAAGGCrrC^ 

3 0 GCAAAAGTGCTGGTGG AT ATCC CACACT CT CT CACTTGGCAG ATGGGG AACAGG CACAGGGAGAGTTTGTGATGTG CT CAGGCCAC 

ACAGCAAGTCAGGC^TGGCGGGCTGGGCACATAGGTGTCCCGGCTCCrrGAGGCA^ 

CTGAAGAAGGGGTTCGTGGTTGAACTGGGTCTCGAACCATAGGGCAGAACTCCATGGTAGAA 

GAAGGACATTGGAGAGTC^AGCCTTGGAGGATGGGAAGGTGTGGGCTCA 

GG CAGAGCAGGGGAG CCTAATTCGCTGAAGGACATAGATCTTCCQTGCTTG ACCCAACCACTCACCT^ 

3 5 GAGCTGGGCAGGGCCCAAATGCTTCCTGTATCCACCAAGGGGTGAACTCTAAATGTAATCTCCATTAC 

ACCCTGGTCATTCCCTGGCAAGTGGGTGAGAGTCTTTACTTCACCCCAAAGTACCTGGTAGGTTACC 
GCAGTGAGATGAGGGCAAGTCCCTATTCTCAACCTCAAGGCCCAGCCCAACCTGGAGCCCAA 
TCTGAGATCTTTATTGTTTTGTTTCGTTTTTATGAGACAGGGTCATGCCCTGTTGCCCAGTCTGGAGTGCAG 
CTCACTA(^GCCTCAGCCrCCCAGX3ATCAAGTGATC 

4 0 GC CTGGTTAATTTTTGT ATTTT TTGTAGAGG C AAGGTCTTG CT ATGTTGCC CAGG CTGGTCT CGAACTC CTGAG CT CAAGTG ATCT 

AC CCTG CCTTGGCCTCCT AAAGTGCACAG ATTACAGG CATG AG CCAC CATGCT CAG CCAGCT ATGAG AT CTTG AG CAGG TCACTT A 
ACCTCTCTGAGCCTCTTCCCTTAGC1AGGGAATGACACCTGGCTTGTGG 

AGTAAGTACTCAATGAATGAGATTTTCTTCCCACCT CTCTAGTCTCAATTTTAG ATGTTCCCTGGCTC CCrTAAGTTCCAGAAGTG 
AGAGGTGGAGAGTGGGGAOVGAGAAGGCAGCTCCTGGACAGGTGGGAATCCCCACTGTG 

4 5 GGAGGCCATACTCGCCX3CACCCTACACAAGTCCTTTGCTTCCCTCAG 

CAGATTTTCTGCCTCCTGAAACCTGGGCCCAGGCCCCAGCA^ 

ATTAACAGCACTTGAGTGTGCTCTGAAGGTTGGTACAACCTGGCGGCCCCACACTGCCCATGCCCTCCTAG 

TCTGGGCCrGCAGGGGCTGAGCAGGATGCATCTTCCA^ 

CAGATCTCCAGGCCCAGTGAGGCTGAGGCTT^GGGGGTGAGGTTGTTGG^ 

5 0 CCTTACAGGGGCTCAGAGGGGGAAAGCAGCTTGCTTGGGGCTACACAGTCTATGGA 

GAAACTCTGTTCATGGGGGCCCX^GTTCCCATCCTGACCAATCTTGGGCAAGTC^ 

GGT AGCGG CAC CTAACTTAAGACAATGTG ACAGTTCAAT CAGGG AACT GAATT C TCAATACCT AG GAGGACCCAC CACAGTACTCA 

CGACATAGTAGACAGTAAATAAGTCATAGCTTCTT^ 

CAAAGAGAGCAAGGGCCTCAGAGACCCAAGGACTGGAGCCTAG 

5 5 CTTGCACAGAAG^GGAGGGTTGAAATGATCCCTGGTTCCATGCTACTAAGTTG 

TTCCTGGGTTTCAGATGTAAATACTGGTGGGAGAGTAGGGG 
GAACATGAGTGGTCACCTCCGAGGTCACCTGCATCCCCC^ATTCCT 
CAGAAAGGTTGAGAGGGCTGGGCGTCCAGACCTAGCTGGTCATTAAGGCAACAAGAACTG 
TCACATCCCAAATCCTCAAATACCCAGAAACTCTCTTTTGGGATT 
60 ' CrGTGGATTGCTGGGAGTGGGACAGCTCTGGCTGCAGCCACCGTTOT 
CCTGGTGTACCTACCFGTGAACTTAGGCCACCCACAGGCTGA^ 

GGAAGAGCAGCAGTCATCACAGTCCATGTCCATGTGACAGG 

GGCATAGGGAGGCAGAATTTCCCCTGGGGAAAATAAAATAACGCCCCCACTCCGCCGCC 

6 5 GCCCGGGTGGACCTAAGCAGC^GTTTTCTGAGCTTTCAGGAATTCC^ 

AATGAATTCAAAGCTGGCAAATTCATTTTGCCTCCTCT 

CCGAGGTC^OSAAGCCCTCAGAAGGATGTTCTAAAATGGGGGTGGGGGGTATGTGCTCCGGCCTGCrTC 

TCCGCGCTGATCACCGCCCAAGAGGCTGGTACTTTTCACTGTC^ 

GAGCGGGCGGCCGGGCGGCAGGCGGGCGAGCAGCGATCGGGCGGCCGAG 

7 0 CCCAGCCGGCGCGGAGCAGGAGCOSGAGCCGAGCGGATCrCGGC^^ 

CGGTGGCGGCGGCGCGTCCTCCATCGGCGGCAGCGGCGCTCGCAGCGCCCGGTAAGTTTGGGG 

GAGCGGGGCGACCGGCCGGCGGCGCGGGCTCCGCTCCGCTCTCCTTGGCGGCCGGGGGCCGAGGCCCGGCGGCCGGCGAGCTCCCG 
GCTGCGCGGAGCCGGCTTGGGCTGCGCXjTGGGGGCCGGGITTGTCAGGGTGTG 
75 C^GTGCCCGCCTCCTGCCGGCGCGCCTCCMCCCTCGCTCCCTACACCCGGGGGCCGGGCAGGACCGGGAATCGGAGGAGG 
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AAGGACCGCCTCGGTCTCCTTCGCCTCCT 

TGGATTTCAGGATGGCAAGGOGTGGTGGTGTTAACTTGTGCGTCT6AA 

GGGGCGCGGTTAAGTTGTTTGTGTGGGAA^ 

GGTTTTCTCCCAGGGCTTCGCAAGCCAGAGGGGCGCTCAGGCCA^ 

GGAAGATGTGCGTGTGTGTGCGCGCGCGCGTTGCAAATACGAAAGTAATTTCCCCGTGT^ 
TGGAAGCTTGCTTCACATTCCriX3GCGTCTGGGCrrGGGG^ 

GCGCGCGTCGCCCCCTCGCTGGGGACATCTGTCCAGGAAGCCGGCCGGGCGGGGATAGGG 
CCTGGAGCGGTGGGGGCGCGGGCGAGGGGCGGCCGCCTTC^ 

CCGCCGGAAAGGCCGGGAGCCGGCTCAGCCCCGGGTTTAAAAGAC^CTCCAGGAATGTAAACAACCCG 
CGGGAGCGCGCCAGGGCAGCCGGCCO^CCCCCC^CCCCTCCCCGGGGACA^ 





GCCAAGGGTGCTTGGTACCCTTGCCTGACCGGGGAGGCCTGACGGCCGGGTCCCTG 

CCCGGGAGCCGTCAGCTTCCTGGGTAGGCAGTGCCCGAGTAACTGGGGCnT^ 

CCCGCTGCTCCTTCCCCCGCGCGCTCTCCCCC^CTCCTCTCCCCTGACTGCTT^ 

AACTTGGCAGCCCAAACAAATGTTCTCTCAGAGGTCCTCCAAGAAGCTGGCGGCT 

GCCTTTCCCTCCTCTCTGCTCTGCC^CGGACATTTGGAGCCCT 

GCCAAGGGTTTTCTGAAGGATTTTCrrTGAGCAACAGTTTGT 

GGTGCCTGGGTTTGCTTATTACTGTTTTTGAAGGTACAGATAAAGAGGCAGG 

GTGAATTCCAGCCTGCGTTCGCTGGCGCTTGTACGGTCCCGGGAGTGTGAAGCTGCTGTTGAGCTATC 

GAGCAAGCTCTTTCTTAGCCTTCTCAGCCCCGGCTGCTTTGTGAGGGGGTCCAGAGTGGGGGAA^ 

GCTCCTGGCCCCAGTGCAGGGGCCGGGTGGGTTTGTGAATAC^^ 

GAAACATCAGCATCAAACAAAAGACACATTGAAAGCCCTATCTGCTACAGAGCACGTG 

GAGGGAGATGGTGTTTTCTTCTGAGCTTTATGATTTTCGGTTGGGTTTTGTTTTCCCCCAAAGGACAGAAGC 

AATCACGAGGTTTTGAAACGCTGTTTTCACATGTAAATCTCAAAAGTTTAGACTTCTGAGAGTTG^ 

GGCTGGGTGGGGGGAGGAGAAGGGAGCAGTAGGGAGTGGATTCAAGAGGTGGATTTGG 

TTGGGAATAAGAAATTGCTGGACTTATGAATAAATAGCCTTGTCTATGCAG 

TCCCTTTTCCCTCCTAAGCAAAGGG<3GGAGGAGGGTTCTTGTTCCTCTC^ 

CGGGGTTTGGCGACAGAACCAGTITCTCCCTCACGCACGCGCGCACGCACACT 

AT C C CTGG CCCC CATAGATATGCCT CCAACAT AGG ACAGGTGTCCATGCTTG ACT CATT CCAG AAAATTGCAGGGGCAAGGACC CA 

ACTGAGAAGAGCTGC^GACITCACTTTTTTTCCTGTTTCTTCCTGATG^ 

GAGGCCTGCTGCATCCTGAGAGTTTTGTCACAAGAA 

CCTGGCCTCAGGGTGAGG(^TTGTTGTAAGAAATTCAGGCACCAACCAATTAATT 

GACCAGGAAGCAGGGCCTGCTGGTTCGTGGCTG<3GTCCTCTACATTTTCATTCATAACT^ 

CTGTCACTGCTGCCrGGTCTTTATGAGGCTCCTTAGAT^ 

CTGCAGCCTTGCCTGGCGAC^CTCTGACCTCTTAGCAGGATATAGTGCTCTCTGCTCATGCCCTTTGGGGG^ 
TTAAGAGTCCCTCAGGATGGGGAGGCAGCAATCCACCTGGCTGTAA^ 

GATGAAGGGAGGAGGCTGAGAAAAGCAGTCCTAGGCATAGGCTCTGGGGAGAGGCTAGACCCCATCATGATTGAGAGGGTCT 
ATTAC03GAGGAGATGGGCGAGAGGACCAAGGTGTCAGGAGGCTGTTGCTTCCTAT 

GACCTTAGTCTCCTGATCTGTAAAATGGGTCTGAGGGCAGAACTCCTCCGATCCCACTTGGTTGCTGGGAGAATGA 
GGCATGAGTCCCCTAGGCATGTGTCCCTTGAAGGCCTGGGAGGCTCTGGGTACAGAA^ 
AGTGCTAGAAGCATTTTGCTGAAGCTGCTCTCATCCAGACCTACTTCCCTGGCA 
CACAGGACAGGGTACTCATTACCTCCTGGAAGCCTCT 

TGCAAATCCACACCTCCTGACCCCACTCTGGGCTTGAGATGCTGGGGGGCCCTCCAGA 

CTGGGTTCAAGACCAGTGCCTAGGCAGG CCCCTAGGTAGGGGGACACATTCAAGTGGCTGCTTCTAG CCCTTATTGGGAATGACCT 

GTTGTGGGGGGTGCAGAAGGGGGAAGGGTCTGCCTGGGGGTGCTCACTGTCATCTGCCTTTGTGTGCC^ 

TTTCCTTGGAGCGGGGTATGAGAAGGAAGGGGGTGCKTGATGAGGGCTGGGGTTGGGTCGGG^ 

GACTGTGAGTGTCCACCCTGTGGATAAAGACTGTGTCCTGGGAGAGACCAGAACTGTGGAA 

ACAGTCCAGTGCCGGCAGGAAGATCAGTCCAAATAAATAATTAACCGCTACCAGCTGGCTG 

AATAACCAC^GCCCCTCTCCTTTGTATGGTACTTGGTGG^ 

AAGCT AGTT ACT AT TTGGAATT ATC CC CATTTAAGAGAGCAG AAAAC CAAGGTTCAGAG CGGT CAAACAACTGGTGGCAAAGTT AG 

GACTGAACCTCAGGCCTTACCGATACCAACCCCTGCGCTGC 

CGGTGC^GAGTCAGGATAGGCTCCAAAACTG<3GTGCTGGGGGAATGAG^ 

GCCTCACCTGTCTTCC CCCTCCC^TATGGCAG CACATGCT CTGTGTCCCTTTCTTTTGGCACAGTGTCTTA 
CAGCCAGAGCITGCTTCGAGACACAAAT^ 




CTCCACCCTTGGGCCTGGTTTTCTGTAGCTCAGCTKSAGTC^ 

AGCCGGGCAGGCTGAGCTGGCACOTCCCCAGGAAGCCTGGCCCAGCCTGGCACACCTGTC 
CCCCAGGCCCTGCCTGTCAGGCCCAGCTGCTGGAGTTACTGCAGGATTGTGCTGGGGGAGGCTGGGG 
TCTTAGATTTCAATGCTCCCCTAATCCTGTTAGTCrrGGCTGTAATTGTAACCACTTGAAGGTT 
ACAGACGGTGGGGGGACTGTGCTCTGCCCCAGCCCCAGCT^ 

TCITGAGGATGGGGGCTTGAGTGATGGTGGGTGTCGTGTTTTTCTGTAGAAACTGAGTATCGC^ 

CTTTGAGATCACCCAGCTCAAACCCTTCACTGTTC^ 

ATGGCTGGTAGCAGTGGAGCATCAAAGACTAGTGGTCTGACTTT^ 

CCAGGGACTAGTGAACCCTCCAGAGACTTCrGCCCCTAGAGATAGGGCTTGG^ 

TGGCAGTTACCTGTCTTATATTCCCCCAGCCCTGTGTG^ 

TGCACACATATGCACACATGCACACCTGCACACATGCA^ 

GGGCTTCTGCCAGGACTCCGGCTGCCTGGCCCAGGGTCCTGAGACCTGAGAGCTCTCTCCG 

T ACTTGC CCTCFTGG AG CCT CTGATTCAATG CAG ATCAGG AAATGG CCAAG CTGTTCTGATGGGG GTGTGTGTTAAAATACAT ACA 
CTGTACTTTCCCCAGCTCTTTCTTCX3AAAGT 

CTCACITCCCAATCCGCAGGGGATGACrTGGCAGGGAGGATGGGGACGGTGTGCCTTGGATm 
TCCTCTTAGCrrCGGGAGCTTCGGGGAAAGATCTGGAAAGCCCCTTCC!ACTGTGAACCTCTCTGACTCT 
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CTTGTAGCATAAAATTAGTGGGAACTGA^ 

AACACTTGGTCTCATCCAAGGTCAGCAGCGAATGGGTGA 
AACATGTATTGAGTGTGTGCTGTTGGCCTGGCACAGCCCCCCrTAGCCA 

GAGAGGCCCAGCTGAAC CTTGGGGGTGGGGTTCACTTCTGAGTG CTCAGGGTGAGGACGAAGGATGGGGAGTTTT CRGTGGCATTC 
5 CCAGACTCCCAGATCCTCTGCAGCCAGGGGAGTGTGGAACACT^ 

ACAGGGTGGTTGCATTGTGAATTTGAGAG AAGTAAAA CTCATG CCTGTAATCTCAGCACTTTGGGAGG 

CCT AGGCAGGTGGATCATGAGGTCAGGAGTTTGACACCAGCCTGGT CAAAGT AGTGAAAC CGCATCTCTG CTAAAGATACAAAAAT 

TATCTCGGCATGATGGTGGTGCACGCTTGTAGTCCCAGCTACTCGAGAGGTTGAGGCATGAGAAAC^ 

GGTTGCAGTGAGCCAAGATTGCACCACTGCACTCCAGCCTGGATGACM 
1 0 AAACAGAAACCATCTAGGGAGGGTACAAGGTTTCTCATCTGCCGG 

CCTCTCCCCrCCAGCCAGCCCTCCAGGAGCTGACCTTTGTCCTTT^ 

CTCTGTGTATGGGGAAGCAGGGAGGTGGCCTTCTCACCCCACCCTAGGAGTCACCTGCT 

GAGCACACTAGGAAGAGCGAGCTAGGCTO^GACAAAGGAGGAGATGCC^ 

COUUVCTCCrrACCCCTACTCCCCGGTGGACCTTGACCGCCGCTGGGAAGC^ 
15 AGGATCAGCAGGGCACAGAGACCTCTGGGAAGAGTCCTGAAACCCTTTCCTGGTGTOT 

CTGTCCAACCTTCTTATTTTCCAC^TGGGGAAACCCAAAGCCCAGAGAGGTGGTGGG^ 

TCACTAAGAGTGGGGGGTTCTGTGGTCTGGGTGGGGAGGGGTGGGGTTTTCCATCGAAAAATGGTCT 

GTGTTCCTGCrGTTGTTTACATACTTAAAACAACTTGCCAAAGGTCT 

6TGTTTCATGCTTTTCCCAGACTGCCTGGGTTCI!AfiCCCTCAGCT 

2 0 ACTTCTCCGACITTGGTCTCCACATCTGTAAAACAGGC^ 

AGAAGATGGAGTGCGCAAACTTCCTGGCACATAGTAAGTGCTCAAGAAAAGGTAGCCCT 

TGCCTTACCTCTGATTTACAGATGAGACGGTGAAGTCGATTGCCCAGGTCATGTGGTGCCTGGA 

GCACCTAGGCrrTCCTGATCTCCAACTCCCrGCCX3TTTTCCA.CAOT 

CCCCAGTCAGTCATAACCCAAGT ACATTGT AATT CAACCT ACT AGGG CAGCTTGTT C CC CTTGGAAAGG CXnTCGAGAGGG AGGTG 
. 2 5 TTGCATCTGGGCTTCTGTCTGAGAAGCrTATTTTTTTCATGTC 
CTCGTGTGAAGGGAAGGAGGCCACAGAGGGACTGGGCTTCT 
GGAACTTCTGGCCAGTCACTCACCCCTCCGAGCCTCAGTTTCCTCATCT 
AGTGGCTTCTCAGGGCCACCAGTCCCCAAAAGCAAAA.GAGAGGGCAGCAGAGCACC 

GCCTC GAGG AGATGGCTCAG AGT AAAT AGTTTG CTTT CACTCAGCCTTTC C C CAG ACT ACAG ACACCAC C CACC CC CG ACT1T1TTT 

3 0 TTTTTTTTTTTTTTTTTGGAGT^ 

ATCCTCCTGAGTAGCTGGGACTACAGGCATGCGCACC^ 
TGTTTCACCATATTGCCCAGGCTGATCTTGAACTC 
AGCATGAGCACGATGCCTAGCCCCACCCTTTTTTTAA^ 
• CGTAAACAGTCCCTTTCIAGCTCAGGCATTGCAGCrc 

3 5 TTCACAGAG^AGAGGAGGGGACAGTGTCTGCAGAGGCAGAAGTCTTCCCCTCAGCCT 

CCCACCCTCCAGCCTCCCACTTTTGCCGGCCTCCTTGT 
TCCCAAGAGCAAGGCCAACTGGGCAGAGGGGTGGGGCCTCTGCCATGT 
GCCIAGAAGGGGTTTGTTTCACCCGTGGGGGCTGTGGGCCCCCAGA 
GGTGGTCTGAAGCCTTGTTTCAGATCCTTAATTCAA 
40 GTAGTCCCTGGCCGTTGAATGGAGTGGAACTGTCATGCTC 
GATGTGAATAGATTTATGTTCACACCTCCTGCTT^ 

AGCGACTTGACC^GGCCACAGGGCCTGGCAOTGGGGCCGGGGGGGTCCAGGGAATGATGAGTGAGTGGCAG^ 

CTGCAGCAGCTGACrrTCACTGTGAAGCCATCCCATCTTGTCCTCTGTGAGACCTACTAGGGGTGTTGGGATT 

GAGCTGGC^CTGGGGTGAAGGAAGTCCCCGCCCAGTCCCTCTTCT^ 

4 5 GGTCAGGAGGTCAGCTTCTGCTCTGCAGGCTGCATTTTGTGGGTCCCGCCCA 

ACTGGCACTTATATTGAGGGGCACACGGTGTCT^ 

TGCTGTTGGTGGGGAGAGGCAGCCTGGGGCCTCCAGAGAGCCG 

CTGCTCGCAGTCCTC^TGTCCCCACCTTCCTGGGATC 

GCTCTCCGCCTGTCTGCCCTGTGTGCCCGG CATACCTGAGCTCCACAACCTC CCTG CGCCTCAGATC CTGGGAGAAGAGCAGAAAG 

5 0 CTCGGGTGTGACTTTTTTTTTTTTTTTTTTTTTTTGAGACTGAGTCTTGCT^ 

GGCTCACTGC^GCCrCTGCCTCCCGGGlTCAAGAGATTCTCTTGCCTCAGCCT^ 
CCATGCCCAGCTGATTTTTGTATTTTT^ 

ATCCGCCCGCCTTGGCCTCCCGAAGTGCTCGGATTATAGACGTGAG^ 
TTGGCGATGGGGCTGTGAAGCTCTGAGTTTGACGTC^^ 

5 5 GGTTGACATCTGCATTTCTGCTTGTGCT^ 

TCCTCATGGGCCTCAGAAGTCTTGGCGCCrrTCTGCCrrTGTGGATGGCCTCC 
CTGCTCCACCCGAAGGCTCTGGATGCTCCTATTTCTGGTATTCTTGGC^ 
ATCTCATCTGATGCCTXSTCATACGGATGACTTA^ 
TCCACGGACAGGCTGCTCCCTGCCTCCCCAGCAGCT^ 

6 0 CCCCrrAGTTCTCTTCTCAGGACCTTGCAGAATGTGGTGAATTTGGTCCTCAGGAAGTCTCT 




GACGTGGCAGTAGGCTGGGGAGG CCACAACAGTGGCTCACTGGGGCC CTAGGGGAAGAAAGTGGTCTGGCTTCTGCTTTTCAAGCA 
GGGTCAGACATTCCAGGAGGCTGAGAAGGCCGTCTGTCTT^ 



65 CTCACTGGAGGTAGCC^GGGGCTGTGAATAGTGCTTCCTCCCCT 

GCCCTGCCCCCTGCCACrTTAGGCGCATCTTGCCTAAGTGTCTACAGGCCCCT 
AGAGACAGAACTTCTGGCOTCCTCCTGCCAGCATCGCTGGGGGCTGTCGC^ 

TCC^GTATCACTGCATCGGGGAGTGTAATGTTTCATTACCATGTGCAATGATGGTAATGGC^GCACA 

7 0 AATCCAGGTGTTCCCACCATCTAGCCAGTGCCCAGCC^ 

GTGCCTCCATGG CTGTGGCAGTGGCACIGGCCTGGCCTGGGACT TTGCAAAGCGGCTGGGCCTTTCGGCTCTG 
GCTGTGCCCACCAGAAACACTCGTCCTGGGGGCAGGCACTCGGTGCCT 

GCCGCTGCCCTATAG CIAGGACACTTTCTCTCACTTCCTGGCTTATATGGGGCTG CCTTGCATTCCGCCTTGGGTGGCAACAGGGTC 
CTCAGGAGCCCACCTGAGACCCQACGCCCAGCTTTTTC 
7 5 CTGAGATGCAGCCTGGTGTGATCCCTTCCAGGCCCCTAAGACCC^ 
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AGTCTACTTAATGCCCTTCTCAT CCCATTTCCCAGATTAGAGCCCTGAGG CCAG AG AGGGAAGGAATCCAGCTG CAGAGCTG CAGG 

CAGGATCCCATGTCCTCCCAGCCTCACTGGCTTGGTCTCCCCCTACTTCCA 

GCTTTGTGTGGAGTGAAGGACGATGGGGGACAATGGGCAGAGTAGCAACrrGCT 

AGGCCCTGCGTCGOTCCAGTTCTGGGGAGCCCATGCACTGAATGGGCTGG^ 

GAACAAGGCITCCrrGGCTGGGCTTGAGTCCCCAGCACTCT^ 

CCCAGCTACCCAAGGAAGGTGGATTCATGTATTCCTTTGTCAAGTT^ 

TGGGGATACATTGGTG AATTTGACAGGGGACCTGCTTACATTCT CGAGACAGT CAG CAGGTCAACAACTT AATGCATGAC 

ATCATGCCTGAGTTATAATCAGTGCTTTGAAGA 

TTAGATGGTGTGGTCAGGGAAGGCTTCTCTGAAGAGGCTGCATTTGAGC^ 
CATTTGGGAGAAGAGCTCTCCAGGCAGTGGGAACAGCATGTC 

GCAGGGAGGCXAGTGTGGCTAGATGGAGTGAGGGAAGGAGAGAGTGGGGAGGAAGTCAACA^ 
GGACAGAGAGCCTTATCAAGTACAGGAAGGGCTTCTGACT 

CTGAGGCTATCAGATGTGTCCTCTAAGATCTCTGGCTGATGAGCCGAGAATAGACT^ 
TCTGGGCAGCATTTACAGGAACCCAGGAAAGGATGCTGGCXX3CTTGGACCA 

CCTGAATGTATTTTAAAGATTGACTGAGGCTGTGGCCAGGAGCCCACTGGGAATGCCrrGTCCACCTC 

GAGAACTGGCTCAAGTTG<^TGTCCTGGGAGTCTGAGCCCTTCTCCTTTTGTC 

ATAAGGTGGCCTTITAACAGTTGGAAAAATAGAGGCATAGTGGGTCTTCCTGATACCC^ 

G CATTTGGTCTCTCTCCTCCC CTCTGGCTGATTCCCTGGCATTCATCCCCTG CATGTTCATACACTCCACTCTATGCACCATAGTT 

TCCCAACATGTTAGTGATCCTCTATCCCGAGGCCTTTGCATATGCTAAGTCCTCCAC 

CCATCCTTAAGAGCCCAGCTCAAAAGCTGCCATCTTTGAACT 

AAAATTCTACTCCTCTCCCTCCCACACTCTATTGCAAGGGATCAACTGCGTGGGGCTGGGTCT 

TTCTGTTCTGTGTCCTCACAGCCTGGCCCAGGGCCAGCACAGA^ 

GGTTACTATGGAAACAAGGAAGTGTTGCAACTTCCTGAACTTGAGGTTC 

TCAACC CAAGATGG C CATGCTGAGGTTCTT ATT AAT TCAAAGT CACTTCGG ACAGG C TCTTT CCCATC CCTGAGC CT CAGCTTAC C 

CCATCAATAAGATGCAGGTGTTGGGACTCCTTGACTGTCTGCTCTAACGTTCTGTATCCTTC 

ACAGGCAGGTCCrTTGCCCTCTTTGAGCCTCAGTTTCCCTATTTTGTTG 

CTCCAGAATCCTAGGGTCAGCGGCTGGGCTCGCCGTTGATGATTGAACTGGGATTCTTG 

TCCTGCITTCCAGGGTGGTGGGGTGGGGGTGGGGGAATGAGGCAGCAGGAGGGTGGGTTGGTAGGGAT 

CTGGCCTCAACCTGCACAGGCCCCTCGGTCTCCTTTCTTCACCTTGAA^ 

TGCCGGACGCTGCCCGCTCCGCATTCTGGTGCCGTGCCATG 

CCAGAGGCTCCCTGCTGCCTGTGGTGGCAGGGCTGAGTGGGTGGGCACCATGTTCTTACTGTGTCTCTTCT 
CTCAGAAGCCAAGAGTTAGCCAAGCACAAAACCACAGCCT^ 



CGTGACGCCAGGCCCGCAGTTGAGTTCCCACCTCCCATTTGCACATGCC^TTGGGCAGAGTGGGCAGAGGCCT 
GCCCCACCCTGACTCCTGCCTGGCACTCAGGTGCCGTCTGGCCTGCACCAGACCCAGAAAGAGTG 

ATGGCTCTCTGGGCCTGCCTCCTCATCTGAAAAATGGGCAGAGGGTGCCCGCCTCCCATGGCCACCAGGAGGGGAGACAGAGGAAG 

GCAGCTTGTGATCTCCAGAGCCCAATAAACACACTTGTTAGGATTAC^ 

GAGAACCTCTTTTTCAGGAAGCAAAAGCCCAAACCTTTTCAGAATGAGACT 

CTG AG CTGTGCCCAGTGGCCAGGGACCTGAGTGGC CAT CACAACGTGCCTTTCCCG 
TTGGAGTTGGATGAAACCCCTGGGGCAGCACCG^ 

CTCATCAGCTTCCTGACCCTGACGAGTCCTTGTCTCTCTGCCCTCTGTGCATTGAGGCCCCTTCTAGCTG 

TCTACTGGAGCATTTTTACCTCCAGGATAGGTGAGGCTCCCTAC^^ 

AAAATTCTCACTTCTGTTAAGAAAGGGCCTTGCCATGGTTGACGGGGATTTGC^ 

CCCCGCCCCTCCTTACCTGGa^CTTTGTCATTAOTCCTCCCAGGGA 

AGCCTTCTCCTCCTCCCGGTACCAGCTAGGTCAGACAAGACCTGGAG 

GATCAAGTGAAAGCCCTCAGGTATCCAGATAGTTCAGTTGGCCAGAAAGCCC^ 

GTTCAGCATATAGAATCCTTTTTATTTTTATTACTTCATTTTATTTTGTAGAAACTGG^ 

CAGTGGTGTGATCATGGCTTACTGCAGTCTCGAACTCCTGGACTCAAGCGATCCTCCCACOT 

ACGAGCCATCACCCCGGCTAATTGAAAAAAAOT 

TTATTTATTTTTTATTTTTTAATT 

TTGGCCAATCTCTTTCTTTTAAAACCAGGGTTTCCTGAATTGCATTTTTGCAT^ 

TTTCTCTTTTGGACTCCATATAGCAAATAATTCTGCCTCTCCTTTTATTTGAATT^ 

AACACTGCTTTGGTTCTTCTTTGTTTTGTTAAGTGTTTTTAACTTCTTAGTGTGTT 

ATGTTCCGCACT^ 

ATGGTAAAGTGTTGGAGGAAGGGGAGCCTGGTTCTAATTCTCA^TAAAGCCACCTTGGATTAGGAGTAGGCTGACATTGT 

TCCTGGCTGAAATTGGGAAGTCATTTTGACTCCTCCCCAGGGCTGTACCCCGTGACATTC^ 

CAGAGCATCGCTTATCGGAGTATGTAGTCAGG 

TCTACATCCCCAGGCAGCAACCTTCTrc 

TGAACCTGAATAAAAGGCACCCCTGATCTATCACCTCTTCCACTCTACTTAAAACTCCTTCGTC 



CTGGAGAACTCCTAGGCATCCTGCAAACTCTGCCCAAGCCTCAGCTTGCCCTCATGCACCAGCTAGGTCAAG 
CCTTGCCCCGGTTCCCCAGGCACTGTTTCCTGACCAGTGTTGCAGCCCTTG 
GCCACCACCAGGCTCTGGCTCCCTGTCACCCAGCAGCACCAGCTTGGATCTAGCAAAC 
AACACATGGTGGCTACCCTTTCCCAGAGCTGGTCGGCTTTGTG^ 

AAAGCTACTGGGTCTGAGGATACCTGTGAGCTTGCTGTGCCTGCATTCCCGCTG 
GCAACAGCAGCCTGGGCCAGGCCACAAGGCTTCTTGGGCAAAACTCCGTGGGCAGGGCCAG 
TGTGCACCAGGOaiGTCTGTGGGCAAATGAGCTCCAGCrrTGCCATGGTGCTGGGGGAGGGC 
XGTTCAGGCCTGCCTAGGCACQGCCCTGGGTTCTTTCGGTTAAA^GTGCA^ 

AGGGTGACAGAGAGGGACACATGGGAACCTTCACCAACCTTCCCTGGGACTCGGCCCCTACAGGCTTTAAGC^ 
GGCTTAGGGGTCCCTGGG CCAGGACTACrGTTCCGACTCTGAGATCTTTTATCTGCTGCAATGCTGG CTGCT TTCTGGGACTTGGG 
GAAGGTGGGTGGGCTGTGAGCAGGTGCGTTTCCTAAGCCAGGTCTGAGAC 
TTTCAGTCCTTGCTCCCCCAGGCAAAGCTCACATCCTTGGGAACATCCTTCCTGCACC^ 
GTCGCAGa\CCCCTCTGAAGTGGGTGGTATTATCCCATTTTGCAGAGGAGGAAG 
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CCACAG CTG CTCAGGGAAT CAGGTCTTCTACTC CCTACGTGAGGCCACAC^^ IT CTG GT CCGTGG CTGTTTGT AC 

AAACGAGT ATCCTG TGG ACATGG TTGTCTGC CCCTGC CC CTC CGACAGGACAGCG CTGAGCAGGT AC CTGGGCrGGTGCCACACTT 
GGGGAGGTGGAC^CGTAGCCTCAGAGCCCGAAGCAACTGGTGGGG^ 
GAGCCTCCGCATGTCAGATCCTTCTCCCCACCCAGATGGAGCCAAGTGGGCCC 
5 TGCAAACACAAATTGTGGAAGAGTGGCATCAACAAC^CGAGGTGCT 
CAGCTCAGGGTGTTTTGTGCCn'CTCCGGTTTGCrCTAGT^ 
AAGCACTGGACCCCTCrCCCTGGGCCCTGAAAACCCGGAGGGGTACTG 

GGGGGTT CTGGGTGCAGGGAGCACTTGAG^TGCCTG CTGCATG CT CCCTGAAGACAAAAT CCCAGGGAGGATGTGTCAGTTCCX3AG 

GGCCACTGAGATCACCACCTCCOTGAGCATGAGCAAGACCCrAT^ 
10 CCACCTCCXTTG TCCCrCTCCCACTCCCC3WGACCCCCACCCCCTACCCCTC^ 

CATGTGTGCCCTTACAAGCTCCTGGCTTTCGGAAG^^ 

CTGTGGCTCTCCTTTCAAGCTGTCACGGTAAATCTATTT^ 

CTTTTAAGGCTTCTCCAACCAGGGACAACCCCAC 

GCTGTGTCTTAAGGCTCAGTTAGAATGTAAGTGGGGACCCAGGAGAC^ 
15 TGGGAAAACAATCCCTGGAGAGGGAGGGACTCCCAGGTGC!A^ 

AGGGAAGTTCCTGAAAGCAACO^CCTGGGCATC 

GTGGCrGTG^CAAGATTGCrCAAGACCCACAAGGAGACCCCGTTTGGAAG 
CCCCTCTCAGGGTGTGGGGCTGGTGGGGTAGGGGACTGGTGAC^GGAAGCTGGAGGTGG 

ATGTGGGATCCCGTTTA 
2 0 GCTGfl 



CCATCTGGAGCAGCCTGCTCACAAAACCAGAGTGAAAAGATGCT 
GGGTGCCCACTTTGAGAAGCCAGGGATGTATGTGCCCAGGGCCAGACT 
TT'GCTGCCTTTTCCATCGGGATACIACTGGCCTCA^ 
GCCCGTCTGTGCCAATAGAGGGCCCCGCCTTGGGGGGCCCACACTC^ 

2 5 CFTCTTGAGCCCAGATGAATTAATCTCrTGCAG CAATTTCAGTCTACCAAGAGACCTTCTCC^GCATCTT CTAGGGGAGGGTCCTGT 

GCTTTGGGGAAGCrGGACTGGGTGACATTCATTTCC^ CTTTCCCTGTTCAAG CCACCATGATGTGCAATGGCAG 

CTGCCTTTTTTTTTTAACTTAAAATTTTA 

TGGCCTATTTCTAGACTCGAGCTGOT 

AGGCAATATTTCTCCATTAGAAGGATGGCCAAACAAGAGTCCTCAGACCCT 

3 0 ACCTTGGCTACTCAGCAGAATAGCrGCAATGAGTACTGAAAGCTTCA^ 

TGAGCTAGAGTGGTCGGTGGGGAGGGGCCrCCTGGAGGAGGGGGCATTTAAGGGCT 
CAAGGCAAGGAGCGGGAGGAAAGGTACTTCTGGGCGGTGGAACCGGCATGACC^ 
GGCAGCTCAGCCAGCACCCATGGACTTTAGGGGAAGA^ 
GTTAGGAGGGAACACTTATTCTAGAAGTGG^ 

AGGA^GTGGCAGCAGCAGCT 

AAATGGGGAGATTCTGCAACTCCACCCIAGCCTGTCTGCCTGCTGCTGTGAGGAGTGCTGGAGACACA 
GAACTGA(^TTGCCTGTTCrrCATCCCCTTCTGGAAAACTGGTGGCTG 

4 0 TTCTCCIAGATTGGGGGTGACCCTAGGACCTGGGTTT 

CCITGGCGTTCTCCTCCATAAATGGATCCGTGCTGA 
CAGGCTTGGGCTGCGGCTAG6GGTGTCCCCCTGTGGGCM 

CTGCCCACrCCTGGAGGCCCCTGATTAGTCTACTTGGTTACAAACCC^TA^ 
45 GGCATGTGCCTTTAAGTGATGCTCGTGGCTCTTTGAAATCTGGACACCAGCTGGCA 
AAGATGGGTGTATTTTGTTTTTTTTCTTTTGGAACAGATTT^ 

CGGAAG CAGCACC C C CAAAGCCGTTTGAATACAATG CAAG CAGTTGAGGGG CTG AGC T ACCAGC CAGTAGG AGAGAGGGT CCTGGG 
GGTT CAAAG AG CAGTGGGGG AAAC!TG AGGT ACAGAG AGTAC CCAGCAAG AG AATG AATGAGTGGCAGG TCCA^ C TTCAG ATAATT 
ATTATTCTTTCTACCACCCCCCCCACCCCC^CAra^ 
50 GGTTCTCTGCACTGTGCCATGCAGTTTATGTG 
CAGTTTTATTTTATTTATTTATTTATTC^ 

GCCCAGGCTGGAGTGCAGTAGCGTGATCTCAGCTCACTGCAATCCTGCCTCCTGGGT 
AGTAACTGGGATTACAGGCGCCrGCCACCATGCCTCGCTAATTTTTGTATTT^ 
CKSGTCTTGAACTCCTGACCTCAAGTGATCCACCCACTT^ 
55 CCTGTATCCCAGTTTTACACACATGCGTAOSTGATGTGCTCAGAGTGATTAGGG 

TG AATGCAGGCCTGTGAGTTTAGATCCTGGCTC CCACTACCCTATGCACACC CTCCCATGCCCTTG CGTX3C CTCTTCAGGGCCTCT 
CTCTGGGCCATCCCTGCCGCAAGCACTCrGCAGACAGACAGTCTTCCTOT 

GGGGAGATAAAAGAGCCCGTTTGGGATGATTTTTATCAGCTCGAGCAAGTCGGTTTGACTTACAGGA 

GGGAAAGAGCTGAG TGT AAAGG CCGCAGTGAG CACTGGGTGG ATT T ATAAAT AAACAGTCGAC CGAGGG CATTGTATGT CCC CAG C 
6 0 TGTACAATGCTGGGTTCATATTGACACCACAGACCTCTTCCAC^ 

GAGAGCGCTGCTGCTGGAAATCCCAAGGAACCCTTCCTCCGACCTGGCTTCCTCGGC^ 

CCGGG CCTGTTTACCGAGGCTG CTTGGCATTGCATGCCAGGTGCAGAGCCCACGGGCGACTCAGAAGGCCACGAATO 
GGCGGCCTCTGCTCTCCCTGCAACCCGGAGGGAGCTGGAGC 

AGACAGTCTGCAGAGCTCAGCGGCCTGGGTTCAAAC CTTCT CGCACAC!TGCCACTGTCGGTTACTTTGG CTTTCTAGAGCCAGATT 

6 5 CCTTGGCCATGAAATGGGTACTGCTTACTTCCCAGGTTATTTTGAGAATGAAG 

TTGTCCCTGCCCTGCTGTGGGATGACAGAGTGATTTT<^CAAGACCAAGGCCT 

TGGGGCTAGGCCAGGTCC^CAGTGGCTCACGCCTCTAATCCCTO 

GTTCGAGACTAGCCTGGGCAACACAGTGAAACCCTXSTCTTTACCA^ 

GTCCCAGCTACTGGGGAGGCTK3 AGATGAGAAGATTGCTTGAG CCCAGGAGGTCGAGGCTGT AGTGAGCCACAGTTGCGCCACTG CA 

7 0 GT CCAGCCTG TGTGACAG AG CAAGACCCTGC CCCT C CTGGAAAAAAAAAAAAAAAG AGAG AGAG AGAAAG CAAG CAAG CAGTGG CG 

CTGGATGCTCATCAGAGGCCTGGGGCCCTTGGAC^GAGAGGGCTGTCCCrGCCCAGCTG 

AGCCTAGGTTCATATCCTGTGTCnTCCACTTGTGGGACCTTGGGCATTTGACTTCCTCT 

GT<K5GAACGATTAAJGGTTGTTGGAAAGACAAAATATGCT 

CAT AAATCAGTCATGT CT AT AACAT ACATTTCAAGGGAAAAACATC!TGTT TTG AGGGCA CTATCAAGTCAAAT ATTTT C CT 
7 5 GATTTAATTTCAATAGTACTGTTATAAAAAACAATAACCATGATACCGATAGTTAGAAGACCGCCCCT 
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CCCCTTTGCTTTTTGGCAATAACCCTTGT^ 

CCGGGTGTCAGGCAjGGTCTCTCCGGTTAGGTAAGGGAGGCCTCATCCTTCTT^ 

CCCATCTTCTCCAGGCGCCTGATCTATTACCTTC 

TTCATCTGGACACATOjTGCAGACCCCCGCACCCCGTGTGGCT 
5 CAGACITCATTGTTCTAAGCCTTTCTCAGATCCA^ 

GC^CCAGCAGACAGCAGCGCTAAAGGCACOT^ 

ATCCCCGTGAGGCAGGTGGGGCCAC^ 

TTTGCTTCAGTTXSCGGGGCTGGGATTTGGAC^^ 

GGTTGCTGGAGAAAGGAAGCTC^TGTAGAGGGTGGGTCTC^ 
10 AAGGAGTTAGAAATGAGATGT CTAGG CTGGGCACGGTGGCTCACGC CTGTAATACCAG CACTTTGG GAGGCCGAGACAGGCGGATC 

ACAAGGTCAGGAGATCG AGACCAT CCTGG CTAACACGGTGAAAC CCTGTCT CTACT AAAAAAT ACAAAAAAATTAGCCAGGTGTGG 

TGGTGGGCACCTGTGGTCCCAGCTACTAGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGC 

GATTGCGCCACTGCACTCCAGCCTGGGCGAC^GAGTGAGACTCCGTCTCAAAAGAAAAGA 

TTCAAAGAACCTCCAGCACGTGGAGGTGAGGGGTGGGAAGGGCT 
1 5 ACGCTGCCTTGAGGGGTTGATAGAGACTGGCTCCrrCAGTCAATGCCT 

CCCCCTGCTAGGCTCTGGGAATAGGACMTGGCCAGAACAGATAAGATCCCrGTCT 

CAAGACAATCACAATCAAGGACACCrTTATAAATTAT 

AGAGGTGAGTGTGGGGGCCTTGAGACAGGAAGGCCTTTCTGAGGAGGGCCGTGAGATGATGGGCT 

AGAGGAAACAGCGTGTG CAAAGGCTAG AGACAGAGAGGG GTCCCATGTGGCTCGGGG ATGAC CGGCAGGGGCAGAG CAGC TGAAAC 
TGTCGACATC^ 

GGCTATTTCTGCCTCCCACCGGCCACCTCCATCTTTGCAGCATGAAGGGAGGC 
GCTGTCTGCCACTGAATATCACCTGCTATTTCAGTGACGATCCT 

TCACCAGGGGTCCCCGGACAAGCCCAGGTTCTTCCGCCTAGGGTCAGCCCTGGCCCTGCAACCTTTGT^ 

2 5 GCTCAAAACT AGTAG CAAG CTG CCTGG AATG CAATGTG ACGAACTCGAG ATAAT AAT AAAATCAATTGG ACTG TTT CCCCAGT AAT 

T CTAGAGACTGATGCAT CATTAATGTTGG TAAT ACAGTGCCTGTCAG AT CACAAACTTCTCAATAAATGTG CAGCTGGCAAGCTCT 

GATAGAAATTGGAGGCAAGCAGAGAATGAGATTTGGGGCCT 

CTGTCACTTTGGGCTrTTGTGCTCACAGCATATGTG^ 

CACCCAACAGTGTGAGGAAGAGGAGTTCCTGGAGAGGCCTCAGCAGCACTGGTGCCCT^ 
30 tCAG^jCAGACGTAGACGGGCAAAGCCTTTGGACTTGACTCTGTGGATC^ 

GATTC^UVGATGCCACTGGGCCCTCGGAGGCCTTTGG AGG CTCTGGAGGACTTGATGGAAG CCTGGGGCTTGGGGTTGTGGGGAATG 

GTCAAATGATCATTCCTACCTAAGGGTCAGGAGAAAATGGAAGGAGTGT^ 

GAGAAGAGACTATTAGACTGGGGAGCTCTC 

TGGCCAGGCCAGGCCCCAGCGGGATTGGAGAGAAGGCAGAAGCT 

3 5 GAGCTAGACAGGCAGGCTCCGAATCCTAGCTCCCACACTTAGCAGCTGGGCAACCTCAGGG 

AGTOTCCTCACCTGTAAAATGGAGGTGGTGATAGTTGACCACCTATCTCGAAGGACTCTAGCG 
ATGAGTGCCTGGTCCATTGCAAGCACCTGATAAGTAATTGGCATTACTTTTGCATAATGAGCTCAG 

AAGCCATAC AGTGAG ATTCAGAGAGTTG AAGTGGGG CAT CCG CACT CAT ACAGCCG C CCAG AAGCAG AACTGAGATTTG AGTG CAG 
GTGATCTTCAAAACCTGCGTCTCTAAGCTGCTGCTGAGGCTG CGGTTGAGAAGCGTGGGCCTTGGGTG CCCAGGTGGAGGAGGTGGC 
40 ATGCGGCTGGGGGCCCAGCACCCTACCGAGGACAGGCCTGTGAGGTCACCGTCATTAGCAAGGAGATTGGGTGGGC^ 
CTCAGGCTTCATTTGTCAGTAGCCCTCTGGAGCTGGAGAACTTCTGA 

CC CTGGAG GCTGGAAGTGG CTT TG AGTAG CAGAGG AG CC AAATG GG AAC CATGTTG G CG GCAGATT C CAG AAGTGAGT CAG AATTT 
CAATGTTCCCAGCTCATCTGCTTGAGAGCAGGTCCTGAGACCCAGC^ 

CACCACAGTGTCTGCAAACCAGAGCGGAGGGTCCCTAGGCTGTGGGGGTGACrTGGGTGTGCGC^CA 
45 GTGGTGCACCTGCTGGGGACCCACCGTGAGGCCCCGGGAGTCGGTGTGCTGGTG 
GCACAGGGACCCCATGCTCCCATGAGCAGGGGTGCTCCTGGCCCTTCCCC^ 

cctcaggaaagaagaagcaatcttgttagactggggtgtggaagacagaggggcctg 
tctgggcccaggtctgcccccatcccctccccggctcccttcaaaccrccctga 
50 ggagcccaagtccctg cacttgttgggtggtggaggatagctggtcctgcatttcc caggtgatgtgcctgggaag catctctcca 
taaaccccctgtgacttgggcttggaggaagtgtcccttcctgc 
gtgcatttgaaaaagcaggtaggtcttctttttaccacctgg^ 
tcctaaccggtgtggttcccatcactactgcttgtatgacccga^ 

5 5 ggatgactggtcctc^cccaaccccaggagtttagagccrcactagggaacaagactgggc^ 

tggtggctgtgaa(zaagagccctgctgaggggagccccagggggacaggttcaggtcagcctacta 
gcctcccagagx3aggtggcctttgtgcttcagcccaaaggatg 

atggtggcttatgcctgtaatccagcactttgggaggtcaaggtgggagaattgctagagcccagga^ 
aacatagacaccatctctgaaaaaaaaaaaaaaaaaagagggcaggtgaggttatcaggcct 

60 GAGCTC^GTGGGGGCCTGATCACATCCCAGAGGGCCTTGAATG^ 

TGTGGGGTGAAGAGTGCTGTCCTCAGATCTGATTTGCAGACCAGTAGCTGCAGTGGCTGCTTGGAGGATGT^ 
ATGATCCCTGCTGCGTTCCAGGAGAGAGGGCATX^ 

CATGTGGCTGGTAGGGGGTGAGGATGGTGGTCATCGGGGGCACTGGGTGCTGTGGTATCTGAAG 
AGGGTGCCCTTGGCCCTCAGGGCTTTTGCACACAGCATC 

6 5 CTGGGCCAGGTGCATTTAGAAACCTGGCTGAAAACTGCAGGTCT 

• TTGAAGGAAGAGGGTGCIAGCTGATTGCAGACTGCCGAATCTTGAAG^ 

CAAGGAGAGAGGAGGACCAAGCTCTCCCCTTGATGAGTAATTCAGCATACAGTTGAAGCTCTTTG(^ 
GATCTTGGCACTGGGGATGCTACTGATTGTCTAATGGAAGGGGTTCTCAGCACAGGCAAAGGGCTGA 
GGGGCIAACTGAGTAAGTCCTCTAGGATGAGGGGGAAGTTCCCAGTCTGGAAGTGAGGGAAAAA 

7 0 TATGCAAAGGCTCAGAGTGCTGGGGGGATGCTGTCAGAGGGGCACAGCTCTTGA^ 

T CT AGGGGCAG CAAGTCAG CCAG^CTTT CTTCT CT CAAGT CAGAG AAGAAAG G C CAGGG CTACTG GAT CCTGGAATGTGTG AGGT C 
CTTGGCCCAGTGGATGTCTGGTGGCCTTGGTGTATCTrATTGGCACGG^ 
CTTGTGCAGAGGGAGGGGTAGTCTCTAGGTAGGCAACTGGC^ 
GGTGGCCAGGTGACGGCCGTCCTGGCAGGTGTOTTCCATCATCTGCC^^ 
75 CTCACACTGTCCCCGACCATAAACACACACACTGAGATTCCACCC^ 
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GTCAATTACCATCAGCTGCCACAATGCAGGGAGGCAGCT^ 
TGGGACCTCCGACCAGCTCCCAGCCTCCCCATGGCACCAGGGACCTC^ 
CTCCTGCTCTCAAGGAGGAAGAGCTGTCCCCTAGCAATAAGCT 
CCCTCCACCTACAAACAACCCAAGGGAACAAACCCAGA 
5 AGGCGTGCAGTGGGGAACGCAGCCTGTAGCCCCT 

CCTTTGCCCXGCCCTCTGCTCCATCAGCCT 

GGGGTCTCAGCAAGGCTGCCCAGGGCCAGGAGCTGAGGGGGGACAGGATGTGGT^ 
AAATATTGGAGCCACCC!AATTTCATGACTTT^ 
10 GTCCCAGGCXJCCACCrGGGAAGCAGTGCTGGCOTAGCACCAGGCGGCCTTC 
TAGCACCTCTCCGCGGCTCAGTGCCCTCATCTGAGAAGTGG 

AG CTGACAGAATCGTAGCTGTTCACTCACTTGTTCATT CAAC CAACACI^ ATTAAATGTTAGTITTATG CCAGC CAGTGATCT AAA 

CATTGG ATAGAAAG CAGGAAAC CAGCTTGAGGAGGTGCTGACC CTCCTGGGG CTAACATCATTGAGCAGGG CTG GGGAACACAGAA 

GGACAGTAATCAGCTCGTAGTACXSTGCAGTACAGTAGTAATGTCGAGCGATGTGTTACGGAGCGGCTGGG^ 
1 5 GT ATTTT AGATGAGGGCAGTCTGGG AGGACCCCTTTGAG AAACTG T AAAACAG C CAG C CACTGTAGGTGGAGGAGGTGGTGAGCGGG 

AACAGCAGGTGCAAAGGCCCAGGGGTAGGAATATGCTTGGTGCCCGAGAACTGAGGACATGGGGAAA 

GAGCCAGCTGCACGGGGCTCGAGGGCCGTGGGGATGCCTGGGGTGGGGGGCT 

GCAGGAAGGAAGTGAGAGTGGGAGGCCAAGGAGGGGCTGTTGCCCCCACCCAGGCAAAGGA^ 

ACGTOGGGAGGGTGAAOK3TGGAAATGGCAGATGGCTTGGTTGAGGGGGAGCGT 
2 0 TTGATGCCTGGGTGGGTGGGGGTGGAGAGGGCATTTGATCC^AAAGGAGGGCGGAGG 

GGGGTGATGCCAACCTGGACACAGGTTTTCGGTGTCTGTTGACATCGGAGTGGAGAGGGGAGCCCT 

CACACCAGGCCCCAGGCTGGCAAGCCCTCCTCCCTGGGGCTCTTTGCCAGCCCGGAGC CTGTGTGGG CGAGTGCTTCCCTGG 



AGAGTGGTCCACGGCTCTGCCCAAAAGGCTGGACAAGTTGGAAGGTGGTCTGTCATTCAAGCACCA 
25 AGCCCTGGTGTCACTACCAACCTTGTTCCCAGCreGGCrGACTTCT 

GTGCCTGGACTGGGATGGGAAGGACAGCAGGAGTAGCAACATGCTCTC 
CACCAGACAATGTAGTGACCCCTCCCCTCTCTGGGCCT 

TCCGTGTGAAACACAGCCTTTCGATGACGGAGATTTGGTATTGAGAGGGTCATGGCGTGAGTAAAGGCCTG 

AGTCACCTTGCAGGATCTTGGGGAGCGGATGAGTATAGACT 
3 0 TGGGAACACCAGGCTGCCTGGTTGGGTCAGAGGGACCAGATTACCATATAAGCCATTGTGCTCT 

AAAG C CTCAGTTTTG TCAT CTG CAAAATGGGTACAATAG ACGT CACT GAG CTTG C AGGGGTG AATAC CCACCAAGGT ACAGAG ATG 

GGCCAACTCAGTGCCTGGTCGTAGGATGTCTTTGATTTCC^GAGGTTGCCITCTTT 

CCAAATTTCAGGGGAGGGCTCTTTTGCTGTTGGGGCTTCCT 

CTGGAGGCCGTGAACCTCACCGTAGCCTTGGGGACTGGCCC^ 
3 5 GAGGTGCACTGGGGTTTTAACCAGAGGC!AGTAATGAGCT 

GGCCTTGATTTCCTATCAGCATTGATGTGGTTG^ 

GTGTGTGTGTGTGTGTGTCTTAACTGCCATGGGTTTTAAAGCCATGGTGGAAAGTGAGGA 
" CACCAAACrGTCTTGAAATGCTATAAATATTTTCTAAAAAGACTAGCTTCCCGAGCCC 
ACTGATTCCCGAAAATTTAGACAAATAACCTCTTA^ 
40 GTCACAGCGACTCTGCTCTGCCTAAGTGCCTCCGTGTGGCATAGAGAAGGTGGTGGCAGAGAA 
AGACCAGGGTCGGAAGCCXJGGCTTGGCCACTGACTGCT 
CAGACCTGTTACAGGCTGCCTCGATCCTGAGAGTGCCACTGAACTGAGCT^ 

TGGCATTCAGGAGTTCATGTTAATCAGGACTGAGAAAACCCCTCCGCCCCTAGTCCCOTTTAGCCCTCT 
CTGCCAGTGACTGGATACTTGCTGGTGTCTCTC 
45 raGTGCCATTGTTTGTTCTCCTTGTATTTGTTTT 

TAATATAAAGTTCTTTTAAAAATAAAAGTATTTATC 

GAAATGACAAACATCAGGAAGATGGGAGGAAATGTGACGGACTCTGCACACTGTCCAGGTTG 
TTCTGTGCCCTTGTAGAAGGAATGAGTGGGGAGGGGTTGCT^ 

AGGTGACCACCACACTCCCTAAGACGGTGGGGCTACAGAAAGCCCCATTCTGGCATCCGGGCCAAGGAAACCACTCTCGTGGAGAG 
50 TCTTGGCCAAGAAGCTGGGATTCTGTAGCAAAACCAAATCCCC^ 

TGAGGGTCTAAGCTTGACACCATCTGGCTGCTCTCTCCTAGGCCAGAGACCAAGACCAGGCCCTAGCTCTCCCAGATGTGGCTGAA 
CCCAGCAGAATGTAGTAGTTGCTCCACTAACAGGGACATCAAGTGCTGTAGGTTTTGT 

GGAGTGCTATACAGAATAGCAATTCTGAGGCTACAAGGAGCTCTGTAAGAGAGCGTGGGGACCAATTGGTAATGCC 
GTGGGAAGGAAGCCTGGGGGTACACAGGCCTTTATTTGTGACTCATATGCTCCCCCAAG 

TGCCCGTTGTGCAAGGGGACATCCCCCAACCACATTGCAGGCCCT 

TCATTCCATACCCACTCCCTGCTAGGGCTGGATATGGGGTAGGTACCCAGGAAGTGTGTTGATGAGACCCAAGAATGATCG^ 
TGGGACTGGAGAGGTGGTTCGGCCTGGCTTGGCGGCAGTCCATGTAACACTACCTCACTACAGGCATCG^ 
CCCAGATTTTCTAGGACAGCCCTGGTTTCAGATC 
60 TTTGGAAGTCCTGGCTGCTGTGGATATAGCCCTCAT 

CATCTGGCCAGGGGATGGAGCAGCCCCATTTGCTCACTTTTTTCAGGTGAGGAAATTC 
GCGGCCACACAGCGAGTTAATGCAGACATCTGGCCTGAACCTGTGATACCCATCTTCTGGCCACTT 
CAGAACTTTCTGCATCCTGCTATTTAAGCCTCATTCATTCAGTCAGCTTTTGTAGCCTAGCG^GTTAGACCTCA 
CATGCTTTGCATTTGAACTACTTCTGGCCGTGTGACTTTTGGTGAGTTCCriTTCTCTCT 

AGTGGTGTTCACAGAGGTGGATTGATCAGAGTTGCTCACAGTCT 

GCGTTCAAGTGGATGGGTGAGGTCAGAGGTGCACTGGGCACCACCGGGCGGTGGGAGGAGGGACCGTCTCCATTGCTCCCA 
CTCGGAGGCATTCAGAAAGGCTCCTCCGTTGATGGACGCTACT 
CTGGCAGATAGCAACCTCCAAAACTTCTCTGGACAGAGGATGGCA 
7 0 TGGGCTCTGCTGTAGGGTGGAGGTGGGGTTGAGAAGAGGTGAAAAGA 

GTCCTTCCTCTCAGCATCTGTCTATGCAATATAAAGAGCCCCTCCCAACACAGTCACCAGTGATCTGOT 
ATCCCAGCCCACCCGGAGACCCATTTATTCTGCCACTGGGGCAC^ 
ATGGTTGTATTATTAATAGTC^TAATTAATCAGCATC 
75 GGGAGGTCCCATTCCAGTCCCCCTTTACAGACAGGGGAAGTGAGGCTGGG 
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GTATGGACCCCAGCTCAGATCTCAGAT^^ 
CCTAGGTCCTAGGGTGCCAATTGCTGGGAGAAGCTTTCAGCA^ 
ATCTCAGGCTGAGAGTGAAGGAAGAAAGGTGGGATGGGGAAGAA 
GGCCTGATTGGGGTAAGGGTGGGGAGGTGGCCTTGGGTAGAAGGAAAGCCAA 
5 TGGGTCAGGGCCrGAGGTTGGCTGCrrTCTCTGTTCTAAG^ 
CCTGATTTATGGGGCTGTTTTGGACGGGC^ 

GAGTGGGCACACCCAGGCCCACGTGCTCTGTGTGCACCTGAGGCT^^ 
GCACG CAGGTGTGTG CTGCTTTGGGC CTGTGCG CAGATGAACACAGGTGTC 
10 TGCAGGCATCCATGTGCCTATGTCCTCTGGGCTGCCTCT 

TGTCCTCTGTGTGTCCCAGTGTGGATGTGCATTGGTGAGGGGTGTACATGCCrCTG 
TGCACACATGTGTGGGCATGC^CTGCCGCCTCATCCTGTCTTTTTCATTATG 

ACACAC CCGGTGC CCAGCGTGGCTGGGCTG CCCCC^ CCTGATC^TT AGG CT CATGGATAAGACTCTCTGT CATCGTAAATGAGTAT 
AGAGTGCCCCAGTCCAGGAGTCCTGAAGGAACTGGGGAGGTAGCCTCAT 
15 GTGCTGGAGAGGGGAGGCTGGGAGGGAGGGATGTGGTGCCCACTCCCAGGCCCCCAC^ 

CCCCACCATGCCTATGCACTCGCCCATATTCTGTGCCAGACCTTGAGGACAGCCCCX3GGTCGT 

ACCTGCAAAATGGT CTT AGCAATGGCCAC CTCATGGGGTTATTGTGAG GACAAAAGGGGTG CAGAAACTG CTAGAG AAGT CAAGTT 
CCTTCCCCCAGCAGACCTCAGTCTCCCCATCTGTAAGATGGCAATAATGACCCrrCACCAGTGGTCT 
2 0 AGATGAGAAAAAGGTTCAGTGAAGGCTGCAGGGTGGTACCCATAGGACTTGTTCC^ 
GCACTTGGGGGTGG/CATGCATGGCCTrGGGCCAG 

GGTTACTTCATCCTCATCTGCATCTCTATTATCCTTAGATTCCTCATCTT^ 

GCATGTTGGGAGAATTCAGGAGAAATGAGTGTGGAGGTTAGC^ 

TATTCTTGTTTTTGGCCTTATTGCGJVTAC^ 

2 5 CACACTAGGATACCTATTAATGAGGTTTAATCTGTGTTTCCA^ 

CTTCCAGGGAACTGAGAGGGCCATGCTGAAACTGGAAGCCTCAGCCTGCAAGAGG 
GTTGGCCGACTACAAAGGAGCTGGCTCCTGGGACTCTGAATGCTGTGCCCTCTGCAGCCACCCA^ 

GTTACATTTACCCCTGTCTCTTCTG AACCATATGGTGG CTG CACCACAGCTGGCTCAGAGGCTCTGAAAG AAGGCCCACAGGGGAA 
CATGATGTCTTGC CTGAGCCAGGTGAGGACTGAGGCCCCCACTGCCAAGCACTGCAGTCTGGGGCTTCCATTTTCACCATAG C 

3 0 TCTAAGCTGCCTGGAGGAAGGGCTTCAACTCATGAACTTCATGACCAATGGACCT 

ACAGGTATCCCAGTGTGATATCAATACATCATCTGTCATGCACTCACCTGTGCTTCCAGATGGGGT 
TCCTTGCACAGTTATGCAGATGGCTGCCAATCTACTTGCTCCCCTTG 

TAGAGCCCATGAGCAGTCCCCATCACCTGTGGGCTTCCCAGTCTGTGCCCAACTTAATGATAGCTGTTAGGG 
CAATGTCCCCATCCTAGCAGGGGCTTTACTGAAGAAACGAGGCCCTGGCACCCA 
35 CAGCCGGGTCTG CAGGGGCTCGGAGGTCATGGGGCCCAGGTGGG CAGGCAAGGGAGGAGGCAGGCCAGGCTGGAGCCAGTCCTTGA 

AGGACAAAGAGGATTTAGCTCACTGGAAGGGATGGCATTCCAGAAGGACAGA 

GAAAGGTGTACCCTCTCCCATAAGAAGTTTGGCCTGCCCTGGACCAAAGAGCAGAGTCTTTGCTCTCGATC 

GTGAAG CAGATGGCCAGGGGGTGTGGGTGACGGGGGAGGCGCACCCTCAGG CCTGAGCCAAGGGAAACCAAGTGGGCCCACACAGG 
CCAGAGCCTX3AGACCTTGGCACCAGCTACTCGGAGATCT/rGACTCA 

4 0 TGCCTGGAAATT CCCAAGCTTGCCCCTTCCCTCACCTTAGGGAG CT CCAAGAAT ACAGTCTCACTTACACTTGGCTGAGAACGTCT 

CACCCACTATCCAGGGAATGGTTTTTAAAATT^ 

CTGGGAGCTGGGGTGGTTTGTTTGAGTGTTTGGAACAAATGTGAATAGAATTCAGTCTAGTGT 

TGGGAGCTACAG CTTG ACCCAGAGTAGGACTTGGTTGGGCTAAAAAAGTAATGG CATAGGCAAAGGCC CTGGGGCATGAACATCCA 
GGGAACGCTGAGTGATTGTGGTGTGGCTGGAGTGTAGAGGGTGTGAGAGACTTCTAGAAGTGTGGACACAACTCCCAAGTGAAGGA 
45 CTCTCAGGACTTGAAGATTTGGCAGTGGGACGTGCTGGCAGTGGGGAGTCAGGGCTGGTACCTGAGTAGGGGACAGGTGTGTTTG^ 
TCTGGGTTGTAGGGTCGCTTGCCCATCCCTGTCTTGATTCTC 

GGTG^TGCAG^GA 

TGTTGGGGTAGGCTTCAGGAACCCCATGTTCCTGGCAGGCTCTGTGGACACGCCGCCTGGAAACCACTTCTC 

5 0 CATGGCTGAAGGGGCCTGGAATGTGTATCTGTCCTTGTAC^AGGAGGGGAGCAGAC^AGGCCCAGAG 

GAGCCCAGGCAGAACTCAGGGCCCAGTCTCCCAATCCTGGCT 

GGACATT TCCCCGAAACCAGGCAAAATAG AGCAGCCTGATTGT CCATGTCAAAATGTCTGGGCATTTGGGGAGGGAGGGTTCCTTT 
ATATCTTTCACAACTTAGCAAATTTAATTTCTTCT^ 
GAATGTTTCCCTGTGACAGCAGCTTTCTCTGGCGAGGGTC 
5 5 CCTGAGATCGTGAGCTCCTGAGCTCCTGGGCTCTTGAGCAAGGCA^ 

GCCCTGTGTACCAGGGCAAGGACAGGCCTAGCCTCTTTGTTCCCCGCAG CAG CCTGG CCTGCCTTCGGAGCCTCAGGTGGCCTCTG 
GTTGTGGATGGTCCCCATGCAGCGAGGCTGGGTGGTATCAGATOT 

GCTCAGTGGC^GGAGGGAGCCTGTOTTCTGAGCAGCTCTGCCAGGACTGTCTGTGGCCGAGAGACCGGCAGCATC 
GCCTCCATGGTCTGCGTCATACTGGCTTAGGTGTCCTGAGTCIAGAGCCAGGAAATGGATCTGCCATGCGG 
60 GCCCGGGCAGCTAGGGTGAACCCCAACCTCCTGCCCGTCCACACCTGGAGGGTCCTGGCCTTGTGCTTCCTT 
TGAATGTGACTGGTGCCTGGAGGAGAGGCTGAAGTTCCAGCCTGCTCCACACTCCACTTTAT 
CTGTTGTTGGOVGGGTGGGTGCTG^ 

TGCCCTTTCCAACTCGTTGTCTGTGTGATTTTCTGCTCCCC CTCCAAGAATTTACACAAGAATG ACTTTI^TGGTGAAGG CTTCCC 
TGGCACCCCCACTCTGGCCAAGGCTGCCCCCCrTTCCrrCTGAGCCCCCTCT 
65 CGTTTGGAGCCCGTTCACCTGTGTAGGTGCCTTGAGGCTTTTTTCTTTGTGGGTCCTACA 

AAAAGCAGGCTGGATTTAATTAAGGGGGTTCCTGAAGGCGGTGCTGGAGTCTCCCTTTGTTAATC 
AGTTTGCAAATGGAGAGAGGCTCACTTGCCTG^ 

GGGACTGGGGTGGCCCCCTCCATTTCTATCCAGCGGCAGACTTCCCT^ 
' ACTCTCCAGAAAACACCCCAGTGGAAACTGCATGAATGATTGATAAGAGCTGAGAGCTGTTTAATT^ 
7 0 TATCAATAATTTAATACGCTCTCTGTATGTTTTTAAGAAATAGCACCCATTTCGCCGTC 
GTGTTGGAAATGTGCTCTGAGGATGGAGGTTGAGTGAGTCGGAGAAA^ 
GCCACAGCATCTTGGCCTGGGTGTGTATGTGTGTGTGT^ 

GCAAGATAATTGCTTTTCCGGGGCCTCACTGAGCAGCA^ 
75 TGGATGGCTCTGGGGGGCCCAGGGGGGCrAGGAAGGAGCTGCCACCAGTATCACCACGGGCTGAGAATCTCCCCTT 
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CTTTGCCCCCTCTTAAAGCTTCTAAATCACTT 

GTGGGTGGTGGAGTGGAGTGAGGGACTCAGAGGGACCTGGAGGGTCA^ 
CACCTGGGAGGACTTGCTAAAAATTTAAATTTC^ 

GTTGTGGGTTTAACAATCAGCCCAGGTGATT CTTAGCAGAGGAGTTTGGG AACTATAGAGTCCAG CT CTGTT CTGGTG CTTACAGC 

TGTGGAAGGCATACTCTAGTCTAGAGTTGCTTCTGATGT^ 

GCTACAGGACAGACCGTGTrrCCTGGGATCTGATGTCTTTC 

TGGGTAAGGAGCTTCCTCTCTTAGCAAAGGGTCCTGGAGGTAGAAGGCMG 

GAGGTGGCTGCATCATCCTATTGCTGGTAAGG^^ 
1 0 GCACTGCCTTGGCCTGTCAGGGATTCCATTCCTCCCACGAGAGAGAAAACTG 

T AGATGACACTGGGGGGAACTGAGGGCTG AGGGGTTT CCCCAGAGACC^ CT ATTT AGGACAGAG CTGGAGCTGCAAATGTG 

GGGGCGAGAGGCCCTTAGAGGGGTGGCCAGGCCAAGGCCCTGGCTTTAGGGATCA^ 

TTGGATGGGGGAGGGCTTCarcCTTCrCAGTCA 

GCCGATGCCCACTCTGTCTGGCATTACAACTC^^ 
1 5 ACTACGG C CATGGGTCTTG CAATTTTGTTTTAATAGATAATTA^ TACTGGG CATCATGGCTT AC 

TGTGAAATGGGGAGTATAATCCCCCCGTAAGCCCTGGACT 

CTGTGGGAGAAGCCTCGGTCCCCAGGAGAGAATTGCCCTGTTC 

TACCCTGAAGCCTGCTCTGGCCATCACAGTATGGCTGGGGGTC 

GATAGAGCTTCTGCrTGGAGAACTCCGAGAGCTGCAGAGGCAGAAGAGAGAC^ 
2 0 AGAACTGTGTGAAT ATGGACACC CAGGCAAGG CCT GGGG AGGT CAG GGAGGGC CTCCAGGAGAAGGTG CCACCTGACTGCAGGCTT 

GAGCATGGTAGGAAGGGGAAGGGCATCCTGCGGTGGAGGTGTGATGCAGCCTGTGGGGGTGCACAGAA 

AGGGCTGGGGATGGGCTAO^GAGGCCGGCTGGCCAGGGCGGGGCrrc^ 

TTTAGTTCCATTGAAGTGTTACACCCATGCAGAAAAGTGTGTATCTCCTACAT^ 

GCTTCTGTAGCTGCCAGGCAGATCAAGAAACAGAACCTTGCAG^^ 

2 5 GGATACCGACTGTCCTGACTTCCAGCCTGCTGTGAGTTATCTCCTGCTGCCCAGCAA^ 

GCCACCATTTATGAAACCTCACAATTCTGTGGTTTGGCTGGACTTCrCCrC 

TTGAGTGTCTGCTGGACTXSGAAGGTCCyiGATGGCCTCAG 

TCGACTAAGACAGACACCTCCCTCCATGGCCrGCJ^ 

AGT CT CT AT AACT CACAAAATGTCACTCACAC CGAAT C CT ATTG AT CTAAACAACT CTCAAGGCCAG CTCT CAGCAGGTGG AGAAC 

3 0 AGACT CCG C CTCTTCATAGG AGGAACTC CAGAGG ACCC CTGCCGTATTT CATCTATTACAG GTGT AGAGTCTTTTTGCC CTT CTTG 

ACCTTTATATAAATGGATCTCATAAGTGTATTCTC^^ 

GTGTTGTTGGATG AAGTTG CAG AT CACTCATTCC CAAGCCTGTGCAGCAGTTTGTTGTG TGGAAACAC CAGTTG AC CCATTC T ATG 
GTTGGTGGCCATCTGGGAAGTTTTCATTTTCTGGCTGTTAOSA^ 
ATATGCIACATTTTTCTGTGGACATGGAGTTCCTGGGTCATGGGAGAG<3 
35 TGGTGGGTCTAGTGTCTAGTCTGTGCCCCACCA^ - 

TGTGGACACTGGAAGGGGCTTCCTGTGATGTCCCCTCTCCCCGCCCTGAGCCTCAGGCCCCCA 
GGCTGATCCCTCTTTTCTGAGCATGAGCTTTCCTGCTACACTTTTTCCACCCCAGAGGCCTCCT 
AGCTGCCTTTCATGGTTGACCTGACCCTCAACAGTCAGGGAGGCCAGG 

AGCCTCCATCCCrGAGGAGATTTGG^CTTCACTCCAGCCTTGCCTTTCACCATGGGAGGAACCCTTTCCCT 

4 0 TCCTCATTTGTAAAATGACAAGGAAAAGTATCAA.CCTCACCAAGTCCCTTTCTTCT 

ATAGAGGAAGGGATGGGGGGAGGCCTCCTGACTTCAGTCAG CC CATGTTTTTCTGTTTGTTCACrGCACACTTTCACGGCACCA 
CATCCATCAAGGAATCTCTGTGGTCCTTGTTAACCAGTT 
GGCCTGGAGCGGGAGAGTGTCCTGTGGGAAGCGGGAGGAGGCAAGGTTGGCC^ 
AGGGGCTGGTGAGAGAGGGCTCCACGGAAGC!ACTTGAGCAGAGTCTCAAAGGCC^ 
45 AGAGCTCTGAGCGGGGCACCCAGCGTGTGC(^AAGGCCGGAGCCAGGGGCTCTGGGAGGCCGGGCTGACTGGAGCCTAGTGGTGGGC 
AGGCAGGTTTGGCTGCTGCrGTGGTCTGGCTTTCAGGGCTGCGGGGCTGCTCGGCCGGTGCT 
AGCTGCGACCTTTATTTGGCAGTGGCAGGACGGTATCCAGTGTCTC^ 
GATGGCAGCCAGGACCTAGAGGTCCATGACGTGGGTGAGGGGCTGCCCGCAGG^ 

TCTCCCATAGGACTCACGGCAGCrrGTGTTCTGATTTCGTACTACTGCT CACCTCCTCCTC CAGACGCTCTCAGC AGAC 

50 TTGAGTCCTGGTCCTTCTGCAGAGGCCTGAGCAGGTAAGTC 

GACAGGGAGTGCGCGCTGCCCAGAGTGCATGGTGCTGTTTCAGGGGCCC^^ 
TTTATTTGTTCCTTGGCCTTTTCCTCTCAAAGAAAGCCACACAGTG^ 
TTTTATTCTCCTGCCAACTCCCAGTATCGCCTTGGGTAGC^ 
TAACAGTAATGATATCCTCTAACCCGACAAGCACTTAACITGTATTOT 

5 5 AGCATCA.CAGCAGCCTTATGAGAGCATTGCTGTAATCACCAGCATTGTACATGT 

ACTTGCCCCAGGCGGCACACCTGGCTGAGCCTAGGGTTCAAGTCCA^ 
CTACTCTGCCTCTGGCCTTGCTCCTAGCTTCTTTTTGGGG 

GTCCCAACTTAAGAAAAAGGGAGAGGGGCTTTTACCAGGTGACAGGCTCCTGGTGGGTCT 
TGTGGCAGGGACCAGGCCTGGCCTCTGGGTGCCTCTGAGGTGGCA^ 

6 0 GGCTTTCTTTCCCTCCGCCCCTCAAGTTCGTTTTCCTAAGCTTGTATTAAACACTTACT 

ACTGCGGTGCAAGGGAAAGGGGCOTGTCCCACTCTCTGCTCCGTGGAGAAAC^GAAAGATCC 
AGTGGCGTGGAGAGGGTTCAGAGGCTGGGGGGAGAATTCTGGGGAGGTGACCACAAGTGAGTT^ 

AG ATGG AGCAGAAC C TGG CAGGATGTGCATGCACACACAGACATG AGTGGTGGTGAGTGCATAG AT AAT ACAG AT GTGCAAATGTG 
CGCTTCCTCACAAACGTGTACACAC^CGTCGCTTCTGCATGCCACATGAATGCA 

6 5 ACATACACATGCACAAACTCGTGTTGGAACTAC^ 

CTGCATATCC!ATGCACACACAACGCAGGGTGCATTCCAACAAGAACA 
TCCAGCAGCTCCTGGGAGGCTCTGGACAGCCTGGGCCX3AGGCT 

GCTTGAGGGCCAGCCATGTGACTGACCATGGACAAGCCCTTGCCCGCTCTAGGACTCATTCCCCATOT 
TGTCCGCCTCCTATACGGTGGCCGCAAGGATGAGAGGCTGCTGTGT^ 

TCCCCTGGGGCCTGGGTACTCTCTGAATCATGGGCATGACAGTCCACCCTGCCTGC 
CCTGCTCCCGCTCTCCCACCTGCACTTCCCTXnTGACT 

ATTTGACTCCCTGCCAGGAGACTGGCA^TTGCCTCCATGAGTGCTCTAAATCCCCT 
ATTTCCAGATGAGGAAACTGAGGCACAATAAATCCAGGTGAGGTTCTCAGGGACACCTGCT 

7 5 ATCCAC^TCATCAGATGTGAACATTTATGCTCTTTTCTCCCATGATCTCCTCTCAGGGATAGG 
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T AAGGAAACTGAGGCTCAGAGAGGGCAGGC CACTCAGC CAGGAAGTAGCACAG CCAGAATTGGGTG CTT ATT AACCACACTG CACC 
CACCTCAATTCAACACTTCCTTTTCCrTTCCCTCCCTCCT 

TGGGATTATCTGGGCTTT AGTTCTGACCTAGTATTC CCAGGCA CTATTCTCTG 
GGGTCTGCTTTCTAG CATG ACCTAGT AACTTC CTCTCCACAACT CCTTTGTAAAGT ATCCTTAGGATTGTC CTTTGGAAACTTGG C 
5 CAGCAGAAAGCTCTGGGGTACCCAAGAGGCTTGGAGGAGTOT 
GCTCTGTGACCTCCTAAAGGATCAGGGTGTGGGACC^ 

TGGAAGGAAATTGGGAAGGAT CCCCGGG AGGTGAT ATTT ACGCTGGGC CTT CT AGGTTGATGTCAAAG 

GCCAAGGAAGACCCAGCAGAAGGCTCAGCTCCACCAAA^ 

TACCCAACAATACCCATCAGAGTCTCCTAGAATTAGGGCCAGCT 
1 0 CCCirrGTTCATGAATTACAAAGGATTTCAAGATGATGACA^ 

TGAAGCCAACCCTGCTTGAGTAAGTGTAGCAATTGTGGGATTTAGCGCCAA 

ACCAGGCCCACGACCTTGGGCAAGTCCTCCTCATCTCCGTAAGCCT 

AGGGTCAGTGGTGCCAGC^TCTAGCTTATCCCAAGCAGTCCAGGACAGAAG 

CGTAGTGATGTTCAAGGCTCTCCrrGACCCACCCCATTTACCTTCCAGCGCTGCCCCTC 
1 5 TITrCCAAATCQ^CCTGGCTGGGCAGTGGAGGGTATGGGGAGTCT 

TGCCCACACTCCCATCrcrrCCTTGGCAGACTCACTCCTGCAGGT^ 

CCAACCTTTACTCTGTGATACCTTCAGTAGCT 

CGATCACAC CCTGGAG AGCTCT TG AAAGGCCCT ATTC CCATTTG CTAG AT AGG AAAAC AG AGGCTCAGAGGTACAAC CATC CTAAG 
TTCCTGGGCTGAGGTCAGACCCACATCGGGTAGGGGTGTAGACAGTGGCAGCAGAGAATGGTACCTAGG 

2 0 TCATCTTGCAGCATTTGGGAAAACrGGCrTTTC 

AAGAGGGCCTCGTAGAGCGCAGAGGTCCAGCCAGCTAGAGCAGACTAGCGCCCAGCCCT 
GTTCACGTGGCTCGGCX3GACTAGAGCCAGGTATACATGGCTCTGAAGT 
GCAACCGCAGGCAAAGGCTGGCCACAGATGAGAGGGGTATCGTGTCAGG^ 
AAAGACACTCAAAACACTCTTGGTGGATGAGTGTGGAGGATGCTGGC^ 
25 CCAGCTCTGTTACCCCTTTCTTTGTGGCTTCTGATGAGCGACTTCATCTCTCTGTGCCC 

AT ATCC CTTCT AGTTGGGAGGG ATTCAACG AAGT AAG CCATGG GCACT CT CACAG C CATGC CTGGAGCATGGTT AGTGCTTT CTGG C 
ACTTTCTAGCkTGAAGAGTGGAGGTGGCAGTAGCGGGAGTGAG 
TCCACTGAGCAGCATGGTTGTGCGGTCTTTTAAGAGAAGGTGGAGAGGGA 
TATGAGGCAATCCCAAACAGCTGTTGCTCAGAGCTCTGTGGGCAGCCACCTGGGGTGAGGGCT 

3 0 AGCCCCTGGTGTGTGGCTTTAGGCTGCCGGCATGCACACCCrAA 

GGCTGGCCATGAAGACCTTGAATGCCAAGCCCAGCTCACAGCAGGCCT 

CCAGGGTCCGCATATGGCATGCCTGTCAATCTTCAGCGCCAGCTGAGAATGCACACCTCT 

TGC^GCCCAGGCCCTCCACCCCMCCTCTGCAGCCTGCTATGACTC^ 

3 5 CTTGTGTTCAAGCCTCAGTCCCIATTAAGAACTGCT 

TGTAAAATCAGAGGATTGGGGTAGGTCIAGGGATTACAAACTGCA.GTCTTC 

TGTTGTACAAATGAGAAAACTCAAAGCACAG AGAGGTTAAGAAACTTAG CCAAGGT CACACAGGTTATAGTGG AGCTAGGACTCAG 

CCCAGTTTTATCTGACACCATAATATGTGTTTTCCCTACCACAACAAAC^ 

CATTTTGTCCATGA.GCCAGCTTGGTGTTGAAGACTTCCTTGTGCT 

4 0 AGTTGAGCTGGGAGAGCACAGCTCCAGGTCCGGAGTTCTGAGTAG A CTAGGTAGGTGCAG C 

TCTTTCXZTCATGCTGTAGGAGGCAXSCCAG^ 

TAGGGTCTTCCTGGGGTCTCTGCACAACTGGGTGGCATGGGTCAGGGCAGTC^ 

AGGCAGCCCCTGAGGGCCTTGAATGCTAGGCCACAGAGTTTGGCC^ 

CAGTGGCACCTCCTTTGTGAGAGCTAGGAGGGCTGGC 

4 5 TAGTTCACCC^GGAACTGTGGCAGCAGGAAGAGCAAGGTTGGAG^CTTGTCCACT 

TCAGCAACCCAGGCACCTCTGCTGGACC^GGCGCTGGGGCTTTCTACC^ 

AGCGACCGACGATGGAGCCAGGTGCCACAATCTGCACACCTCCCCC^CCCCATC^ 

GGGCAGGCGTTGCTCAGTGACACAGTGGGATCAGAGAGGCTCAGAGA^ 

50 ATGGATGGTGTGTGCTCACTCTGGTAGGAGTGTAAGGTTGTAAAGAGTTTCCTGGGTGTTACT^ 
ACTGATGTTTGCCCCACTGGCTCTTTTTGAGCTCTGG 

GGCAGAGGACAGCCTTGGCCGTG CCTX3TGTTCCT AGGTCTGAAG CCTCAGCGACCTTGCIAGCAAGTAATAGACACTG 

ACAAGCGTGACATCTTTATTGGAATAAAAACAAAACCACJU3CCTCTOTCT 

CCACCCCACTCATGGCACAGATGAGGAAACTGAGGCCCAGAGATAGGAGGGCATGCCCC^ 

5 5 ATCAGCTGAGTGCAGGGCCACACTAAACTGTGTAGGCTACATGGGGGTGCAGAGGCTGG 

GGAGCCTTCTATCTGCAGAAAGGCAGGACTGACCCTGAAGGGCCAAG 

ACTGAGAGCTGCACGGGAACCCGCAGACCTGGCTCCGTGGAGCCGCATTTGTGTTCTGCCrCCTGCT 

AAGCAAGGGCTTATCCTGTCATCCATTTACCC^ 

ACGCCTCAGAGTCAGGGCCTAGTGTGAGGCTTATAAGGGG 

6 0 CTTCCCCTCATCAGCCCCCTCTGCCAAGGCTTAGCAGAGTGTCCAGCAGGCT 

TGGCATTACCTAGGGCCAGGGTCTAGGGGCTCTGAAGGCCTC 
CCCCAAAGAATGACCAGACTGTGAACCAAACTGAAAGCATTCCACGCA 
TGGGGCAGATAATGGGGGCTTTGTGGAGGCTAGG&CTGCTC^GGGAAA 
GCfcGAAGGAAAGAAGGTTATATTTTO 

6 5 CTTTTATCTCCTCTGAGCTTCAGGCACCTGGGGAGCCACAGCAGGGAAGTGCCTCAGTG^ 

CTGATGCTGGGGATATCAGCTGAGGGAAGTATGGGCTTCCCCAGCACCTCGGGGGCAGTTG^ 
AGGCTTCTCCCCAGTGGAGGAGAGCTGACCTAAGATTTCTGGGGTTTGCCA 
CCTGGGGCAGCCCCCACTACAGACTAAGGACCTGGGTCCCAGGATAA 
TCCTTCTTTCCTCCTTCTGCCTCCTTCCCTCCTGCTGGGGAC 
70 GATACCCTGATGCAGAGGAAATGCCATGCCAGTGCCAGGCATGCTGCCGGCGCTCTGGGTGCCT 
CCTCCCCATAGCCCTGGGCTCGGCCGGTCTGTGTTTGCAGAGCCTCC^ 
AGC^GAGCGCCAGCCATGGGCTCCTTGGACTACAAGTCCCCGGGATTGCGCAAAT^ 
GGCCCCGTCAGATGGCCCAGGTTCCTGCTCGGGAAGCCCA^ 
AAGCTTCTCTTATCCTCACAACCCCTOACTCATGTCACCCTCATAATATCAAG 

7 5 TTTCCAGAGGGGGAACrGGAGTCAGAGAAGAGAAGGCATC 
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CCTGCTCAGCCTTTTTAC^CCGAGC 

GCTGGGGGCCAGCRGGAGAGGCCACCAGTCTCTGTCTTCCCTC^ 
TGACATTTACCTGGTGGGAAGGAGGGCCTCCTCCCTCTTGAATTGAfiATGCT 
CTCAGCCAGCGCCTGCCTGCCCCCTCAGCTACCTTTTTCC^^ 
5 TGAGAACTTCCTAGGGATTGGGACAGGAGGCATTG 

TCCAGGCAGCAGGGTGATGGTCOTCTGCCTCAGGTCrCCCTCCCCCTTl^ 
CACCGCTAGGTGCTTGGGTGCVl'1'CTTTGAGCTGCCAGAGTTTCCCCTGOT 

GCAACCAAGAGGAGCCCAGGCTTCGGAGCTGGAC^^ 
10 (^TCTCAGAAATGGGCATGAATCTGACCTTTGCAGTGTTGTTGCAGAATGAATCG 

AGCCCCTGGGGCCAGGAGGAGGCGTGTGTACATCATGGGGCGTTCCTTTCTCTTGCTGTC 

TTAGATTGATTGGACCAGAAGGGGAAGGTCTTGAAGAC^ 

GCCAATCCCCAGGGCACCTCCAGAGTGGCCTGTGTGTT^ 

GGCCAGCATCTCCCGGGCTTTGGGATCATTCCAGTGGAAAAAGTAGTTTTCAGTGCT 
1 5 GTGCTT AACCCAT C CACAC CTCAATTTC CT CATCTG AAAATGGGG ATGAT AAT ACTTCCT TCCCCGTTAAGT ATTGTAAGGATT AG 
ATTAGCAACATATGCACAGTACTTAGAAAGTCTGGTGCAGGGTAACACTTGGTATGTGTTAGCT 

GTCTAGCAGTTGGG AGT AATAACAGCAG CCACCTG CCGTTTGTTT AATGG^ CT AAGCACTTT 
AAGTACATTATTTAATGTCACTCTCCCCCACCTCCCAGC^ 
ACIGAGACACAGAGAGTAGTTGAACTGAAAACCAGAGTTCA^ 
2 0 CTGGTTTTAGCATGGAAAATTCTGTGTCCCAGAAACCTGTCACTTGTGGGCA 

GTCCACTGGCCCCACTCTGAAGCCCACAGGATCCTCTrCTTGGCG CAGGGG CTTGCTGGTGCAGTCGCTCGG CCCCTTCCCACACC 

TCATAGGCCCCTCCCCCAGCCAGATTGTGCCTGCCCATCCCCCAGGGCCTGGCTCCAGTCTCCTCTCTGGCT^ 

CCCTCTGCCAGCAGGAGGGGCATGACTCATGCCTGGTGGGCTCTTTGCrCAAGGCTGTGA 

TGCCCCTCACCAGG CCTCAGTCTCCCTATTTGCACTGAGG^CTTTAGTGGCTTTGCTCC CAGCTTTCTGGAGACGTTCAGGAAACC 
2 5 TCTTCCTTGAAAGAGGGAGGGCAGAGCGAGCCCCTGCCCCCAGCCCT 

TGCATTGATGGAGTGCACTGCTGACAAAGGAGAAAGGTCAGCAGGGGGACAGAGCGGAGTTG^ 

CCCTCCCACAAGATAGGGCTTCCTGGGGGAGGGGAGCAGTGCT 

GTTCAGCAAAGCTGGGCATCCAAGCCCCCAGAGGGCTGCGGGGAGATGCCTA^ 

ACATCCAACAGTTGATTAGGCCCCCTTAACCATCTCTGGATCTGTGACCCTCTCAGCATCTCACCCCTGC 
30 GCTTCTGAATCCCTGAGCCCTTTCCAGCCCrCCCTGGCTTCCCCCATCACCTTGCATAAAGTC^ 

AAGGTCATGTGAAAGGTCTGGGGACACCCTGAGTGCCAGGCCCAGCGCT^ 
CTGAGCCTGCIAGCAGAGCTGCAGCCAGGTGCCAAATGGATGCAGTCCCCCTC 
CCATGCGGTTGCCCAGTGGATCTCATGCCCCAGCTCCCAGAGGGCAGGAGTTATCTGTCTCCC 
35 ATGGCAGGAAAGCGCCATGGTTGGTGCATAGGCGCTGGGGTCATAATATGAGGGTCTGATTCCGGCTCCATCA 
TGACCTTGGGCAGTCACITAACCTCTCGGTGCrrCAATGACATCATT^ 
AGTGGGATTCAATAAAGTGACACTTATGAAGTGTTTG 

ATCATCAGGCAGGTATCTATGAAATGAGCACCGTCTTCCAAATGGGGCCCTCCCAGTTCTTTGAAG^ 

GATTGTTCCCTCTTTGTCTTCCTCCTGCTCTAGCATCCCGCTGCACCCCCGCCTGACATCT 
40 CCCTAGCCTTGCATTCTTGC CTAATCCACCCCATTGCCATGGAATG CCTTCAGGCTATGG CCAGGAGTTCTTCCCATGTCCCCCAC 

AGCTCTGCACAGGTTCATGGACCACAGTGGATGCCTAATAAGAATTTGTTGGGTTGAAC 

AAGAACIAATGCAGCAATGACCAGCCAGGCCTCT 

TGTGCAGGAGAACAGCATCTGCCCACTCTGGCCTCTTCTC 

CCACACTGGAAGGTTTTTCAGGGGGTTTATCrCCCTCTTCTCTGAGCT 
4 5 CCCACAGTTCTCTGTTAAATTGTGTTATTATATTTAAACAAATCAATTCCA 

AGCACAAATGCCATTGATTTCTCACAGCCCTGAAAACCAGTTGATGAATAGCACCAGGC 

CTCTGCATGGCCCCCTCTGCTTCTCCCCTGTGCTC^ 

TTCGGGCTGAAGTCATGTGATAAAGGGAGACCCTCTG 

TGGAAC AAT CAT AG TGG AG AT TTGT ATTG ATTT TGGCCAAAG G CACAACT T ACAG AGCAGAGAAG TGG TG C GGTG T G AT CTT TGTC 
50 TTCCCTTACAAGTTCAGAGCGCGTTCCTGAGAGTGATGGCAGATAGCAGCTC^ 

AAGACCTTTGAGCACTGCTGTTTTAAAGGGAGGCTCCAGAGGTTTGAGAGCTGTCT^ 
AAATCCTTCAGAACCTGCAGTGTTCAAAGTCACGGTTAAACC 

CTTCTCTGAGCTTTCTAATGACCCCGGGGCTTGGGTGGGCAGGTAATATTGTTCCCC^ 
GTCACTGGAGAGAGGGAAGGGCTGCTTGTCC^CTCCAAGTCCTGAGTGCAG 
55 CATGGCAGGGGAGCTACTCTGATCACCAGCCTrTTCGGCCGGGCTTTCTGTCTGTCTGAGC 
CTCAGCTGCTGCCTGCAGCATACCACACAAGCCTGTGTATTGGGCIX3GAGTGAGGCA 
GGAGAGTGCCTGCCCCACCTAAATTCCTAAAAGTTTGGACAACCCATGCCCGTAAAAGTATACTACAT 
AAATTTGCTGCCTCTCCCCTATTCTAATTCA^ 



6 0 CAGGCTTGGCGAGGCAGTGGGGTGGGGGCAGGGCCAGGGACTGTGTT CTAGGGAAGATGGGCCCCTCACC CCTGAGTTCAGGGGTA 

GCTCAG^GGCCCCACTTCCCITTTTTCTCCCTTCCTCCTTCCCTCCC^ 
TCCTGCTCCGATGAGCTGGACCAGCTGCCTCrTAATGACCT 
TAAGAAAAGAACGGTCGGTGGAAAGAGGCTGTCCACTCCCTTGATTCAGACCT 
AGAATAGCATCCAAGACCCTGCTTGCTTCTCCAGCACCCACC 
65 CCTCCCTGCCTTTTCACATGCAGGTCCTTTGGCTACTTCCCACCTTCTCAGAAAACTCC^ 
CCTCTTCCTCAGGACTGCITCCTCTGGGTTCCTGCCTCTGTACTCTC^ 

CTGTGTTGGCCTCCATTATTCCTGGATCTGGTTTCTCCGGAGTTCCGGGAGGAAAGGAAGAATATT 

GG GTACAG CACAG AG AAGTGC TCAGTG AAAGT AT AGGGC CAG CAAAGAAGAG AG AAGT AG GG ACCAGGG AAGAG AAAG AG ACAG AC 
AAGAGTATAAAAACAGCTGCTCCC^GCCCCTCATACTGCCCTAGCCTGCTTCCTGCCAGGGAG 

7 0 GGCCTGAAATAGACACATGGGGACCTTGGTAGTGCATTTTCATAGCTCAAGAC 

TGAAGCTCCCT CTCCTCCCAGGGCCTTGCCTCTG CAGCCJAAGACCCCTCCTCCAGCAGTG CCCAGACCTCACCCCTCTGTTTCTGT 
T AACTTTG TTT TATAAArAAATCATATAAATAGG CT CCT CTG ATT C CTCGGAG CATG CAGTGTTTAAGGACTGGGGCATTG C CG CA 
GG CATAATTGTAC TAC CGAAT CCATCAG CTGCCT AATTACCATT CATTTTGCATTTATTT CTGCTAAT TAAAAAGAG AGAAACTCC 
TAACAACTTTTACATAAATACTATATTTGTTCTATGTAAAGATGGATGCT 
7 5 GCTGTCACACAGGATGGCCCTCTCTCCCATTTGTACCCC^ 
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GGAGGCCCAC CTGGGGGGTT CGAGGGCCTGTCTGAAGCCCGGGGTCAC CAGGAGCAC CAGATGC CACTGGGGCT CAGGGCGGGGTA 
GGGGGAAGGTGCTGCAGTC TGATTGGTGGAGCTGGG AGGAAC CT GTGGCTCACTCAC CG CTT CTGTATAGATGGAGAAACTGAGGC 
CCAGAGAGGCCTAGCACTTCCCAGCAAAACAGTGACCAGGGACAGAGTCAGACTGCTA^ 
ACTGTTGTATGCTGTTGTCAGGAGGAATTTGCAGG^ 
5 TGCAGAGGAAAGGAGCACTTAGCCC^TGGCCCAAACrrcAACC 
GGGCAGCCCTGGGCTTTGCCCCACGGCACCCTGACrrGCAOT 

gccttgcaccctgtgggctatccggacacatgtgagggtgtcaaattttaacact 

aatat aattaatacatcaaacccatgattt cacagat catagtgcttttgatgaggctaagtaaaaaaaaaagtaattgg ttt aaa 
gtgatataaataattaaatagtttttttttttgagatggagtttc^ 

ttttgtatttttaatagagaagt^ 

cctccaaagtgctaggattacaggcatgatccactgcacctggttgatacatttttaagtacataaag^ 
acacggcttacagtacctaataaggtgggttatttgataact^ 

ggtcagggtcagc^gtgacctgggggaaggtgggccaggtctgcagccaccactgccccatccca 
15 cagattgtgcccttctctccccagcccrgcagttggtcagaattagcccacaagagagcagtca 
cactgggactctgttgggggaggccaggaggttggacattaccccatt^ 

tggacatgcggtgcaccrgtttcccctggcattgataggtgagacactgactggcccctgttctctttct 
aatgcccagttctcgagttaaagggataacagctttgacaaggccaggtgttgtct 
gcgtggcccattggcccattggagatccccaaggctttggggtccaagggccagggttccaggcctggctgctg 
2 0 tcagtctcctcctctataagatggggacagtt 

gtacagccggcagtgagcccagttccaggggccctctcggcc^^ 
cctctcatcagcccctttatcgccctgtacacaatcaggggaagctgggaaggccgct 
gagaaaacaaaggcactgctcctaggggggcaaaaggc^ 
atctgccccacccctccagaaggatagtctcagaactcatgtaggtcaccacrrct 

2 5 ggccaccttttcttatctgtaaaatgagagaattggacaacaggccatgtgacttttggcaaaccttcccc 

agggtctccttggggcaggcccagctcgtctctaggaccccccagcctgtcccaggaa^ 
gcaaggaccccagccctgccttgggcagggtgaggtaaacaggaggcccctgtttcctgccgag 
cagtgcatctgtgtggaatgaatgaattatttcatcagaagggggcctactcagagtcagccctaggttg 
ctctcagggaaggagacccttctggtcacttgagggaggtggagagaa 

3 0 ctgggttcacatcttggttgtgtgaccrtggactggtttcttaaccttctgtttcctcatct 

ctcataggttgtggtaagattaagtggattaatttacataaagtgct^ 

t catcact cttatt atg attag agggaggctt acaggg aag cacatg aagatt ag gcat cctggcctct cctggg cacag c cctg g 
tg aaggctgggaattgctgggagctat agagtgttctaggtgggagg gg cag cagt t tg caaacagg aacattt ct atgtaagt at 
tt ctggt aaactgtt aaacaggggc ct catgagaaaggggcatggatt c ct aagaat atag g aat ttgttgtg attt ctttt ctg a 

3 5 TTCTAAATATATATTTG TAATGTTAAATAATTTTT AAAACAGGG CCT CCAAAATTC G CAACCTG AG CTG 

TTCCTGATTGTTTCTGTTATTGTGATTGTATTTCTGTGGC^ 

CCTTGGGGTCCCAGGGACCTGTCTTCTCACCCTATGGACAGCGGTGTTATGTGCAGGCTGGCCT 
CACCCCAGATGGAGGGAGCTCTGAGGGAGCAGACAAAGGAAAATGG^TACCTCAGGGAAGGTCTCTG 
TGCAGGACACaGAGCCCTGTGCCCaGAGGCTTAGTGGCCAGCCCTGCTGCrrCCTACT 
40 CCTCTCCCTTCCCACTCCCCCTCGCTATGTGCC^ 

AACAGTCACCCCTACAGTGGTCXnTGAGTGGCTGGCTGTGC^ 

GCCCTCTGGGAGCTCTCCCGGTAACTGGGACCACAAGATTTGCA^ 

CCATCTCTCTCCGAGAGGAGACAGGAGAGCCCAGAGGTCGTGAGGGCM 

GCTGGGGGGGGGGTGCTCCTAGGGTGGGCGCrreTGGGCCCTGGCGCCTCTGCCAGCAGCAGGGCCTCT 
45 . GGACATTTATAACTCACAGCTGTGCGGTCCTGGGCCCAACTGA 

CACTCCCCATCCCGACCCCCACACrCTCCTTCACCTCCTGACACCCACATCCTGTTCTCA 

GGCCAGGGGCCCAGCCAGGGTCTCCAACCTGTGGGCAGAGTGGGAAA^ 

CCCACCATTGCCAGCCTCCTCTCAAAGCAGA.CACAGGGAGGCA.GAGG 

AGAGCGCCAAGAGCATAGGTTGTAGGGCTCAGGGGAATGTTCTGCGCCTCCAGATCTACCT 
5 0 CTGTGGACCCACTCCAGCCAAAAGGAGCCTGACCGGAGGGGCAGAGTGAACCAGAGGCCTCAGAGA^ 

TTGGGCCCTGGGCCTTGGGCTGGCCCTCCTTTGGGACACCCCGTCCCCAAGATGCTTCGTGC 

CTAGTTCAAGTGTGGGGCCAAAAATGGTCTTGCCCAGTGCTAGAAAGAG 

TCTTGCACAACGTTTGCCAAAGCTTGGCCCAGGTAAATCTGGTAGTCTACAGC 

5 5 TACATTTTTG TTTTACTAGTTACAATTTATT AAAG AGT ATCAGAAATATATAACT C CTAT CAAAT CAATTATTTCAAG G GCATG 

ATTGCTTTGGACAAGGCCAGATTTATTGGAGCTTAAAAAAAAAGT^ 
ATCTCAGCTCAGACCAAGAATCCTAAAGAGCGTGGAGGAAGAGCTGAAGTTTGG^ 

TGAGCTGGTTTTCCCGAGGTGCGTACACACATAAAATGGCTCCGAGTGTTGGTCAGG^ 

6 0 ' GAATTCAGGCAGGGGTCACAGAGTGGCTTTGGA&GTGGCTGCCGGCCGGCC^ 

GAGAGGAGGCCACGGCAAGGGCTTTGTACACCCGGCTAAAAATACCCCCGAC^ 

AGG AAAGTGATG CTGT T CTG CTTTT C CT ACT AAGAAGAGAAAG ACIAACACAAAGTCC C CAG TC CTGG AAGAAAGAAAACTT AT ATT 
TATATATTTTTCAAAATCCCACTAGGAGGAATTGGCTTGCTGGTTGGAGTGTGGCTGGG 
ATCAGGCATCCCGGGGTGTGCTTAAGTGACCCAAGAGGCCGCTGAGATTAGCrTGGCGGCAGGGACAG 
65 CGGCACACCGGCCTGCTCTCACA(^CGTCTCATCTCTCCCCTGTGTACCCAGGA 

ATG CTGCAAGGGGTGTGAGGAGAGGAGCCGCTGTTTTTCACTGAGCTG CCATACCC CG AAAGGTAACACCAGCCCCCG ATGCCCAG 

GCC^TACCATCACCTTTGCCTAAGCCTGGGCAGGGGCAAGCCCTAGC^GCCCAG 

TCAGCTGCTTCTCACAGCCTTTCCTGGGCCTGGCTGCAGGGCCrr 

70 AGTTGTGTCTGATTTGGTTGAGCCATCTTAGCAGCCTGGCTCAGTTGGAACTC^ 

ATGCTGGATTCTAAGGCTACGAGGTCCACAGTGCCATGTGGGGCTTGAGAAGGGA 

TGCTTCACTGGGGATGTGGCTGTGGTCTCCrrTGTGGCAGCATTC^ 

TTATCTGCATTTTCAGACCACGAGAACAAGGCAC^GGGAAGTTAATGGTGCCT 

ACAGCAGCTTGGCTTCATCCTATTTGCCATACCAGCTTCTATGCTTTTACTTC 
75 CTGCTTTTGAGGCCAAGCAACCACTAGTCTGTAT^ 
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TGTC CATCTCT CCAT CCATCCCTClTI'Crm'AGTCCTTCCACACT CATC CAC CCATT CATCCCTCCACCACCTAT CCGTCCTTCC 
ATCOM^ACTTACXCACCCATTCCCCCCATCCACC 

CAGGCCAGGC C CGCATTAAGGGCTGGGGTTTTTCTTTTTG'lTrT TUTITT TTTGAGG CAGGGTCT CATTCTGTCACCCAGGTTGG A 
GTGCAGTGACACGAACATGATTCACTGCAG CCTCGACCTCCCAGGCTGAAG CGATTCTC CCACCT CAG CCT CCCGTGTAG CTGGGA 
5 CTACAGGCATGCACCGCCACGCCCAGCTGATTTTTAAATTTT^ 

AGACAGAGCCAATTTAGCCCTGCCCTGAiGAGGGTGTGGTGTGCAATGG 
AGGCAGTGCTCAAAGCCTGTGTGGTGACCATGACGACACA^ 

GGGGAGGAGGAGGCGG CATTG CT CATGCATTCATTCCCCAGGTGTTGACCG AGAAA CTTT AAGTTCAGAGAT 

ATAGCAGGGAATAAAGCAGACAAAACTCCCTGCCCTTACGGCGCTTGTACTCT 
1 0 GTGAATAGGTCAGCTGACGCTAAGTTTTAAG 

CTAGCCAAGGAAAGCTTTTTTTTTTGAGACA 

CTCAGCCCCCTGAGTAGCTGGGACTACIAGGCACACA 

ATGTTGCCCAGGCTGGTCTCGAACTCCTGGACTTAAGT^ 

CACTCTCCCTGACCC^AGAAAGGCTTCTGTTAAGTTGGC 
1 5 G ACCTGT TCAAAGAAATG CAAAT AATCCAGTGTGGCTGGGAGGATGTCAGGATAGGGGGCGAAGTCAGAGAGATC^ 

ACTCCTGTAGGCGGACAAGTGTGGAGCAGAGAGATAGCTGGGAAGCTGTTGCCCACAT 

AAGGGGTAGTGG AGGAAG AGGTGGG AAGCCAT CAAAGG CTGAATATATTT CAG AGACC CAGCAGGCAGG CT CGCT AGTGAATT AGA 
GGTGGAGGATTTAAAAAGACGGAGACAGTCAGGGAAGGCCCTTGGGT^ 

GCTCAGC CAGGGACAATGG CCGGGG AG GGCTG CGTGAAGCTGAGGAGTT CTGTTT CAG ACACATTG AGTTTGAG ATGCCTTTG AGA 
2 0 CGTTCAAGTGGAAAGATTGAGGGG G CAG CTGG AAATATG AGT CGGGAAGG C1AGAGAGAGGTCCCAGACTAGAGAT ATAAATTTGGG 

AGTTGTCGTGGATTGATGGTGTTGCAAGTCGTGAGGCAGATGAGATCACTCGAGAGTGTGTGGCGCT 
AATGTGAGTCACGGGGCACCCAGCCGTTTAGGGGAGTGAGTGGATAAGGGGTCTGCAGAGGAG^ 
GGGAAGftGAAAGCAGGCTCTGAGTGGACAAGCAAGACT 

C CTACACAAAG GCTCAGAGG CCTC AAT AAAC CAGCGAT ATTTGGAGAGGACT AATGGAGAGACAGGAG AG ACCACACCGC CTT CCT 

2 5 GCCCACACTTGGCCACACAGGGGCTAGGGGCTGTGGGC^GCGCATGCCTC 

CCTGGCCAAG<^TCTTGCCCTGATGTCTGTCCACTTTCCCACTTTOT 

TCCH'CCCCCTTGTCCAAAAACGCAGGGCTCCAGGTG^ 

AGCACCTGAGCATGGTGTGGATCCTTTATCCTAAAAGATC 

TTAGGGGAG CT CTCTGCCTTTGGTGGG CAATCATATACGTGCGTGTCAGTGTTGTGTCACTTCATTCT^ 
30 CTCTTTCTCAGCCTCTACTGCAAACAT CAGCAGTGCCCCCTCCAGG CCAGGCCCCGGGCTCGGTAGGATCCCGAGGAACAAGTGTG 

CTCCTGACCTTCCAGGAGCTCCCCACCGCTCCATCTCTCCCTTGGCrrCCTGAGTGTACCTTTTTGGTGCTCAGGAGCCTGAGC 
ACTTAGAATTCCAGCTTTGCCACCCACCAGTAGTATGTCTTGGGGTGCTC 

CTCCACAGCAGAGTTGTGGGTCTTGTCAGATGCTCTTCCCCTTCAGCGTGCAGCTCTAGTCCCA 

3 5 GACCACTCrrGAGGTCAACCAGTGTTTCCTAAGCACCTGTGTTGTGC 

GGAGGGTTTCTGraCTGAGAACrGGCAGTCCAGATGGGGTGGTGGGGGG^ 

GCACITGCAGGTGCCCAGC^GAGAGAACGGCCTTGACT 

GCCGGGCAGGTGTGCAAGGTGTGATAGGGGCTGGGGGGTGGGGGTGGAGA 

AGTCCAGCCCGTCTGCAGAGTGAGTGCGGTCTGACATGGCCAGGACGTGAGAGGAGGCTC!AGGAG 

4 0 AATGGAGCCAGGCAGGGACTTGTGGACCTCCTCTCCCTCGGGAAAGTCAGGGG 

GG CAGCAGCAGCAGCTGCTCTGATGATGCATGGCTTCCGTGGC^GAGCGGTGGCAGGAGG 

GCGGAGGGGACGAGTAATCCCGTGCCTTTAGACTC^CTTTTCTTTAAGCCTGTCCCTATCC^TTCTCTCCCT 

CTTTGGTGACATCAGGCGGGACTGGACTGGACTTACCTCCATTGAACTGATGGGTAAAACAGGGTT 

GATGCCATAGCTCACAAGGGGCAGGAACTG CATCCGTCTGACTCCAGAACC CATG CTCTTATAATCCTCTACTATCTCCCAGACTG 
45 TCTTAGACTATCCAGGCTTATAGACTGCCTCCCCACTGGCCCCATTTTG 

TTGCCTAGGGCCACGCAGCAAGTCAAGOACAGCACTGGCACCAGAA CTGTTTGTGTC CCCACCCCACCCCACAAG 

CA.GGCACAGGGATCCCCCTGGTCCCTGCCCATTTCTGCCTGGAGCACCCACTGTGCTTGGTCTTCCTC 

CCGGTGGAAGGCACCAAGTTCCrCATCAGAAGGCTCCATTGTGCCTA . 
GCCTTCCCAGTGAATAGTTTCCGTGTAAAGTTAATTTGC^ 

5 0 TTTTACCGCAGTAATTC CC!ATTTGATTTGTCATCTCCTGTTAGGTATTCAATGTTCGGTGGG CGGX3CTTGCTTTCCGTCTGCCTCT 

CAGTTTTCCAAGGCCCTGACCTCCTCTCTGCTGTGTGTCTGTCT^ 
CTCATTTCCCACCTCCTCGCC^TCCCTCC^TCTATCTCCT^ 
TCCCCTTCTGTTTTCTTCTTCTCTGTCTCCCCT^ 
TCTGTATCTTCTTTCTGTATCTTCCTTTCTTCTCTCTATTCCGCT 

5 5 CTCCACTGACAAGACTCTCTCCTGCCTCCTCTGCTCACA^ 

CACCIAGAGGAGCCCTGTGCAGCCCTGAGTTGTTGGGGTGGGGGATTGGTGAAGAACACA 

AAATCCCACC CCACCCCTTGTTAACTGTGTGACCTTC CGGTTAGTTCACATTGATTTCTGTGCTGCATCCATAAAATGGGGTGATA 
ATTTTACCTGCCTCCACAGTAAAGCCCTTCCCGAGGCTTCATTCT 

6 0 AGAGTCATGGAGTTATTGTGAGAAGTCAGTGAGATAACCTATGTGAAGGGCT 

TTTTACTTTCTTAAGAGACAAGGTCTTGCTTTGTCACCCA.GGGTGGAGTGCAGTGGCT 
CTGACCCTCCCACCTCAGCCTTCTGAATACCTGGGACTATAGATGCATTCCAGCATGCCTG 
GACAAGGTCTCACTATGTTGCCCAGGCTGGTCTCAAACrrCCTGGGCTCA^ 
TTATAGATGTGAGCTGCCACACCCRGCCCAGGAATTA 
6 5 CTGAGATGCATCnX3CAGTGAGTTTCTG^ 

AGAGCTGGCATTTCCCGCTCCCCACTCTCCACCCCTCAGCCTT^ 

GGCGTCCTTCTGTCTTAGTGGGCCCATCTCAGGCTGTGAGGACAAGCCTGCCACCCTGGGGTAGTAGCGGAGACACAGAT 
TG CTGG AAAAGTGCTC!AAGACT CTGG CCCGCAGACCTTTCCCAAGCAACAGACC 
GACACCCCCTTCTTCTACCCTCCCTCACCCCCTAGATGCCCC^ 
70 CCAGGTGCAGCATGTTGGCAGCTCTX3CCATTTATCACA 

GGGCTTTGGGATTCTCCCTGAGTTTCGGGATTGTCCCTGT^ 



GCTTCCGCCCCGACrrCTGCTGGGCTTCCCACCCAGCAAACCCCTGGCACA^ 
TCTTCTGGGTGAAGAAGGGCAGCCGGTCACTTGTCCTTCAGAGACACAGAA 
75 TGTGCGTGTGTGTAGTGTACGTGTGTGTGTGTGTGTGTGTGTGTGTATATCCCTGTCCCTGGCCAAGGAAACCTTAGCCATCACGT 
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GGCTGGTTTCTCTCGCTATTTGTCCAAGTAAG CAGTTQ\GGCC3VGCTGCAGCX!AGGAGGCCGTGACACAGAG CCAGGGAGAAGGGC 

CACATTGGTGTGGGTGGTGGCAGCGAGGATCGCCATGGGCATC^ 

CTGGTATGGGGAGGCCTGGGCTGGGGAATGGCCTTGTTCTGATCCT 

CrTGTCAAGAAAAGG CACAGAGCITGAT AACAG AGAACT CTCTCAGCC CTAAATAGAGAGGGTGGG CAC CTT CTGACCAGGGGGAGC 
5 CGAGAAAGACAGAGTCCTCAC CAGAGAAAGGGGCACACATGGGCACAACCACCATGTGTATGG CC CCTGGAGTGTG CTTTAGAGCC 

CTGGCCTCCXjTGGCAGGAGTGTGGAAGCGA 
GACCTGGGCTAGGCAGGGATAGGGGTTGTCAGAACACAGACCTC 
TCCCATTGGATGGGCAGCATCACCTTrCATTGCAAAACCCCT 
CAGGAGGATGCAACCCCTACATTGTTGCAGGGCTCAGGAAGAGCAGTC 
1 0 TTCTAGCCAGGGAGAGTGTCCCCTTCCTCTAGGGAGGTGTGAGCCAGACCCTGGGCTTCCAA 
ACACAGAGGCCCTGGGCTC^GACACCTGGGGGCCTTCTCCTAGC 

GTGGGGCCT C CC CAGG CCTTCAT CG AGGTTTGTCAGCCTGCATACTGTGTCTTG CATAGATGACACCAT CTCTGGGGG CCTGGAGG 
AGGAGGGCTGTGGTGGCrTTTTGGGGCCATGCTTCTGA^ 
TTCCCTCTTTCCTTCCTTTGACAAACATTTCCT^ 
15 TGCATGATCACGGCaSAACTGGAATCCAGCAACATTCCAGGGTGGGCCCAGGG 
TCTTGGCTAGGAGGCAGGGCATGGCTTGACCCCACAGGCTCCCCTGGG1TCT 

ACX3GCAGAGCCCACCAGGTGCCTGAG ACAAGGGGGTGT AG AG AG CCAG CAT CAAGGC CACACAGAGCAT CATCAGACCTTGTCT CT 
GTTGTGATGAGGTCTAGGGTATTCAATCTGGAAGAGTCCCACAGGCrrCGTGTGCT 
AGGGGTGGAGCTGGGTGGCCAGGGCTGAGGGTGACGCAGATCCGCCATCATC 
2 0 TTTTTAATAAAGCTGCTGGGTCAGCATCCC 

CCCTGCTTGT^^ 

• ATCAAGCTGGGTCTGTACGTGGCCCCGCCAGGGCCTGTTTGATTGGGGTAATC^ 
AGGGTGTGGCTAAAGTCAGTTCCTGCCTAAGCTGGGCTGACCCACCTTCTCACCACT 
25 TGTGAGTTTGCCTTCATGAGGAGGCCCTTGGGCIXK:crrTTGGTA 

CCTGTCCCTGGGGTAATCTCGAGCAAGGCTGGGCCCCAGGCAGGCCCACACCC 
AAGAGGCTTTTGTCCCrGGTGGGCrGTCGGCCAGCTGGAA 

CCTGGGAGAGCATGAATGGGCAGGAGAGGGGACTGGAGAGGGCTGAGGGCCTCTGTT 
CCAACCTCTGCACTCCCACCATACCCCCAACCTGTGGACAGGTGAGCCTCC^GTTCCTGAGGCC^ 
30 CTGGAGGCCATGGGTAGCCTTGCAGCCCCTCGGCCAA 

CCAAGTCAGGGTGGGGTGtAAGGGGTCTTTCTCTAGGCGCTGTGCCT 
CCCGTGGGGTGGCACGCCTTCTGCAAGAGCrCGCCCTCCAGG 

CTCTCATCCTGGTCTCTTGGGCATG ACTTCTCCCAGGGGACACT CTGACTTGGAAAGGCAGGGATTCAAT CAGGGGCC 

CCCGCTCAGCTGTGTCTGAAGTGTGCCCTCCTGGGAAGA 
35 TGGGAGGCCTAGAGGTAAGAGGCCAAAGGCCACTAAGCAGGCCAGCGTGGCCTGCCCACTC 
CTGAGGAGGCTTGCTCGGCCCTCCCTCTCCCCTGGGCTCGGTGGGGCCAGTGGG 

CTCAGGGTGGCCTGGGCCTGAGCCAGGGATGAGGCGAAACCCAACCCTGGCCTGGCCCCGGGCCGCAGCCGTGC^ 
CTCGGCGAGTGGCraCCCGCCCACCCAAC^GGAAGCCCTGCCCAGAC^CC^^ 

TTCTCCTATTTCCCATGAGAGCTG CTG CCAGGAATCCTGCCCTCTTGAGAAAGGAAGCCCGACCTCGGG CCGCCCTGCCTCTGGCC 
40 TGCGCTGACCCCCTCCCTCGGCCTCCCCGTGGTGG 

TCTCCTCTCAGACAAAGCCTGTGACCCTTGGGGACCTAGCAGGAGAAACCTC^ 
GGCTCTTG^TCTGGGATCCTCCACCAACCTGCTGGTGCACCTGGGCAAGTCACTCAG 

GGGAGAGGCAGACGCTCTGCCCAGCAATGAGCATCCCATGGGATAAAGGGTTTGGGGTGGGTTGAATGTGTGCCCT 
CCAGGCTTGGGGGGTGATAACACAAAACCTCCCCATCCACCCCrrcTCT 

TCCCACCACCCTCCCAACCCCATCCTCTTCCrrCCCTCCTGGTC 
CCCACCTCAGTATCAGCAGGGGTAGCAGCTTAGGGAATCAACrGTGT 
GAAAGACCAGAGTCGTTGTGACATTGGATGGGTTAAGACAGATCCG^ 
GCAGCAGGGAAGACCAGGCACCTGAGCAGCCAAGCCCCTGCC^ 
5 0 CTGCTCTG AG CCTCAGTGTCTGCATCTGGCAAGTGGACCTTTCAGAG CTTAAGATTTG 

GGGACACAGTGCTGGGCCCCCrcCGGCCCTTCCCTTTGAATCAGCOT 

TAGATTTCGTAGGCAGTGGGGAGCC^CTGAAAGTTCTTAGAGGAGTAGTGAGTTCATGGGGAGAG 
GGCACACTCATCCGGTGTTGGAGCCCTGATTCA 

GGCTIGGGAGGCCTCAGTTTTCCTCCTGTAGCTGGGTGCCTGTCCGCAAGGCT 
55 GGCACCACCACCCCTCCTTCTTCCTCTGGTGCCCCTGAGTCCCAC^ 

GCCCCACTGCCTCTCCCTCCCrCCATGGCTTTCTAGCrrCTGAATGGGAGTGGACTCTC 



CGTGCTGTGOT3GCTCCTCCCGCCTGTCTGCCTCATCT 
CTCTGCTCTGTTTATAAGAGTTCAAAACGTAGGATGGAGTAGTC^GGGGA^ 
6 0 GTGACCTTGGGCAGGTCCCCTTCCTCTTCITCTGAGCCAATTTCCCCC^ 
CATGGCCCCAGATGAGATCCGAGGCAGCTTTGAAAGATAGC 

AGGCTGAGGCAAGAGGAGCACTTGAGCTCAGGAGTTTGAGACCAGCCTGGGCAACATGGCAAAACCCT 

AAAAATTAGCTGGGTGTGGTGGTGGGCGCTTGCAGTCCCAGCTACTTC 

CGAAGCTACAGTGATCAGAGATTACACCACTGCACTCC^ 

6 5 AAAAAGATAACCCACCCCAAACAAGTTAAATCAGATCTCCGGGGCGGGGCCTGCGGCCAGG 

TCAGACGKSGCTGCTTTCTCCGCCCTGCCCGGATCCCC^GGCAGCACTC^ 

TGAGTACAGCT CTACCACTTTCTAATACTGGGCAAGTG AATTCATG CCTCTGAGCCT CTGTTTCCTCATCTGTGAGATGGGGCTAA 

(^CCACCCACCTTATAGAATCTGTACCAGTGTCAGCCACCATGTT 

CCCATTCATTCTTCCAGAAAATATTCGGGGATTCCTGGACC^ 

7 0 ACCATGGGGGTCCAGCCTTTTAGGCCTTTCCAGGGCC^ 

TGAGAATCT CACCGTGT CX5CTG GACTGAGG G AACC CCTCTTTC TGAAAGAACG AATGAGACCC CAG ATT CAAGGG AG AAGAAGGGG 

TGGG CAG ATGCTGG CT CAAACAGG GCITGGCAGT AGACAGAACAGAATAAGGAC C CT CCT CCTTGT AGTC CATCACTGGGCTGGACT 
ATGTCCTCCATCCCGTGGTCCrrCCTGGCAGCCCAGGGCATTGGCAGGCATCX3TTCCT 
7 5 TCAAGGGATGGGAGGCACTTGCCTAGGT CAAGGG CTGGATT^ 
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GCTTCCCTGGGAATCTGCCCTGGTGGGACTAAAACGTGA^ 

CCTG CAGACCACCACTGG CTGTCCTTG CGGGAGAGAC CTGATGTTC CTGT AAATAGCAAGTGCT CCTGTTGCCTGGGCT CTGAAAG 
CCAGAGGTTGAACCATGCACrTGTCACCCACTAAACCTTCT 
GGGACTTGGCTAAGGCCATGTGGCTTGTGAGCCGGC^ 
5 CACCACCCGCTAGCAAGAACAGGACTTTCCTTTOT 

ACGTGTGGCCAGAATCCCAAAGGGATGCCTGTTTGGGAA 
AGAACACTC^TGCrrCAGTGGTTTGTGTGTGCCACC^CAG 
TATGATGTCAGTTCATCTAATCC1TGAGCT 
CAATATTGGACCCCCTGGGTCCTCAGTTGGACCT^ 
1 0 TTTGTGTCCCCTTCTCTGTAGGGTGGCAGGCTTGC^ 
GCCCATTTAATGCTGCAGGGCATTTAGGATTGG 
GGGCCAGGTGGTTGGC^CCAAGCAGAGTGGGCAGAGGCAAGAGG 
AACCCTGGCATTGCAAGTGGOU\GGGCCCAGGTGGGTC 

GACCrCGGTCCCTGGAAAAGGGATGGCTGTGCCCACCCTATGGGGCCTGATGAGTGGTGGGTGTG 
1 5 TTTTGCACCTGGAGGATGCTTGACACCTTAGG 

GAAGCCCAGCTGTGGGGGTCTATGAAGCCTCTCCTGAGTTGGTCTCGGACT 

GTGATACGGGGTT CAGTGCCC C CATCCCAG CAGGACC C CAAGT CT ACACTG GGCAGC^C CCTGAGCCAGTGAT AATAATGACTTA 
TGG CCC CTGTTT ACGGAGCATTTCCTGTGCACGTGCATATGGGC CTAAT CITG TT ATGTCCTT CAACAACTCTGTGAG AGTAGTG A 
CT CTTAGT ATC CCCCATTTT ACAG ATG AGG AAACTGAC CCAAGTT C CTAGC CCAGT ATCACGCAJ3CT AAGT AAGTGT CAGAG CTGA 

2 0 GGCCTCTCTGATGCATCGGAACCTAGATGCATTG^ 

AGAAATTGTTCATTTGGTGACTTGGGGAGAAGGGAGCCCC^TCCCTCTTCCTCAGG 
CCTCTTTGATTGCTTGGAGGTGTTGGCTGGGGCAGTC 
AAAGGAAAGGGAAGCCAGTCCCTGACTGGGCCTGAGTCATTTC 
GTGTGATCTTGG<rrGGGCCCTTCCCTCrCTCrrGGTCTCAGTTTGTCTOT 
25 C CGGAGG CCAGACAA 

HUMAN SEQUENCE - mRNA 
TTTTCACTGTCTGTGGACATTAAAAAAGCG^ 

GCGATCGGGCGGCCGAGCGAGCGAGCAACGCCGGCGCAGCGCGGTC 
30 GCGGATCTCX3GCGCCCTCGCTGCGCTCCTCCCGGCCCGAGCCTGCCCTACCCGGCGGTGGCGGCGG 

GCGGCGCTCGCAGCGCCCGTGATTTCGTACTACTGCTGGGGCTG CCACCTCCTCCTCCAGACG CTCTCAGCAGACTTGAGTCCTGG 
TCCTTCTGCAGAGGCCTG AG CAGGAGGAAGAGGAGGAGGCCCGTTGGCGTCGG ACCAATGCTG CAAGGGGTGTGAGGAGAGGAGCC 
GCTGTTTTTCACTGAGCTGCCATACCCCGAAAGCAGGATGGAGC^ 

GGACACTGGTGGTTTGCAGATCACTGAGGCTGGACAACGTTCATGGCTCTCGGGTAGAACCTAGTGA 

3 5 ATGGACAGGCACATCCAGCAGACC^TGACCGACTGCA 

GTCGGGTGGCAGCCCAGCAAGGCTTTGACCTGGACCTCGGCrrACAGA 

CCG AAGTCIX3CCGCCTTGTTGTCCTCCTGGTGCGAAGAGCT CGGC CGC CTGCTGCTGCTCCGACATCAGAAGAGCCG CCAGAG CGA 
TCCCCCTGGGAAACTCCCCATGCAGCCCCCTCTCAGCTCCATGAGCTCCATGAAACC 

4 0 CCTATGACTCTGTCCCTTGGCAGCAGAACACCAACCAGCCTCCCGGCTCCCTTTCCG 

ACATCCCAGAGCCAGGTCCTTGkjGAACCCTATGGCC^ 
GACCACCAGCAACCCAGGCCTCAACTCCCCACA^ 
ACATCCAGCAGAGCATGTATGCCCGGCCCAACTACCCCGG 
GCAGGC^TGGGCATCCCTCCGCACACCAGGCCGCCTGCTGACTTCACT 

4 5 AGCAGCAGCCACAGCTACAGCCACAGCCACX^ 

CGGTCTGTTCCTCTTTCCAGATGGGTCCCACCCAGGCGTATAACAGCCAATTCATGA^ 

ATGGGGGGCAGCATGAACCCCGCGAGCATGGCGGCTGGCJV^ 

CCGGCCGCCCGGCATC^CCCCTTTGGC^^ 

CTATTCAGAACATAAAGAGG CXIATACCCTGG AG AG C C CAAC TATG GAAAC CAG CAATATGGACCAAACAG C CAGT TCCCCACC CAG 
50 CCAGGCCAGTACCCAGCCCCCAACCCCCC 

CGGGCAGTACCCGCCCCCCACGGTCAACATGGGGCAGTATTA(^^ 
GCAGCTACAGTAACTACAGCCAAGGGAATGTCAACAG 

GGGAACCCCACACCCCCC^TGACCCCTGGGAGCAGCATCCCTCCATACCTGTCCCCCAGCCAAGACGTC^ 
TGACATGAAGCCAAATATGAGCGCXCTGCCACC^ 

5 5 GCGTGGTGCTGGAGCCCTTCCGCCTGGAGCACA 

CTGATGTGGAGGTCTGACCTGGAGCTGCAGTTCAAGTGCTACCACCACGA 

GCAGGTCAGCGTGAACGCCACGCCCCT(^CCATTGA 

GCCAGCCGGGCCGCAACACCATCCAGATCACCGTCA^ 

TCCGTCCGCTCTGTGCTGCAAGGACTCCTCAAGAAGCGC^ 
60 CAGCGTGGCTGCCTCCTCGGGCAACACGACCCTCA^^ 

CCATCACATTCCGGCGCATCCAGCTGCCTGCTCGAGGACACGATTGCAAGCATGTGQAGTGCTTTGATCTG 

CTGAATTGCGAGAGAGGGACCTGGAGGTGTCCTGTGTGCAATAAAACCGCTCTGCTGGAGGGCCTGGAGG 

GGGAATCCrTCAATGCCATCCAACACrrCCGAGTTTGAAGAGGTCACCATCGATCCCA 

CGGACTTACACATCAAGGACGACCCTGATGGCATCCCCTCCAAGCGGTTCAAGACCA 
65 GTCATGGAG ATGATCGCAG CXTCTGGGCCCCGGCCCGTCCCCCTATCCCCT CCCGCCT CCCCCAGGGGGCACCAACTCCAACGACTA 

C^GCAGCCAAGGCAAOVACTACCAAGGCCATGGCMCTTTGACTTCC 

TGCACGGGCCCCCCCAGCTCTCCCACCCCCCXSGACATGCCCAACAACATGGCCGCCCTC^ 

GAAACTATGCGACACGCTGGCAGCTCTGACCAGCCC 

AGGGCCTCCATTACATCACAGTGGGGCTCCTCCT 
70 ATCCACACAGCGACCTGACCTTTAACCCCTCCTCAGCCTTAGAGGGTCA 

TGACCTCCTGTCrCTATTTGAGAACAACTGAGGGCCACCCGGTCGGGGCCATCCCTCCACACTCTGCAT 
ACAC^CTTTTCCACCTGGGAGCCTGTGCCCTCAGACCGCCCCGCACCAGA 
CAGCCCTCTCAGAACAGAGGGGTAGGGAGGGTGCACCAGTGCACCA 
7 5 ACCCTTGGCTTGGGCCC^TGCCCAGCGCAGGCCTGAAGA^ 
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CGGCTTCCCGGTCCCTCCGCGTGTGCCGATTCCAGATGACC^ 
CCTCCCTGCTTCCTGGTCCAGAGCCTCCCTCCAGTGACTC 
GGAGGAGCACTGGCAGTTGGTGGCAGTGAGACCAGCCACCCACC^ 
GTCTCTCGGGGGCCAGGGGTCATCGGTTTGACCCCTGACCT 
5 GGACAAGAGTCTGTGTTTGAGAGGGGGTCTGCCATTCCTGCTTGGGGACTC 
ACGTCCCTGAGGCTCCACCTCCAAGCTCAGACAGGGCC 

TTTTAAACAGCAACAATTTAAAGCTATGAA^ 

1 0 T^CAATCAGATGGCAAGGG<^TGCGTGGAAAGGCCGGGGAG 



GGGAGGGGGGCGTGTTTCTGGGCCTOCCCCAGACACTGCCCTTGGC^ 
TCACCCAAACTCCTGCTCACTC^VAGCAAAAGCAGCCT 

GCCTCCTGGGGAGCCTGTCCTGTGTTCACrTTGTTTCAGGCTGGTCTGTGCCCC 
15 GTCCCGTCACrrGGGGTCCCATCrcTAAATTCTTTGCGCCCTTCCCGG 

GGGTGGTCCCCAGCACTCCTCrGTGGGTTTTACCGGAAAGGTGG 

GTTTTCTGTAGGAAAGCTGCCATTGCCCCGGCCCCTTT^ 

GTATGCAAATCAATTATTTTAAGAATCGCTTTTGTAAATATm 

CATGACCTGGTTATAGCCrTTGCTGGTGTTTTTAAAA 
2 0 ACAAAAAAAGCAACCAGGGCTATTTOTACAGTTGAAGGG 

CTCGCAAGTGGCCACGCCCTGTGTCAGACCATCTGGGA^ 
TCTTGCGTTGATACTTTGCCCCAGAAAGGCCTGGGATTCAT^ 

CCACGGCTTTCCAGTGCGGAGAGTCGAGATGCTCCCTGCAGCCCAGGCCCCGGGCAC CTGCAACCATCT CTGGGCTCAGCACC 

2 5 TGAGGCGGGTTTCCTGGX3TCCCCTCTCCAGCAAGCCTCCACCAGCAAGCT CTTC CCTTCCGGCTGGCTCTGAACCG 

TGCGTGGTGC CTACAG CCTGCAGT OTGGAG ACAAGCTCITCCX3GAGTGCTCTGGGAGCCAGG CCAGGGTG TGAGGG AGGTG CAG AG 

GCATCCGGGGCGGGAGCAAGCCCCAGGTTGTGACAGGTGCAGGTAGACAACGCCC^ 

ATCCTTCCCTGATCCTTTCGGACGACTACTTGGAGCCAT^ 

CAAGCATCCACCCGGCCCACAGGCATGTTTCTGCCGCCACT 

3 0 TACAGG CAAT CAC CC C CAT CTT CTT TGGTTTGAAGCTTTATCCATGT ATCATGTTC CGTGT AGC CATTTTATTTTTT AAG AAACTG C 

TAATACTTTCTCCCTAATGGAAGCCCTGATCCCCCAGAGAGCTACAGGTCTGCTCCCG^ 

AGGGCCGTGTCAACAG CAGCG ACT CAAGGGACGTGTGTACATATG TAAATGAGAAAT AGAGACGTGTCAACAG ATG CATTCATTTC 
TCTTGGAATGTGTATTGTTTTTATTTTGCGAAACAAAACAAA 

3 5 TGTTCTGTTTTTATTTT AC CT ACATGT ACT ATTT AG CTTCAGTGT ACTAGT CCTG CCACCTGTGTATTT TTAGGGTG CTATGGAAA 

TAATGAAAAGAAACGGGGATTTCAGAAGAAAATTGTA 
CACACGTACACAC!ATAAACACACCCACCAGTGCAGCCTGAAGTAACTC 
AATGCAATCATTTCATACTTTAAACTGGTCAAAAAACTAATTGTGATTTCTAGTCTTG 
CAACCTCTAATATTTATCTAATAAATATGTATTCAGATGAA^ 

4 0 AAATT ATTTGACTATTG CT AGACATTTCTAT ACTCTGTTGT AACACTGAGGT A CCATGTTAATTTTTTT CT AAAT A 

AATTGACAAAAACAAAGGTT 



45 



50 



55 



60 



65 



70 



75 



HUMAN SEQUENCE - CODING 

ATGAATTCTATGGACAGGCACATCCAGCAGACCAATGACCGACT 

CAATGCCGCCACGGAGCTGCTGGACTGGTGCGGAGACCCACGGGCCTT 

CGGTGGTCAGTCGGGTGGCAGCCCAGCAAGGCITTGACCTGGACCTCGGCTACAC^ 

CC^GAGCGATCCCCCTGGGAAACTCCCCATGCAGCCCCCTCTCA^ 
GGTCGTTCCCCTATGACTCTGTCCCTTGGCAGCAGAACAC 

G TAAC CAACACAT C CCAGAGCCAGGTCC TTGGGAAC C CTATGG C CAATG C CAACAACC CCATG AAT C CAGGCGGCAACCC CATGG C 

GTCGGGCATGACC^CCAGCAACCCAGGCCTCAACTCCCCACAGTTTGCGGGGC^ 

CTCAGCCCTACATCCAGCAGAGCATGTATGGCCGGCCCAACTACCCC^ 

AACGCCCCCGCAGGC^TGGGC^TCCCTCCGC^^ 

AGTGGCAGCAG CAGCAG CCACAG CTACAG CCACAG C CACGG C CACTGTGG CAG C C CTG CAGG AGACACAG AACAAGG AT ATAAACC 
AGTATGGACCGGTCTGTTCCTCTTTCCAGATGGGTCCCACCCAGGCGTATAACAGCCAATTCATG 
CCTGCCTCCATGGGGGGCAGCATGAACCCCGCGAGCATGGCGGCTGGCATGACGCCCTCGGGGATGAGC 
GAACCAGCCCCGGCCGCCCGGC^TCAGCCCCTTTGG^ 

AGTCCCTT CCTATTCAGAAC!ATAAAGAGGCCATACCCTGGAGAGCCCAACTATGGAAACCAG CAATATGGACCAAACAGC CAGTTC 

CCC^CCCAGCCAGGCCAGTACCCAGCCCCCAACCCCCCGAGGCCACTCACCTCCCCCAACTACCC^ 

GCCGAGCTCCGGGCAGTACCCGCCCCCCACGGTCAACATGGGGCAGTAT^ 

TCTCGGGAAGCAC^TACAGTAACTACAGCCAAGGGAATGTCAACAG^ 

CCTGTTCCAGGGAACCCCACACCCCCCATGACCCCTGGGAGCAGCATCCCT^ 

CTTCCCGCCTGACATCAAGCCAAATATGAGCGCTCTGCCACCACCC^ 

TGCGGGATGGCGTGGTGCTGGAGCCCTTCCGCCTGGAGCACAACCTGGCGGTCAGCAACCATGTGTTCCA 

CACCAGACGCTGATGTGGAGGTCTGACCTGGAGCTGC^GTTCAAGTGCTACCACCACGAGGAC 

CGCCTCGGTGCAGGTCAGCGTGAACGCCACGCCCCTCIACCAT^ 

AGCACGTGTGCCAGCCGGGCCGCAACACCATCO\GATC^CCGT 

CACCGGCCCTCCGTCCGCTCTGTGCTGCAAGGACTCCTCAAGAAGCGCC^ 

GAATTTCAGCAGCGTGGCTGCCTCCTCGGGCAAC 

TGAAGTGCCCCATC^CATTCO^GCGCATCCAGCTGCCrrGCTCGAGGACACG 



GTACATGTGGGGAATCCTGAATGCCATCCAACACTCCGAGTTTGAAGAG 

CCATCAAGTCGGACTTACACATCAAGGACGACCCTGATGGCATCCCCTCCAAGCGGTTCAAGACC^TGA 

ATGCCCAATGTC!ATGGAGATGATCGCAGCCCrGGGCCCCGGCCCGTC^ 

CAACGACTACAGCAGCCAAGGCAACAACTACCAAGGCCATGG 
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ATGACTTCATGCACGGGCCCCCCCAGCTCTCCCAC^^ 
CCCATGCAGGAAACTATGCCACACGCTGGCAGCTC^ 
CAGCCAGTCAG<X^CTCCATTACATCACAGTGGG 
CCAGC^CCATCCACACAGOSACCTGACCTTTAACCCCTCCTCAG^ 
5 CCGGAGCCTTCGCrGGATCTCCTTCCCGAACTCACAAATCCTG^ 
TAGTAACGATGACCTCCTGTCTCTATTTGAGAACAACTGA 
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Table 112 

MOUSE NOMENCLATURE 
ICSGNM Lfng 
Celera mCG14497 

5 

HUMAN NOMENCLATURE 
HGNC LFNG 
Celera hCG18436 

10 MOUSE SEQUENCE - GENOMIC 

GGAACC^GGCATGGCAGGAAAGCCACCCTGGAGAGCCCCTC 

CTGTGGGATCCCACGTCCTGTATGACTTGCCCAGTTTATCA^ 

GAGCTGTGTGTGCTGATGGTTTCCATGCTGGGGAGGGTCrcTGAGGAGGTCCTG 

GCAGGGGCTCTACCACTGAGCCACGCCCCCAGCCCCTCACT 
1 5 CCCTCATTGGGGGGGTTATAGG<2AGGGGCrCTACCACTGAGCCACGCCCCCAG 

CAAATCACCTTGAATGAAACCCACCGGGC^TAGGCTGGGCCCTCCAGACAGCAAGGAAT 
GGAGCTTCAGAAGATCACCTCCCTTCCATGTTAGCCTCGGCTGGTCT 

CCCTGAGCCACCTGGCAATGCCAACTCC^CAGTGCCTACACCTCCTGCAAGACTAAACTCTTCCT 
2 0 AAGCAAGGAAAGACAACAAAAG CTGGTCT AGG CAGTTT C CCAG C C CTTACAG CTTTCTC CATCTT AAGGCT C CAAGGACACAGACC 

TAGAGGCTCAGGTGAGCACACTGACTAACAAGGACTTGCCATATCCC^ 
GCCTGCTGTGGCTCTGTCCTCTCCTCCCAGGTGGCATCACGGCCCTGGACT 
GGTGCTACCTTCACCCATGTCTAAGCATACAACCTCT 

ATGGGTCAACAAG ATGATTCAACAGGTT AAAGTG CTTG CTGCCAAGCTTAAAGAC CTG AGTTTGATCG CTGG G ACCAC CATGG CGG 

2 5 GAGG AAAAGCC AACTCCTGCAAGTTGT CCT CTGAT CT C CACATACAT ACATGG CATG CACATACATGTATACAAGT AAAGAG ATCT 

AAAAACAACAATTTTT CTTTTTAATT CACCACATTAAG C CAAGAACAGTGC GTTCAT AC CCATGATG AGG CT AACC CATCACTCGG 
GAAATCTGAGGCAAAAGGATTGCTTCGAATTTTAGGCTAGCCTAGGGTACATCTGGAGACCT 
GCGAATGAGTTGGCATGGACAAAGTACCTGCTGTCJ\AAGCATAAG 
CTCACCTGC&ATCCTGAGGCTGGGGAGGGCGTACAGGTGGGTCCCT 

3 0 CTAGGTTTAGTGAGAGACCCTGCCTCAAAAAGTAAGGGGCTGAGGGATAGAGAAAGATACCTO 

ATGAATTC^C^VGATGATCTCTCTCTCTCTCTCTCTCrCrCrCTCTCTCTCGCTCGCTC 

TGTGATGGACG AATGGGT GAAG CAG AAGGG AGCATGGGTG AGG CAGACAG G AAAG T GAG AAGTAAGCAGAG C CCATCACTGACTCT 
CTGACGCTGTGCTGATAGCTTGTCn'CrTTGGAGGTGTGCrrT 

AGCCTGGATG AT CTCAG ATG CCATTT CTATTT CAT CCTTAGG AAGGTGAAG AAAGC TGG CCT CATGGT AAAG AC CTG ACATC CCAG 

3 5 TTACTCGAGGGATGGAGGCAGGAGGATTGCCAGTTCGGGAAACACT 

AGATGGCTCAGTGGGTAAGAGCACCCC^CTGCTCTTCCGAAGGTCCGAAGTTCAAATCCC^ 

TCCGTAACAAGATCTGACGCCCTCTTCTGGAGTGTCTGAAGAGAGCT^ 

TAAAAAAAAAAAAGTC^TAAGAGAGCTGGAAGTATGGCTCAGTGGTAGAGCCCCTGCCT 

GTG GCTCAGTGGT AG AAT AT CTACTT AGCAT ATTG AG GGTTTAAATT CAACTCCT ATT ATTTAGT ACT ACTJ^ 

4 0 AAAACAAAGAAGGAG AAG CATCAGAT C ACGTTC CTG AGG AAG AG CTT ACAGGGAG AG ACT C CATCATTCAGT AAGCAGAGT CCT AT 

TTGJVGACTCTGACTATCCAGGCCAGAGCCTCCTCCCACTGTCTCTGGGATCCT^ 
AGGCTCCAGAACCAOTGGGCTCACCCCTGCCTGCCTGCTGC^ 
TTGTGGTGGCTCTG^CGTCTCCCTGGAGGGTGTCTGGCCTCTGCAGGATC 
ACTGGGTATCGCAAATC^CATTAAGCCAGCCCTTCACCCCTCCATGCA^ 
45 CTGCTTAGGCAGGGATGCTTTTGGCTCCCGGGCAGTTGAGTTCTCCCCATC 

OTCAGTAGGTGTCCCTGACACACCTTCTTATTCCCAAGACCTTG 

CGCATGTCTGC!ATGTGTGCATGTGTTCATGTGTGCTCTGCCCCCTGA(^ 

GCTCTGCTTTAGCCATGGGCTCAGCCTTGATGGTTTCTGCTCTCATAGCACT 

5 0 TTTCTAGTGTCTAGCCATCCACTOTTTGTTTGX3TTGGTTTTGGTTTTT . 

AACTCACTCTGCAGACTAGGCTG<3CCACGAACTCAGAAATTCACTTGCCT 
CCACTGCCCGGCTTCCZATCTGTTGTTTACACATCCTCTTAC^ 
AACCATCCATTTGACATCC CAT ATGTCCCTC CATC CTCTCACCC^ 
CCAAGCTTCCTTCCATGCATCTATCCATTTATCCATC^ 
55 ACCCACCCACATCATCTACACATCCTCCCATGCAGCAGA 

TCCCATCCATCCACCTCCCACAC^TTCCATTC^CACACCCCC^ 

CACATCC^CTCACCCTCCCACC(^TCTAGCCACCCCATTCACGCACCTACAGTCTGTCTGTCCATC 

GACCCTTGAACCCTGCTTTGTGGCAGATCTGGATGAGAAGTGAAATGTTACCCGCCT 

TCTTGTTTACCAAGTAAGTAAGTAAGTAAATACCAAGAGGAGGGGTTACAAGG^ 
60 GATACCCCAAACCAGAGTGCCCCCTGAAGTTTTGCCC^GC<^CTTG<5TCAAAG^AACCAGACTCTAA 

TCAATCAAGCAACTGG^TCCATCGAACTGTGCCTGCAAAGCGC 

GAAGCCCTCAGCCTACAGAGAGTCGGGAACTCGCCCAGGACCAC^CAGAATGTG 

GAGCGGGCAGATGGGTGGATCTGCATGGGGCGGCTGTCTCTTGC^ 

AGTTGATGGGAATCAGC!TGCTGTGTTAATTCTACCCTTGTATTCCTCCATGCTTT 
65 CTGGCTGTACTGAACTTCTAGCCAACCCCTTCCCTTCCCCCCTCT 

CATTTTGATTGAGCCCCGTCTCTGCTGTATGTGTACCAGATTTAAAAAAAAAAAAAAAAAA 

ATTGTACTGCCTTGCAGTtiXSCTTTAGG&ATACATT^ 

GTGTGTATGGCACACAGGTACACIATGTGTGCCAGTAGAGACCAA^ 

TTTTGAGACAGGGCCTCTCACTGGACTGAATCAACTCAACGGGTAAGCAAGGGTAGCTGCCA^ 
7 0 . TCTAACTCCTCAGTGCTACGACTATAAGCACACA.CCCATCATGCCT 

TTTGCCAACTGAGGCTATGTCCCCTACTTTTGGTTTTTTTGAAACAGGATTTCA 
TCCTGCCTCTGCCTTTTAAGTGTTGGGACTACAGGTGTGC^^ 

GGCTGGTTCCC^GACTGCC^GCTCCAAGAGGTGGTGGGTGTGGGCTGCTTTGGATGGC^ 
TGGTCTCCATAGATCTTCCCAG^ACCCTGCATGGGAGGGCAGGCAACTGTCGGGGCCTC^ 
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GATGGCCAGTT CCAACAATTCCCAGGAGACTACAAGACT AGGC CAG GGAAGAACCTGTGT CCTCAGGACTTCGTGAGTTTGACGTC 
AGCTGGAGAGCCCTGGCrCATTCTACAGGGGTACTCAGTTTCAAC^ 

TC!AGAGTGCAAAAACTACACCTTG AGT CK3AGAGCT CTGCTCTGCCACAAAATGACAGGGGAC C CAACTAGGTCC 

CCGATGTCAGTAAAAAAGGAAAAGTTAATTTTAGTGACTTGAAATGAAAGCTGTGACT 

5 CAAGATTCAAATCCTGCTCAGTGAGTGTGGGGGAAAGATTGTATATGGGTTCTGAT^ 
GGGACCAAGGGG ACAGGTACCCAGTAG CGTCAGAAAAACAACTGTG AATGTCT 
AGTTTCTCTGGAATCTATAATATGATGGTCACATCCrGGATAGTTTCTTGTTCTCT^ 
ATTTTATTTTATCITATGTATCTAAGTA 

CCTGGAAG CTTCTAT ATAG ACCAGGCTGGCCTTG AACT CACAGAGAT CTGC CTGC CTCTGCCTC CCAAGTGCT GGGATTAAAGGTG 
1 0 TACATGACCACACTAGGCCTGATGCTCTCTCTCCCTTTAACATCATCCATG 

AAATCCC^GGACTGTAAGGAAAAGGAATGAAACTGAGGCCCAAGAACTCACAGCCTC^ 

OTGCCTAGAATCCCTCAGTGAGGGGCTGGGGGCGTGACTC^GTGGTAGAAC^ 

TGGCTCAGTGGTAGAACACCTGCCTAGAATCTCCCAATGAGGGGCTGGGGGCGTGGCT 

CCAGTGAGGGGCTGGGGGCGTGGCTCAGTGGTAGAGCCCCT 
15 GAGCCCCTGCCTAGAATCCCCCAGTGAGGTGCTGGGGGCATGGCTCAGTGGTAGAGCCCCTGCCTAGAATCCCCCAG 

GGGGGCGTGGCTCAGTGGGAGAGCCCCTGCCTAGAATCCCCCAGTGAGGGGTTGGGGGCATGGCT 

TGCACCCTAGTATTTCCCCTCCCAGAACITTTTCrGAGGACCCTAGGGGCTTCCCGAA 

ATTTTCTGTCCTGGGATGAGCGCCCTACCCTCCCATC 

CCATGAGCTCAGTGCAGGAAGCAATAGGGAGAGGCGAGCTGGT^^ 
2 0 ATTCCAATCTTCTGACAGCGAAGAAACTGAATCACTCTTGCCAGAGCCTGCAGC 

CAGGCJATGGACACTCCACCAGCCCCTCTCT^ 

TGACATGGTC CCAGG CTTGGGATG AAGG CG CACCAAG ACTGACAAC CAGCTATGACTTGGTCAGGTGGGTATGGAAGG AAGGCATA 
GGTAGTTTGTTTCCTCTCCTGCCGTCAGAGACTTGATAAATGCCCACGAAGGTATCTTCCCTTTGGTTTAAATG 

2 5 AAAGTTAGGGTGATTAGGGGACTTGAGCTATTGGGAAATGAA 

CAAGACATGTGGATACCAGGAGAGTOTGAATTCATAGGTGAC 

TGAGAGAGAGACAGAGAGAGACAGAGAGACAGAGAGACAGAGACAGAGAGACACAGAGA 

CAGG AGAGGG GTCT CAAGG AGG AGCCT T AGAC CTAAAAC ACT CGG AAG ACACAT AGAT AGTTTTTGGTGGGAAT GG ATTAG C C CCT 
GAGGCTGAGGCTGGAACTGGGGTGAGTTTATTTGGAGAGCAGTCCAGCAGATAGCA 

3 0 CAGGCTCTGGCAGGAGGAGGAGGGGTTTATAGCAGGAGGTGACGCCATGGGAGCTTTGTCACTCG 

TTAGGCATCTGCCTACCAGAGGTGGTGCCTCCTA 

CCCTCCCACCACAGAGGGGTTGCAGTGTCCCACAGGGCCCTCAGAGCAAATGGAC(^^ 
TGGGGGGTGGGGTGGGGTGGGGAGAGCCCTGGCTACACATCAATTCA^ 

GCT CAGG CTG G ACT CAAACT CACT ATTTG C CAAGGAATATTTGGTG CAG AGATG AACCG AGG CGT CATACATGCTAGGCAAACAC T 
35 XTACCAACTTTGCCAGATGAGGTACATTTCAGGC^ 

CTGCCTTTAAATGTTCAGCAATCGTCCAAGCCTGCTACATGGGCTATCATGTCTGTCrrTCATTTTGAAAA^ 
AGGTGTGCGTATGTGTATGTGCX3TGCGTGCGTGCGTGTGTATAAATGTG 
GGCAACATATTGTTGGTTCTTTCTGACCATGTGGGTCCTGGGAAATAAACTCAAATCT 
GAACCATCTGGCCAGCCCCIX3CaGTAAGTCTTTACCTGATCCAGCCTGATGCCTTCT^ 

4 0 AG AAGAGTGT AGTAGAGAAATAATT AG AAATG AT ACT CCG GT CT AAAAT ATAAACGCT AG GGAGT CGG CGACAGAAGG AAC AGCAG 

TTTATCCGTGGTGGCAGGGTGGGCTCAGACTTCCTTTGAAGGGGGACGCGGAAACTG 
GGAGAGCCGCCCCCAGCTGCGCTCCCGGAAGATGCAGCAGGCTCGGGCT 

TCCGGACCCGTAGCCTCTCACCCTACCATTAGGCGCACATTCCTGCCGCCCCCGCTCGTGCTAGGCGCAGGCAAG^ 
TGGGAAGGGGGCCGTGTGCCGGGGAGGCJGGGTCTGGGATGGCGCIACAGCTGGG^ 
4 5 GAACAGAAGGTAGCCTTTCATCCTTCGAGGAAAGAGGCTGCTTGAGTGTCTCCTCGCT 

ACCCTCCCCATTTAGGAACATGATGCCAAGACAGGCCTG CCTGGGGGAGGAGGG AAGGATGGAC CGACGGAGGGACGTAGAATTTT 
GTCACTGATACCCTTTTG CCTGJACTTCAGGTCCACTCCTAAGACATAAGTATCAGTAAGGG CAGCTCAAAATTAAT CGCTCAGGA 
CAGACTGGAGGCTACCACTTTGGCTC!AGTCAGCACCGT^ 

GCCAAGGATGTCGCCCGCGG AGGAGCAC!TGGCCCTATCACCCAGGTCCCAGCTTCGGGGG CGCGTCCTCCACGTG GAGCCGTGG CG 
50 GGCGTTGGTCAGGCTGGGTGTCTCCC^GAGCAGCTCCGGAGGGTGACTCGGCGCAGGCGGGACAGGOT 
GTGCCGTGGGTACCCCAGCTGGGTGGTTCCCAGCCCCTTGTCCGTATCCCAC^ 

CAAGGCTAGCAGCCAATTAGCG CGTGGAGGCGTGGCGGGCAGGGG AGGTTTAAGAGTTCCTGAAGCG AGAGG CGCGCAGCTGGACC 
GGGAGCCTCGGGCAGCCGGACCAGTTGGCACTGGGATAGATATTACGTGC^ 
GCCTGCTGCTGG^GCTGGTGGGCGCGCTGTTGGCTTGTCT 
55 CGCGGACGGCGCGCGCTGCGTAGCCTGGCGGGCTCCTCTGGAGGAGCT^ 
CCTCACCCGCGAGGTGCATAGCCrCTCCGAGTACTTCAG^ 

AAGTTTCACCGCKSCGCGGCTCGATCTGCTGTTCGAGACCTGG^ 
AGGGGGATCAAGTGGGGATC^CCCTCTGGTCCAACTGGTGACTG 
• 6 0 GGGAAACGATGACACTGATAAATCCCATTACAGCCCCGTGTGGCT 

CGTGTTGCCCAGGGTCACCCCAGCCACATGCrCTGTGTCTGGATGAGTCGCCTCCAG 
AAACACTGGAGTGAATGGCCATATTTGTTCCTAAACTTTTTCGAGCAAATTGGCTCAGTTCCATC 

TCCTTCTTCCTGGGTGGAAGGTACGCCCCCTTGAAAGGGAGGGAGGAAAAGGCATAGCACACCCCTGGACCAGGCAAAGTGAAAGA 
AATGGGTGCTAGTTCTCCAGACTTGCrrCAGAACCCTGACTTGAGTAG 

6 5 ATAATAAAAAG ACACACCC CACTTTG AAAATCTCC CTG G AGG CTACAACAGGACCGT TGGGG CGGG AGAG CAG AGTGCACTG CGTG 

TGTGTGTGTGTGTGTGTCTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTAGGGG^ 

ACTGACTTTCCTTCCTGACCCAGCCCCAGTGAGACTTCGCTTT^ 

TCAAAGAGGGGCCCTGAGAACCCCACCGTAGCCTCCTGAGAAT^ 

GCAGCGGGGGGCCATGGGTTTTGTTTGG CTGTGGTGCCGTTAGGATTCCCAGCGCTAGG CTGCTCTCACGCCGGCCCCTTGGGCAA 
70 GGCAACAAAGCCCCTTTCTCCAG CCGAGAAAGGCGTCGGCCTCT^GTG CTGG CGTG 

CTGGCC!ACTCAGGTTGCCTGGTGATGAGGGGGCCTCTGTGTCCTCCCC^ 

TTCTGAGAAACTCTTATTCAGGGATTTGGGACTC^ 

GCTCTCAGAAAAGGGAAATACTTTCTAGTGAGGTATTATAAAGTGGACTGTAGAACTTTGGAGGCTCT 
75 TTCTGAATAAGGGGGCCCCCTACCCAGAAAATGCTGCATTTACTG 
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CTTGTGTGCTATGATCAGGGGCTCTTGGAACCTGTAGCAAAGGGCCTGCTCCC 
GGTGAGAGAGGCTCCATTAACCTGGATTGGGGATGAATGGAAGGAAG 
AGGGTCCAGGAAGGGCAATCTTGGCACCAGGCATGAC^ 
GGTCAGGGGTACAGGCAGAGCGCCTGGCCTCTGGCCCGCTGCTCCTCC 
CG CCACAGCCACAATGG CTGGGG CAGAACTTGG CATCCTGTG TTTCTTTGAAT 



GGGGACAGACTGAGATCAGGGTCTCCCATTACTTCTTCCTGA^ 

CCAAGGAGAAAAGAAGATGGT ACT CACTT CCCAGGGACACTTGGG AGAGGAGAGTC CATCCCT AG CA.CT TGGCTGTGCACAGGCAA 

GCTGTGTTGTGCCCATTTCACAGAGTTTCCGGT^ 
1 0 CACCAGGCAGAATACCC^CACACCC 

aUUVGGTGACTTTAATCTCCGTAGCCTTGGGCTGGGTTTT^ 

ATTAAGAAAGCATTTTCCTTTTAATCCAGCAGC^ 

TGCTAGGGTGCGGGGGGGGGGGGGTGTGCTXX3ATGCAAGCGAGTCTCT 

GGAGGGGGTGCATACTl^CAGATGATATAGATCCCACATTTCATCTCCTAACCT 
1 5 ATCAGCCTTGGGTAAAGGAGTAAGGCTGCTCGAGGCATGGTGTGCCACCCTTGTAGACCT 

C^TGCTGACCTCCGTGTTTGCATGCTCACACCCAGG^ 

TTGGGGAGCCTCCTGCCrrCTGCTGTGAGCAGCCCCCA^ 

CGCTGGGCGGGCCGGCGGAGGAAGGAACATTCCTTCCCGTGTGCCG 

GGCGCCCCCCTCCTCTTCCGCAGTGCAATCCTGGCAGGGGGTGGCCCAGCGCTGGGCC^ 

2 0 AAAGCCTTCTGT AAGCGGGG AACAGTTTT C CTCT CTGAAGC CT CAC CCGGCT GCTCAG AT AC C CCAGTGGGC CTGGAG ACATGAAT 

CCTTTTCTGCTACTACCATGTTGACCGCCGTGACCTCCCAGAACCT 
CAGGAGGAAGTCCAGGCCAGAGTCGCGAGCGACCAAT^ 

GTTAACCAGGCCAGCTGTATGGAAATGACAGCCCGGCCACTCOTGCTTAACTCG 
GCTGGCXSGACAATGGCTGGTGATTCATCCTCCCTTC^ 
25 GCCCGACCCGAGTGAGTGAGGCTCaGCITCCa^AAG^ 

CAGGCTGGGCGGGCTGGCAGGCTGTGTCAATAA^GGTGTGTGCCCCGC 
TTACCIX3GCTGGTGGGGGGGGCATCCCACCTGGCTCGTGCCTCCCAGCTGCCTGT 
AGAGTGAGCCCACCTCCCC(^GC(ZAGCCAGCCCGC<^TCCAGCTGCCACTCCCCCTGGGC^ 
TGGAGAATTCAGGCAGTTCAGTGCAGGGTCGGGGAG 

3 0 TGGAAGGCTCAAGATCCTGTCTCTTCTTCCAGACGTTCATCTTCACTGATGGGGAGGAC 

TCCCAGGCTTGGGGGTGGGGATGAGGGTGGGAGGCGCTGGT^ 

CCTCGGGGCTAAGAGGCCAGTTCACTCGGCITGTTTGCTTCATGCCCCGCCCCACC^ 
AGACCCTTTCAGTCCTCTGGGGACCCTTTGAGTCCCGCTTAGCAGCAACAGGTGGC^ 
GAATGGGCAGCGGTGGCTACCCTGGGGAGGGGGCTAGGCTGGGCAGGGCTGGGGCTAAGGTTCGA 

3 5 CTTGTGCGGACCAGGCCTGCTTACGGGCCCTCCCCGTCCACAGGCAATGTGGTGCT 

GGCTCTGTCCTGCAAGATGGCTGTGGAGTATGACCGA 
ATCCGTGCCCCTTCCCTGGa^GCCCGGCAGCCCTCGACCC^ 
GTTCTGCCACGTGGATCATGACAACTACGTCAACCTCCGGGCGCTGCT 
ACATCGGCAAGCCCAGCCTGGACAGGCCCATC(2AGGCCACAGAACGGATCAGCG 

4 0 A(2ACACCCTGAGGTGTTAGGGAGGGGCTGAGCCACCTTTGCAACC^GGAG 

CTGTCCACTTTTGGTTTGCCACCGGAGGAGCTGGC!TTCT 
GTGTCCCCTGCCTAGTTGCCACTACCCCTGACAGCAGACTGTTCCT 
TCTGGCTTGGGCTATTCTCTCCCTGCCTTAGGTAGCTGTGTTTTCTG 
GACTCTGGCTCATGGCGGGGCG^GGCATAGTGGACTCCCTCTTTTTGGGGCC^ 

TCCCCGATGACTGCACCATTGGCTACATTGTAGAGGCTCrrGCTGGGTGTACCCCTCATCC 

GAGAACCTGCAGC^GGTGCCCACCACCGAGCTTCATGAGCAGGTGCGC^TGTGGCCCCCAGGCCT 

AACAGTTGAGGGACAGAAGGGAGAGGGTGCCTCCAAGGAAGGGACAATTTTTA 

TCTGAGACAGGGTTT CT CTGTGTAGCTCTGGTTGTCCAGAACTACCTTTGTAGACCAGGCTGG CCTCGAACTCAGGGATCAACCTG 
50 CCTCTGCCTCTGAGAGCTGGATTATAGGCATACTGATAC^VGGGCTTCACT 

CCCCCCATATGCTAGGAATCAGCTGTGTGTCACC!ACACCTGATTCAGAGTCGTTTAGGGGT 

TTATTTTTTTATTTACTTAGAGATGGGTCTCTTATAGCCCAGGCTAGCCC^ 

CTGATCCTCCTGACTCCACTTTCCTGGGCTAGGGTTGT^ 

GGGTCACTGTTTTAAATAGGGATATCAGAGACTCCTCATTCTCAGGTC 
55 CTCTCTAGCTCCTCCAAAACAGAAAGTCAAATAGATTCTTCTAAGTTGCr 

CCCTTATGGTCGTAGGCCCCACCATGGGCTTC!ATCCCCAGGTG^ 

GCCAGGCIAGGACTGCCACCTCTCCAGCATCATC CTGTTGC 

GAACGCAGTCCACATCAAGGGACCATTCTCTGTGGA^ 

GGTTGGGGGGGTGGCAGGAACCCCATGGTTGGGACTGCCAGAGGGCCCATGGC^ 
60 TCCCAACAGGTTCCGCTCTGTCCATTGCC!ACCTGTACCCAGACACACCCTGGT^ 

TGAAACTCTGTCCCTGGGCGCCCCTGGTATCCAAAGGGCCCAGGGACCTTTATTGTATGCCCTGGCCTTGGTGTCCGAG^ 

TCTTCAGGGAGTGGCAGGTCTGTCCCTAACTACACTCTGAGCACC^ 
CAGTGGGGGTGGGTGAGGCCTTGGGGCITCCACTCAGGGGTCCGTGGGTGCTGATTGTAG 
6 5 CTATCTGGAG CCTTCCCTGTC!AGCTGAGCTGTGCCCAGTCC]TAGGGAAGCTGATTT CCTGGACCCCTTGTGAGC 

TCTGGCCCGGGCGCTGCCTGGTGCCATTCTGGTGAACAATGCTCCCCCACCCCCTGCCCTCCRGGGTGGAGGCG 
C CAC C CTC!TG ATGCC CT ATCTTTGT CTC CCACACCAAGTGC^ C TC CAAAGTAGAGAGAAA 

GTCGCCCATAGTGGGTGTGCCTGTACIATATTGTGACAGTA 

AACTCCTCACTCCCCTACCCCTGGGCTGTATTCTGTGTTCTTTTTGCAAAGACCTTAACTAGG 

GTCTTGCTTGGGAGTTGGGGCTAAGCCACCTGTCCIATCAA 
TGAC^TAGAGAAGAAGGCAATTGGTAAGAATTCAGCAAGTTAGCCGGGCATG 

CAG AGG CAGGGGGATT TCTG AGTTCGAG GC CAG CCTGGTTT ACAGAGTG AGTT C CAGG ACAG CCAGG ACTAT AT AGAGAAACCCTG 
TCTCGAAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
75 XG£K3AGGGAAACTAGTGTTAAAATAGATGAAGTATGGCCAGCTTC 
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TGGGAATTGGAJCAGGGCTGATC^ 

AGGTGCAGTGGCTAGTGGGGGCTGGCC2U3AGACCCATT^ 
AAAGTCAGGCAGCCAGGCTGCTGAGGTGGATGACCCCCACCCCCCCTAGATCTT 
CCTCTCATCTGTCATGAGATACACCTGTCATCT 
5 AGGACTCGGATCTTTCTGAGGCTCTTCAGGGACCTITCCT 

CCTTCCCTGTGGTCTCCTTGTTTCCGCTCCrTTACAGAGAAGCA 
TGCTCTGACCTGACCCCGGACTCACGGGGCTAGGTCCTCATTACCTGCC^ 
ATTAGAGATCTGTCCTGCACTCCACCAGACATTGCTCACAGCT 
GTAACTGCTCTTGCCTTGTGGGGTGACTGGCCAGAGCCTGACCCCACCTI^ 
1 0 TGCCK3GGATTTTCAGAGTAGGGGATCCCCA 

CTTCTAGATGAGGAAGCrGACATTTGTAAGGGTGGAAAGGTCACGGTGGAGCAGTCT 



GTACAGCGGCACCAACCTCGCCGCCGGCATGGAGCTGGGGGCCCCAGCCAGGGTGAGCTTCCC^ 
GTCCTGAGCCTGGAGGGCAGAACCTCAGCACGGAGCAGGA^ 
1 5 TG AGGC TGTTGAGGTAGGAGG ATTGTGAGGGT CGTGAGAT CAGAG C CAG CATGGGTT ACCAAGTG AGGTCAAAAAAATCATCT AAG 
AAACAG AAGC CAAGTAGGGAGGGCCCAGG AAGGGTCTAGCCAGTGGCTT CAACAAC T CCCAGGT AG AGCAGG CGGAAGAGTT T CTC 
CTGAGGAGGGAAGCAAJGCTGACTGACTGACTGACTGACTGACTGACTGAOT 



GGCGTGCGCCZACCAATGACTGGCTGCCTGGCCGCCTCAGTGAGGGTCTTAGCGAGGAGCCCTCCTCTCTC 
2 0 GGTATCCTGTCCTGGGGCTTCCAAAAGTTTATAAAACGCCAGGTACACTG 

TAAAAGT 



GTCTGGGAGAGGGTGGCAGGGTAGGGTTGAAGCTTTCAACTTCCCTCTCTGAACTTCCCTCCCCTCTG 
TCCTGGAAGAGGTGCCTCACTGCTGGAGCCGGAAGGCCAAAACCACAGCGTGTGAGCAG 

2 5 AGCTTGGCACAGCCCAGACAAATG CACCCCAGGGTGGGGTTTGGAGGGACCTGGTGAGGAGAAGGCTGGGAAGGG CAGTGAGCAAA 

GC CATTTCGTCTCTC CCTGAGGCTGG ACCTCTGATGGTAGAAGTC^GAG CCCGCTGGGTGGCTTCAGCCTCT^ 

GAGGTCT CAAGAGGAAGCAGAAGGCCCTGGGCAGACAGCAATTTGTCAGACCCATTGGCCTAG CCTGTGGCACATTCTCCTAGGGC 

CACCTGACCCCAGCCTTGACCTCTCCCCCACCAAAACAACAAAAAAA 

TAAATTTTTTTTTTTAAGTTGGG G CTGG AGAG ATGGTT C CATG ATTGTTGCTCTTGCACAG CAAGG AC CCAGGTT CG ATTC C CAG C 

3 0 ACCCACAGGGCTGCCCATAACTGTCTCCAATCCTAGGGGATGTAACACCATCCTCTGGC 

TACATACAGACAAACGCCTATGCATGTAACTTATAAAAAAAGGTTAAAAATATTGGGC^ 

GAT ATAT AAG TACACTGTAC CTGAC TTCAG AC ACAC CAGAAG AGGGCGT CAGATCTCATT ACAGGTGGTTGTGAG C CAC CATGTGG 
TTGOTGGGAGTTAAACTCAGGACCTCTGGAAGAGCAGTCAGTGCTCTTAACCTCTGAGC 
TTGTGTGTCATGTGCAAGTTAGGTCACTAGACn'AACAATACGTACCTTAGCCATCTGCGTC 
35 TTTTTTATTGGTGTATTTCCACTGTACACAGTAATTT 

CACACAC ACACACATCCATGAGAAC CG ACAATATG GCT CAGT CAGT AGAG TGCTTAC TAAG CATG AACCAAG TC CTG GGTT CAAAT 
GAGTATCACATGAACATCTTTGGATACATAGAATCCAAGGCCAGCCTGGACTCCAT^ 
CTTCAGTGGTTAAGAGCACTGGCTGTTCTTCCT 
40 TGGCCTCTGCCX3GTATTGCATGCATGTGATGCAAAGAC 

AATAAGAGGGGGGGAATGTATTAAAACTAACCAGCCTTGCTTCCTGTC 

CAAGAGGCACCCCATATTGAAGGTTGCTTAGOTATTG^ 

CACCATGGTCCTTGGCTGCCGCCTGGTG^CCACAGTTGATAA<ICT 

ACCCTCCCCCAGATCTAGGGOTACAACCAGGAAGAACTGC^GCCTTTGCITTGGCGTAAGTTC 
45 CTTCTGATTTATTGGTCTTTCTCCCTTAGTAGTGCTGAAAATAGAGCCATGGGTCTCG(^ 
GAATACATCCCTGGCCTTCTAATGTCTGATTTATGCAATAAGGGAGTCAAGTAGAAGG 
GTGTGGCTCAAGCCGCAGGTGCCAGCAGAAGCATTCTCAGCM3AA 

GTGGATTCAGACTGGCAAGTACTCTCCTGATGGGGGGACCAGTGCCTCCTGCTGGACCCTCGC 
TTCACATACAGCTCCCTCACTGGGGGCCTCTAGGTAGGACC!ACTGAGCCAC!ACTCTAG 
50 CTCTCCCACTTAGCCACGCCTCCAGAGCCTCCCTGGGGAATTCTAGGCAGGGCTCTACCACTCAGGGGCTCT 

G CCCCAGCCCCTCCCTGGAG CCATGCTTT AGTACAGAGCTTTCTGATTGGTC CTTGGCTTCTATACGTCCTTCATCCCTT CCTGAC 

TCAGCAGCTAGAGCAATTCAGTCGAAATGAATACTAGATAGTGTCACTCrrACGGCTCCT 

ATT^GCAACCCTGTGTCCCTAGCCTGAGCTACTCTGTGTACAGTGT^ 

G AGAGAG AGAGAGAGAG AG AG AG AG AGAGAG CTGTTTCT TAAAATTGTTT ATTTT AAAACATG A CA 

5 5 CAGAAATT ACATGG ACAAG AACTTT AGTTATTTGCCAAG AAAAATAAAGT TC CC CACAGAAAT AAAAATCAAACC C CAAACC CAGT 

TTAAAAAATGAAAGCACAGCAACAGTAACACACAGCCATGGCCCTACA 

GACATAGT CCACTGGGGG CAG ATG CAGG CAG AGGG AC CGT AACTGGGAGGCAAT CTCACAAAATCACAT ATGGGAGGAAGGG TG CG 

GCACACCACCGCTGTGATGJW^AACAAAGAGACCCTGGTCACCTGTGGCCCACCAG 

GACCGGGTAGGGCAAGCTTTGGAGCTG^GGTCAAGAATTGCATAGGCTTGAAGGATGC 

6 0 TAAACCAGAGAGTGTTGGTTGGGGTGGGGGCCTCCAGTCCTATCAGTTTTCTCAGCCAGATCCCTCCTCCTGGCTCGGCAAGGGAT 



CAGGCAGCTGGCACTGGACTGCTCACTCCCTGGGGATGTGGGGGTGGACCACTCGGATTGAGTA 

AGGAAAACGAAATGAAGCCAACCGCAGTGGTAGGAGCCGTCTCCTCCGCAGGGGTCAGGGCGAGAAACCCCTGCTT^ 

TGCTTCCTTTCTTCAGCCTCGGTGAGTGAAACGTCACAGATGGGGGCA^ 

6 5 GTTCACAGCCACGTCCGGGCAGTX3CCCXJCCTCT 

TGCTTCTGAG&GXSCCCCAGG&CACACCCAGGC^ 

CATAGTGGGGAGGCTGGCTGCAGCTGGCICTGCTCT^ 

CTG^CTGATGTTGTCGTXSCTTAGGAGCTACAGATGCCC 

TCCCAGACACCAGCACAAAGAAAGGG'ATTTTGGGGAAGGAAAGCAACGTGGAGCCT 

7 0 GTTCTGGGATCCGTAGCTGCCAGGTT'GCTCCTUjGAGGGCCACAGGTCTGTAAACCACTGGG 

TGCTG CTCCACTGGCCCGTAGCTGCCCATCTCACAGACCAGAGCTCACACTGG CTGGCTGGGCCAGGGTAGAGGCTCCAGGTCCTG 
ATTTGGAAGCTAATTCAGCCC1AGAAGCAC!AGCAAACATCA(^ 

GGGGGCTGAGGCCACATGGACTTCAGTCCCCGGACCCCAGGACATGCTCCTCCTGGGGC!^ 
GCCCTTTGAGTAGCCTGTGG&TCTGGCCCCTCC^ 
75 TTAGGGTCACGCTTGCTGGCTCAGGGTCAGCCAGGAGTGCCCATGGCAGCT 
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GGGGGGGGCCITCTGAGGCTCGGGGCACATTCACRGCCGTCACAGGTC 
GTCCTGGGTGAGAATGAAACCGAGAGGAAAGTGGGC^TGTCCTC 
GTCAGTGCAAAAACAAAGGGGAAGACAAAGCTGCCACTCTTC^ 
CACTGAGAGGCCAGGGAGTGACCTGGGCTCAGCTTGGCACCTC 
5 GGGGAGGAGGTGTAAGAAGAGCCCCACCCTAGAGGCAGCTCAGAACCAACAG 
CAGGCGCCGGACGTTGCAAAGACCAGAGAGTATGAAGGGGCCGT^ 
TGTGAATGGAGATGGGATTCTOVGGAGTGTGGCGGTAGGGCAGTACCAGGAAAGGCT 

GAGTGCAAGGGG CAGCATG AAGGGT CACCTGTAGAAGGAACG CAGG CAGGGTGGAAGCACCCGG CACAGAGCAGGGCTCAGGGAGC 

CCAAGAGGACAGAAGGGCZAGGGGAGACGGAGGCAGGCAGGGTCA 
1 0 CCTCACTTCACAGGCTGGCCCGGAGCTCATGCTQGGCACC^ 

GCCTAGGGCTGGCCCGGTGTGGCTTGTGGAAGGTCCAGCAGGGCGGAAGCAGCGGTGOT 

GTGGGGGCAGATGGCTGGCCGGTGTGCGGTCTA^STGCCCGCTGCCGCTGGAGTCGGGGCGAGGCT 

CTCTCATGCTGGAGGTGTACTAGAGGAGAGAGGGAAGGAGGGAGCAGGGTCAGGTGAGTTCCT 

GCCTCTTTGATGTGGTCCTTGGTAGGGTGAGCCACAGCCGACCCTTGGGGAGAGCAT 
1 5 CTGGCCCAGGGCTGGACCAAGGGCTGAGTACAGGTTAG 

TGGATCACCCAGGCAAGTTCCCTTCCCAGTGAAGGACACCCAGTCCCCTO 

TTTGCCACCTCCAGTAATTCAG CTCAAAGGCTCTGAATGCATCACACACGCTGAG CACAGGAAGGCCACATGGACCCCATCT CTAT 
ACCTCTGCCTCAGAGCTGTGCTGGGGTCAGGGCA^ 

2 0 CCCTTGGATGGGAAAACTAACTAAAGGATGTGTGACTCGG 

CGCCTTAACAACAGC^CCACAGGCTCCAAGGTTTGGGCAGTGTGTCCACGTGCA 
TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTCTGGCTGTCTGCCTGCTTGCCTGCCTGCCT 
TGTGCCAAGGCCAGGTAGTGAGTGTGCAAGAGTTTGCATGCCTGA^ 
TGGATCCCCACTCACCCCACCCCAGGGAGCACAGGCTGGCTCCA^ 

2 5 GTGCTTTCCTGAGACATTCTAGGTTAAGGTTGCAAGCGGCCT 

GCCATGTCATTCCATGGACATGCAGAGGTGATAAGGAACAGGGGGTTGGGATGGGACAAOT 

MOUSE SEQUENCE - mRNA 

CTGGCACTGGGATAGATATTACGTGCGGC<X3CCGGCCACCATGCTC CAGCGGTG CGGCCGGCG CCTGCTGCTGGCGCTGGTGGGCG 
30 CGCTGTTGGCTTGTCTCCTGGTGCTCACGGCCGACCCGCCACCGACTCCGATGCCCGCTGAGCGCGGACGGCGCGCGCTGCGTAGC 
CTGGCGGGCTCCTCTGGAGGAGCTCCGGCTTCAGGGTCCAGGGCGGCTGTGGATCCCGGAGTCCT 

CGCGTCCCCCCGCCGAAGTTCTGTCCCCTOSCGACGTCrT^ 

CTGCTGTTCGAGACCTGGATCTCGCGCCACAAGGAGATGACGTTCATCTTCACTGATGGGGAGGACG 

3 5 AGGCAATGTGGTGCTCACCAACTGCTCCTCGGCCCACAGCCGCCAAGCTCTGTCCTGCAAGATC 

TTGAGTCTGGGAAGAAGTGGTTCTGCCACGTGGATGATGACAACTACGTCAACCTCCGGGCGCTGCTGCGGOT 
CCCCACACCCAAGACGTGTACATCGGCAAGCCCAGCCTGGACAGGCCCATCCAGGC CACAGAACGGATCAGCGAG CACAAAGTG AG 
ACCTGTCCACTTTTGGTTTGCCACCX3GAGGAGCT 

GAGGACACTTCATGAGCACGGCAGAGCGC^TCCGGCTCCCCGATGACTGCACCATTGGCTA 

4 0 CCCCTCATCCGGAGCGGCCTCTTCCACTCCCACCTAGAGAACCTGCAGCAGGTGCC 

GAGCTATGGGATGTTTGAGAACAAGCGGAACGCAGTGCACATCAAGGGAC 

MOUSE SEQUENCE - CODING 
45 ATGCTCCAGCGGTGCGGCCGGCGCCTGCTGCTGGCGCTGGTGGGCGCGCTGTTGGCT^ 

ACCGACTCCGATGCCCGCTGAGCGCGGACXJGCGCGCGCTGCGTAGCCTGGCGGGCTCCT^ 

GGGCGGCTGTGGATCCCGGAGTCCTCACCCGCGAGGTGCATAGCCTCTCCGAGTACTTC^GTCTACTC^CCC^CGCGCGCAGAGAC 

GCGGATCCACCGCCCGGGGTCGCnTCTCGCCAGGGCXjACGGCCATCCGCGTCCCCCCGCCGAAGTTCTGTCCCCTCG 

CATCGCCGTCAAGACCACCAGAAAGTTTCACCGCGCGCGGCTCGATCTGCTGTTCGAGACCTGGATCTCGCGCCACAAGGAG 
50 CGTTCATCTTCT^CTGATGGGGAGGACGAAGCT 

CGCC^AAGCTCTGTCCTGCAAGATGGCTGTGGAGTATGACCGATTC^ 

CAACTACGTCAACCTCCGGGCGCTGCTGCX3GCTCCTGGCCAGCTATC 

ACAGGCCCATCCAGGCCACAGAACGGATCAGKZGAGCACAAAGTGAGACCTGTCCACT^ 

TGCATCAGCCGAGGGCTGGCCCTAAAGATGGGCCCATGGGCCAGTGGAGGACACTTCATC 
55 CGATGACTGCACCATTGGCTACATTGTAGAG^CTCTGCTGGGTGTACCCCrrCATCCGGAGCGGCCTCTTC 

ACCTGCAGCAGGTGCCCACCACCGAGCTTCATGAGCAGGTGACCCT 

ATCAAGGGACCATTCTCTGTGGAAGCTGACCCATCCAGGTTCCGCTCTC 

TCGCTCCG CCATCTTCTAG 

60 HUMAN SEQUENCE - GENOMIC 

GGTGGAGGCTGGAGTGGGGGGCGCIAGCTGAGACCAGAAAGAAA 
GGACTCTCCACCCAGAGAGACGCCATAGGCCAGGTTGTTGGGGAC^^ 
CCAGCCAGCTGCCCACCCCTTGGCCTGGGCATGTCCAGCTTTTGCTCAG 
GAGAAGCAGGGCTGGCCAAAAGACCGCCATGCCTGAAGGTTC^ 

6 5 CTTGGAGCCTCAGCGCCTCAGAACrrrTGGGACTTCCTGGCATATGGGAGGCACCCGGAGG 

GGGTGGCTGGAGACACCTGCTGCTGGCCATGTGGGCCTGGGAGGCCTTCTGGGGCCTCCATCTTCCCATCTGTGAAATC 
GACCTAGATGCTCTTCAAGGTCCCTTCTTCCAAGTCTTCTGGC^ 

GGACTGAGTACTCTGTGGGTGCCGGGCTGTGCTGAGGGCTGGACGGGACCCAGGAACAGACAACAGGTA^ 
GCGAGGAGACTTCCCACAGGGGTGGAGACGCTGAGCACTCCAGGCIAGACA 

7 0 GCAGGGCCTGGGGGGTT CGGCAGGCAG AGTGGG CCAGGGGGTGK3CCTGAGGACX3CCAGGGGCAGAGCCAGC^ 

GTGGGGAGCCAAGGGAGGGTGGCAGAGCCGAG^CCCATCTCTAAGATATGGGGACAGCCCTCAGA 
GCTCAGGACAAGCAGTGGGCATGGAACTGAGATGCAGGAAGGCCGAGGTCTGG^ 
GTGGGCGGCCCTCAGCCCTGCCCGGAGATCCCTGGTCCTACTGAGTCAGCCACCGGGGAGAAGATAAGG^ 
GGGGCCTGACGTTGCTGTTGGGTGAAGTGCCCGCCCAGAATGAC^ 
7 5 CTCCGCAGCTTCCTCTGGCAGTGGGTGGATGTATAATC^ 
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GCTGAGCCCCAGCTGGGTCCGACCTCCTGGGTACC^ 
TGATGGTGGACAGAGCCACTCCCAAGAGTGTTAGAGACCTGGGG^ 



CCGGAGCCCCCTCCAGCCrCCTGCCTCACTCT^ 
5 TGCAGCCTGGGGGGCTGTGGCAGGTGATGGGAGCAGCTGTATTTGAGGGCAGGGTCTGT 
AGGAGGGACCAGGGAGTGTAACCTGGCAGCCCGCAGCAGTTCCGGCTGCT 
AACCCCGGCTGAGCAAACAGCnrTTCCAGGCGGCC^ 

CCTGGAGG CTGTGGGGGTGGTGGGGGATGGAGGGGAC CAAGCTTTGC CAT CC CTG CCCTCAC CAAG GC CTGGAC CACAGC CTGAGC 
AGCCCAGCCCCTGCCCCTGCCGCAGCCTCCTC 
1 0 AGGGCAGCCTCACrrGTCCTGGACGCCCGGCCACTGCATCCTCTGCCCTTGTGTGACCT 
CCGGGCCrCATGGCCAGACACAAGCCCACCTTGTG 

GCCAGGGGAGGGCCCGCCATCTCTTGGATGGTTTCTGCGCCGTGGCCACT 

TTCTGTGCATGGGGGG CTGTGGGGCAG CCTCTCTGGGCCCTATAGGCCAG ACCCTACTGTCTGTTCAGCCCCAG CCCGGTGGCGAC 
AAATGAAAGTGTCTCCAAACATTCACTGTCAGGTGT^ 
1 5 GCAGATGGCGTGC^CAGAAACGGGCTGCAGAGAGAGGACGGTGCCTCCAATGCTTGGA 
CCTCTGCCGCCCTOAGAGAGACTCTGCCCCACCrCCCCACACCCGCCTCCAC 
CACGGCCCTGCCTTCTCTCTGCAAATCCTTCCCAGACTAAGCTGCAACACTGGCA^ 
GAGGCCAAGGCGGCTCCTGGCCACCCTCGCAGCTTCCTCCCTCCCTGGCCACAAGGACCCAAA 
CAGGAAGGCTCAGGCTGGACACGTATTGTATGAGGTGGGCCTC^ 

CTCACCTCCATGAATGCCCCA<^CACCTGTGCC^ 

CTCC^CCTGCTTCACACACCCCTGCC^ 

ATCTTTTCCTGGAAGACTTTCCAGAAGTCCCCTGCCT 

TTTGGGCTGGACCCATTTTTCTAACCCCTGTTGAATCCTCACCTACTACCTTC 

2 5 ATGCTCAGGAAGGCTGTGGGATGAGCACATGAAATGGATGGAAGGATGGACGGACAGATGGACAGAT 

TGGATGAGTGGAGCCCAACACCAGCTCTCAGGTCCT 
CCTCCCCTTCTTCTGCTTCCAGAGCAGCCTGGAATATCCTTTGATGCT 

CCCATGGTAGCGATGGGGAGCCTGGGATCAGGGAGGTACACAAGCCCAGGATCACATAGCAAGCAG 

3 0 TAGGGTGGGGGGTCATGCCTCGGGCCTGACAGGCAACACCGTGGCCACAGCCCACCCCC 

GCCATTCTTGGCAAGACAAAGGCCGGCTGGGGCAGGGTGGCTCTGGTGCCTCCCTGGTG 
AAGTGGATCCTGGTGGCTGOSCTCTGATGCTGGGGGCT^ 
CAGAAGCGCCAGAAAGAGGCGCAGCTGCCAGCCCAGGCCTGCCGTCCT 
CrTGCTGTTCCTGCTGTGGGCTTAAGGGTGTGCCACCT 
3 5 AGACCCCCTGAGGCCTTCTGGAGGTGGTGTTTGGAAAT 

CTTCAGCACAGCCCTCCCCCAGGCCCCAGGCCCTGTGCGGGCTCAGAGGCCA 

CTATGCTCCCCAGGGCTGGGCATTTGGGAGACAGAGTCAGAGAGGCGCCTTTGAGCCCTCTGTGTATCT 
TCCTTCCATCCAACCTCTACCAACCAATGTTCATCAAGCACCT 
AT^CATCACGGTCCTGCTGTCAAGCAGCTGACA^ 
40 CTATCCATTAGACCATCTGTTGGTCCATCTGTC 
ATCCATCCATCTATCCATCCATCCATTC^ 
CCATCCATCCGTTAGTCTGTCTGTCCTT^ 

ATCCATCCATCCATTCATCCATCCATGCATCCATCTGTCCATTCATCCATGCATCC^ 
CCATCCATCCATCCATCCATCCATCCATC^ 
45 GTCTGTCCATTCACCTATCCTTCCATCCATC^ 

CCATCCATCTGTCCATTCATCTGTCTATCCATCCATCCATCCATCTGTCCATCCGTC 
GCCCATCCATCTGCCCCTCCATCTAACT^ 

CCATCTGTCCATCTGTCTATCC^TTCATTC^TCCATCCAACCATCCATCCATCTGCCAATCTAT 

GTATGT CCATT CCC C C CAC CATCAGTT CAAC CAGTGTTTACTG AGCAC CTAC TTTGTG CCAGG CCCTG ACAAGCAGCAAG AGG CAG 

5 0 CCTCTTCCCCCAGGGACACAGACGTAAAGGA 

GAGGACGTTCTAGGAGGGGTAGCCCTTGGTGGGCCCTGCAGGGTG<^ 

AGGC CAAGTCGGCTGAACTCTAATGGGC CTGG£CTCTCTGTCCATG CCTG ACCCCAGAGTGCAGGCAAAAGG^ 
GGAAAGGCCGTGGCTTCACGGCTGCAGAGCACCAACCC 
TTACAGAGTCAAGTAACGTCCCAAGACCCACAGAAGGGCACTGGCCC^ 
55 AGGCAGGGTTGGGAATGGTGGGGGTGCTACrrCACAGCTGGAGTCTGCTAGTCTCAGGGCAGTGCCCACTCCAGCTGCA 

AATCAGCTGTGCCACCCTGCGGTATCCCTCCCCCGGCCTTGGGAGCCGGCTGTGCTGAGCCCAGCCAACCTTCCCATCACC 
TCCTGCCACCAAGGGCTG CTCACAGAAGCTCCTGGGTCGGGACACT CACTAGGCCACGCCTGCCTTTTGGAAATC CATGGGCCCCC 
TGGCATCTGGGGGCTTCAAACACTGCCAGTTCTGATG CCTGGCCCCAG CCTGGGGGTTGG CATTCCTGCGTCTGTGTCAGGAGAGG 
ACAGGGAGGTGCAGGCTTGCCCGAGCCACAG CCAC CACCATGCTGAGT CCTGTCCAGGAGTTTCCATCTCTGGGCACTGCTACTGG 

6 0 CCCAGAGGAAGCTGGTTCCCAAGACACTGAGTC 

CGAGTTAATGGGTCACCCATTATGTCTCCACAGATCTTCCCA-GGACCCTGCAGGAAGGCAG^ 
CGGGGGAGGGAAGGCAGATGGCCACCTCAAAGCCAGCCCCGAG 

TGGCTTTGGGGACCTGGTGGCACTGACTCCAGCAGGAGACCCCCCGGGCTGAGCCCTGGCAGTTA^ 

GGGACAGAACACTGTCTCAGCCTGATGGCGGCTAGAGGAGGAGGGTAGGGCCCCTGGTCATGTGGGTCCTGGCTCCGTGGG 
65 TTTCCACCTGACCTCAGCAGGCAGGTCTCTCGGGCCAT^ 

CCCTGCAGAGGGGAAGACTAGGATCCAAGAAGCCCATGAAGCCATGCGCCAGGGCCAGAGCT 
TCATCCTTGTGTTACAAGTGAGGAAACTGAGGCTCAGGGAG^ 
GGGTTCGGAAGGCAGGCTGGGCCCTGGGGCAGCCTCTATCCCCAGCCCCT 
GGCAACTCGGTCAGCCCTCTCTAGCCCCCTCCCCAGCTCCCCTTTGTCCTGT 

TGGGGAAGCGGGCX3GGGGTCGCTAGGCGGGCCZACCCCCTCAACAGTTTTCAGACCTCAA 
GACAGCAAAGAAATTGATTCACTCTTGCCAAAGC^ 

GCAGCAGCATC^GGTCCCCCACTCCTCCCATTCCTGGCTACAGAGAGACGCGGGAACCCC 
CTGCCAGGCTGGGCCATCCACCTCTGTATGGTC7VCTCCCTT 

7 5 CACGACCTGCTGGGTCAAAGGCACCGCAGTCAGGGGTAG CTGGGGAG CAGGAGTTGCGGGGGTGGGGCGG CTCAGGGCAGGAAGCC 
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CCATCCCCTGGCCTIX3GGAACTGTGGGGAGACGGACT 

TGCCACTGATGTAAAGTCCAGGTACAGAGCACAACCCGTGCACATATTTGGGGGCTTGGCC^ 
GGGATTGGGCCTOU^GAAACGGGGCTGGGACTAAGTCACGA^ 

T ATGG AGGGTTTTGCT CTTTGTGTGGGTGTGTGAGTGGCTATG AT CCT AGTTGT CCAGCTGCTGCGTGG AGAATAAGTAAAGCAGG 
5 TAGGATAGGAGTGGGTGGGATGGGGGTGGAjGGCTGCCCAAGTGATTCAG 

AAACAATTCAGATATGGGAGAGCOTGAAGTCAGCGGTGTGACTGAAGCACGTC 
GCTGGGCTGGGACTCAGCTGACTTTGTGGGGGCAGCCAACCCAGTC^ 
AGCTGCATGGAGGCTCCAGGGCTAGGCTCTGrGCTGATCGGAAJGGG 
GGCACAGCCAGCACCATCTGCTGGGGCATCTGCTTTCGGAGGTGGTGACTGCT 
1 0 CCCATCCCCCACACCACCCAGCGGGTACAGGG 

CGGGGGGAGAGGGTCTGACAGTGGGGC CAGACATTTATCCAGTGAATTTTTT CAGAG CAC CTACTATGT CCCAAAACCTGTTCCAG 

GTCTGAATGACACCAGGTCGCTGCTGTCTTGCGGCTGACATTCAGGTAGAGGTAAGGAGAG 

TAAAATGCCATCTGAGGCGTGAACATAACTCCGAGG 

ACCCTGCTTAGGAGGGCACGGGAAGACAGCGCAGCCACGGGCAGAGCCGCCTCCGT 
1 5 GGCTCACACCCTGCTCAGGCGACCCTTGGCCACTTCCGGGCCCCTCGCCCCTCACCCT 

CCGCTCGTGCGAGGGGCGGGCCCGGGCCCGTGGGAAGGGCCGGGGATGGGGGCCGTGTGCCGGGGAGGCGGGGTGGGG 
CAGCTGGGCCGCATCTGGCCCGAGCGCAAGGAGACCAAATGGGAC^GAAGGCGGCCTTTCA 

GTTTCT CCTCGCCATGGGGCG C CTG AGG ATTTCGGGGAG C CAAC CCT CCCG ACGGAG GG C CAGGATCGAGGAAATCAG C CAGGGAG 
GAAGGAGTGGAATTTGGTTCTGGTGCTTTCTCGTTAGAACCTCGTGTCCCCACCT 
2 0 CAAATAACGGCTCCCCTGAACCGAGCGCTCGGGAAGACCCAGGCGCC^ 
AGCCGTGCTCAGGAGGGGAAGCGCACAGCGA 

CTGCCCGCTGCGCCGAGGGCGCGCCCCTCCCGTGCCGGGACGCCCGGTGCATCCCCGGACGGCGCTGGCC^ 
TCAGGGACACCGGCGCGGGCCGGGCCCGTCCCCAAGCCCCTCGGCGGCCCGCAGGGGCTACCCGGGGCGGG 
GCACGTGCGCGCCGGGGGCGGGGCCGCTGCGCCCTCTGGGCAGTTCCCAGCCCCCAGCCGCGGCCCCACOT 

2 5 GAGCGGCXIACCGGCGCGAGGCCCCGCCCCCGAGCCCGGGAGCCAATGAGCGCACGGGGGCGGGACCGGGGCGGGGAGG 

CGCGGGAGACTGGTGCTGCGCTGCTGGACTGCGCCGCCGGAGCGACGGGCTTC^ 

GCGCCGOSCGGCC^CCACCCCACCATGCTCAAGCGCTGC^GCCGGCGCCT^ 

GCTGGTGCTCACOSCCGACCCGCCGCCGCCTCCACTGCC^^ 

30 AGCCTGCTCACCCGCGCGCGCAGAGATGCGGGCCXIGCCGCCCGGGGCTGCCCCCCGCCCCGCCGACGGCCACCCG^ 
CGAGCCGCTCGCGCCCCGAGACGTCTTCATCGCTGTCAAGACCACC^ 
CCTGCaTCTCGCGCCACAAGGAGATGGTGAGCCCCCCGCGGCCT 
GCTGGTGGCAGTGTCCCATGGGAGTCAGGCTGCATCCCC^TCCAGCCACT 
CACCCAGTTTGCCTGCTGGGGCCACTCTCCCGTTACAGTTGTGTTACTGTCCCCGGGGCCCCG 

3 5 TGGATGGCTGCAGGACCCTACACCAGTCTCCAGTGCTTGGGGTTGGTGCCCTGTGATA 

TTCATAGGGCATTATGTCCTCAAACTTCC1ACAAGCAAATTCT 

CTCTTCCTTTAGAGAAGTGAGGGGTTCTGACCTCTGGGCCCriTTCCTCATCACAGTGTTAAGGTACG 

GCTGGTTGGTGGGTCCTGGGGAGTACATCTCAAAGAACTCCAGGTTTTGGG 

TCTCTGAGGCCTGCAACCTTGGGCACAGCCCACTTCAGTTGGGCATCAGCTTCTTACA 

4 0 GAACATTTCCCTCGAGGCCAC^GCAGGACGGTTGGGC 

GGTGTCTCTCGGCTGGGAGATGCAGAGGCATTTGCTGTGAGGGTGGGGGCTCAGGGAAGACCCTG 
ACCCCGGAGACCCAGGGCCTTCTGACCTGGCGCTGTCCTGTTTTGGGGCACTGTGGGTGGG^ 

AGCGTCCCATTCCCAGGGGCTCAGCGGGCTGGCGGGAGGGGGCGGCGGGGGCCGTGGGTTTTGTTTC 
45 TCCC!AGCGCCGGGCGGCTCTCACGCCGGCCCCTTGGGCA^ 

TGTGCAGCTACGCGCTCCGTGGGAACCGCTGCTGGCGTGCTGGCC^CTCAGGTT 
AGCATCTCTTACTGACAGTCCTTTTGG 

TCTTGGGCATCCTTCAGCCCCACTCTGGGCrrACCACCAGCTCTGAGCACCTC 
GGGATGTGGTCAGAGGGCTGGGCTGTGTGATCCTGAGCAAGACCTGGGGCCTCTCAGACCCT 

5 0 ACATGGTCCTTCC!AGGAGCACGTTGTAAGAGACTGGCAAAGCCTGCAG 

GGGGCCATCACTCCCTTGAGCCCTGAGGTCCCTACrGGAGGGT^CCCCACC^GGGGACAGGTTGAATCTC 
CACCAGCACACAGCAAGCAAAGACCCCCTTGTGGTGGGTGATGGGCAGGGGTCCTGGAA^ 

AGCCCTGCACGGAATGACTCAGACATGGTCTGATGGGGGAGATAGAAAGATGTCAGGGCTGTGCGGGGGGAAAACCCCGA^ 
GTGTGAGGCAACAGTGGG CATCTGAC CCACATG CTGGG GG AAGGGAGGG AG G AGGGAAGACTT CCTG G AGGAGG CG C CATTTG AGC 
5 5 TAGACTAAGAGTAAAAAAGTGCCCAGAAAGGGCACTC 



TGGCCTTTGGCCCAGAGGTTCCCCGCCCCTCCCAGGAGCAGCCAGGCTGCC^ 
AACTTGGCATCCTGTGTTTCTTTGAATCGCGGCTCCAGGAACA^ 
CCCTC&TCC^CCCGGCTCTTCCTCCTGC^CCCCGTCTTCGTCTG^ 
60 AGGGACTGGGCTGGTCTAGCACCGCCTCGGCCCCAGGGTATGACCTCAGGCT(^ 
CTTGGGCCTCAGCTGGTAATGGCACCCACCrrCCGAGGGCTGTCAGGGAGAGGAA^ 

GCCCAGGGGAGCTCTGTTATGCCCATTTTACTGAAGACGAGACTGAGGCrrCTGAGCGGCAGGTGAGGGCTATAGCCCAGC^ 
TTTTTCCTCCCCTCCrrGGTGGCTCAGXSTGTCGGGAGACAGTTCGCCCGGCACCCCCGCACCCCATA^ 
GTTCTAGAGGCCTCCCTGCTCCCTGAGCTrGAAClAAAGGTGACTTTAATCTCCGTAGCCTTGGG 
65 ATCTCGGGAGAACAGGCCTCCCTTGATCTCATTAAGAAAGCATTTTCCTTTTAACCCAGCGGCGCAGTGGAGCGTCTGCAGGGGCT 
GGCCCTCCGCCTCCCACGGCATTCCTGGGCTGAGATTCTGG<3GAGGCCM^ 

G<3GTGTCTATGGGTGCTGCAGGAGGGGTGCATGCATGGAGAGGCGGAGACCCAG CCTG CCACCCCCTTAG CTCTGGGCCTCACCCT 
AGTGAGCATGTGCCCAGGGCTGGGTC^GGCTGGATAAGGATCTGA^ 

70 GCCATGCCAGGGTGCGCCGGGCGGCACGCGCTGGGCGGGCCGGGCAGAGGAAGGAACATTCCT 
CGG£CGCCG£CGACTCCTGGCCCCGCCCTGGCGGCCCCCTCCTC 

CAGGTGGGCTGTGGCCTTGGGCAAGCCTGGGTTTCCTTGCCTGAGCAGTGGGAGGGGTGCTGGGCTGC^ 
TGACCTGGGGGCTGGGGACACCTGAGGTTTCCTGCCACAACCCTGCn^ACCGTTGGCAACT 
CTrCAGAGTGGGTAGCTTGTCCTGCAGGTCACCCAGGACAGGGGTGCTGGCCCAA 
75 GCTGCACCCTACGCGCGGGACCTCATGCCCAG 



2297 



WO 03/008583 



PCT7US01/51291 



CTTAACTCGGCCGAGTTAAGGCCGGCCCGGCGCCCCTCTCCC 

CTCCCTCCAGGCCCTTTGTCCGCCAGGGCTTGGCAGCCGCCCAGCGA 

TCCCACAGCCGACCTCCTCCGCCTGTCCCTCCCGCCACCCCGCCTC 

GCTGTGTCAATAAGGGTGTGTGCCCTGCCAGGTGCACGGG^ 
5 TGGCTCGTACCTCACCGCGCCTGGACGACCTTGGGTGCAGCT^ 

AGTGAATGCGTGAGCTCTGGCCCAGATGGCACTGAGAXGGGGCACT 

GAAGGGTGGGTGTCAGTGTGACGCAGCTGCAGCTGCAGC^ 

GATGAAGGGCTGCCTGCTGAAGGCCGATTTTCTCOT 

ACGGGTGAGCCCTGGACTTGGGGCGGGAGGGGGCCCAGGCCTC 
10 GGGCCGAGAGGTCACCAAGGGCAGGACAGGGAGGG 

CAATTCTCMGCAAATGAAGCCCATTCAGCCCCCCGGGTCC 

GCTGCCTGAATGGGCGGGGGCGGGGCCGCCCTGGAAGGGGCTrc 

GGGA.GGCTACGGCCGCCGGGCCCAGCTTGAACrAGGGGGCTTTTCTCGAGCCCCTGGG 

TCTCI^GAGCCXK5CTC^^CCTACTCACAGCCGCTCCCCTGTC^ 
15 CCGCCAGGCGCTGTCCTGCAAGATGGCCGTGGAGTATGACCGCTTC^ 

CCATCTCCCTGCCCGAGCCTGGCAGCGCCCGCCCCGGCATGCCCT 

TGGACGATGACA&CTACGTCAACCTGCGGGCCCrGCT 

CCCAGCCTGGACAGGCCCATCCAGGCCATGGAGCGGGT 

CACGCX5GAGCGCACACCCX3GAGTGGGGGTGGGACAGTGAGGGGCAGCAGCGCCTC 
2 0 GTCCTGTCCACTTCTGGTTTGCCACGGGCX3GCGCTGGCTTCTGCATCAGCCGT 

TGAGTGCCCrrGCACAGGTTAGGCCAGCCCGGTCCCAGGCTCCTCGCCACTGTGGGGCCT 

GTGTCCCCAGCCTCCTGTGTGGCACTGCCCACTTACTTCCTATATTCCACTTC 

CGGCTGCCCCCAAGCCTGACCTGCTCAGAGCAG^ 

CCAGATTCCCTCCACAGAGAGCCACGGAGCACAGGAGCTGTGCAGGGAGTGTGCCCTGG 

2 5 QCAGCCCAGAGCTCTCCTCAGGGCTCCTCTCCC 

CTCCCGCCTCTGCTCACTGGTCTGGGCCCTTC^ 
ATGACTGCACCATCGGCTACATCGTGGAGGCCCTC 

CTGCAGCAGGTGCCCACCTCAGAGCTCCACGAGCAGGTGCACCATCCTCCG 
GTGTGGCTGCCGAGAGGGGCGCAGTGGGGTGGGGCACTGTTCTAA^ 
30 TCACTCCTGCCATGAGCTCAAAGCTGTTTATGGCGGGTTG 
ATTTTGTCCACGCTGGQ^TGCAGCCAGCCCCA^ 
CCAGGTCCAAGGGAGGCCAGGGCAGGGCCGTTGCCT 

CCTCCCGGCATCACTCCGCCCGCTCCCCACAGGTGACGCTGAGCTACGGTATGTTTGAA 
GGGCCCTTCTCGGTGGAGGCCGACCCATCCAGGTAAGGAAA^ 

3 5 GGGCCTCGTGGAGCTGCAGCAGGGTCTCTCTAAGCG 

GGCCGCCAGTGTTGTGGGACTGCAAATGGGAGCTCAGCACCTGCCTGCC^ 
TCCGCTCCATCCACTG£CACCTGTACCCGGAC^ 

CCCTGGGCGCCCCTGGTATCCAAAGGGCCCAGGGACCCTGTTGCGCTGCCCrGGCCTCG^ 
CTGTGCGTGTGCGTGTGCGTGTGCGTGTGTGTGTGTGTGTACTGC^ 

4 0 TGGAGGGGCGGGCACCAGCGCCACTTATGTGCCTCTGCTCCGAGGGCCAGT 

GTCGGAGGCCACTCCGAGGGCAATTCTGTTAGGATTTTTGGATCT 

ATGCGTAGGTGCCTTTCTCTTCCTGCTCACCACAAGCTCTGTGCTGGGGGTACCTGTGCCCrG 

CCCCAAGTACGACTCACTGAGCCATGCTCATTGCAGGGGAGGCTGGGTCTGGGGGTGCGTGACCCAATCCCCTTCCTCCTGGCCTG 
GACGCTGTGGCTTAAGAGTAACAGCAGCCACCGCCCAGTTCCAGTGGCCCCACGAAG 
45 AACAGTGGCACCCCTCCCZAGCTCTTCTGAGTGGGGAGTCTTCCAGGCCTCCTC^^ 

GAGTCAGAGCTCAG CATTTAATCTCCT CTCCAAAGTAGAGAG CAAG TCGCC CACAG TGGGCGTGT CTG TAAATATTG TGACAGTAT 

TTTTTTACTGTGCTGTTTTTTTTGAAAGGGGATGGGTAAAAAGTAGGGTGTTCTTGTTT^ 

GGGTCTTATTTTATCTTATCTTTTCTGTGGAT 

GACTGTCTGGCACTCTGTGTATTTATGCGTTC 

5 0 GCTGGGCCCAGCATGGCTCACCTGTCCCGTGGGCTGTGT 

ATAAGGGAAAAGTTCTCAGGGAATTGAAGTGTTGTTGCTATGGTGACGTCCTT 
CTGGGGGTCCTGGTCTITGGCrGGAGGTGGGGCCTAAGCCCCAGGGCAGCCAGG 

GGCCTTGGGTGCACCGCATGGAGCAGGTAGATAGGAAACGCITAGAGACAACTATTTTGGAACGAAACCTG^ 
CCTGCTGGTGACTACAGGTGCGGCCCACCCAGGGTCAGGGTGGTC^ 
55 GGATGCTGGGGGCTCTAGGTCCCAGCCAAAAGCACAACCCCAAGGCTCTGGCCTGGTCTG^ 
GAGGGACAGTGACTCCTGTGACCGGGCCTGGGGCCTCCTTA^^ 

CTCACTTTG CAGGTG^jGGAAACTGAGGCACGGAGCAGCTGATGCCCAAGGTC CAGCAG CCATGCTGGTGGGCTGGGG AGGCGCTCA 

GACTCCAAGCCAGGCAGGCCGTGGGTCCGGCCGCCCCAC^ 

CGACCCCGCTCAGGG<3TGGTGGGAAGAGTCCTGGAGAGC^ 

6 0 AGATCCAGACGCTTGTTTTTTAAATACAGCAGCTCTGCGGTGCTTTT^ 

GACGGGTGCTGCAGCCTCTGCGGGGATTCCCrCCCCTCACCCTCCTGGCAGCCCCGGGGCGGCTCACTTCCT 
CCCACCCCTTCCCTGGGGCTTACACCCTTGGCCTGGCACTGCCCTGGC 

TTCCCCCGACCCCAGGGCTCCAGGTCCCCCTCCTGGTGCCTCTGCACCCCCCACTCTGCTCCAAGGAGACGACACACTGTGAAA^ 
TGCCTCTCTATGGGGGGGTGCCTTGCCCTCCCCCGTCGCTGCCCCCTCCAGAAGAGTGCTGACGTG 

6 5 CCCCTTACTCTAAGCCrGGCTGGGGGCTACCCTGGCCTTGGGGGCGGCTGGCAGGGCTGTGACCC 

TGAGTCAGCAGAGCAGCTGCCTGGGGGGTGTGGACAGTGATGGGGATGGCGG 
CCCCCGTCTGGGTGGCTGTGGTCCTCCCTGAGCACCCACCATGTGC 

ACGAGGAAGCTGAGACTCTGGGGTGGGGCGGGGGGGGTCGTGACX3GAACAGGCTGCGGGCAGAGGCTGCTGGCCTCCT 
GGCTGACTCAGAGGCTTGGGGCCTTCTGCCCTGGC^ 
70 GGGCCCCGGGGGCAGAGGTGTGGCTGGACCCAGGGGTCGGAt^ 

GTCCCCATTAATGTTCATATGTCCCTGTGTGTGGAGCGGGGTGGGAGTGGGXK3CT 
AAGTGGGGTGGGGAGGAAGCCCGGCTCCTCCCGAGGCCTCGACCT 
GGTGCTGGAGGTCAGGGGGTGTTTTGGAAGGTGCACAG^ 
C^CTGCCCTCAAAAGGGGAAGCAGGCCCAGCAGTGTCTTAGGGGAATGAGGCA 

7 5 CCCAGAAGGTGACTCACCCGGCAGGGGAGGTGAAGTC 
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GGTGCCATCACATCAGATTAGAGGTCTGCT 
AGGCCCCCTTTCCCAAACTTCTGATCATTAATGGGGCTTTCAAAA 
AGGGTGGCAGACCT(^CCGCCACCCCTGCTTCCGTTCTGCCAGGCG^ 
GGGCTGCrTAGGGTGGGGCGCCTCCCCACTCAGT^ 
5 TGCCTCTGAGCTGAAGCAGGAAGGCTTTGTC^ 

GTGCAGAGGTGGGATCCACTGGGAGGACTG<3AAGCCTG^ 
ATC^CAGACAACTAGGGCTCTCCACAGGCTCAGAGGG^ 
TGGGGTCAGAGG<3AAAGCAGGGACCAGGGCCAGATCCAT 
CATGTGCCCACACAGCCTC^GCCTTCTTAGATATTCrrGCAGGGTGACCCCCT 
1 0 ATTGGACACACAGGAAGGCCCTTGGGGCCAGGCGTGG 

CTCCC^CCCCAC^TTTAGGCTTCAGGCCCCACAGCTGCCTC^ 

CCTGTTCAGCGTCAGCCTTCX5CCCAGGAGTGCAGGCT 

AATGTCCAGCCCACCGTGGGCACTCAAa3CAGG03GCTGTGCT 

CACC CTGCACCGGCCCACGCGGCCATCACACTCGGAGCACACG CGCTCCGATGTGG CTGCACTGTTGGCACACGCACAGGCGCGCA 
15 CGTGGGCGCATACACGCCGGGGCGCCCTGACCTCCCCGCACX3CCCTCCCGCAGCCX3CCCCGGCCT 
ACGGCTGGGTGTGCTCGCTCACGCCGTCCCGCX3ACGTTGCTCCAGCCGGCGCCCGGAGCT 
CTCTGCCTGCCAGCGTTTGCCAAGCGGTTrCCATGGTGAC^ 
CCACGGAGGGGGCGAGGCGAGGCAGGAGGTCTCTGCGTCTTC^ 

2 0 GTCAGTCATTCAACAAACTTTCTGGAAGAGCCACTCTGTGCTC^ 
GGCTCGGAGTCGAATCCTGAGGCTAAACTGACGTCCCGC 
GCACCAAGGCCCACCGAGCACnTTTCTCAAAGGGGGGCT 

TCCCCTGAATCACCACGCCCTTTGCTGTGTGACTTTG CCCCAACTCCTGTCTAGAGGTGG CGTCTACTTCCCCATCCTTCAATCTG 
AGCCTGTCTTATGACTTGCTGTG<3CCAATAAAAGGTGACA^ 

2 5 GCCTCTGCCTGCTCTCCCCGACTCCTGCTATAGCACGAGAACGGGCTGGGCTGCTTTGCTGGAAG 

GGTCACCCTAGCTGAGGCCATCCTGGACrAGCTAGCTCCCAGCTGCCCACA^ 

CTAG CAATG AG CAGCAGTACCACCAGG CCCTTT AAT ACTAAGCACA CTATTTGTTACTCAGCACT 
AGCTACCTGATACCCCTCTCrCTAGGGCCCTGGCCAGCAGCTGCCCAAC 

GGGGTGCTAAG AACCAGGC CATGGTGACAGCTGG ATT C CG CTGGCT AACAGTGC CAGAGG GTG C CGGGG ACAGAG CT TGTGGGAGG 

3 0 GGGTTCCCATGTCAAGTCCATGGCAGGAGCTACTGAAGCAGAAGCACGAGTCT 

GTGAAATGGGCCCATCACCCTGTCCTGTCTCCCGGGGACCCCTGGTAAGGTAGCGTGTGTGGCA 

GCCAGCCCCGGGTTGCAACCTAGGGAGAGAGGCCCATTGCTGCCT^ 

GCTTACAGACTGGAAACAGTGGG<3AGAGGATGA^ 

3 5 TTTTTTACATTTGACTATTTTCTTGTTTATTGGTATTTCTTTGGGAGATTTTTTTTTTTTT 

CTTTTTTTTTTTATTTTTTTTTTTTTTATTGATCATTCTTGGGTGTTTCTCGCAGAGGGGGATTTGG 
TGGAGGGAAGGTCAGCAGATAAACAAGTGAACAAAG^ 

i GTCC CTGGGT ACTTG AG ATTAGGGAGTGGTG ATGACT CTTAACG AG CATG CTGC CTTCAAG CATC TG TTTAACAAAG CACAT CTTG 
' CACCGC CCTT AATC CATTT AACCCTGAGTGG ACACAG CACAT GTTTCAGAG AGCACGGGG TTGGGG AT AAG GTG ACAGATCAACAG 
40 GATCCCAAGGCAGAAGAATTTTTCTTAGTACAGAACAAAATGAAAAGTCTCCCATGTCrAC^ 

ATCCGATTTCTCAATTTTTTCCCCACCCTTCCCGCCTTTCTATT^ 

CGCTG<3GCACACCrCCCAGACGG<MTCGTGGCCGGGCAGAGGG 

CACCTCCCGGACGGGGCGGCTGGCCGGGCAGAGG^CTCCTCACTTCCCAG 

GGATGGGGCGGCTGGCCAGGCGG<3GGGCTGATCCCCCCACCTCCCTCCCGGAa3GGGCGGCrGGCT 
45 CACCTCCCTCCCGGACTGGGCGGCTGGCCGGGCGGGGG^^ 

GGGCTCCTCACTTCCCAGTAGGGGCGGCCGGGCAGAGGAGCCCCTCACCTCC^ 

CCCCCCACCTCCCTCCCGGACGGGGCGGCTGCCGGGCGGAGAC^ 

CCTCACTTCTCAGACGGCGCGGTTGCCAGGCAGAGGCTTTCCrCACTTCT 

CCAGACAGGGTTGCG<3CCCAGCAGAGGCGCTCCTCACATCCCAGACAG<Ma3G 
5 0 GGGCGG CGK3GGAAGAGGCGCTCCTCGCITCCTAGATGGGATGGCGG CCGGGAAGAGGCGCTCCTCAC TTCCTAGATGGGATGGCGG 

CCGGGC AGAGACG<rrCCTCACTTTCCAGACTGGGCAGCCAGG CAGAGAGGCT CCTCATATCCCAGACGATGGGGGGCCAGGCAGAG 

ACGCTC CTCACTTCCCACACGGGGTGGCGGCTGGG CAGAGGCTGCAATCTTGGCACTTTGGGGGGCCAAGGCG<3GCA 

TGGAGCTTGTAGCGAGCCGAGATCACGCCGCTGCACTCCAGCCTGGGCACCATTGAG 

CCCAGCACCTCGGGAGGCCGAGGCTGGCGGATCACTCGCGGTTAGGAGCTGGAG 

AGGCAGGGAGGTTXSCAGTGAGCCGAGATGGCAGCAGT^ 

TGGAGAGAGAGGGAGACGGAGAGGGAGAGGGAGAGG<KjGAGCGGGAGGGGGAGGGGGAGGGAGAGGGAGAGGGAGAGGGAGAGGGA 
GAGGCTGTGTGTCAGAGGTTTTCCrCGGAGTTTGGTTCTTGTTGCCCAGGCT 
CTCTCCCCTTTTTGGAAATATTGTACTTCCTGAGCTAACTGCACTTACGAGGTA^ 
60 TTGTTTTTTGAGACAGAGTCTCAT^CTGTCACCCAG<3^ 

GTTCAAGCGATTCTCCTACCTCAGCCTCCTGAGTAGCTGGGATGAC^ 

GTAGACAAGGTCTCACCATGTTGGCCAGGCTGGTCCTGAACTCCTGACCTCAAGTGATCCACCC 
GGATTACAGGCGTGAGCCACTGCGCCCCGGCTCGCTCCTTTTTTTTTTTTTTTGAGACAGAGTCT 
GTGCAGTGGTGTGATTTTGGCTCACTGCA\ACCTO 
65 TTACAGGTGCCCACCACCATGCCTAGCTAATT/rTTGTATTT^TAGTGGACGGGGTCT 

CCTGAG CTCGAGAGATCCGCCCACCTCGG CCTCCCAAAGTGCTGAGATTACAGGCGTGAGCCACCACGCTX^GCCCTGCAGTCTCT 
TTTCACTGGGTACTGCGGTTCIAGTGTTACTGCCTA 

GCTGAGAG CCAG G CTCTCCAGAT CC CTGTGGGAGGGC CTTGTAT CTGAGT CCAGTTTTC CT CCAG CACG AG GC C TGG CATTTAATT 
ACCTGCTATCTTGTCAGCTGTTCAGTGTTCCTGGCCTTTGGATATTTCACCC^ 
70 CCAGGCACCTAGCCTGCTGTCACCGGGAAGGGACATGTGCCCCGAGGCGGGGCTGA 
AGCCCAOSGCTGTGCCACCCGGGAGGTGCCCTTTTGTTGGAGGG 
TTGAGGGGTCAGGGGGACAGAAGTCAGGGCCAGTCCTGCCCAAAATGGGGAC 

AC CTAAAGGT CT ACACTCTTAGGGT AAGAATGAAAC G GAATTAG CC CTGCTCC CTCCACACG AAAG ACACTTGG TG CTAAGAG AAG 
CAAAAAGAAAAAGTTCTTCAGAAGTTAGGGC TACAGCTCTT CT CT AGCACAGATTTC GTGCT AG CACAGATTTGAGGTG CTGCC 
7 5 AAGAATGCAGAGAACTCCAAGCACTCAATGCGTTGTTTGTTTTGACATA 
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GTGATCACGGCTC^CTGCAGCCTCGAACTCCT 

ATACCACCATGCC CAG CT AATTAAAAACAAAAGAGAT ACGGCAGGTGGATCAC CTG AGGTT AGG AGTTTGAG AC CAGCCTGGCCAA 

ACTGGTGAGATT CTGTCTCTACT AAAAATACAAAAATT AGCCAGGCATG ATGGTGGGCG CCT GTAAT CCCAG CTACTGGGG AGGCT 

GAGGCAGAATTGCTTGAACCCAGGAGGCGGAGGTTGCAGT 
5 GACTCCATCTCTCAACAAAAAAAAATGTGTAGAGATGGGGTC^ 

ATC CTC CT ACCTCGGC CTCCCGAAGTGTTGGAATTACAGGCGTGAG CCACTG CGCCCAGCCCCCAGGAATAAi i'i"iAACCAAAATA 

AATTGGCCGGGCGGAGTGG CTCACG CCTGTAATCAC^GCACTTTGGGA CGA 

GACCAGCCTGGCTTAACATGGTGAAACTCCGTTTCT 

CAGCTACTCGGGAGGCTGAGGCAGGAGAATCACT 
1 0 AGCTTGGGCAACAAGAGCX5AAACTCCATC^ 

ATATCCACATGCGAATGAATGACACI^ACCCrTGTCTCAGATCATTAT^ 

AATGCCATTTATTTTGAGTAGAAATAAGTCATT^ 

GGCTTCCCCAGAGGATCCACCGGGCTGGGCrGGAGCCCTGGGGCT^ 

CCTGAGCCCQU3TGGCAGACTCTGGTTCTGCTCAGTAGCAG 
1 5 TGCAGGAGGGACTGGGGACTCTTTTCCACG 

CATGGCCAGCAGAGCCTCAGGCTCCTGGTCCCCT^ 

AGGCAGTGTTGGGGCTGCCCCTGGCCrCCCG^GGCTGCTC 

TTCTGCGCCACAGGGCGGTCGCAGTCAAACAASGCACAAAAGG 

GGTGAGTGGCTGGGCTGGGGCCACCTCGGGTAGGGCCACGGCATAGGGGC^^ 
2 0 ACACGAGGGCCAGC!TCCAGGCCCTGGGCGCGGACCTCCCAGTCCAGGTCTCGGCTCG^ 

TCCGTGTCCTGGGCCGCGTCGGCGTGGCCGTCCCX3CAGCCIACTCAGTGAAGACTTG 



GAGGCTCCCGCAACAGGACTCCGGTAC^TGATGGGGGCTGGCCCCGGACATGCAGTCTAGGGCCAC^ 
CATGCTGCGCAGTGAGCAGACGGCCCTCCAATCCT 
2 5 TGGGGAGACAGAAGGCGAAGCGGACAGGTCCCA^ 

GCAGCAGCTGCTCCCACGCTGCATGCTGGCCGCTCGACCACCCGCAAGCA 

HUMAN SEQUENCE - mRNA 

ATGCTCAAGCGCTGCG^CCGGCGCCTGCTGCTGGCGCTGGCGGGCGCGC^ 
30 GCTOCerCC^CTGCCCGCCGAGCGCGGCCGGCGCGCGCTGCGCAGCCTGGCGGGCCCCGCGGGGGCTGCCCCGGCGCCCGGGCTGG 



GATGCGGGCCCGCCGCCCGGGGCTGCCCCCCGCCCCGCCGACGGCCACCCGCGCCCCCTGGCCGAGCCGCTCGCGCCCCGAGACGT 

CTTCATCGCTGTC!AAGACCACCAAAAAGTTCCACCGCGCGCGCCT CGACCTGCTGCTGGAGACCTGGATCTCGCG CCACAAGGAGA 

TGACGTTCATCTTCACTGACGGGGAAGATGAGGCCCTGGC CAGG CAC^CGGGCAACGTGGTCATCACIAAACTGCTCGGCCGCCCAC 
35 AGCCGCCAGGCGCTGTCCTGCAAGATGGCCGTGGAGTATGA 

TGACAACTACGTGAACCTGCGGGCCCTGCTGCGGCTGCTGG 

TGGACAGGCCCATCCAGGCCATGGAGCGGGTCAGCGAGAAC3UVGGTGCG 

TTCTGCATCAGCCCTGGGCTGGCTCTGAAGATGAGCCCGTGGGCCAG^^ 

GCCTGATGACTGCACCATCGGCTACATCGTGGAGGCCCT 
4 0 AGAACCTGCAGCAGGTGCCCACCTCAGAGCTCCACGAGCAGGTGACGCT 

CACGTGAAGGGGCCCTTCTCX3GTGGAGGCCGACCCATCCAGGTTCCGCT 

TCCCCGCACTGCCATCTTCTAG 

HUMAN SEQUENCE - CODING 
45 ATGCTCAAGCGCTGCGGCCGGCGCCTGCTGCTGGCGCTGGCGGGCGCGCTGCTCGCCTGCCTGCTGG 

GGGCGGCGGCGGCGGCGCCCGGGGCGCTGGTCCGCGACGTGCACAGTCTrGTCCGAG^ 

GATGCGGGCCCGCCGCCCGGGGCTGCCCCCCGCCCCGCCGACGGCCACC(^CGCCCCCTGGCCGAGCCGCTCGCGCCCCGAGACGT 
CITCATCGCTGTCJAAGACCACGAAAAAGTTCCACCGCGCGCGCCTCGAC CTGCTG CTGGAGACOTGGATCTCGCGCCACfiAGGAG A 
50 TGACGTTCATCTTCACTGACGGGGAAGATGAGGCCCTGGCCIAGGCACACGG^ 
AGCCGCCAGGCX3CTGTCCTGCAAGATGGCCGTGGAGTATGACCGC!TTC^^ 
TGACAACTACGTCAACCTGCGGGCCCTGCTGCXSGCTGCTGGCCAGCTACC^ 
TGGACAGGCCCATCCAGGCCATGGAGCGGGTCAGCGAGAACAAG 

TTCTGCATC^GCC^TGGGCTGGCTCTGAAGATGAGCCCGTGGGCCAGCGGGGGTCACTTC^ 
55 GCCTGATGACTGCACCATCGGCTACATCGTGGAGGCCCTGCTGGGTGTGCCCC^ 

AGAACCTG CAGCAGGTGCCCAC CTCAGAGCTCCACGAGCAGGTGACGCTGAGCTACGGTATGTTTGAAAACAAGCGGAACGCCGTC 

ClACGTGAAGGGGCCCrrCTCGGTGGAGGCCGACCCATCCAGGTTCCGCTCC^ 

TCCCCGCACTGCCATCTTCTAG 

60 
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CLAIMS 

We claim: 



5 1. A recombinant nucleic acid comprising a nucleotide sequence selected from the 
group consisting of the sequences outlined in Tables 1-112. 

2. A host cell comprising the recombinant nucleic acid of claim 1 . 



10 3. An expression vector comprising the recombinant nucleic acid according to claim 2. 

4. A host cell comprising the expression vector of claim 3. 

5. A recombinant protein comprising an amino acid sequence encoded by a nucleic acid 
is sequence comprising a sequence selected from the group consisting of the sequences 

outlined in Tables 1-112. 



6. A method of screening drug candidates comprising: 

a) providing a cell that expresses a carcinoma associated (CA) gene comprising a 

20 nucleic acid sequence selected from the group consisting of the sequences outlined in Tables 
1-1 12 or fragment thereof; 

b) adding a drug candidate to said cell; and 

c) determining the effect of said drug candidate on the expression of said CA gene. 

25 7. A method according to claim 6 wherein said determining comprises comparing the 
level of expression in the absence of said drug candidate to the level of expression in the 
presence of said drug candidate. 

8. A method of screening for a bioactive agent capable of binding to an CA protein 
30 (CAP), wherein said CAP is encoded by a nucleic acid comprising a nucleic acid sequence 

selected from the group consisting of the sequences outlined in Tables 1-112, said method 
comprising: 

a) combining said CAP and a candidate bioactive agent; and 

b) determining the binding of said candidate agent to said CAP. 

35 

9. A method for screening for a bioactive agent capable of modulating the activity of an 
CA protein (CAP), wherein said CAP is encoded by a nucleic acid comprising a nucleic acid 
sequence selected from the group consisting of the sequences outlined in Tables 1-112, said 
method comprising: 
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a) combining said CAP and a candidate bioactive agent; and 

b) determining the effect of said candidate agent on the bioactivity of said 
CAP. 

5 10. A method of evaluating the effect of a candidate carcinoma drug comprising: 

a) administering said drug to a patient; 

b) removing a cell sample from said patient; and 

c) determining alterations in the expression or activation of a gene comprising a 
nucleic acid sequence selected from the group consisting of the sequences outlined in Tables 

10 1-112. 

11. A method of diagnosing carcinoma comprising: 

a) determining the expression of one or more genes comprising a nucleic acid 
sequence selected from the group consisting of the sequences outlined in Tables 1-1 12, in a 
first tissue type of a first individual; and 
15 b) comparing said expression of said gene(s) from a second normal tissue type from 

said first individual or a second unaffected individual; 

wherein a difference in said expression indicates that the first individual has carcinoma. 

12. A method for inhibiting the activity of a CA protein (CAP), wherein said CAP is 
20 encoded by a nucleic acid comprising a nucleic acid sequence selected from the group 

consisting of the sequences outlined in Tables 1-112, said method comprising binding an 
inhibitor to said CAP. 

13. A method of treating carcinomas comprising administering to a patient an inhibitor of 
25 an CA protein (CAP), wherein said CAP is encoded by a nucleic acid comprising a nucleic 

acid sequence selected from the group consisting of the sequences outlined in Tables 1-112. 

14. A method of neutralizing the effect of an CA protein (CAP), wherein said CAP is 
encoded by a nucleic acid comprising a nucleic acid sequence selected from the group 

3 o consisting of the sequences outlined in Tables 1-112, comprising contacting an agent specific 
for said CAP protein with said CAP protein in an amount sufficient to effect neutralization. 

15. A polypeptide which specifically binds to a protein encoded by a nucleic acid 
comprising a nucleic acid selected from the group consisting of the sequences outlined in 

35 Tables 1-112. 

16. A polypeptide according to claim 15 comprising an antibody which specifically binds 
to a protein encoded by a nucleic acid comprising a nucleic acid sequence selected from the 
group consisting of the sequences outlined in Tables 1-112. 
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17. A biochip comprising one or more nucleic acid segments selected from the group 
consisting of a nucleic acid of the sequences outlined in Tables 1-1 12 or fragments thereof. 

18. A method of diagnosing carcinoma or a propensity to carcinoma by sequencing at 
least one CA gene of an individual. 

19. A method of determining CA gene copy number comprising adding an CA gene probe 
to a sample of genomic DNA from an individual under conditions suitable for hybridization. 
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Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-19 (partially): Invention 1 

A nucleic acid comprising the nucleotide sequence SEQ ID 
N0:1 submitted in electronic form; other subject-matter 
refer ing to said nucleic acid. 



2. Claims: 1-19 (partially): Inventions 2-1612 

Idem as subject 1 but limited to each of the nucleic acids 
comprising the nucleotide sequences SEQ ID NO: 2 to SEQ ID 
NO: 1612, submitted in electronic form. 

For the sake of conciseness, the first subject-matter is 
seperately defined, the other subject-matters are defined by 
analogy thereto. 



